- Accuracy
The degree of closeness an estimate has to the true value.
- acquiescence
Consistent and stable tendencies in response behavior which are not explainable by question content or presentation. These are considered to be a source of biased reporting. For example,
extreme response style is the tendency to select the two extreme endpoints of a scale, midpoint response style refers to the consistent selection of middle or neutral category of the scale, and acquiescent response style is the tendency to agree with or select the positive responses.
- acquiescent response style
Consistent and stable tendencies in response behavior which are not explainable by question content or presentation. These are considered to be a source of biased reporting. For example, extreme response style is the tendency to select the two extreme endpoints of a scale, midpoint response style refers to the consistent selection of middle or neutral category of the scale, and acquiescent response style is the tendency to agree with or select the positive responses.
- acquiescent response styles
Consistent and stable tendencies in response behavior which are not explainable by question content or presentation. These are considered to be a source of biased reporting. For example, extreme response style is the tendency to select the two extreme endpoints of a scale, midpoint response style refers to the consistent selection of middle or neutral category of the scale, and acquiescent response style is the tendency to agree with or select the positive responses.
- Adaptation
Changing existing materials (e.g., management plans,
contracts, training manuals, questionnaires, etc.) by deliberately altering some content or design component to make the resulting materials more suitable for another sociocultural context or a particular population. In the context of questionnaire translation, the distinction between translation and
adaptation is difficult to define, because almost every translation includes adaptation to a certain degree. Therefore, sometimes both terms are used in combination ('translation and adaptation') to refer to the process of making a questionnaire fit for use in another language and culture. For a discussion of adaptation and its different forms, see
Behr & Shishido (2016). See also
Adaptation for more information.
- Adaptive behavior
Interviewer behavior that is
tailored to the actual situation encountered.
- Adjudication
The translation evaluation step at which a translation is signed off and released for whatever follows next, such as
pretesting or final fielding (the 'A' in the TRAPD method; see
Translation). When all review and refinement procedures are completed, including any revisions after pretesting and copyediting, a final signing off/
adjudication is required. Thus, in any translation effort, there will be one or more additional sign-off steps ("ready to go to external assessment," "ready to go to client," and "ready to go to fielding agency," for example).
- Adjudicator
The person who signs off on a finalized version of a questionnaire ('adjudication').
- Adjustment error
Survey error (
variance and
bias) occurring due to post-data collection statistical adjustment.
- Advance translation
A translation is made from a
source questionnaire in order to find problems in the source text that only become apparent when translation is attempted. These insights are then used to modify the source questionnaire or plan for adaptation. We recommend carrying out the
advance translation using the team approach so as to receive input
comparable to that expected during the final translation phase. Comments made in the course of the advance translation typically concern both linguistic- and translation-related issues, as well as intercultural issues (see
Dorer (2011)).
- Anchoring vignette
A technique used to adjust for noncomparability in self-assessment questions caused by differences in response scale usage across groups. It relies on a set of (usually brief) descriptions of hypothetical people and situations to which self-assessment is calibrated (
King, Murray, Salomon, & Tandon, 2004).
- Annotation
Information appended to text in the source questionnaire to help clarify the intended meaning of a source text concept, phrase, or term. See
Translation: Overview, Appendices A,
B,
C, and
D for further detail and examples of the use of
annotations. Note that 'annotations' are sometimes referred to as 'footnotes.'
- Anonymity
Recording or storing information without a name or identifier, so that the respondent cannot be identified in any way by anyone. No one can link an individual person to their responses, including the investigator or interviewer. Face-to-face interviews are never anonymous, since the interviewer knows the address (and most likely the name) of the respondent. See also '
anonymization' and '
confidentiality.'
- Anonymization
Stripping all information from a survey data file that would allow for the re-identification of respondents (see '
anonymity' and 'confidentiality').
- ASCII file
Data files in the American Standard Code for Information Interchange (
ASCII) format, which is the most common format for text files on the Internet.
- Ask different questions (ADQ)
An approach to question design wherein researchers collect data across populations or countries based on using the most salient population-specific questions on a given construct/research topic. The questions and indicators used in each location are assumed (or better, have been shown) to tap a construct that is germane or shared across populations.
- Ask the same questions and translate (ASQT)
An approach to question design wherein researchers collect data across populations/countries by asking a shared set of questions. The most common way to do this is by developing a source questionnaire in one language and then producing whatever other language versions are needed on the basis of translation or translation and adaptation.
Decentering is a second way to 'ask the same questions' (ASQ), but this procedure is differently organized.
- Attitudinal question
A question asking about respondents' opinions, judgments, emotions, or perceptions. These cannot be measured by other means; we are entirely dependent on respondents' answers. Example: "Do you think smoking cigarettes is bad for the smoker's health?"
- Audio computer-assisted self-interview (A-CASI)
A
mode in which the respondent uses a computer that plays audio recordings of the questions to the respondent, who then enters their answers. The computer may or may not also display the questions on the screen.
- Audit trail
An electronic file in which computer-assisted and Web survey software captures
paradata about survey questions and user actions, including times spent on different questions and in different sections of a survey (
timestamps) and interviewer or respondent actions while proceeding through a survey. The file may contain a record of
keystrokes and function keys pressed, as well as
mouse clicks and actions.
- Auxiliary data
Data from an external source, such as census data, that is incorporated or linked in some way to the data collected by the study.
Auxiliary data is sometimes used to supplement collected data, for creating weights, or in
imputation techniques.
- Base weight
The inverse of the probability of selection.
- Behavior code
Behavior codes are information about the interviewer's and respondent's verbal behaviors during the question-and-answer process of a survey interview. They are developed and recorded during by human coders, not automatically
coded by computers. To obtain behavior codes ('
behavior coding'), interviews are audio recorded (generally digitally today, but cassette tapes have been used in the past), transcribed, and then coded by a set of at least two coders to identify relevant behaviors (
Olson & Parkhurst, 2013).
- Behavior coding
Systematic coding of the interviewer-respondent interaction in order to identify problems and sometimes to estimate the frequency of behaviors that occur during the question-answer process (see behavior codes).
- Behavioral question
A question asking respondents to report behaviors or actions. Example: "Have you ever smoked cigarettes?"
- Bias
The systematic difference over all conceptual trials between the expected value of the survey estimate of a population parameter and the true value of that parameter in the
target population.
- Bid
A complete proposal (submitted in competition with other bidders) to execute specified jobs within the prescribed time and budget, and not exceeding a proposed amount.
- Bilingual glossary
A glossary is a list of words or phrases used in a particular field alongside their definitions, and can often found at the back of a specialist or academic book as an appendix to the text. A
bilingual glossary lists special terms used in a particular field in two languages. A key notion or concept present in one language for a given field may not have a ready single match in a given other language.
- Bottom-coding
A type of coding in which values that exceed the predetermined minimum value are reassigned to that minimum value or are recoded as missing data.
- Bridge language
A language, common to both interviewers and respondents, that is used for data collection but may not be the first language of either party.
- Cause-and-effect diagram
A fishbone-structured diagram for a process, used as a brainstorming tool to help understand or improve the process. The main bone represents the process (e.g., interviewer training), and bones coming off of the main bone are pre-identified factors (e.g., training materials) that may affect the
quality of the process. From there, potential causes (i.e., lack of resources and time) and effects (i.e., poor quality materials) can be discussed and solutions identified. Also known as a 'fishbone' or 'Ishikawa' diagram.
- Certification
Objective assessment of performance. Based on pre-established criteria, the interviewer either meets the requirements and may proceed to conduct the study interview, or does not meet the requirements and may either be permitted to try again or be dismissed from the study.
Certification outcomes should be documented and filed at the data collection agency.
- Closed-ended question
A survey question format that provides a limited set of predefined answer categories from which respondents must choose. Example: "Do you smoke?"
- Cluster
A grouping of units on the
sampling frame that is similar on one or more variables, typically geographic. For example, an interviewer for an in-person study will typically only visit households in a certain geographic area, with the geographic area being the '
cluster.'
- Cluster sampling
A sampling procedure wherein units of the sampling frame that are similar on one or more variables (typically geographic) are organized into larger groups ('clusters'), and a sample of groups is selected. The selected groups contain the units to be included in the sample. The sample may include all units in the selected clusters or a sub-sample of units in each selected cluster. The ultimate purpose of this procedure is to reduce interviewer travel costs for in-person studies by producing distinct groups of elements where the elements within each group area are geographically close to one another.
- CMS
The software and procedures used to
capture, save, organize, and distribute information in digitalized form.
- code frame
A list of descriptions of variable categories and associated code numbers. Also referred to as '
code frame,' 'coding frame,' or 'codes.'
- code frames
A list of descriptions of variable categories and associated code numbers. Also referred to as 'code frame,' 'coding frame,' or 'codes.'
- Code structure
A list of descriptions of variable categories and associated code numbers. Also referred to as 'code frame,' 'coding frame,' or 'codes.'
- Codebook
A document that provides question-level
metadata that is matched to variables in a dataset. Metadata include the elements of a
data dictionary as well as basic study documentation, question text,
universe statements (the characteristics of respondents who were asked the question), the number of respondents who answered the question, and response frequencies or statistics.
- Coding
The translation of nonnumeric data into numeric fields.
- coding frame
A list of descriptions of variable categories and associated code numbers. Also referred to as 'code frame,' 'coding frame,' or 'codes.'
- coding frames
A list of descriptions of variable categories and associated code numbers. Also referred to as 'code frame,' 'coding frame,' or 'codes.'
- Coefficient of variation
A measure of dispersion of a probability distribution or frequency distribution that describes the amount of variability relative to the mean.
- Cognitive interviewing (CI)
A pretesting method designed to uncover problems in survey items by having respondents think out loud while (or after) answering a question.
- Cohen's kappa
A statistical measure that accounts for degree of chance of agreements between coders.
- Comparability
The extent to which differences between survey statistics from different countries, regions, cultures, domains, time periods, etc., can be attributable to differences in population true values (
Mohler & Johnson, 2010); in other words, whether the concepts are comparable or not. It is often referred as 'equivalence,' '
functional equivalence,' 'similarity,' or some other frame of reference (
Mohler & Johnson, 2010). Improving comparability implies that error due to translation has to be minimized. In terms of questionnaire translation for 3MC surveys, the aim is to achieve the defined statistical level of comparability across all local versions (expressed in minimized translation error).
- Comparativist
A person who carries out comparative studies, especially a student of comparative literature or comparative linguistics.
- Complex survey data/designs
Survey datasets or designs based on
stratified single-stage or multistage samples with
survey weights designed to compensate for unequal probabilities of selection or
nonresponse.
- Computer-assisted personal interview (CAPI)
A face-to-face interviewing mode in which a computer displays the questions onscreen and the interviewer reads them to the respondent and enters the respondent's answers directly into the computer.
- Computer-assisted recorded interview (CARI)
A system for audio recording of interviews (or parts of interviews) that allows for monitoring interviewer performance in the field/call center and detection of data fraud.
- Computer-assisted self-interview (CASI)
A mode in which a computer displays the questions on a screen to the respondent and the respondent then enters their answers into the computer.
- Computer-assisted telephone interview (CATI)
A telephone interviewing mode in which a computer displays the questions on a screen and the interviewer reads them to the respondent over the phone and enters the respondent's answers directly into the computer.
- Concurrent mixed-mode
A mixed-mode design in which one group of respondents uses one mode and another group of respondents uses another.
- Confidentiality
Securing the identity of, as well as any information provided by, the respondent in order to ensure to that public identification of an individual participating in the study and/or their individual responses does not occur. See also 'anonymity' and 'anonymization.'
- Consistency
Consistency is achieved when the same term or phrase is used throughout a translation to refer to an object or entity that was referred to with one term or phrase in the source text. In many cases, consistency is most important with regard to technical terminology or to standard repeated components of a questionnaire. All references to a "show card" in a source questionnaire should be consistently translated, for example. The translation of instructions which are repeated in the source text should also be repeated (and not varied) in the target text.
- Construct validity
The degree to which a survey question adequately measures an intended hypothetical construct. This may be assessed by checking the correlation between observations from that question with observations from other questions expected on theoretical grounds to be related.
- Constructed variable
A recoded variable created by data producers or archives based on the data originally collected. Examples include age grouped into cohorts, income grouped into 7 categories, Goldthorpe-Index, or the creation of a variable called 'POVERTY' from information collected on the income of respondents.
- Contact attempt record
A written record of the time and outcome of each contact attempt to a
sample unit.
- Contact rate
The proportion of all elements to those in which some responsible member of the housing unit was reached by the survey.
- content management system
The software and procedures used to capture, save, organize, and distribute information in digitalized form.
- content management system (CMS)
The software and procedures used to capture, save, organize, and distribute information in digitalized form.
- content management systems
The software and procedures used to capture, save, organize, and distribute information in digitalized form.
- content management systems (CMS)
The software and procedures used to capture, save, organize, and distribute information in digitalized form.
- Content management
The software and procedures used to capture, save, organize, and distribute information in digitalized form.
- Context effect
The effect of question context, such as the order or layout of questions, on survey responses.
- Contract
A legally binding exchange of promises, or an agreement creating and defining the obligations between two of more parties (for example, a survey organization and the
coordinating center) written and enforceable by law.
- Convenience sample
A sample of elements that are selected because it is convenient to use them, not because they are representative of the target population.
- Conversational interviewing
An interviewing style in which interviewers read questions as they are worded but are allowed to use their own words to clarify the meaning of the questions.
- Conversion process
Data processing procedures used to create harmonized variables from original input variables.
- Cooperation rate
The proportion of all elements interviewed over all eligible units ever contacted.
- Coordinating center
A research center that facilitates and organizes cross-cultural or multi-site research activities.
- Copyeditor
The person who reviews a text and marks up any changes required to correct stylistic, punctuation, spelling, and grammatical errors. In many instances, the
copyeditor may also make the corrections needed.
- Coverage
The proportion of the target population that is accounted for on the sampling frame.
- Coverage bias
The systematic difference between the expected value (over all conceptual trials) of a statistic and the target population value that occurs because some elements in the target population do not appear on the sampling frame.
- Coverage error
Survey error (variance and bias) that is introduced when there is not a one-to-one correspondence between frame and target population units. Causes include when some units in the target population are not included on the sampling frame (undercoverage), when some units on the sampling frame are not members of the target population (out-of-scope), when more than one unit on the sampling frame corresponds to the same target population unit (overcoverage), and when one sampling frame unit corresponds to more than one target population unit.
- Coverage rate
The number of elements on the sampling frame divided by the estimated number of elements in the target population.
- Coversheet
Electronic or printed materials associated with each element that identify information about the element, e.g., the sample address, the
unique identification number associated with an element, and the interviewer to whom an element is assigned. The
coversheet often also contains an introduction to the study, instructions on how to screen sample members and randomly select the respondent, and space to record the date, time, outcome, and notes for every contact attempt.
- Crosswalk
A description, usually presented in tabular format, of all the relationships between variables in individual data files and their counterparts in the harmonized file.
- Cultural schema
A conceptual structure, shared by members of a cultural group and created from common experiences, by which objects and events can be identified and understood.
- data documentation initiative
An international effort to establish a standard for technical documentation describing social science data. The membership-based
DDI Alliance is developing the specification, which is written in
XML.
- Data capture
The process of converting data (from questionnaires, audio/visual recordings, samples, etc.) to an electronic file.
- Data dictionary
A document linking the survey instrument (questionnaire) to the dataset, or to more abstract question- or variable-level metadata, including question identifiers (variable names and labels), response category identifiers (value labels), and data types (e.g., F2.0, specifying that the response is a two-digit integer with zero decimal places).
- Data Documentation Initiative (DDI)
An international effort to establish a standard for technical documentation describing social science data. The membership-based DDI Alliance is developing the specification, which is written in XML.
- DDI
An international effort to establish a standard for technical documentation describing social science data. The membership-based DDI Alliance is developing the specification, which is written in XML.
- De-identification
- Decentering
An approach to designing questions in two languages in which neither of the languages or cultures involved is allowed to dominate. A 'ping-pong'-like process of formulation and comparison between the two languages is used to develop versions in each language. Any language or cultural obstacles met with are resolved, often by removing or changing wording in one or both languages. The question formulation in both languages then moves on from that modification. Since the process removes culture-specific elements from both versions, decentered questions may be vague and not especially salient for either target population.
- Design effect
The effect of the
complex survey design on
sampling variance, measured as the ratio of the sampling variance under the complex design to the sampling variance computed as a simple random sample of the same sample size.
- Differential item functioning (DIF)
Item bias occurring as a result of systematic differences in responses across cultures due to features of the item or measure itself, such as poor translation or ambiguous wording.
- Diglossic linguistic context
Diglossic linguistic contexts exist in single-language communities that use two or more markedly different varieties of a language, or two different languages, in different contexts. The variety used may be determined by whether the language is written or spoken in a given instance, or by the relationships between participants in a discourse. Considerations such as age, gender, social status, and the topic under discussion may all contribute to the form chosen in any given instance.
- Direct cost
An expense that can be traced directly to (or identified with) a specific cost center or is directly attributable to a cost object, such as a department, process, or product.
- Disclosure analysis/avoidance
The process of identifying and protecting the confidentiality of data. It involves limiting the amount of detailed information disseminated and/or masking data via noise addition, data swapping, generation of simulated or synthetic data, etc. For any proposed release of tabulations or
microdata, the level of risk of disclosure should be evaluated.
- Disposition code
A code that indicates the result of a specific contact attempt or the outcome assigned to a
sample element at the end of data collection (e.g.,
noncontact,
refusal, ineligible, complete interview).
- Document management system
A
document management system (DMS) is a computer program (or set of computer programs) used to track and store electronic documents and/or images of paper documents. The term has some overlap with the concept of a
content management system (CMS). DMSs are often viewed as a component of enterprise content management systems (ECMs, see
here) and are related to digital asset management, document imaging, workflow systems, and records management systems.
- double translation
Each
translator translates all of the material to be translated, in contrast to
split translations. This refers to the first step in the TRAPD model, the 'T.' It is also referred to as 'double' or 'parallel' translation.
- double translations
Each translator translates all of the material to be translated, in contrast to split translations. This refers to the first step in the TRAPD model, the 'T.' It is also referred to as 'double' or 'parallel' translation.
- Double-barreled question
A survey question that inadvertently asks about two topics at once.
- dress rehearsal
A quantitative miniature version of the survey data collection process that involves all procedures and materials that will be used during data collection taking place before the actual data collection begins. A
pilot study is also known as a 'dress rehearsal.'
- dress rehearsals
A quantitative miniature version of the survey data collection process that involves all procedures and materials that will be used during data collection taking place before the actual data collection begins. A pilot study is also known as a 'dress rehearsal.'
- Editing
Altering data recorded by the interviewer or respondent to improve the quality of the data (e.g., checking consistency, correcting mistakes, following up on suspicious values, deleting duplicates, etc.). Sometimes this also includes coding and imputation (the placement of a number into a field where data were missing).
- Eligibility rate
The number of eligible sample elements divided by the total number of elements on the sampling frame.
- Embedded experiment
An experiment included within the framework of an actual study, rather than in a study conducted solely for the purpose of conducting said experiment (a '
standalone experiment').
- equivalence
The extent to which differences between survey statistics from different countries, regions, cultures, domains, time periods, etc., can be attributable to differences in population true values (
Mohler & Johnson, 2010); in other words, whether the concepts are comparable or not. It is often referred as 'equivalence,' 'functional equivalence,' 'similarity,' or some other frame of reference (
Mohler & Johnson, 2010). Improving comparability implies that error due to translation has to be minimized. In terms of questionnaire translation for 3MC surveys, the aim is to achieve the defined statistical level of comparability across all local versions (expressed in minimized translation error).
- ethics review board
A group or committee that is given the responsibility by an institution to review that institution's research projects involving human subjects. The primary purpose of the review is to assure the protection of the safety, rights and welfare of the human subjects.
- ethics review boards
A group or committee that is given the responsibility by an institution to review that institution's research projects involving human subjects. The primary purpose of the review is to assure the protection of the safety, rights and welfare of the human subjects.
- ethics review committee
A group or committee that is given the responsibility by an institution to review that institution's research projects involving human subjects. The primary purpose of the review is to assure the protection of the safety, rights and welfare of the human subjects.
- ethics review committees
A group or committee that is given the responsibility by an institution to review that institution's research projects involving human subjects. The primary purpose of the review is to assure the protection of the safety, rights and welfare of the human subjects.
- Event history calendar
A
conversational interviewing technique designed to improve recollection of complex sequences of personal events by using respondents' own past experiences as memory cues. This technique is sometimes also referred to as a '
life history calendar.'
- Ex-ante
The process of creating harmonized variables at the outset of data collection, based on using the same questionnaire or agreed-upon definitions in the harmonization process.
- Ex-post
The process of creating harmonized variables from data that already exists.
- Expectation question
A question asking about the respondent's expectation about the chances or probabilities that certain things will happen in the future.
- extreme response style
Consistent and stable tendencies in response behavior which are not explainable by question content or presentation. These are considered to be a source of biased reporting. For example, extreme response style is the tendency to select the two extreme endpoints of a scale, midpoint response style refers to the consistent selection of middle or neutral category of the scale, and acquiescent response style is the tendency to agree with or select the positive responses.
- extreme response styles
Consistent and stable tendencies in response behavior which are not explainable by question content or presentation. These are considered to be a source of biased reporting. For example, extreme response style is the tendency to select the two extreme endpoints of a scale, midpoint response style refers to the consistent selection of middle or neutral category of the scale, and acquiescent response style is the tendency to agree with or select the positive responses.
- Eye tracking
Eye tracking tracks the time, duration, and location of our eyes' fixations and the saccades between fixations. See
Li (2011).
- Fact sheet
A sheet, pamphlet, or brochure that provides important information about the study to assist respondents in making an informed decision about participation. Elements of a
fact sheet may include the following: the purpose of the study, its sponsorship, uses of the data, the role of the respondent, the sample selection procedures, any benefits and risks of participation, and confidentiality.
- Factual judgment question
A question that requires respondents to remember autobiographical events and use that information to make judgments. In principle, such information could also be obtained by other means of observation, such as comparing survey data with administrative records (if such records exist). Factual judgment questions can be about a variety of things, such as figure-based facts (e.g., date, age, weight), events (e.g., pregnancy, marriage), and behaviors (e.g., smoking, media consumption).
- Factual question
A question that aims to collect information about things for which there is a correct answer. In principle, such information could also be obtained by other means of observation, such as comparing survey data with administrative records (if such records exist).
Factual questions can be about a variety of things, such as figure-based facts (e.g., date, age, weight), events (e.g., pregnancy, marriage), and behaviors (e.g., smoking, media consumption).
- fishbone diagram
A fishbone-structured diagram for a process, used as a brainstorming tool to help understand or improve the process. The main bone represents the process (e.g., interviewer training), and bones coming off of the main bone are pre-identified factors (e.g., training materials) that may affect the quality of the process. From there, potential causes (i.e., lack of resources and time) and effects (i.e., poor quality materials) can be discussed and solutions identified. Also known as a 'fishbone' or 'Ishikawa' diagram.
- fishbone diagrams
A fishbone-structured diagram for a process, used as a brainstorming tool to help understand or improve the process. The main bone represents the process (e.g., interviewer training), and bones coming off of the main bone are pre-identified factors (e.g., training materials) that may affect the quality of the process. From there, potential causes (i.e., lack of resources and time) and effects (i.e., poor quality materials) can be discussed and solutions identified. Also known as a 'fishbone' or 'Ishikawa' diagram.
- fitness for purpose
The degree to which products conform to essential requirements and meet the needs of the users for which they are intended. In literature on quality, this is also known as '
fitness for use' or 'fitness for purpose.'
- fitness for use
The degree to which products conform to essential requirements and meet the needs of the users for which they are intended. In literature on quality, this is also known as 'fitness for use' or 'fitness for purpose.'
- Fitness for intended use
The degree to which products conform to essential requirements and meet the needs of the users for which they are intended. In literature on quality, this is also known as 'fitness for use' or 'fitness for purpose.'
- Fixed panel design
A
longitudinal study which attempts to collect survey data on the same sample elements at intervals over a period of time. After the initial sample selection, no additions to the sample are made.
- Fixed panel plus births
A longitudinal study in which a panel of individuals is interviewed at intervals over a period of time and additional elements are added to the sample.
- Flowchart
A method used to identify the steps or events in a process. It uses basic shapes for starting and ending the process, taking an action, making a decision, and producing data and documentation. These are connected by arrows indicating the flow of the process. A
flowchart can help identify points at which to perform
quality assurance activities and produce indicators of quality that can be used in
quality control.
- Focus group
Small group discussions carried out under the guidance of a moderator which can be used to explore and develop topics and items for questions. They are often used in qualitative research to test survey questionnaires and survey protocols.
- footnote
Information appended to text in the source questionnaire to help clarify the intended meaning of a source text concept, phrase, or term. See
Translation: Overview, Appendices A,
B,
C, and
D for further detail and examples of the use of annotations. Note that 'annotations' are sometimes referred to as 'footnotes.'
- footnotes
Information appended to text in the source questionnaire to help clarify the intended meaning of a source text concept, phrase, or term. See
Translation: Overview, Appendices A,
B,
C, and
D for further detail and examples of the use of annotations. Note that 'annotations' are sometimes referred to as 'footnotes.'
- frequency scale
A response format requiring the respondent to select the option that best describes the frequency in which certain behaviors occur.
- frequency scales
A response format requiring the respondent to select the option that best describes the frequency in which certain behaviors occur.
- Frequency format
A response format requiring the respondent to select the option that best describes the frequency in which certain behaviors occur.
- Full translation
Each translator translates all of the material to be translated, in contrast to split translations. This refers to the first step in the TRAPD model, the 'T.' It is also referred to as 'double' or 'parallel' translation.
- functional equivalence
The extent to which differences between survey statistics from different countries, regions, cultures, domains, time periods, etc., can be attributable to differences in population true values (
Mohler & Johnson, 2010); in other words, whether the concepts are comparable or not. It is often referred as 'equivalence,' 'functional equivalence,' 'similarity,' or some other frame of reference (
Mohler & Johnson, 2010). Improving comparability implies that error due to translation has to be minimized. In terms of questionnaire translation for 3MC surveys, the aim is to achieve the defined statistical level of comparability across all local versions (expressed in minimized translation error).
- Functional equivalence
Measures the degree to which a hypothetical construct serves similar functions within each society or cultural group. See
Johnson (1998) for more information.
- Hadamard matrix
- Hard consistency check
A signal warning that there is an inconsistency between the current response and a previous response; the interviewer or respondent cannot continue until the inconsistency is resolved (as opposed to a
soft consistency check).
- Hours per interview (HPI)
A measure of study efficiency, calculated as the total number of interviewer hours spent during production (including travel, reluctance handling,
listing, completing an interview, and other administrative tasks) divided by the total number of interviews.
- Human subjects review board
A group or committee that is given the responsibility by an institution to review that institution's research projects involving human subjects. The primary purpose of the review is to assure the protection of the safety, rights and welfare of the human subjects.
- Imputation
A computation method that, using some protocol, assigns one or more replacement answers for each missing, incomplete, or implausible data item.
- Imputation variance
The component of overall variability in survey estimates that can be accounted for by imputation.
- Inconsistent responses
Inappropriate responses to branched questions. For instance, one question might ask if the respondent attended church last week; a response of 'no' should skip the questions about church attendance and code the answers to those questions as 'inapplicable.' If those questions were coded any other way than 'inapplicable,' this would be inconsistent with the skip patterns of the survey instrument.
- Indirect cost
An expense that is incurred in joint usage and is difficult to assign to or is not directly attributable to a specific department, process, or product.
- Informant
The person who supplies a list of the eligible elements within the selected unit. For example, many in-person surveys select a sample of housing units at the penultimate stage of selection. Interviewers then contact the housing unit with the aim of convincing the member of the housing unit who responded to the contact attempt to provide a list of housing unit members who are eligible for the study. The housing unit member who provides this list of all eligible housing unit members is the
informant. Informants can also be selected respondents as well, if they are eligible for the study and are chosen as the respondent during the within household stage of selection.
- Informed consent
A process by which a sample member voluntarily confirms their willingness to participate in a study after having been informed of all aspects of the study that are relevant to the decision to participate.
Informed consent can be obtained orally or via a written consent form (or implied, if the respondent returns a mail survey), depending on the study protocol. In some cases, consent must be given by someone other than the respondent (e.g., an adult when interviewing children).
- Intention question
A question asking respondents to indicate their intention regarding some behavior.
- Interactive voice response (IVR)
A telephone interviewing method in which respondents listen to recordings of the questions and respond by using the telephone keypad or saying their answers aloud.
- Interface design
Aspects of computer-assisted survey design focused on the interviewer's or respondent's experience and interaction with the computer and instrument.
- Interpenetrated sample assignment
Randomized assignment of interviewers to subsamples of respondents in order to measure correlated response variance, arising from the fact that
response errors of persons interviewed by the same interviewer may be correlated.
Interpenetration allows researchers to disentangle the effects interviewers have on respondents from the true differences between respondents.
- interpenetration
Randomized assignment of interviewers to subsamples of respondents in order to measure correlated response variance, arising from the fact that response errors of persons interviewed by the same interviewer may be correlated. Interpenetration allows researchers to disentangle the effects interviewers have on respondents from the true differences between respondents.
- Interviewer design effect (deffint)
The extent to which
interviewer variance increases the variance of the sample mean of a simple random sample. Written as '
deffint' (or '') in equations.
- Interviewer effect
Measurement error, both systematic and variable, for which interviewers are responsible.
- Interviewer falsification
Intentionally departing from the designed interviewer guidelines that could result in the contamination of the data. Falsification includes: 1) fabricating all or part of an interview by recording data that were not provided by a designated survey respondent and reporting them as answers of that respondent; 2) deliberately misreporting
disposition codes and falsifying process data (e.g., recording a respondent refusal as ineligible for the sample or reporting a fictitious contact attempt); 3) deliberately miscoding the answer to a question in order to avoid follow-up questions; and 4) deliberately interviewing a nonsampled person in order to reduce effort required to complete an interview, or intentionally misrepresenting the data collection process to the survey management.
- Interviewer observations/evaluations
In interviewer-administered surveys, interviewers have long been asked to make general assessments about how engaged, cooperative, hostile, or attentive the respondent was during the interview. Additionally, interviewers record information about the interview-taking environment, such as whether other individuals were present or whether the respondent used headphones during an ACASI component. Unlike the previous sources of paradata, these
interviewer evaluations are questions asked directly of the interviewer, and are included as a few additional questions in the questionnaire (
Olson & Parkhurst, 2013).
- Interviewer variance
The component of overall variability in survey estimates that can be accounted for by the interviewers.
- Ishikawa diagram
A fishbone-structured diagram for a process, used as a brainstorming tool to help understand or improve the process. The main bone represents the process (e.g., interviewer training), and bones coming off of the main bone are pre-identified factors (e.g., training materials) that may affect the quality of the process. From there, potential causes (i.e., lack of resources and time) and effects (i.e., poor quality materials) can be discussed and solutions identified. Also known as a 'fishbone' or 'Ishikawa' diagram.
- Ishikawa diagrams
A fishbone-structured diagram for a process, used as a brainstorming tool to help understand or improve the process. The main bone represents the process (e.g., interviewer training), and bones coming off of the main bone are pre-identified factors (e.g., training materials) that may affect the quality of the process. From there, potential causes (i.e., lack of resources and time) and effects (i.e., poor quality materials) can be discussed and solutions identified. Also known as a 'fishbone' or 'Ishikawa' diagram.
- item missing
The lack of information on individual data items for a sample element where other data items were successfully obtained.
- item missing data
The lack of information on individual data items for a sample element where other data items were successfully obtained.
- item-missing
The lack of information on individual data items for a sample element where other data items were successfully obtained.
- item-missing data
The lack of information on individual data items for a sample element where other data items were successfully obtained.
- Item nonresponse
The lack of information on individual data items for a sample element where other data items were successfully obtained.
- Item response theory (IRT)
A theory that guides statistical techniques used to detect survey or test questions that have item bias or
differential item functioning (DIF).
IRT is based on the idea that the probability of a response an individual provides is a function of the person's traits and characteristics of the item.
- Keystroke
Keystroke files, sometimes called '
audit trails' or 'trace files,' are recorded when interviewers or respondents use specific keys during the survey. Keystroke files contain both response timing data and a record of the keystrokes pressed during the questionnaire administration (
Olson & Parkhurst, 2013).
- Latent class analysis (LCA)
Latent class analysis (LCA) is a subset of structural equation modeling, used to find groups or subtypes of cases in multivariate categorical data. These subtypes are called 'latent classes.'
- life history calendar
A conversational interviewing technique designed to improve recollection of complex sequences of personal events by using respondents' own past experiences as memory cues. This technique is sometimes also referred to as a 'life history calendar.'
- life history calendars
A conversational interviewing technique designed to improve recollection of complex sequences of personal events by using respondents' own past experiences as memory cues. This technique is sometimes also referred to as a 'life history calendar.'
- Listing
A procedure used in area probability sample designs to create a complete list of all elements or a cluster of elements within a specific set of geographic boundaries.
- Loaded question
A question that is worded such that it invites respondents to respond in a particular way.
- longitudinal survey
A study where elements are repeatedly measured over time.
- longitudinal surveys
A study where elements are repeatedly measured over time.
- Longitudinal study
A study where elements are repeatedly measured over time.
- Mean squared error (MSE)
The total error of a survey estimate; specifically, the sum of the variance and the bias squared.
- Measurement equivalence
Equivalence of the calibration system used in the questionnaire and the translation.
- Measurement error
Survey error (variance and bias) due to the measurement process; that is, error introduced by the survey instrument, the interviewer, or the respondent.
- Metadata
Information that describes data. The term encompasses a broad spectrum of information about the survey, from the study title to
sample design, details such as interviewer briefing notes, and contextual data and/or information such as legal regulations, customs, and economic indicators. Note that the term 'data' is used here in a technical definition. Typically, metadata are descriptive information, and data are the numerical values described.
- Microdata
Nonaggregated data that concern individual records for sampled units such as households, respondents, organizations, administrators, schools, classrooms, students, etc. Microdata may come from auxiliary sources (e.g., census or geographical data) as well as surveys. They are contrasted with macrodata, such as variable means and frequencies, gained through the aggregation of microdata.
- midpoint response style
Consistent and stable tendencies in response behavior which are not explainable by question content or presentation. These are considered to be a source of biased reporting. For example, extreme response style is the tendency to select the two extreme endpoints of a scale, midpoint response style refers to the consistent selection of middle or neutral category of the scale, and acquiescent response style is the tendency to agree with or select the positive responses.
- midpoint response styles
Consistent and stable tendencies in response behavior which are not explainable by question content or presentation. These are considered to be a source of biased reporting. For example, extreme response style is the tendency to select the two extreme endpoints of a scale, midpoint response style refers to the consistent selection of middle or neutral category of the scale, and acquiescent response style is the tendency to agree with or select the positive responses.
- Minority country
A country with high per capita income (the minority of countries).
- Mode
Method of data collection.
- Mouse click
Mouse click files record each action the respondent or interviewer takes using the computer's mouse, ranging from the presence or absence of simple single mouse clicks to the position of the mouse cursor at a specified time interval on an
x/
y coordinate of the survey page (
Olson & Parkhurst, 2013).
- Multitrait-multimethod (MTMM)
A technique that uses the correlations between multiple methods (i.e., modes) and multiple traits (i.e., variables) to assess the
validity of a measurement process.
- Non-interview
A sample element is selected, but an interview does not take place (for example, due to noncontact, refusal, or ineligibility).
- Noncontact
Sampling units that were potentially eligible but could not be reached.
- Nonresponse
- Nonresponse bias
The systematic difference between the expected value (over all conceptual trials) of a statistic and the target population value due to differences between respondents and nonrespondents on that statistic of interest.
- Nonresponse error
Survey error (variance and bias) that is introduced when not all sample members participate in the survey (unit nonresponse) or not all survey items are answered by a sample element (item nonreponse).
- Nonresponse followup
A supplemental survey of sampled survey nonrespondents.
Nonresponse followup surveys are designed to assess whether respondent data are biased due to differences between survey respondents and nonrespondents.
- Open-ended question
A survey question that allows the respondent to formulate an answer in their own words. Unlike a closed question format, it does not provide a limited set of predefined answers.
- Open tendering
A bidding process in which all the bidders are evaluated and then chosen on the basis of cost and technical merit.
- Outcome rate
A rate calculated based on the study's defined final disposition codes that reflect the outcome of specific contact attempts before the unit was finalized. Examples include
response rate,
cooperation rate, refusal rate, and
contact rate.
- Outlier
An atypical observation that does not appear to follow the distribution of the rest of a dataset.
- Overediting
Extensive
editing that becomes too costly for the amount of error that is being reduced.
- Overlap in split translations
A compromise solution between split and
full translations is to ensure that some overlap exists between materials divided among translators. The material is split up the way cards are dealt in many games, with everyone getting a spread of the material. Each translator could then receive the last one or two questions of another translator's 'piece.' This allows the review team members to have an increased sense of differences in translators' approaches to translation and understanding of source text components at the draft production level.
- Overrun
The exceeding of costs estimated in a contract.
- Paradata
Couper first introduced the term 'paradata' into the survey research methodology field in 1998 (
Groves & Couper, 1998), and the definition of paradata has vastly expanded since then. Paradata now refers to additional data that can be captured during the process of producing a survey statistic (
Kreuter, 2013). As discussed in the 2011 International Nonresponse Workshop (
Smith, 2011b), two main types of paradata are available. One is process paradata, which is collected during the process of data collection, such as timestamps and keystroke data. The other main type is related to observational information, such as the observed demographic information of respondents and observed neighborhood conditions.
- parallel translation
Each translator translates all of the material to be translated, in contrast to split translations. This refers to the first step in the TRAPD model, the 'T.' It is also referred to as 'double' or 'parallel' translation.
- parallel translations
Each translator translates all of the material to be translated, in contrast to split translations. This refers to the first step in the TRAPD model, the 'T.' It is also referred to as 'double' or 'parallel' translation.
- Pareto chart
A bar chart that reflects the types of most errors in a process, by error type in descending order; for example, the five or six most frequent types of help desk calls from interviewers using computer-assisted interviewing.
- Performance measurement analysis
A technique used in quality control to determine whether quality assurance procedures have worked. An example of a technique would be analysis of routine measures of interviewer or coder performance.
- Personally identifiable information (PII)
Information that can be used to identify a respondent that minimally includes their name, address, telephone number, and identification number (such as a social security number or driver's license number), but may include other information as well, such as biometric data.
- Pilot study
A quantitative miniature version of the survey data collection process that involves all procedures and materials that will be used during data collection taking place before the actual data collection begins. A pilot study is also known as a 'dress rehearsal.'
- Pledge of confidentiality
An agreement (typically in written or electronic form) to maintain the confidentiality of survey data that is signed by persons who have any form of access to confidential information.
- Portable file
A file that is coded in a non-proprietary format such as XML or ASCII, and thus can be used by a variety of software and hardware platforms.
- Post-survey adjustment
An adjustment made to reduce the impact of error on estimates.
- Poststratification adjustment
A statistical adjustment that assures that sample estimates of totals or percentages (e.g., the estimate of the percentage of men in living in Mexico based on the sample) equal population totals or percentages (e.g., the estimate of the percentage of men living in Mexico based on census data). The adjustment cells for poststratification are formed in a similar way to
strata in sample selection, but variables can be used that were not on the original sampling frame at the time of selection.
- Precision
A measure of how close an estimator is expected to be to the true value of a parameter, which is usually expressed in terms of imprecision and is related to the variance of the estimator. Less
precision is reflected by a larger variance.
- Precoding
- prescribed procedure
An interviewer behavior that must be carried out exactly as specified.
- prescribed procedures
An interviewer behavior that must be carried out exactly as specified.
- Prescribed behavior
An interviewer behavior that must be carried out exactly as specified.
- Pretesting
A collection of techniques and activities that allow researchers to evaluate survey questions, questionnaires, and/or other survey procedures before data collection begins.
- Primacy
A
context effect in which the placement of an item at the beginning of a list of
response options increases the likelihood that it will be selected by the respondent (see also '
recency').
- Primary sampling unit (PSU)
A cluster of elements sampled at the first stage of selection.
- Probability proportional to size (PPS)
A sampling method that ensures that sample estimates of totals or percentages (e.g., the estimate of the percentage of men living in Mexico based on the sample) equal population totals or percentages (e.g., the estimate of the percentage of men living in Mexico based on census data). The adjustment cells for poststratification are formed in a similar way to strata in sample selection, but variables can be used that were not on the original sampling frame at the time of selection.
- Probability sampling
A sampling method wherein each element of the sampling frame has a known, non-zero chance of selection.
- Process analysis
The use of tools such as flowcharts to analyze processes, e.g., respondent
tracking, computerized instrument programming and testing, coding, data entry, etc. The aim is to identify indicators or measures of product quality.
Process analysis also is used to identify improvements that can be made to processes.
- Process improvement plan
- Process indicator
An indicator that refers to aspects of data collection (e.g.,
HPI, refusal rates, etc.).
- Processing error
Survey error (variance and bias) that arises during the steps between collecting information from the respondent and having the value used in estimation.
Processing errors can include all post-collection operations, as well as the printing of questionnaires. Most processing errors occur in data for individual units, although errors can also be introduced in the implementation of systems and estimates. In survey data, processing errors may include errors of transcription, errors of coding, errors of data entry, errors in the assignment of weights, errors in
disclosure avoidance, and errors of arithmetic in tabulation.
- Progress indicator
An indicator that refers to aspects relating to reaching the goal (e.g., number of complete interviews).
- Proxy interview
An interview with someone other than the person about whom information is being sought (e.g., a parent or spouse). There should be a set of rules specific to each survey that define who can serve as a proxy respondent.
- Public-use data file
An anonymized data file, stripped of respondent identifiers, that is distributed for the public to analyze.
- Quality
The degree to which a product's characteristics conform to the requirements as agreed upon by the producers and clients.
- Quality assurance
A planned system of procedures, performance checks,
quality audits, and corrective actions to ensure that the products produced throughout the
survey lifecycle are of the highest achievable quality. Quality assurance planning involves identifying key indicators of quality.
- Quality audit
A systematic examination of the quality system of an organization by an internal or external quality auditor or team. It assesses whether the quality management plan has clearly outlined quality assurance, quality control, corrective actions to be taken, etc., and whether they have been effectively carried out.
- Quality checklist
- Quality control
A planned system of process monitoring, verification, and analysis of quality indicators and updates to quality assurance procedures to ensure that quality assurance is working.
- Quality management plan
A document that describes the quality system an organization will use, including quality assurance and quality control techniques and procedures, requirements for documenting the results of those procedures, corrective actions taken, and process improvements made.
- Quality profile
A comprehensive report prepared by producers of survey data that provides information users will need to assess the quality of the data.
- Question-by-question objectives
Text associated with some questions in interviewer-administered surveys that provides information on the objectives of the questions.
- Questionnaire adaptation
The deliberate technical or substantive modification of some feature of a question, a response scale, or another part of a questionnaire to better fit a new sociocultural context or particular target population (e.g., updating language: 'radio' for 'wireless,' adapting an adult questionnaire for children: 'tummy' for 'stomach,' or tailoring for cultural needs: 'walk several blocks' versus 'walk 100 yards').
- Quota sampling
A non-
probability sampling method that sets specific sample size quotas or target sample sizes for subclasses of the target population. The sample quotas are generally based on simple demographic characteristics (e.g., quotas for gender, age groups, and geographic region subclasses).
- random walk
For each randomly chosen sampling point (e.g., urban units, small cities, or voting districts), interviewers are assigned with a starting location and provided with instructions on the random walking rules (e.g., which direction to start, on which side of the streets to walk, and which crossroads to take). Households are selected by interviewers following the instructions. The routes end when the predefined number of respondents (or households) is achieved (
Bauer, 2016). Since the probability of the selected household is unknown, this method is categorized as a non-probability sampling method (
Bauer, 2016).
- Random-digit dialing (RDD)
A method of selecting telephone numbers in which the target population consists of all possible telephone numbers, and all telephone numbers have an equal probability of selection.
- Random route
For each randomly chosen sampling point (e.g., urban units, small cities, or voting districts), interviewers are assigned with a starting location and provided with instructions on the random walking rules (e.g., which direction to start, on which side of the streets to walk, and which crossroads to take). Households are selected by interviewers following the instructions. The routes end when the predefined number of respondents (or households) is achieved (
Bauer, 2016). Since the probability of the selected household is unknown, this method is categorized as a non-probability sampling method (
Bauer, 2016).
- Ranking format
A response format wherein respondents express their preferences by ordering persons, brands, etc. from top to bottom, i.e., generating a ranked order of a list of items or entities.
- Rating format
A response format requiring the respondent to select one position on an ordered scale of response options. Example: "To what extent do you agree or disagree with the following statement?"
- Recency
A context effect in which the placement of an item at the end of a list of response options increases the likelihood that it will be selected by the respondent (see also '
primacy').
- Recontact
To have someone other than the interviewer (often a supervisor) attempt to speak with the sample member after a screener or interview is conducted, in order to verify that it was completed according to the specified protocol.
- Refusal rate
The proportion of all units of all potentially eligible sampling units to those in which a respondent sampling unit refuses to do an interview or breaks off interviews of all potentially eligible sampling units.
- Reinterview
The process or action of interviewing the same respondent twice to assess
reliability (simple response variance).
- Reliability
The consistency of a measurement, or the degree to which an instrument measures the same way each time it is used under the same condition with the same subjects.
- Reluctance aversion techniques
Techniques that can reduce reluctance to participate in potential respondents, thereby increasing the overall response rate.
- Repeated panel
A series of
fixed panel surveys that may or may not overlap in time. Generally, each panel is designed to represent the same target population definition applied at a different point in time.
- Replicate
A systematic probability subsample of the full sample.
- Replicated question
A question which is repeated ('replicated') at a later stage in a study or in a different study. Replication assumes identical question wording. Questions which were used in one study, then translated and used in another, are also frequently spoken of as having been 'replicated.'
- Residency rule
A rule to help interviewers determine which persons to include in the household listing based on what the informant reports.
- Response distribution
A description of the values and frequencies associated with a particular question.
- Response error
The return of false or subjectively modified information from survey respondents.
- Response latency
A method of examining potential problems in responding to particular items, measured by recording the time between the interviewer asking a question and the response being given.
- Response options
The category, wording, and order of options given with the survey question. See
Questionnaire Design for more information.
- Response rate
The number of complete interviews with reporting units divided by the number of eligible reporting units in the sample.
- Response styles
Consistent and stable tendencies in response behavior which are not explainable by question content or presentation. These are considered to be a source of biased reporting. For example, extreme response style is the tendency to select the two extreme endpoints of a scale, midpoint response style refers to the consistent selection of middle or neutral category of the scale, and acquiescent response style is the tendency to agree with or select the positive responses.
- responsive survey design
Developed by
Groves and Heeringa (2006),
responsive design is an approach wherein researchers continually monitor selected paradata to inform the error-cost tradeoff in real-time as the basis for altering design features through interventions during the course of data collection of for subsequent waves, with the objective of reducing nonresponse and, ideally,
nonresponse bias. Usually the following steps are included: first, researchers pre-identify a set of design features that are of interest (e.g., tradeoff between cost and error); second, researchers identify and monitor these indicators; third, researchers provide intervention or alter the design features based on the pre-identified design decisions rules. Most recently, with the development of technology, real-time responsive design can be achieved in a single survey, such as prompting respondents who show signs of
satisficing in Web surveys.
- responsive survey designs
Developed by
Groves and Heeringa (2006), responsive design is an approach wherein researchers continually monitor selected paradata to inform the error-cost tradeoff in real-time as the basis for altering design features through interventions during the course of data collection of for subsequent waves, with the objective of reducing nonresponse and, ideally, nonresponse bias. Usually the following steps are included: first, researchers pre-identify a set of design features that are of interest (e.g., tradeoff between cost and error); second, researchers identify and monitor these indicators; third, researchers provide intervention or alter the design features based on the pre-identified design decisions rules. Most recently, with the development of technology, real-time responsive design can be achieved in a single survey, such as prompting respondents who show signs of satisficing in Web surveys.
- Responsive design
Developed by
Groves and Heeringa (2006), responsive design is an approach wherein researchers continually monitor selected paradata to inform the error-cost tradeoff in real-time as the basis for altering design features through interventions during the course of data collection of for subsequent waves, with the objective of reducing nonresponse and, ideally, nonresponse bias. Usually the following steps are included: first, researchers pre-identify a set of design features that are of interest (e.g., tradeoff between cost and error); second, researchers identify and monitor these indicators; third, researchers provide intervention or alter the design features based on the pre-identified design decisions rules. Most recently, with the development of technology, real-time responsive design can be achieved in a single survey, such as prompting respondents who show signs of satisficing in Web surveys.
- Restricted tendering
A bidding process in which only bidders prequalified through a screening process may participate in bidding. Bidders are evaluated and then chosen on the basis of cost and technical merit.
- Restricted-use data file
A file that includes information that can be related to specific individuals and is confidential and/or protected by law.
Restricted-use data files are not required to include variables that have undergone coarsening disclosure risk edits. These files are available to researchers under controlled conditions.
- Reviewer
- Rotating panel design
A study where elements are repeatedly measured a set number of times, then replaced by new randomly chosen elements. Typically, the newly chosen elements are also repeatedly measured the appropriate number of times.
- Sample design
Information on the target and final sample sizes, strata definitions, and the sample selection methodology.
- Sample element
A selected sampling unit of the target population that may be eligible or ineligible.
- Sample management system
A computerized and/or paper-based system used to assign and monitor sampling units and record documentation for sample records (e.g., time and outcome of each contact attempt).
- Sample person
A person selected from a sampling frame to participate in a particular survey.
- Sampling bias
The systematic difference between the expected value (over all conceptual trials) of an unweighted sample estimate and the target population value, occurring because some elements on the sampling frame have a higher chance of selection than other elements.
- Sampling error
Survey error (variance and bias) due to observing a sample of the population rather than the entire population.
- Sampling error computational units (SECUs)
Primary sampling units in one-PSU-per-stratum sampling designs that are grouped in pairs after data collection for the purpose of estimating approximate sampling variances.
- Sampling frame
A list or group of materials used to identify all elements (e.g., persons, households, establishments) of a survey population from which the sample will be selected. This list or group of materials can include maps of areas in which the elements can be found, lists of members of a professional association, and registries of addresses or persons.
- Sampling unit
An element, or a cluster of elements, considered for selection in some stage of sampling. For a sample with only one stage of selection, the sampling units are the same as the elements. In multi-stage samples (e.g., enumeration areas, then households within selected enumeration areas, and finally adults within selected households), different sampling units exist, while only the last is an element. The term 'primary sampling units' (PSUs) refers to the sampling units chosen in the first stage of selection. The term '
secondary sampling units' (SSUs) refers to sampling units within the PSUs that are chosen in the second stage of selection.
- Sampling variance
A measure of how much a statistic varies around its mean (over all conceptual trials) as a result of the sample design only. This measure does not account for other sources of variable error such as
coverage and nonresponse.
- Satisficing
To answer survey questions optimally, four stages of cognitive processing are required: (1) interpret the questions comprehensively, (2) retrieve information from memory, (3) form a judgment, and (4) map the judgment to the appropriate response category. However, to lower cognitive burden, instead of seeking to optimize, respondents may skip some steps when the answer survey questions. This behavior is called satisficing (see also '
satisficing behaviors').
- Satisficing behavior
A decision-making strategy that entails searching through any available alternatives until an acceptability threshold is met.
- Secondary sampling unit (SSU)
A cluster of sample elements sampled at the second stage of selection.
- Sequential mixed-mode
A mixed-mode design in which additional modes are offered as part of a nonresponse followup program.
- Shared language harmonization
Shared language harmonization can be understood as the procedures and result of trying to harmonize, as much as possible, different regional varieties of a 'shared' language across countries, i.e., in terms of vocabulary and/or structure. An example would be to harmonize translations into Italian between questionnaires used in Italy and in Switzerland.
- Silent monitoring
Monitoring without the awareness of the interviewer.
- similarity
The extent to which differences between survey statistics from different countries, regions, cultures, domains, time periods, etc., can be attributable to differences in population true values (
Mohler & Johnson, 2010); in other words, whether the concepts are comparable or not. It is often referred as 'equivalence,' 'functional equivalence,' 'similarity,' or some other frame of reference (
Mohler & Johnson, 2010). Improving comparability implies that error due to translation has to be minimized. In terms of questionnaire translation for 3MC surveys, the aim is to achieve the defined statistical level of comparability across all local versions (expressed in minimized translation error).
- Simple random sampling (SRS)
A procedure where a sample of size is drawn from a population of size in such a way that every possible sample of size has the same probability of being selected.
- Social desirability bias
A tendency for respondents to overreport desirable attributes or attitudes and underreport undesirable attributes or attitudes.
- Socio-demographic question
A question typically asking about respondent characteristics such as age, marital status, income, employment status, and education.
- Soft consistency check
A signal warning that there is an inconsistency between the current response and a previous response. The soft consistency check should provide guidance on resolving the inconsistency, but the interviewer or respondent may continue the survey without resolving it (as opposed to a
hard consistency check).
- Source document
The original document from which other (target) documents are translated or adapted as necessary.
- Source instrument
The original instrument from which other (target) instruments are translated or adapted as necessary.
- Source language
The language in which a questionnaire is available from which a translation is made. This is usually, but not always, the language in which the questionnaire was designed.
- Source questionnaire
The questionnaire taken as the text for translation. The source questionnaire would normally not be intended to be fielded as such, but would require local adaptation even if fielded in the
source language: for instance, in the ESS, the source questionnaire is English, but the questionnaires fielded in Ireland and UK differ slightly from the source questionnaire.
- Source variable
Original variable(s) chosen as part of the harmonization process.
- Split panel
A design that contains a blend of cross-sectional and panel samples at each new wave of data collection.
- Split translation
Each translator translates only a part of the total material to be translated in preparation for a review meeting, in contrast to translating the entire text (see full translation).
- Standalone experiment
An experiment conducted as an independent research project (as opposed to an '
embedded experiment').
- standardized question-asking
An interviewing technique in which interviewers are trained to read every question exactly as worded and abstain from interpreting questions or responses or offering much in the way of clarification.
- standardized survey interview
An interviewing technique in which interviewers are trained to read every question exactly as worded and abstain from interpreting questions or responses or offering much in the way of clarification.
- standardized survey interviewing
An interviewing technique in which interviewers are trained to read every question exactly as worded and abstain from interpreting questions or responses or offering much in the way of clarification.
- standardized survey interviews
An interviewing technique in which interviewers are trained to read every question exactly as worded and abstain from interpreting questions or responses or offering much in the way of clarification.
- Standardized interviewing technique
An interviewing technique in which interviewers are trained to read every question exactly as worded and abstain from interpreting questions or responses or offering much in the way of clarification.
- Statistical process control (SPC) chart
A statistical chart that compares expected process performance (e.g., number of hours worked by interviewers in a week) against actual performance. For example, interviewers who perform outside upper and lower boundaries on this measure are flagged; if greater variation from expected performance for some interviewers in a certain location can be explained (e.g., a hurricane or a snow storm causing lower hours worked than expected), the process is in control; if not, corrective actions are taken.
- Straightlining
Clicking the same answer for each item in a multi-numeric list.
- Strata
Mutually exclusive, homogenous groupings of population sample elements or clusters of elements that comprise all of the elements on the sampling frame. The groupings are formed prior to selection of the sample.
- Stratification
A sampling procedure that divides the sampling frame into mutually exclusive and exhaustive groups (or 'strata') and places each element on the frame into one of the groups. Independent selections are then made from each stratum, one by one, to ensure representation of each subgroup on the frame in the sample.
- Substitution
A technique where each nonresponding sample element from the initial sample is replaced by another element of the target population, typically not an element selected in the initial sample.
Substitution increases the nonresponse rate and, most likely, the nonresponse bias.
- Survey lifecycle
The lifecycle of a survey research study, from design to data dissemination. (See figure
here.)
- Survey population
The actual population from which the survey data are collected, given the restrictions from data collection operations.
- Survey weight
A statistical adjustment created to compensate for complex survey designs with features including, but not limited to, unequal likelihoods of selection, differences in response rates across key subgroups, and deviations from distributions on critical variables found in the target population from external sources, such as a national census.
- Systematic sampling
A procedure that selects of every element on the sampling frame after a random start.
- Tailoring
The practice of adapting interviewer behavior to the respondent's expressed concerns and other cues in order to provide feedback to the respondent that addresses their perceived reasons for not wanting to participate.
- Target language
The language into which a questionnaire is to be translated.
- Target population
The finite population for which the survey sponsor wants to make inferences using the sample statistics.
- Target variable
A variable created during the harmonization process.
- Taylor series variance estimation
A commonly used tool in statistics for handling the variance estimation of statistics that are not simple additions of sample values, such as odds ratios. The Taylor series handles this by converting a ratio into an approximation that is a function of the sums of the values.
- Team translation
Team approaches to survey translation and translation assessment bring together a group of people with different talents and functions in the team so as to ensure the mix of skills and discipline expertise needed to produce an optimal translation version in the survey context. Each stage of the
team translation process builds on the foregoing steps and uses the documentation required for the previous step to inform the next. In addition, each phase of translation engages the appropriate personnel for that particular activity and provides them with relevant tools for the work at hand.
- Tender
A formal offer specifying activities to be completed within the prescribed time and budget.
- Timestamp
Time and date data recorded with survey data, indicating dates and times of responses, at the question and questionnaire section levels. They also appear in audit trails, recording times questions are asked, responses recorded, and so on.
- Top-coding
A type of coding in which values that exceed the predetermined maximum value are reassigned to that maximal value or are recoded as missing data.
- Total survey error (TSE)
- Tracking
The process of attempting to locate a sample element whose contact information (e.g. address, telephone number, email address) has changed since the previous time the element's contact information was collected.
- Transformation algorithm
A mathematical operation that changes the values of variables.
- Translatability assessment
- Translator
The person who translates text from one language to another (e.g., French to Russian). In survey research, translators might be asked to fulfill other tasks as well, such as reviewing and copyediting.
- Trusted digital repository
A repository whose mission is to provide reliable, long-term access to managed digital resources to its designated community, both now and in the future.
- Undocumented code
A code that is not authorized for a particular question. For instance, if a question that records the sex of the respondent has documented codes of '1' for female, '2' for male. and '9' for missing data, a code of '3' would be an
undocumented code.
- Unique identification number
A unique number that identifies an element (e.g., a serial number). The number sticks to the element throughout the entire survey lifecycle, and is published with the public dataset. It does not contain any information about the respondents or their addresses.
- Unit nonresponse
An eligible sampling unit that has little or no information because the unit did not participate in the survey.
- Universe statement
A description of the subgroup of respondents to which the survey item applies (e.g., 'Female, 45, Now working').
- Unwritten language
An
unwritten language is one which does not have a standard written form used by the native speakers of the language.
- Usability testing
Evaluation of a computer-assisted survey instrument to assess the effect of design on interviewer or respondent performance. Methods of evaluation include review by usability experts and observation of users working with the computer and survey instrument.
- Validity
The extent to which a variable measures what it intends to measure.
- Variance
A measure of how much a statistic varies around its mean over all conceptual trials.
- Vignette
A brief story/scenario describing hypothetical situations or persons and their behaviors to which respondents are asked to react in order to allow the researcher to explore contextual influences on respondents' response formation processes.
- Vocal characteristics
- Weighting
- Within-country language harmonization
The process of harmonizing language versions within one multilingual country, such as harmonizing the Ukrainian and Russian versions within Ukraine.
- Word list
Word lists can serve various purposes: when regional varieties of a language need to be accommodated, a word list can be created featuring the words that are required for specific varieties of a language. They can also be incorporated into computer applications of an instrument. Word lists can also be a useful resource for interviewers. They cannot, however, address challenges faced when regional varieties differ in more radical and structural ways from one another. They can also serve similar functions as a glossary.
- Working group
Experts working together to oversee the implementation of a particular aspect of the survey lifecycle (e.g., sampling, questionnaire design, training, quality control, etc.)
- XML (eXtensible Markup Language)
XML (eXtensible Markup Language) is a flexible way to create common information formats and share both the format and the data on the Web, on intranets, and elsewhere. XML documents are made up of storage units called entities, which contain either parsed or unparsed data. Parsed data are made up of characters, some of which form character data, and some of which form markup, which encodes a description of the document's storage layout and logical structure. XML provides a mechanism to impose constraints on the storage layout and logical structure.