Podcast
Questions and Answers
What type of error is indicated when respondents are unable to provide accurate answers due to misunderstandings?
What type of error is indicated when respondents are unable to provide accurate answers due to misunderstandings?
What does a high standard deviation in survey statistics indicate?
What does a high standard deviation in survey statistics indicate?
Which group is incorrectly represented if only iPhone users are included in a survey?
Which group is incorrectly represented if only iPhone users are included in a survey?
What aspect fails if the technology used for a survey is malfunctioning?
What aspect fails if the technology used for a survey is malfunctioning?
Signup and view all the answers
How can the adjustment error in survey responses be identified?
How can the adjustment error in survey responses be identified?
Signup and view all the answers
What is the implication of a poor sampling frame in a survey?
What is the implication of a poor sampling frame in a survey?
Signup and view all the answers
In survey design, what does validity assess?
In survey design, what does validity assess?
Signup and view all the answers
What is a major concern when evaluating if steps represent physical activity?
What is a major concern when evaluating if steps represent physical activity?
Signup and view all the answers
What is a major privacy concern related to sensor data?
What is a major privacy concern related to sensor data?
Signup and view all the answers
How does the level of privacy concern affect participation willingness?
How does the level of privacy concern affect participation willingness?
Signup and view all the answers
What is a potential risk associated with connecting multiple streams of data?
What is a potential risk associated with connecting multiple streams of data?
Signup and view all the answers
What was the total sample size for the hypothetical willingness to share sensor data from the LISS Panel?
What was the total sample size for the hypothetical willingness to share sensor data from the LISS Panel?
Signup and view all the answers
Which of the following methods does NOT collect big data?
Which of the following methods does NOT collect big data?
Signup and view all the answers
What aspect of digital trace data is introduced in the content?
What aspect of digital trace data is introduced in the content?
Signup and view all the answers
Which type of data is specifically mentioned as being at risk of unauthorized access?
Which type of data is specifically mentioned as being at risk of unauthorized access?
Signup and view all the answers
Increased willingness to share sensor data is likely correlated with which of the following?
Increased willingness to share sensor data is likely correlated with which of the following?
Signup and view all the answers
What characterizes non-probability sampling in survey research?
What characterizes non-probability sampling in survey research?
Signup and view all the answers
Which sampling method involves ensuring specific characteristics are proportionally represented?
Which sampling method involves ensuring specific characteristics are proportionally represented?
Signup and view all the answers
What is a key feature of the Era of Expansion in survey research?
What is a key feature of the Era of Expansion in survey research?
Signup and view all the answers
Which of the following is an example of digital trace data?
Which of the following is an example of digital trace data?
Signup and view all the answers
What is a potential issue with recruitment via targeted advertisements for surveys?
What is a potential issue with recruitment via targeted advertisements for surveys?
Signup and view all the answers
What is a primary characteristic of probability samples?
What is a primary characteristic of probability samples?
Signup and view all the answers
How can digital trace data be collected from survey participants?
How can digital trace data be collected from survey participants?
Signup and view all the answers
Which term best describes a database of potential respondents who agree to participate in future surveys?
Which term best describes a database of potential respondents who agree to participate in future surveys?
Signup and view all the answers
Which of the following statements is true about non-probability samples?
Which of the following statements is true about non-probability samples?
Signup and view all the answers
What is meant by 'river sampling' in surveys?
What is meant by 'river sampling' in surveys?
Signup and view all the answers
What is a disadvantage of digital trace data compared to surveys?
What is a disadvantage of digital trace data compared to surveys?
Signup and view all the answers
Which of the following is a common issue related to surveys?
Which of the following is a common issue related to surveys?
Signup and view all the answers
What is an example of an advantage of probability samples?
What is an example of an advantage of probability samples?
Signup and view all the answers
Why might non-probability samples be considered more convenient?
Why might non-probability samples be considered more convenient?
Signup and view all the answers
How does the collection approach of digital trace data differ from surveys?
How does the collection approach of digital trace data differ from surveys?
Signup and view all the answers
What can be a consequence of falling response rates in probability samples?
What can be a consequence of falling response rates in probability samples?
Signup and view all the answers
What is a potential source of bias when sampling respondents who only downloaded a mobile app?
What is a potential source of bias when sampling respondents who only downloaded a mobile app?
Signup and view all the answers
What does nonresponse error refer to in the context of surveys?
What does nonresponse error refer to in the context of surveys?
Signup and view all the answers
Which error arises from malfunctioning devices during data collection?
Which error arises from malfunctioning devices during data collection?
Signup and view all the answers
What limitation might arise from only collecting data from iPhone users in a survey?
What limitation might arise from only collecting data from iPhone users in a survey?
Signup and view all the answers
Adjustments made after data collection to correct for known biases are referred to as what?
Adjustments made after data collection to correct for known biases are referred to as what?
Signup and view all the answers
What characterizes adjustment errors in survey research?
What characterizes adjustment errors in survey research?
Signup and view all the answers
Why is it important to understand the characteristics of respondents when conducting surveys?
Why is it important to understand the characteristics of respondents when conducting surveys?
Signup and view all the answers
Which technique can enhance the reliability of survey data collection?
Which technique can enhance the reliability of survey data collection?
Signup and view all the answers
What was the primary concern of the researcher in Case 1?
What was the primary concern of the researcher in Case 1?
Signup and view all the answers
What percentage of young people did the researcher conclude were suffering from social isolation in Case 2?
What percentage of young people did the researcher conclude were suffering from social isolation in Case 2?
Signup and view all the answers
What was identified as a significant issue with using social media data as a replacement for official statistics?
What was identified as a significant issue with using social media data as a replacement for official statistics?
Signup and view all the answers
What do some researchers argue about social media data's ability to replace survey data?
What do some researchers argue about social media data's ability to replace survey data?
Signup and view all the answers
What are some potential reasons for inferential 'failures' in social media data analysis?
What are some potential reasons for inferential 'failures' in social media data analysis?
Signup and view all the answers
What did research by Conrad et al. (2021) determine about the relationship between social media data and traditional indexes?
What did research by Conrad et al. (2021) determine about the relationship between social media data and traditional indexes?
Signup and view all the answers
What element was noted as having a strong potential impact on research results in the analysis of social media data?
What element was noted as having a strong potential impact on research results in the analysis of social media data?
Signup and view all the answers
What can be inferred about the reliability of social media data based on the findings presented?
What can be inferred about the reliability of social media data based on the findings presented?
Signup and view all the answers
Study Notes
Introduction to Survey Research
- Survey research has evolved through three eras: Invention, Expansion, and the current era, which incorporates "organic data."
- The Era of Invention (1930-1960) focused on area probability sampling and face-to-face/mail surveys.
- The Era of Expansion (1990-present) saw the rise of non-probability sampling methods like computer-assisted online surveys and the integration of big data sources.
- "Designed data" is supplemented by "organic data"
Non-Probability Sample Surveys
- Online non-probability panels are databases of potential respondents who've stated they'll cooperate for future data collection.
- Quota sampling involves representing specific population characteristics proportionally in the sample.
- River sampling is used to invite website visitors to immediate surveys.
- Recruitment can occur via targeted advertisements on social media platforms like Facebook and Twitter.
- A potential problem with non-probability samples is that some parts of the population may be systematically excluded because participation is voluntary.
Digital Trace Data
- Digital trace data is a kind of data that is collected through various online activities.
- It is collected through sources like social media, web browsing history, applications and geographic location data.
- It can be collected directly from the web through APIs or web scraping.
- Data donation from survey participants, such as web browsing history, specific apps used, and activity data (e.g., Strava), is also a source of digital trace data.
- Designed big data and smart surveys also contribute to digital trace data; collected within or after a survey, through apps, and browsers.
- This data can be obtained by using data download packages (DDPs).
Introducing "Design" to Digital Trace Data
- Example workflow: Data collected with sensors. An initial population register feeds into a sample selection process.
- Respondents participating provide consent, and their data is processed, with sensor data captured.
Privacy Concerns
- Potential risks related to sensor data include interception by unauthorized parties.
- Connecting multiple data streams can potentially re-identify previously anonymous users.
- Information from sensor data can be used to negatively impact an individual's credit, employment, or insurability.
- Higher privacy concerns tend to correlate with decreased willingness to participate in studies.
Hypothetical Willingness from the LISS Panel
- The data from the LISS panel, which covers roughly 2,678 Dutch smartphone users, illustrates that willingness to share sensor data varies across the sharing of locations, videos, photos of a house and photos of yourself.
CBS Consent Survey
- Data on the willingness to share sensor data, from the CBS Consent Survey, for approximately 1,883 Dutch smartphone and tablet users, illustrates that willingness differs among types of data. GPS data has greater willingness than sharing photo of homes and videos.
Examples of New Forms of Data
- Smartphone sensors such as NFC, Bluetooth, thermometers, Wi-Fi, GPS, cellular networks, fingerprint sensors, barometers, accelerometers, pedometers, gyroscopes, and camera are examples.
Examples of Use of Sensor Data
- SurveyMotion: JavaScript-based and total acceleration.
- Completion Behaviour: Fitness Tasks.
- Wearables. Wrist-worn GENEActiv, Axivity ax3 at the upper thigh, total physical activity.
More on Sensors
- Several organizations are using cameras to scan receipts to learn about consumer spending patterns.
- Some work focuses on linking sensor data to administrative data, such as tax records, for research purposes.
Examples of App Data
- The Tabi app with Statistics Netherlands, focuses on travel mode and history.
Some Examples of Studies Using Sensors
- Presented are several examples, in a table format, of specific studies using sensors to investigate areas, such as, social networks, spatial segregation, urbanicity, mobility, and more.
Some Examples of Studies Collecting Biomeasures
- A table summarises studies collecting specific biomeasures, such as blood and saliva, from particular geographical areas, and associated methodologies.
Some Examples of Studies Using Linkage to Administrative Data
- A table summarises examples of research studies using linkages with administrative data.
Questions Concerning Study Conclusions
- Researchers analyzing tweets to gauge climate change worry levels among Germans. Researchers advertise a survey regarding social isolation on Instagram, with the aim of finding out the social isolation levels from this sample.
Social Media Use as a Replacement for Official Statistics
- Initially, social media was a promising alternative to official statistics due to its alignment with existing metrics.
- However, issues emerged, as social media usage patterns changed leading to a degradation of the correlation with traditional metrics. Subsequently, job loss trends in social media data were not aligned to actual official unemployment figures.
Concerns of using Social Media as a basis for study
- Micro-decisions during the analysis may strongly affect the results of the study
- Social media data may not be appropriate for replacing survey data, except under a limited set of highly controlled conditions.
Comparing Two Types of Online Survey Samples
- Opt-in samples, used in online surveys, are significantly less accurate than probability-based panel samples, which use random sampling methods.
Probability vs Non-Probability Sample Surveys
- Probability samples are preferred for generalizable inferences but are often expensive and time-consuming.
- Non-probability samples are usually more affordable, timely, and convenient but might not allow for generalization.
- Diverse methods like surveys, digital traces, and others are included in non-probability samples.
Probability Surveys vs Digital Trace Data
- Designed data is used for research purposes.
- Researchers have control over content and a large number of covariates in probability surveys, while organic data, from digital traces, is collected from different purposes with often little control.
What can we do if none of the sources are perfect?
- Combining data sources might be helpful. Combining the strengths of multiple data sources can improve inference while mitigating biases from any single source.
Two Principles of Data Integration
- Data integration (DI) is context dependent, i.e. it depends on the specific purpose of the data being integrated and the data in question.
- DI, considered as a puzzle, requires aligning data quality, such as timeliness, coverage, quality, and size.
Additional Reading
- This section provides further sources of scholarly material that support the themes and examples discussed in the presentation.
Studying That Suits You
Use AI to generate personalized quizzes and flashcards to suit your learning preferences.
Related Documents
Description
This quiz explores essential concepts in survey design, including types of errors, validity, and the implications of sampling frames. It also addresses privacy concerns related to sensor data and the willingness of participants to share information. Test your knowledge on these critical aspects of data collection and analysis.