39 results on '"Matthew J. Salganik"'
Search Results
2. The origins of unpredictability in life trajectory prediction tasks.
- Author
-
Ian Lundberg, Rachel Brown-Weinstock, Susan Clampet-Lundquist, Sarah Pachman, Timothy J. Nelson, Vicki Yang, Kathryn Edin, and Matthew J. Salganik
- Published
- 2023
- Full Text
- View/download PDF
3. Consensus-based guidance for conducting and reporting multi-analyst studies
- Author
-
Balazs Aczel, Barnabas Szaszi, Gustav Nilsonne, Olmo R van den Akker, Casper J Albers, Marcel ALM van Assen, Jojanneke A Bastiaansen, Daniel Benjamin, Udo Boehm, Rotem Botvinik-Nezer, Laura F Bringmann, Niko A Busch, Emmanuel Caruyer, Andrea M Cataldo, Nelson Cowan, Andrew Delios, Noah NN van Dongen, Chris Donkin, Johnny B van Doorn, Anna Dreber, Gilles Dutilh, Gary F Egan, Morton Ann Gernsbacher, Rink Hoekstra, Sabine Hoffmann, Felix Holzmeister, Juergen Huber, Magnus Johannesson, Kai J Jonas, Alexander T Kindel, Michael Kirchler, Yoram K Kunkels, D Stephen Lindsay, Jean-Francois Mangin, Dora Matzke, Marcus R Munafò, Ben R Newell, Brian A Nosek, Russell A Poldrack, Don van Ravenzwaaij, Jörg Rieskamp, Matthew J Salganik, Alexandra Sarafoglou, Tom Schonberg, Martin Schweinsberg, David Shanks, Raphael Silberzahn, Daniel J Simons, Barbara A Spellman, Samuel St-Jean, Jeffrey J Starns, Eric Luis Uhlmann, Jelte Wicherts, and Eric-Jan Wagenmakers
- Subjects
multi-analyst ,metascience ,statistical practice ,science forum ,expert consensus ,analytical variability ,Medicine ,Science ,Biology (General) ,QH301-705.5 - Abstract
Any large dataset can be analyzed in a number of ways, and it is possible that the use of different analysis strategies will lead to different results and conclusions. One way to assess whether the results obtained depend on the analysis strategy chosen is to employ multiple analysts and leave each of them free to follow their own approach. Here, we present consensus-based guidance for conducting and reporting such multi-analyst studies, and we discuss how broader adoption of the multi-analyst approach has the potential to strengthen the robustness of results and conclusions obtained from analyses of datasets in basic and applied research.
- Published
- 2021
- Full Text
- View/download PDF
4. Bit by Bit: Social Research in the Digital Age
- Author
-
Matthew J. Salganik
- Published
- 2017
5. Privacy, ethics, and data access: A case study of the Fragile Families Challenge.
- Author
-
Ian Lundberg, Arvind Narayanan, Karen Levy, and Matthew J. Salganik
- Published
- 2018
6. Integrating explanation and prediction in computational social science
- Author
-
Tal Yarkoni, Alessandro Vespignani, Filiz Garip, Sendhil Mullainathan, Duncan J. Watts, Thomas L. Griffiths, Simine Vazire, Matthew J. Salganik, Jake M. Hofman, Susan Athey, Helen Margetts, and Jon Kleinberg
- Subjects
Multidisciplinary ,business.industry ,05 social sciences ,Big data ,02 engineering and technology ,Data science ,050105 experimental psychology ,Complement (complexity) ,law.invention ,law ,020204 information systems ,Schema (psychology) ,Causal inference ,0202 electrical engineering, electronic engineering, information engineering ,CLARITY ,0501 psychology and cognitive sciences ,Computational sociology ,Convergence (relationship) ,Construct (philosophy) ,business - Abstract
Computational social science is more than just large repositories of digital data and the computational methods needed to construct and analyse them. It also represents a convergence of different fields with different ways of thinking about and doing science. The goal of this Perspective is to provide some clarity around how these approaches differ from one another and to propose how they might be productively integrated. Towards this end we make two contributions. The first is a schema for thinking about research activities along two dimensions-the extent to which work is explanatory, focusing on identifying and estimating causal effects, and the degree of consideration given to testing predictions of outcomes-and how these two priorities can complement, rather than compete with, one another. Our second contribution is to advocate that computational social scientists devote more attention to combining prediction and explanation, which we call integrative modelling, and to outline some practical suggestions for realizing this goal.
- Published
- 2021
- Full Text
- View/download PDF
7. Experimental study of inequality and unpredictability in an artificial cultural market
- Author
-
Matthew J. Salganik, Peter Sheridan Dodds, and Duncan J. Watts
- Published
- 2022
- Full Text
- View/download PDF
8. The game of contacts: Estimating the social visibility of groups.
- Author
-
Matthew J. Salganik, Maeve B. Mello, Alexandre Hannud Abdo, Neilane Bertoni, Dimitri Fazito, and Francisco I. Bastos
- Published
- 2011
- Full Text
- View/download PDF
9. Web-Based Experiments for the Study of Collective Social Dynamics in Cultural Markets.
- Author
-
Matthew J. Salganik and Duncan J. Watts
- Published
- 2009
- Full Text
- View/download PDF
10. Measuring the predictability of life outcomes with a scientific mass collaboration
- Author
-
Kristin E. Porter, Malte Möser, Flora Wang, Bingyu Zhao, Wei Lee Woon, Yoshihiko Suhara, Adaner Usmani, Erik H. Wang, Kun Jin, Samantha Weissman, William Eggert, Hamidreza Omidvar, Andrew Or, Lisa M Hummel, Gregory Faletto, Ben Sender, Qiankun Niu, Viola Mocz, Antje Kirchner, Catherine Wu, Karen Ouyang, Ian Lundberg, Allison C. Morgan, Abdulla Alhajri, Arvind Narayanan, Khaled AlGhoneim, Louis Raes, Ilana M. Horwitz, Barbara E. Engelhardt, Ben Leizman, Crystal Qian, Drew Altschul, Guanhua He, Jeanne Brooks-Gunn, Ridhi Kashyap, Eaman Jahani, Ryan James Compton, Anna Filippova, Sara McLanahan, Tejomay Gadgil, Claudia V. Roberts, Muna Adem, Julia Wang, Jeremy Freese, Alexander T. Kindel, Daniel E Rigobon, Naijia Liu, Lisa P. Argyle, Mayank Mahajan, Jonathan D Tang, Moritz Hardt, Ethan Porter, Diana Mercado-Garcia, Andrew Halpern-Manners, Anahit Sargsyan, Duncan J. Watts, Alex Pentland, Sonia P Hashim, Dean Knox, Onur Varol, Ryan Amos, James M. Wu, Thomas Davidson, Emma Tsurkov, Bernie Hogan, Areg Karapetyan, William Nowak, Jingwen Yin, Livia Baer-Bositis, Landon Schnabel, Chenyun Zhu, Noah Mandell, Ahmed Musse, Yue Gao, Josh Gagné, Stephen McKay, Jennie E. Brand, Abdullah Almaatouq, Katy M. Pinto, Andrew E Mack, Austin van Loon, Bedoor K. AlShebli, Helge Marahrens, Xiafei Wang, Bryan Schonfeld, Sonia Hausen, Kengran Yang, Maria Wolters, Brandon M. Stewart, Naman Jain, Moritz Büchi, Nicole Bohme Carnegie, Redwane Amin, Caitlin Ahearn, Kirstie Whitaker, Bo-Ryehn Chung, Diana Stanescu, Thomas Schaffner, Patrick Kaminski, David Jurgens, Kivan Polimis, Kimberly Higuera, Zhilin Fan, Matthew J. Salganik, Debanjan Datta, Connor Gilroy, E H Kim, Katariina Mueller-Gastell, Karen Levy, Brian J. Goode, Zhi Wang, Tamkinat Rauf, Lundberg, Ian [0000-0002-1909-2270], Almaatouq, Abdullah [0000-0002-8467-9123], Altschul, Drew M [0000-0001-7053-4209], Carnegie, Nicole Bohme [0000-0001-7664-6682], Kashyap, Ridhi [0000-0003-0615-2868], McKay, Stephen [0000-0002-5080-8417], Morgan, Allison C [0000-0003-2926-2162], Raes, Louis [0000-0003-2640-7493], Argyle, Lisa P [0000-0003-3109-2537], Büchi, Moritz [0000-0002-9202-889X], Jin, Kun [0000-0002-0118-1021], Varol, Onur [0000-0002-3994-6106], Watts, Duncan J [0000-0001-5005-4961], Apollo - University of Cambridge Repository, Department of Economics, and Research Group: Economics
- Subjects
Male ,Adolescent ,Computer science ,Social Sciences ,02 engineering and technology ,Cohort Studies ,Machine Learning ,03 medical and health sciences ,0302 clinical medicine ,Life ,Multidisciplinary approach ,Predictive Value of Tests ,Benchmark (surveying) ,0202 electrical engineering, electronic engineering, information engineering ,Econometrics ,Humans ,Family ,Predictability ,mass collaboration ,Child ,life course ,Multidisciplinary ,Fragile Families ,Correction ,Infant ,Fragile Families and Child Wellbeing Study ,prediction ,Outcome (probability) ,Mass collaboration ,machine learning ,L310 Applied Sociology ,Child, Preschool ,Computational Social Science ,Life course approach ,020201 artificial intelligence & image processing ,Computational sociology ,Female ,030217 neurology & neurosurgery - Abstract
© This open access article is distributed under Creative Commons Attribution-NonCommercialNoDerivatives License 4.0 (CC BY-NC-ND). How predictable are life trajectories? We investigated this question with a scientific mass collaboration using the common task method; 160 teams built predictive models for six life outcomes using data from the Fragile Families and Child Wellbeing Study, a high-quality birth cohort study. Despite using a rich dataset and applying machine-learning methods optimized for prediction, the best predictions were not very accurate and were only slightly better than those from a simple benchmark model. Within each outcome, prediction error was strongly associated with the family being predicted and weakly associated with the technique used to generate the prediction. Overall, these results suggest practical limits to the predictability of life outcomes in some settings and illustrate the value of mass collaborations in the social sciences.
- Published
- 2020
- Full Text
- View/download PDF
11. Author response: Consensus-based guidance for conducting and reporting multi-analyst studies
- Author
-
Jeffrey J. Starns, Yoram K. Kunkels, Gustav Nilsonne, Balazs Aczel, Felix Holzmeister, Jörg Rieskamp, Andrea M Cataldo, Casper J. Albers, Ben R. Newell, Brian A Nosek, Raphael Silberzahn, Jean-François Mangin, Morton Ann Gernsbacher, Gary F. Egan, Alexandra Sarafoglou, Barnabas Szaszi, Matthew J. Salganik, Martin Schweinsberg, Alexander T. Kindel, Niko A. Busch, Daniel J. Simons, Emmanuel Caruyer, Eric-Jan Wagenmakers, Russell A. Poldrack, Dora Matzke, Noah van Dongen, Anna Dreber, Nelson Cowan, Barbara A. Spellman, Chris Donkin, Gilles Dutilh, Johnny van Doorn, Don van Ravenzwaaij, Marcus R. Munafò, Marcel A.L.M. van Assen, Rink Hoekstra, Juergen Huber, Samuel St-Jean, Laura F. Bringmann, Udo Boehm, Daniel J. Benjamin, Tom Schonberg, Eric Luis Uhlmann, D. Stephen Lindsay, Michael Kirchler, David R. Shanks, Jojanneke A. Bastiaansen, Kai J. Jonas, Andrew Delios, Olmo van den Akker, Jelte M. Wicherts, Sabine Hoffmann, Rotem Botvinik-Nezer, and Magnus Johannesson
- Published
- 2021
- Full Text
- View/download PDF
12. Wiki surveys: open and quantifiable social data collection.
- Author
-
Matthew J Salganik and Karen E C Levy
- Subjects
Medicine ,Science - Abstract
In the social sciences, there is a longstanding tension between data collection methods that facilitate quantification and those that are open to unanticipated information. Advances in technology now enable new, hybrid methods that combine some of the benefits of both approaches. Drawing inspiration from online information aggregation systems like Wikipedia and from traditional survey research, we propose a new class of research instruments called wiki surveys. Just as Wikipedia evolves over time based on contributions from participants, we envision an evolving survey driven by contributions from respondents. We develop three general principles that underlie wiki surveys: they should be greedy, collaborative, and adaptive. Building on these principles, we develop methods for data collection and data analysis for one type of wiki survey, a pairwise wiki survey. Using two proof-of-concept case studies involving our free and open-source website www.allourideas.org, we show that pairwise wiki surveys can yield insights that would be difficult to obtain with other methods.
- Published
- 2015
- Full Text
- View/download PDF
13. Computational social science: Obstacles and opportunities
- Author
-
Noshir Contractor, Alessandro Vespignani, Sinan Aral, Deen Freelon, Alondra Nelson, Alex Pentland, David Lazer, Duncan J. Watts, Susan Athey, Matthew J. Salganik, Helen Margetts, Claudia Wagner, Sandra González-Bailón, Markus Strohmaier, and Gary King
- Subjects
Multidisciplinary ,Management science ,MEDLINE ,Computational sociology ,Sociology - Abstract
Data sharing, research ethics, and incentives must improve
- Published
- 2020
- Full Text
- View/download PDF
14. Integrating explanation and prediction in computational social science
- Author
-
Jake M, Hofman, Duncan J, Watts, Susan, Athey, Filiz, Garip, Thomas L, Griffiths, Jon, Kleinberg, Helen, Margetts, Sendhil, Mullainathan, Matthew J, Salganik, Simine, Vazire, Alessandro, Vespignani, and Tal, Yarkoni
- Subjects
Data Science ,Humans ,Social Sciences ,Computer Simulation ,Models, Theoretical ,Goals ,Forecasting - Abstract
Computational social science is more than just large repositories of digital data and the computational methods needed to construct and analyse them. It also represents a convergence of different fields with different ways of thinking about and doing science. The goal of this Perspective is to provide some clarity around how these approaches differ from one another and to propose how they might be productively integrated. Towards this end we make two contributions. The first is a schema for thinking about research activities along two dimensions-the extent to which work is explanatory, focusing on identifying and estimating causal effects, and the degree of consideration given to testing predictions of outcomes-and how these two priorities can complement, rather than compete with, one another. Our second contribution is to advocate that computational social scientists devote more attention to combining prediction and explanation, which we call integrative modelling, and to outline some practical suggestions for realizing this goal.
- Published
- 2021
15. Wiki surveys: Open and quantifiable social data collection
- Author
-
Matthew J. Salganik and Karen E. C. Levy
- Published
- 2012
16. Improving metadata infrastructure for complex surveys: Insights from the Fragile Families Challenge
- Author
-
Maya Phillips, Shiva Rouhani, Kate Jaeger, Alexander T. Kindel, Thomas H. Hartshorne, Dawn Koffman, Sara McLanahan, Matthew J. Salganik, Kristin Catena, Ryan Vinh, and Vineet Bansal
- Subjects
050402 sociology ,Computer science ,Quantitative methodology ,05 social sciences ,SocArXiv|Social and Behavioral Sciences|Sociology|Methodology ,General Social Sciences ,Survey research ,01 natural sciences ,Data science ,SocArXiv|Social and Behavioral Sciences|Social Statistics ,Data sharing ,Metadata ,bepress|Social and Behavioral Sciences|Sociology ,SocArXiv|Social and Behavioral Sciences|Sociology ,010104 statistics & probability ,0504 sociology ,bepress|Social and Behavioral Sciences|Sociology|Quantitative, Qualitative, Comparative, and Historical Methodologies ,bepress|Social and Behavioral Sciences|Social Statistics ,bepress|Social and Behavioral Sciences ,Computational sociology ,Limit (mathematics) ,SocArXiv|Social and Behavioral Sciences ,0101 mathematics - Abstract
Researchers rely on metadata systems to prepare data for analysis. As the complexity of data sets increases and the breadth of data analysis practices grow, existing metadata systems can limit the efficiency and quality of data preparation. This article describes the redesign of a metadata system supporting the Fragile Families and Child Wellbeing Study on the basis of the experiences of participants in the Fragile Families Challenge. The authors demonstrate how treating metadata as data (i.e., releasing comprehensive information about variables in a format amenable to both automated and manual processing) can make the task of data preparation less arduous and less error prone for all types of data analysis. The authors hope that their work will facilitate new applications of machine-learning methods to longitudinal surveys and inspire research on data preparation in the social sciences. The authors have open-sourced the tools they created so that others can use and improve them.
- Published
- 2018
17. Privacy, ethics, and data access: A case study of the Fragile Families Challenge
- Author
-
Matthew J. Salganik, Ian Lundberg, Arvind Narayanan, and Karen Levy
- Subjects
FOS: Computer and information sciences ,050402 sociology ,Computer science ,business.industry ,05 social sciences ,Internet privacy ,General Social Sciences ,Face (sociological concept) ,01 natural sciences ,010104 statistics & probability ,Mass collaboration ,Computer Science - Computers and Society ,Data access ,0504 sociology ,Computers and Society (cs.CY) ,0101 mathematics ,business - Abstract
Stewards of social science data face a fundamental tension. On one hand, they want to make their data accessible to as many researchers as possible to facilitate new discoveries. At the same time, they want to restrict access to their data as much as possible in order to protect the people represented in the data. In this paper, we provide a case study addressing this common tension in an uncommon setting: the Fragile Families Challenge, a scientific mass collaboration designed to yield insights that could improve the lives of disadvantaged children in the United States. We describe our process of threat modeling, threat mitigation, and third-party guidance. We also describe the ethical principles that formed the basis of our process. We are open about our process and the trade-offs that we made in the hopes that others can improve on what we have done., 60 pages, 9 figures, 1 table
- Published
- 2018
18. Strengthening the Reporting of Observational Studies in Epidemiology for respondent-driven sampling studies: 'STROBE-RDS' statement
- Author
-
Wolfgang Hladik, Matthias Egger, Michael W. Spiller, Amy Drake, Avi J Hakim, Richard G. White, Kate K. Orroth, Carl Kendall, David Wilson, Matthew J. Salganik, Ligia Regina Franco Sansigolo Kerr, and Lisa G. Johnston
- Subjects
Research design ,Practice guidelines as topic ,congenital, hereditary, and neonatal diseases and abnormalities ,medicine.medical_specialty ,Cross-sectional study ,Epidemiology ,MEDLINE ,610 Medicine & health ,Review Article ,Strengthening the reporting of observational studies in epidemiology ,computer.software_genre ,Sampling Studies ,03 medical and health sciences ,0302 clinical medicine ,Observation/methods ,360 Social problems & social services ,Surveys and Questionnaires ,Biomedical research/methods ,medicine ,Humans ,030212 general & internal medicine ,030505 public health ,Publishing/standards ,business.industry ,Public health ,Epidemiologic studies ,Guidelines as topic ,respiratory system ,eye diseases ,Checklist ,respiratory tract diseases ,3. Good health ,Observational Studies as Topic ,Family medicine ,Respondent ,Cross-sectional studies ,sense organs ,Data mining ,0305 other medical science ,business ,Epidemiologic research design ,Guidelines as topic/standards ,computer - Abstract
OBJECTIVES Respondent-driven sampling (RDS) is a new data collection methodology used to estimate characteristics of hard-to-reach groups, such as the HIV prevalence in drug users. Many national public health systems and international organizations rely on RDS data. However, RDS reporting quality and available reporting guidelines are inadequate. We carried out a systematic review of RDS studies and present Strengthening the Reporting of Observational Studies in Epidemiology for RDS Studies (STROBE-RDS), a checklist of essential items to present in RDS publications, justified by an explanation and elaboration document. STUDY DESIGN AND SETTING We searched the MEDLINE (1970-2013), EMBASE (1974-2013), and Global Health (1910-2013) databases to assess the number and geographical distribution of published RDS studies. STROBE-RDS was developed based on STROBE guidelines, following Guidance for Developers of Health Research Reporting Guidelines. RESULTS RDS has been used in over 460 studies from 69 countries, including the USA (151 studies), China (70), and India (32). STROBE-RDS includes modifications to 12 of the 22 items on the STROBE checklist. The two key areas that required modification concerned the selection of participants and statistical analysis of the sample. CONCLUSION STROBE-RDS seeks to enhance the transparency and utility of research using RDS. If widely adopted, STROBE-RDS should improve global infectious diseases public health decision making.
- Published
- 2015
- Full Text
- View/download PDF
19. Diagnostics for Respondent-Driven Sampling
- Author
-
Lisa G. Johnston, Matthew J. Salganik, and Krista J. Gile
- Subjects
Statistics and Probability ,Economics and Econometrics ,Computer science ,Human immunodeficiency virus (HIV) ,Survey sampling ,Inference ,medicine.disease_cause ,computer.software_genre ,Article ,03 medical and health sciences ,0302 clinical medicine ,Sampling design ,medicine ,030212 general & internal medicine ,030505 public health ,Data collection ,Sampling (statistics) ,Data science ,3. Good health ,Exploratory data analysis ,Respondent ,Data mining ,Statistics, Probability and Uncertainty ,0305 other medical science ,computer ,Social Sciences (miscellaneous) - Abstract
Summary Respondent-driven sampling (RDS) is a widely used method for sampling from hard-to-reach human populations, especially populations at higher risk for human immunodeficiency virus or acquired immune deficiency syndrome. Data are collected through a peer referral process over social networks. RDS has proven practical for data collection in many difficult settings and has been adopted by leading public health organizations around the world. Unfortunately, inference from RDS data requires many strong assumptions because the sampling design is partially beyond the control of the researcher and not fully observable. We introduce diagnostic tools for most of these assumptions and apply them in 12 high risk populations. These diagnostics empower researchers to understand their RDS data better and encourage future statistical research on RDS sampling and inference.
- Published
- 2014
- Full Text
- View/download PDF
20. Successes and Struggles with Computational Reproducibility: Lessons from the Fragile Families Challenge
- Author
-
David M. Liu and Matthew J. Salganik
- Subjects
0303 health sciences ,03 medical and health sciences ,05 social sciences ,050602 political science & public administration ,General Social Sciences ,030304 developmental biology ,0506 political science - Abstract
Reproducibility is fundamental to science, and an important component of reproducibility is computational reproducibility: the ability of a researcher to recreate the results of a published study using the original author’s raw data and code. Although most people agree that computational reproducibility is important, it is still difficult to achieve in practice. In this article, the authors describe their approach to enabling computational reproducibility for the 12 articles in this special issue of Socius about the Fragile Families Challenge. The approach draws on two tools commonly used by professional software engineers but not widely used by academic researchers: software containers (e.g., Docker) and cloud computing (e.g., Amazon Web Services). These tools made it possible to standardize the computing environment around each submission, which will ease computational reproducibility both today and in the future. Drawing on their successes and struggles, the authors conclude with recommendations to researchers and journals.
- Published
- 2019
- Full Text
- View/download PDF
21. Introduction to the Special Collection on the Fragile Families Challenge
- Author
-
Sara McLanahan, Alexander T. Kindel, Matthew J. Salganik, and Ian Lundberg
- Subjects
Mass collaboration ,050402 sociology ,0504 sociology ,Computer science ,05 social sciences ,050602 political science & public administration ,Measure (physics) ,General Social Sciences ,Life course approach ,Predictability ,Data science ,0506 political science - Abstract
The Fragile Families Challenge is a scientific mass collaboration designed to measure and understand the predictability of life trajectories. Participants in the Challenge created predictive models of six life outcomes using data from the Fragile Families and Child Wellbeing Study, a high-quality birth cohort study. This Special Collection includes 12 articles describing participants’ approaches to predicting these six outcomes as well as 3 articles describing methodological and procedural insights from running the Challenge. This introduction will help readers interpret the individual articles and help researchers interested in running future projects similar to the Fragile Families Challenge.
- Published
- 2019
- Full Text
- View/download PDF
22. Quantity Versus Quality: A Survey Experiment to Improve the Network Scale-up Method
- Author
-
Aline Umubyeyi, Mary Mahy, Dennis M. Feehan, Matthew J. Salganik, and Wolfgang Hladik
- Subjects
Male ,social networks ,Generalization ,Practice of Epidemiology ,Epidemiology ,media_common.quotation_subject ,epidemiologic methods ,Human immunodeficiency virus (HIV) ,HIV Infections ,medicine.disease_cause ,Social Environment ,01 natural sciences ,Risk Assessment ,Medical and Health Sciences ,Mathematical Sciences ,Social Networking ,Personal network ,Drug Users ,010104 statistics & probability ,03 medical and health sciences ,0302 clinical medicine ,Surveys and Questionnaires ,population size estimation ,survey research ,Statistics ,Medicine ,Hidden populations ,Humans ,Quality (business) ,030212 general & internal medicine ,0101 mathematics ,Homosexuality, Male ,Substance Abuse, Intravenous ,media_common ,Sex Workers ,Social network ,business.industry ,Substance Abuse ,Rwanda ,HIV ,acquired immunodeficiency syndrome ,Homosexuality ,Survey experiment ,3. Good health ,Key (cryptography) ,Female ,network sampling ,business ,Intravenous - Abstract
The network scale-up method is a promising technique that uses sampled social network data to estimate the sizes of epidemiologically important hidden populations, such as sex workers and people who inject illicit drugs. Although previous scale-up research has focused exclusively on networks of acquaintances, we show that the type of personal network about which survey respondents are asked to report is a potentially crucial parameter that researchers are free to vary. This generalization leads to a method that is more flexible and potentially more accurate. In 2011, we conducted a large, nationally representative survey experiment in Rwanda that randomized respondents to report about one of 2 different personal networks. Our results showed that asking respondents for less information can, somewhat surprisingly, produce more accurate size estimates. We also estimated the sizes of 4 key populations at risk for human immunodeficiency virus infection in Rwanda. Our estimates were higher than earlier estimates from Rwanda but lower than international benchmarks. Finally, in this article we develop a new sensitivity analysis framework and use it to assess the possible biases in our estimates. Our design can be customized and extended for other settings, enabling researchers to continue to improve the network scale-up method.
- Published
- 2016
- Full Text
- View/download PDF
23. The game of contacts: Estimating the social visibility of groups
- Author
-
Alexandre Hannud Abdo, Dimitri Fazito, Matthew J. Salganik, Francisco Inácio Bastos, Neilane Bertoni, and Maeve Brito de Mello
- Subjects
Estimation ,Actuarial science ,Sociology and Political Science ,Population size ,Visibility (geometry) ,General Social Sciences ,Public policy ,Sample (statistics) ,Article ,Men who have sex with men ,Anthropology ,Scale (social sciences) ,Statistics ,Psychology ,Social network analysis ,General Psychology - Abstract
Estimating the sizes of hard-to-count populations is a challenging and important problem that occurs frequently in social science, public health, and public policy. This problem is particularly pressing in HIV/AIDS research because estimates of the sizes of the most at-risk populations—illicit drug users, men who have sex with men, and sex workers—are needed for designing, evaluating, and funding programs to curb the spread of the disease. A promising new approach in this area is the network scale-up method, which uses information about the personal networks of respondents to make population size estimates. However, if the target population has low social visibility, as is likely to be the case in HIV/AIDS research, scale-up estimates will be too low. In this paper we develop a game-like activity that we call the game of contacts in order to estimate the social visibility of groups, and report results from a study of heavy drug users in Curitiba, Brazil (n = 294). The game produced estimates of social visibility that were consistent with qualitative expectations but of surprising magnitude. Further, a number of checks suggest that the data are high-quality. While motivated by the specific problem of population size estimation, our method could be used by researchers more broadly and adds to long-standing efforts to combine the richness of social network analysis with the power and scale of sample surveys.
- Published
- 2011
- Full Text
- View/download PDF
24. Web-Based Experiments for the Study of Collective Social Dynamics in Cultural Markets
- Author
-
Duncan J. Watts and Matthew J. Salganik
- Subjects
Adult ,Male ,Linguistics and Language ,Adolescent ,Cognitive Neuroscience ,Culture ,Experimental and Cognitive Psychology ,Article ,Social group ,Young Adult ,Interpersonal relationship ,Web-based experiments ,Artificial Intelligence ,Humans ,Interpersonal Relations ,Social Behavior ,Social influence ,Internet ,Group Processes ,Human-Computer Interaction ,Social dynamics ,Social system ,Female ,Psychology ,Social psychology ,Algorithms ,Music ,Mechanism (sociology) ,Strengths and weaknesses - Abstract
Social scientists are often interested in understanding how the dynamics of social systems are driven by the behavior of individuals that make up those systems. However, this process is hindered by the difficulty of experimentally studying how individual behavioral tendencies lead to collective social dynamics in large groups of people interacting over time. In this study, we investigate the role of social influence, a process well studied at the individual level, on the puzzling nature of success for cultural products such as books, movies, and music. Using a "multiple-worlds" experimental design, we are able to isolate the causal effect of an individual-level mechanism on collective social outcomes. We employ this design in a Web-based experiment in which 2,930 participants listened to, rated, and downloaded 48 songs by up-and-coming bands. Surprisingly, despite relatively large differences in the demographics, behavior, and preferences of participants, the experimental results at both the individual and collective levels were similar to those found in Salganik, Dodds, and Watts (2006). Further, by comparing results from two distinct pools of participants, we are able to gain new insights into the role of individual behavior on collective outcomes. We conclude with a discussion of the strengths and weaknesses of Web-based experiments to address questions of collective social dynamics.
- Published
- 2009
- Full Text
- View/download PDF
25. Respondent-driven sampling as Markov chain Monte Carlo
- Author
-
Sharad Goel and Matthew J. Salganik
- Subjects
Statistics and Probability ,congenital, hereditary, and neonatal diseases and abnormalities ,Markov chain ,Epidemiology ,Computer science ,Monte Carlo method ,Markov process ,Sampling (statistics) ,Markov chain Monte Carlo ,Sample (statistics) ,Variance (accounting) ,respiratory tract diseases ,symbols.namesake ,Statistics ,symbols ,Econometrics ,Importance sampling - Abstract
Respondent-driven sampling (RDS) is a recently introduced, and now widely used, technique for estimating disease prevalence in hidden populations. RDS data are collected through a snowball mechanism, in which current sample members recruit future sample members. In this paper we present RDS as Markov chain Monte Carlo importance sampling, and we examine the effects of community structure and the recruitment procedure on the variance of RDS estimates. Past work has assumed that the variance of RDS estimates is primarily affected by segregation between healthy and infected individuals. We examine an illustrative model to show that this is not necessarily the case, and that bottlenecks anywhere in the networks can substantially affect estimates. We also show that variance is inflated by a common design feature in which the sample members are encouraged to recruit multiple future sample members. The paper concludes with suggestions for implementing and evaluating RDS studies.
- Published
- 2009
- Full Text
- View/download PDF
26. Wiki Surveys: Open and Quantifiable Social Data Collection
- Author
-
Karen Levy and Matthew J. Salganik
- Subjects
FOS: Computer and information sciences ,Computer science ,lcsh:Medicine ,Social Sciences ,Bioinformatics ,01 natural sciences ,Statistics - Applications ,010104 statistics & probability ,Computer Science - Computers and Society ,Surveys and Questionnaires ,Computers and Society (cs.CY) ,050602 political science & public administration ,Humans ,Applications (stat.AP) ,0101 mathematics ,lcsh:Science ,Internet ,Multidisciplinary ,Data collection ,business.industry ,Data Collection ,05 social sciences ,lcsh:R ,Survey research ,Statistical model ,Data science ,0506 political science ,Survey data collection ,The Internet ,Pairwise comparison ,lcsh:Q ,business ,Research Article - Abstract
In the social sciences, there is a longstanding tension between data collection methods that facilitate quantification and those that are open to unanticipated information. Advances in technology now enable new, hybrid methods that combine some of the benefits of both approaches. Drawing inspiration from online information aggregation systems like Wikipedia and from traditional survey research, we propose a new class of research instruments called wiki surveys. Just as Wikipedia evolves over time based on contributions from participants, we envision an evolving survey driven by contributions from respondents. We develop three general principles that underlie wiki surveys: they should be greedy, collaborative, and adaptive. Building on these principles, we develop methods for data collection and data analysis for one type of wiki survey, a pairwise wiki survey. Using two proof-of-concept case studies involving our free and open-source website www.allourideas.org, we show that pairwise wiki surveys can yield insights that would be difficult to obtain with other methods., Comment: 24 pages, 8 figures, 1 table
- Published
- 2015
27. How Many People Do You Know in Prison?
- Author
-
Andrew Gelman, Matthew J. Salganik, and Tian Zheng
- Subjects
Statistics and Probability ,education.field_of_study ,Population ,Negative binomial distribution ,Survey sampling ,Regression analysis ,Simple random sample ,Social group ,Overdispersion ,Statistics ,Econometrics ,Statistics, Probability and Uncertainty ,education ,Count data - Abstract
Networks—sets of objects connected by relationships—are important in a number of fields. The study of networks has long been central to sociology, where researchers have attempted to understand the causes and consequences of the structure of relationships in large groups of people. Using insight from previous network research, Killworth et al. and McCarty et al. have developed and evaluated a method for estimating the sizes of hard-to-count populations using network data collected from a simple random sample of Americans. In this article we show how, using a multilevel overdispersed Poisson regression model, these data also can be used to estimate aspects of social structure in the population. Our work goes beyond most previous research on networks by using variation, as well as average responses, as a source of information. We apply our method to the data of McCarty et al. and find that Americans vary greatly in their number of acquaintances. Further, Americans show great variation in propensity to form ...
- Published
- 2006
- Full Text
- View/download PDF
28. 5. Sampling and Estimation in Hidden Populations Using Respondent-Driven Sampling
- Author
-
Douglas D. Heckathorn and Matthew J. Salganik
- Subjects
Estimation ,education.field_of_study ,Sociology and Political Science ,Computer science ,05 social sciences ,Population ,050401 social sciences methods ,Sampling (statistics) ,Sample (statistics) ,Target population ,0506 political science ,0504 sociology ,Respondent ,Statistics ,050602 political science & public administration ,Trait ,Econometrics ,Hidden populations ,education - Abstract
Standard statistical methods often provide no way to make accurate estimates about the characteristics of hidden populations such as injection drug users, the homeless, and artists. In this paper, we further develop a sampling and estimation technique called respondent-driven sampling, which allows researchers to make asymptotically unbiased estimates about these hidden populations. The sample is selected with a snowball-type design that can be done more cheaply, quickly, and easily than other methods currently in use. Further, we can show that under certain specified (and quite general) conditions, our estimates for the percentage of the population with a specific trait are asymptotically unbiased. We further show that these estimates are asymptotically unbiased no matter how the seeds are selected. We conclude with a comparison of respondent-driven samples of jazz musicians in New York and San Francisco, with corresponding institutional samples of jazz musicians from these cities. The results show that some standard methods for studying hidden populations can produce misleading results.
- Published
- 2004
- Full Text
- View/download PDF
29. Commentary
- Author
-
Matthew J. Salganik
- Subjects
Epidemiology ,Computer science ,Extramural ,Statistics ,Respondent ,MEDLINE ,Sampling (statistics) - Published
- 2012
- Full Text
- View/download PDF
30. Generalizing the Network Scale-Up Method: A New Estimator for the Size of Hidden Populations
- Author
-
Dennis M. Feehan and Matthew J. Salganik
- Subjects
FOS: Computer and information sciences ,hidden populations ,social networks ,sampling ,Sociology and Political Science ,Computer science ,Population ,Sample (statistics) ,Machine learning ,computer.software_genre ,01 natural sciences ,Statistics - Applications ,Article ,010104 statistics & probability ,03 medical and health sciences ,0302 clinical medicine ,Sociology ,network scale-up method ,Applications (stat.AP) ,Social Sciences Methods ,030212 general & internal medicine ,0101 mathematics ,education ,Sampling frame ,Demography ,education.field_of_study ,business.industry ,Frame (networking) ,Estimator ,Artificial intelligence ,business ,computer - Abstract
The network scale-up method enables researchers to estimate the sizes of hidden populations, such as drug injectors and sex workers, using sampled social network data. The basic scale-up estimator offers advantages over other size estimation techniques, but it depends on problematic modeling assumptions. The authors propose a new generalized scale-up estimator that can be used in settings with nonrandom social mixing and imperfect awareness about membership in the hidden population. In addition, the new estimator can be used when data are collected via complex sample designs and from incomplete sampling frames. However, the generalized scale-up estimator also requires data from two samples: one from the frame population and one from the hidden population. In some situations these data from the hidden population can be collected by adding a small number of questions to already planned studies. For other situations, the authors develop interpretable adjustment factors that can be applied to the basic scale-up estimator. The authors conclude with practical recommendations for the design and analysis of future studies.
- Published
- 2014
31. Experimental Study of Inequality and Unpredictability in an Artificial Cultural Market
- Author
-
Duncan J. Watts, Matthew J. Salganik, and Peter Sheridan Dodds
- Subjects
Rest (physics) ,Internet ,Multidisciplinary ,Inequality ,media_common.quotation_subject ,Culture ,Consumer research ,Consumer Behavior ,Consumer satisfaction ,Interpersonal relationship ,Sociology ,Research Design ,Humans ,Interpersonal Relations ,Quality (business) ,Positive economics ,Social Behavior ,Psychology ,Music ,Consumer behaviour ,Forecasting ,media_common ,Social influence - Abstract
Hit songs, books, and movies are many times more successful than average, suggesting that “the best” alternatives are qualitatively different from “the rest”; yet experts routinely fail to predict which products will succeed. We investigated this paradox experimentally, by creating an artificial “music market” in which 14,341 participants downloaded previously unknown songs either with or without knowledge of previous participants' choices. Increasing the strength of social influence increased both inequality and unpredictability of success. Success was also only partly determined by quality: The best songs rarely did poorly, and the worst rarely did well, but any other result was possible.
- Published
- 2006
- Full Text
- View/download PDF
32. Leading the Herd Astray: An Experimental Study of Self-Fulfilling Prophecies in an Artificial Cultural Market
- Author
-
Matthew J. Salganik and Duncan J. Watts
- Subjects
Social Psychology ,media_common.quotation_subject ,Deception ,Popularity ,Article ,Value judgment ,Perception ,Self-fulfilling prophecy ,Singing ,Psychology ,Social psychology ,media_common ,Reputation ,Social influence - Abstract
Individuals influence each others' decisions about cultural products such as songs, books, and movies; but to what extent can the perception of success become a “self-fulfilling prophecy”? We have explored this question experimentally by artificially inverting the true popularity of songs in an online “music market,” in which 12,207 participants listened to and downloaded songs by unknown bands. We found that most songs experienced self-ful- filling prophecies, in which perceived—but initially false—popularity became real over time. We also found, however, that the inversion was not self-fulfilling for the market as a whole, in part because the very best songs recovered their popularity in the long run. Moreover, the distortion of market information reduced the correlation between appeal and popularity, and led to fewer overall downloads. These results, although partial and speculative, suggest a new approach to the study of cultural markets, and indicate the potential of web-based experiments to explore the social psychological origin of other macrosocio- logical phenomena.
- Published
- 2013
33. Assessing network scale-up estimates for groups most at risk of HIV/AIDS: evidence from a multiple-method study of heavy drug users in Curitiba, Brazil
- Author
-
Matthew J. Salganik, Alexandre Hannud Abdo, Neilane Bertoni, Maeve Brito de Mello, Francisco Inácio Bastos, and Dimitri Fazito
- Subjects
Gerontology ,medicine.medical_specialty ,social networks ,Epidemiology ,Practice of Epidemiology ,epidemiologic methods ,MEDLINE ,Developing country ,HIV Infections ,Disease ,Risk Assessment ,03 medical and health sciences ,0302 clinical medicine ,Acquired immunodeficiency syndrome (AIDS) ,population size estimation ,Statistics ,medicine ,Prevalence ,Humans ,030212 general & internal medicine ,Substance Abuse, Intravenous ,Estimation ,Acquired Immunodeficiency Syndrome ,030505 public health ,biology ,business.industry ,Curitiba ,HIV ,medicine.disease ,biology.organism_classification ,3. Good health ,Epidemiologic Research Design ,network sampling ,0305 other medical science ,Risk assessment ,business ,Brazil - Abstract
One of the many challenges hindering the global response to the human immunodeficiency virus (HIV)/acquired immunodeficiency syndrome (AIDS) epidemic is the difficulty of collecting reliable information about the populations most at risk for the disease. Thus, the authors empirically assessed a promising new method for estimating the sizes of most at-risk populations: the network scale-up method. Using 4 different data sources, 2 of which were from other researchers, the authors produced 5 estimates of the number of heavy drug users in Curitiba, Brazil. The authors found that the network scale-up and generalized network scale-up estimators produced estimates 5-10 times higher than estimates made using standard methods (the multiplier method and the direct estimation method using data from 2004 and 2010). Given that equally plausible methods produced such a wide range of results, the authors recommend that additional studies be undertaken to compare estimates based on the scale-up method with those made using other methods. If scale-up-based methods routinely produce higher estimates, this would suggest that scale-up-based methods are inappropriate for populations most at risk of HIV/AIDS or that standard methods may tend to underestimate the sizes of these populations.
- Published
- 2011
34. Social Influence
- Author
-
Duncan J. Watts and Matthew J. Salganik
- Abstract
This article examines the role of social influence in the puzzling nature of success in cultural markets: successful cultural products, such as hit songs, best-selling books, and blockbuster movies, are considerably more successful than average; yet which particular songs, books, and movies will become the next ‘big thing’ appears impossible to predict. The article investigates this paradox empirically by constructing a website where more than 27,000 participants were allowed to listen to, rate, and download new music, and where the information that these participants had about the behavior of others could be controlled. In the first three experiments, the popularity of the songs were allowed to emerge naturally, without any intervention. In the fourth experiment, the problem of self-fulfilling prophecies in cultural markets was addressed. The results show that social influence gives rise to unanticipated consequences at the collective level, including inequality and unpredictability of success.
- Published
- 2011
- Full Text
- View/download PDF
35. Counting hard-to-count populations: the network scale-up method for public health
- Author
-
Matthew J. Salganik, Rob Lyerla, Otilia Scutelniciuc, Timothy B. Hallett, Alexandrina Iovita, Sharon S. Weir, Eugene C. Johnsen, Gene A. Shelley, Christopher McCarty, Donna F. Stroup, Mary Mahy, Petchsri Sirinirund, H. Russell Bernard, and Tetiana Saliuk
- Subjects
medicine.medical_specialty ,Population ,Dermatology ,Risk Assessment ,Personal network ,summation method ,Environmental health ,Statistics ,medicine ,Humans ,education ,known population method ,Estimation ,education.field_of_study ,Data collection ,NSU ,business.industry ,Public health ,Data Collection ,HIV ,AIDS ,personal network size ,Infectious Diseases ,Sample size determination ,Sample Size ,Population size ,surveillance ,Public Health ,Risk assessment ,business ,Strengths and weaknesses ,Supplement - Abstract
Estimating sizes of hidden or hard-to-reach populations is an important problem in public health. For example, estimates of the sizes of populations at highest risk for HIV and AIDS are needed for designing, evaluating and allocating funding for treatment and prevention programmes. A promising approach to size estimation, relatively new to public health, is the network scale-up method (NSUM), involving two steps: estimating the personal network size of the members of a random sample of a total population and, with this information, estimating the number of members of a hidden subpopulation of the total population. We describe the method, including two approaches to estimating personal network sizes (summation and known population). We discuss the strengths and weaknesses of each approach and provide examples of international applications of the NSUM in public health. We conclude with recommendations for future research and evaluation.
- Published
- 2010
36. Assessing respondent-driven sampling
- Author
-
Sharad Goel and Matthew J. Salganik
- Subjects
medicine.medical_specialty ,Computer science ,HIV Infections ,Public health surveillance ,Econometrics ,medicine ,Humans ,Substance Abuse, Intravenous ,Multidisciplinary ,Data collection ,Models, Statistical ,Public health ,Data Collection ,Sampling (statistics) ,Reproducibility of Results ,Variance (accounting) ,Confidence interval ,Sample size determination ,Research Design ,Data Interpretation, Statistical ,Population Surveillance ,Sample Size ,Respondent ,Communicable Disease Control ,Physical Sciences ,Public Health ,Algorithms - Abstract
Respondent-driven sampling (RDS) is a network-based technique for estimating traits in hard-to-reach populations, for example, the prevalence of HIV among drug injectors. In recent years RDS has been used in more than 120 studies in more than 20 countries and by leading public health organizations, including the Centers for Disease Control and Prevention in the United States. Despite the widespread use and growing popularity of RDS, there has been little empirical validation of the methodology. Here we investigate the performance of RDS by simulating sampling from 85 known, network populations. Across a variety of traits we find that RDS is substantially less accurate than generally acknowledged and that reported RDS confidence intervals are misleadingly narrow. Moreover, because we model a best-case scenario in which the theoretical RDS sampling assumptions hold exactly, it is unlikely that RDS performs any better in practice than in our simulations. Notably, the poor performance of RDS is driven not by the bias but by the high variance of estimates, a possibility that had been largely overlooked in the RDS literature. Given the consistency of our results across networks and our generous sampling conditions, we conclude that RDS as currently practiced may not be suitable for key aspects of public health surveillance where it is now extensively applied.
- Published
- 2010
37. How many people do you know?: Efficiently estimating personal network size
- Author
-
Tyler H. McCormick, Tian Zheng, and Matthew J. Salganik
- Subjects
Statistics and Probability ,education.field_of_study ,Social network ,Computer science ,business.industry ,Population size ,Social sciences--Research ,Population ,Statistics ,Survey sampling ,Statistical model ,Sample (statistics) ,Article ,Social group ,Personal network ,Econometrics ,FOS: Mathematics ,Statistics, Probability and Uncertainty ,education ,business - Abstract
In this paper we develop a method to estimate both individual social network size (i.e., degree) and the distribution of network sizes in a population by asking respondents how many people they know in specific subpopulations (e.g., people named Michael). Building on the scale-up method of Killworth et al. (1998b) and other previous attempts to estimate individual network size, we propose a latent non-random mixing model which resolves three known problems with previous approaches. As a byproduct, our method also provides estimates of the rate of social mixing between population groups. We demonstrate the model using a sample of 1,370 adults originally collected by McCarty et al. (2001). Based on insights developed during the statistical modeling, we conclude by offering practical guidelines for the design of future surveys to estimate social network size. Most importantly, we show that if the first names to be asked about are chosen properly, the simple scale-up degree estimates can enjoy the same bias-reduction as that from the our more complex latent non-random mixing model.
- Published
- 2010
- Full Text
- View/download PDF
38. Variance Estimation, Design Effects, and Sample Size Calculations for Respondent-Driven Sampling
- Author
-
Matthew J. Salganik
- Subjects
Health (social science) ,Respondent-driven sampling ,Computer science ,Sexually Transmitted Diseases ,Article ,Variance estimation ,Sampling Studies ,Health(social science) ,Nonprobability sampling ,Snowball sampling ,Statistics ,Econometrics ,Humans ,Design effects ,Analysis of Variance ,Sample size ,Hidden populations ,Data Collection ,Public Health, Environmental and Occupational Health ,Sampling (statistics) ,Simple random sample ,Confidence interval ,Stratified sampling ,Urban Studies ,Power analysis ,Sample size determination ,Epidemiologic Research Design ,Cluster sampling - Abstract
Hidden populations, such as injection drug users and sex workers, are central to a number of public health problems. However, because of the nature of these groups, it is difficult to collect accurate information about them, and this difficulty complicates disease prevention efforts. A recently developed statistical approach called respondent-driven sampling improves our ability to study hidden populations by allowing researchers to make unbiased estimates of the prevalence of certain traits in these populations. Yet, not enough is known about the sample-to-sample variability of these prevalence estimates. In this paper, we present a bootstrap method for constructing confidence intervals around respondent-driven sampling estimates and demonstrate in simulations that it outperforms the naive method currently in use. We also use simulations and real data to estimate the design effects for respondent-driven sampling in a number of situations. We conclude with practical advice about the power calculations that are needed to determine the appropriate sample size for a study using respondent-driven sampling. In general, we recommend a sample size twice as large as would be needed under simple random sampling.
- Published
- 2006
- Full Text
- View/download PDF
39. Consensus-based guidance for conducting and reporting multi-analyst studies
- Author
-
Casper J. Albers, Daniel J. Simons, Michael Kirchler, Brian A Nosek, Alexander T. Kindel, David R. Shanks, Morton Ann Gernsbacher, Tom Schonberg, Russell A. Poldrack, Nelson Cowan, Martin Schweinsberg, Juergen Huber, Barbara A. Spellman, Eric Luis Uhlmann, Dora Matzke, Yoram K. Kunkels, Udo Boehm, Laura F. Bringmann, D. Stephen Lindsay, Kai J. Jonas, Gilles Dutilh, Noah van Dongen, Daniel J. Benjamin, Johnny van Doorn, Magnus Johannesson, Andrea M Cataldo, Alexandra Sarafoglou, Don van Ravenzwaaij, Barnabas Szaszi, Matthew J. Salganik, Felix Holzmeister, Andrew Delios, Anna Dreber, Balazs Aczel, Olmo van den Akker, Raphael Silberzahn, Jean-François Mangin, Niko A. Busch, Jelte M. Wicherts, Chris Donkin, Jojanneke A. Bastiaansen, Marcel A.L.M. van Assen, Rink Hoekstra, Samuel St-Jean, Ben R. Newell, Jörg Rieskamp, Marcus R. Munafò, Gary F. Egan, Jeffrey J. Starns, Gustav Nilsonne, Eric-Jan Wagenmakers, Emmanuel Caruyer, Sabine Hoffmann, Rotem Botvinik-Nezer, Caruyer, Emmanuel, Eötvös Loránd University (ELTE), Karolinska Institutet [Stockholm], Tilburg University [Tilburg], Netspar, University of Groningen [Groningen], University of Southern California (USC), University of Amsterdam [Amsterdam] (UvA), Dartmouth College [Hanover], Westfälische Wilhelms-Universität Münster = University of Münster (WWU), Neuroimagerie: méthodes et applications (Empenn), Institut National de la Santé et de la Recherche Médicale (INSERM)-Inria Rennes – Bretagne Atlantique, Institut National de Recherche en Informatique et en Automatique (Inria)-Institut National de Recherche en Informatique et en Automatique (Inria)-SIGNAUX ET IMAGES NUMÉRIQUES, ROBOTIQUE (IRISA-D5), Institut de Recherche en Informatique et Systèmes Aléatoires (IRISA), Université de Rennes (UR)-Institut National des Sciences Appliquées - Rennes (INSA Rennes), Institut National des Sciences Appliquées (INSA)-Institut National des Sciences Appliquées (INSA)-Université de Bretagne Sud (UBS)-École normale supérieure - Rennes (ENS Rennes)-Institut National de Recherche en Informatique et en Automatique (Inria)-CentraleSupélec-Centre National de la Recherche Scientifique (CNRS)-IMT Atlantique (IMT Atlantique), Institut Mines-Télécom [Paris] (IMT)-Institut Mines-Télécom [Paris] (IMT)-Université de Rennes (UR)-Institut National des Sciences Appliquées - Rennes (INSA Rennes), Institut Mines-Télécom [Paris] (IMT)-Institut Mines-Télécom [Paris] (IMT)-Institut de Recherche en Informatique et Systèmes Aléatoires (IRISA), Institut National des Sciences Appliquées (INSA)-Institut National des Sciences Appliquées (INSA)-Université de Bretagne Sud (UBS)-École normale supérieure - Rennes (ENS Rennes)-CentraleSupélec-Centre National de la Recherche Scientifique (CNRS)-IMT Atlantique (IMT Atlantique), Institut Mines-Télécom [Paris] (IMT)-Institut Mines-Télécom [Paris] (IMT), McLean Hospital [Belmont, Ma.], University of Missouri [Columbia] (Mizzou), University of Missouri System, National University of Singapore (NUS), University of New South Wales [Sydney] (UNSW), Stockholm School of Economics (SSE), University Hospital Basel [Basel], Monash University [Melbourne], University of Wisconsin-Madison, Ludwig-Maximilians-Universität München (LMU), Leopold Franzens Universität Innsbruck - University of Innsbruck, Maastricht University [Maastricht], Princeton University, University of Victoria [Canada] (UVIC), Université Paris-Saclay, University of Bristol [Bristol], University of Virginia, Stanford University, University of Basel (Unibas), Tel Aviv University (TAU), European School of Management and Technology [Berlin, Germany] (ESMT), University College of London [London] (UCL), University of Sussex, University of Illinois at Urbana-Champaign [Urbana], University of Illinois System, University of Alberta, University of Massachusetts [Amherst] (UMass Amherst), University of Massachusetts System (UMASS), Institut européen d'administration des affaires [Singapour, Singapour] (INSEAD), University of Münster, Université de Rennes (UNIV-RENNES), University of Innsbruck, University of Virginia [Charlottesville], Tel Aviv University [Tel Aviv], Department of Methodology and Statistics, Empenn, Université de Rennes 1 (UR1), Université de Rennes (UNIV-RENNES)-Université de Rennes (UNIV-RENNES)-Institut National des Sciences Appliquées - Rennes (INSA Rennes), Institut National des Sciences Appliquées (INSA)-Université de Rennes (UNIV-RENNES)-Institut National des Sciences Appliquées (INSA)-Université de Bretagne Sud (UBS)-École normale supérieure - Rennes (ENS Rennes)-Institut National de Recherche en Informatique et en Automatique (Inria)-CentraleSupélec-Centre National de la Recherche Scientifique (CNRS)-IMT Atlantique Bretagne-Pays de la Loire (IMT Atlantique), Institut Mines-Télécom [Paris] (IMT)-Institut Mines-Télécom [Paris] (IMT)-Université de Rennes 1 (UR1), Institut National des Sciences Appliquées (INSA)-Université de Rennes (UNIV-RENNES)-Institut National des Sciences Appliquées (INSA)-Université de Bretagne Sud (UBS)-École normale supérieure - Rennes (ENS Rennes)-CentraleSupélec-Centre National de la Recherche Scientifique (CNRS)-IMT Atlantique Bretagne-Pays de la Loire (IMT Atlantique), Psychometrics and Statistics, Interdisciplinary Centre Psychopathology and Emotion regulation (ICPE), and Research and Evaluation of Educational Effectiveness
- Subjects
Data Analysis ,Consensus ,medicine ,QH301-705.5 ,Computer science ,none ,Science ,Datasets as Topic ,statistical practice ,Machine learning ,computer.software_genre ,General Biochemistry, Genetics and Molecular Biology ,neuroscience ,03 medical and health sciences ,0302 clinical medicine ,Science Forum ,REPRODUCIBILITY ,multi-analyst ,Applied research ,Biology (General) ,Robustness (economics) ,030304 developmental biology ,DELPHI ,0303 health sciences ,metascience ,expert consensus ,General Immunology and Microbiology ,business.industry ,General Neuroscience ,Research ,[SCCO.NEUR]Cognitive science/Neuroscience ,Feature Article ,[SCCO.NEUR] Cognitive science/Neuroscience ,Expert consensus ,General Medicine ,science forum ,Artificial intelligence ,Biochemistry and Cell Biology ,analytical variability ,business ,computer ,030217 neurology & neurosurgery - Abstract
International audience; Any large dataset can be analyzed in a number of ways, and it is possible that the use of different analysis strategies will lead to different results and conclusions. One way to assess whether the results obtained depend on the analysis strategy chosen is to employ multiple analysts and leave each of them free to follow their own approach. Here, we present consensus-based guidance for conducting and reporting such multi-analyst studies, and we discuss how broader adoption of the multi-analyst approach has the potential to strengthen the robustness of results and conclusions obtained from analyses of datasets in basic and applied research.
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.