Investigating Samples Representativeness for an Online Experiment in Java Code Search

Authors :: Rafael M. de Mello
Kathryn T. Stolee
Guilherme Horta Travassos
Source :: ESEM
Publication Year :: 2015
Publisher :: IEEE, 2015.
Abstract: Context: The results of large-scale studies in software engineering can be significantly impacted by samples' representativeness. Diverse population sources can be used to support sampling for such studies. Goal: To compare two samples, one from the crowdsourcing platform Mechanical Turk and another from the professional social network LinkedIn, in an online experiment for evaluating the relevance of Java code snippets to programming tasks. Method: To compare the samples (subjects' experience, programming habits) and experimental results concerned with three experimental trials. Results: LinkedIn's subjects present significantly higher levels of experience in Java programming and programming in general than Mechanical Turk's subjects. The experimental results revealed a significant difference between samples and suggested that LinkedIn's subjects were more pessimistic than Mechanical Turk's subjects despite a high level consistency in the experimental results. Conclusion: The combined use of sources of sampling can bring benefits to large scale studies in software engineering, especially when heterogeneity is desired in the population. Thus, it can be useful to investigate and characterize alternative sources of sampling for performing large-scale studies in software engineering.

Subjects :: education.field_of_study
Information retrieval
Java
business.industry
Computer science
Population
Sampling (statistics)
Context (language use)
Crowdsourcing
Representativeness heuristic
Real time Java
Relevance (information retrieval)
education
business
Software engineering
computer
computer.programming_language

Database :: OpenAIRE
Journal :: 2015 ACM/IEEE International Symposium on Empirical Software Engineering and Measurement (ESEM)
Accession number :: edsair.doi...........099b393831ea23c735983243f255f380

Tools