Empir Software Eng
in the open within GitHub’s infrastructure and will continue to be an attractive source to
mine for research in software engineering.
Acknowledgments We would like to thank the authors of Padhye et al. (2014)andMatragkasetal.(2014)
for their valuable feedback regarding the evaluation of the impact of these perils on their research. We would
also like to thank Margaret-Anne Storey for her invaluable help in the development of this paper.
References
Aranda J, Venolia G (2009) The secret life of bugs: Going past the errors and omissions in software
repositories. In: Proceedings of the 31st international conference on software engineering, pp 298–
308
Bacchelli A, Bird C (2013) Expectations, outcomes, and challenges of m odern code review. In: Proceedings
international conference on soft engineering, ICSE ’13, pp 712–721
Bachmann A, Bird C, Rahman F, Devanbu P, Bernstein A (2010) The missing links: bugs and bug-fix com-
mits. In: Proceedings of the 18th ACM SIGSOFT international symposium on Foundations of software
engineering, pp 97–106
Baysal O, Gousios G (2014) The MSR’14 Mining Challenge., http://2014.msrconf.org/challenge.php
Begel A, Bosch J, Storey MA (2013) Social networking meets software development: perspectives from
github, msdn, stack exchange, and topcoder. Software, IEEE 30(1):52–66
Bird C, Bachmann A, Aune E, Duffy J, Bernstein A et al. (2009a) Fair and balanced?: bias in bug-fix datasets.
In: Proceedings of the the symposium on the foundations of software engineering, pp 121–130
Bird C, Rigby PC, Barr ET, Hamilton DJ, German DM, Devanbu P (2009b) The promises and perils of
mining git. In: Mining software repositories, (MSR’09). IEEE, pp 1–10
Bissyande TF, Lo D, Jiang L, Reveillere L, Klein J, Le Traon Y (2013) Got issues? who cares about it? a
large scale investigation of issue trackers from github. In: 2013 IEEE 24th international symposium on
software reliability engineering (ISSRE). IEEE, pp 188–197
Corbin J, Strauss A (2008) Basics of qualitative research: Techniques and procedures for developing
grounded theory. Sage
Dabbish L, Stuart C, Tsay J, Herbsleb J (2012) Social coding in GitHub: transparency and collaboration
in an open software repository. In: Proceedings conference on computer supported cooperative work,
pp 1277–1286
Finley K (2011) Github Has Surpassed Sourceforge and Google Code in Popularity., http://readwrite.com/
2011/06/02/github-has-passed-sourceforge
Gousios G (2013) The GHTorrent dataset and tool suite. In: Proceedings of the 10th Conference on mining
software repositories, MSR ’13, pp 233–236. http://dl.acm.org/citation.cfm?id=2487085.2487132
Gousios G, Spinellis D (2012) GHTorrent: GitHub’s data from a firehose. In: MSR ’12: proceedings of the
9th working conference on mining software repositories, pp 12–21
Gousios G, Zaidman A (2014a) A dataset for pull-based development research. In: Proceedings of the 11th
working conference on mining software repositories, MSR 2014, pp 368–371
Gousios G, Zaidman A (2014b) A dataset for pull-based development research. In: Proceedings of the 11th
working conference on mining software repositories, MSR 2014, pp 368–371
Gousios G, Pinzger M, Av D (2014) An exploratory study of the pull-based software development model.
In: Proceedings of the 36th international conference on software engineering, ICSE 2014, pp 345–
355
Gousios G, Zaidman A, Storey MA, Av D (2015) Work practices and challenges in pull-based develop-
ment: The integrator
˘
A
´
Zs perspective. In: Proceedings of the 37th international conference on software
engineering, ICSE 2015, to appear
Grigorik I (2012) The Github archive., http://www.githubarchive.org/
Howison J, Crowston K (2004) The perils and pitfalls of mining sourceforge. In: Proceedings of the
international workshop on mining software repositories, pp 7–11
Kalliamvakou E, Damian D, Singer L, German DM (2014a) The code-centric collaboration perspective:
evidence from GitHub. Technical Report DCS-352-IR, University of Victoria
Kalliamvakou E, Gousios G, Blincoe K, Singer L, German DM, Damian D (2014b) The promises and perils
of mining github. In: Proceedings of the 11th working conference on mining software repositories, MSR
2014, pp 92–101