research-article

An exploratory study of the pull-based software development model

Authors:
Georgios Gousios

Delft University of Technology, Netherlands

Delft University of Technology, Netherlands
View Profile

,
Martin Pinzger

University of Klagenfurt, Austria

University of Klagenfurt, Austria
View Profile

,
Arie van Deursen

Delft University of Technology, Netherlands

Delft University of Technology, Netherlands
View Profile

ICSE 2014: Proceedings of the 36th International Conference on Software EngineeringMay 2014Pages 345–355https://doi.org/10.1145/2568225.2568260

Published:31 May 2014Publication History

ICSE 2014: Proceedings of the 36th International Conference on Software Engineering

Pages 345–355

ABSTRACT

The advent of distributed version control systems has led to the development of a new paradigm for distributed software development; instead of pushing changes to a central repository, developers pull them from other repositories and merge them locally. Various code hosting sites, notably Github, have tapped on the opportunity to facilitate pull-based development by offering workflow support tools, such as code reviewing systems and integrated issue trackers. In this work, we explore how pull-based software development works, first on the GHTorrent corpus and then on a carefully selected sample of 291 projects. We find that the pull request model offers fast turnaround, increased opportunities for community engagement and decreased time to incorporate contributions. We show that a relatively small number of factors affect both the decision to merge a pull request and the time to process it. We also examine the reasons for pull request rejection and find that technical ones are only a small minority.

References

J. Anvik, L. Hiew, and G. C. Murphy. Who should fix this bug? In Proceedings of ICSE ’06, pages 361–370. ACM, 2006. Google ScholarDigital Library
E. T. Barr, C. Bird, P. C. Rigby, A. Hindle, D. M. German, and P. Devanbu. Cohesive and isolated development with branches. In Proceedings of FASE ’12. Springer, 2012. Google ScholarDigital Library
O. Baysal, R. Holmes, and M. W. Godfrey. Mining usage data and development artifacts. In Proceedings of MSR ’09, pages 98–107. IEEE, 2012.Google Scholar
C. Bird, A. Gourley, and P. Devanbu. Detecting patch submission and acceptance in oss projects. In Proceedings of MSR ’07, page 26. IEEE Computer Society, 2007. Google ScholarDigital Library
C. Bird, A. Gourley, P. Devanbu, A. Swaminathan, and G. Hsu. Open borders? Immigration in open source projects. In Proceedings of MSR ’07, page 6. IEEE Computer Society, 2007. Google ScholarDigital Library
C. Bird, P. C. Rigby, E. T. Barr, D. J. Hamilton, D. M. German, and P. Devanbu. The promises and perils of mining Git. In Proceedings of MSR ’09, pages 1–10, 2009. Google ScholarDigital Library
C. Bird and T. Zimmermann. Assessing the value of branches with what-if analysis. In Proceedings of FSE ’12, pages 45:1–45:11. ACM, 2012. Google ScholarDigital Library
S. Chacon. Pro Git. Expert’s Voice in Software Development. Apress, 1rst edition, Aug 2009. Google ScholarDigital Library
L. Dabbish, C. Stuart, J. Tsay, and J. Herbsleb. Social coding in Github: transparency and collaboration in an open software repository. In Proceedings of CSCW ’12, pages 1277–1286. ACM, 2012. Google ScholarDigital Library
L. Dabbish, C. Stuart, J. Tsay, and J. Herbsleb. Leveraging transparency. IEEE Software, 30(1):37–43, 2013. Google ScholarDigital Library
B. Fluri, M. Wursch, M. PInzger, and H. Gall. Change distilling: Tree differencing for fine-grained source code change extraction. IEEE Trans. Soft. Eng., 33(11):725–743, 2007. Google ScholarDigital Library
R. Genuer, J.-M. Poggi, and C. Tuleau-Malot. Variable selection using random forests. Pattern Recognition Letters, 31(14):2225 – 2236, 2010. Google ScholarDigital Library
E. Giger, M. D’Ambros, M. Pinzger, and H. C. Gall. Method-level bug prediction. In In Proceedings of ESEM ’12, pages 171–180. ACM, 2012. Google ScholarDigital Library
E. Giger, M. Pinzger, and H. Gall. Predicting the fix time of bugs. In In Proceedings of RSSE ’10, pages 52–56. ACM, 2010. Google ScholarDigital Library
T. Girba, S. Ducasse, and M. Lanza. Yesterday’s weather: guiding early reverse engineering efforts by summarizing the evolution of changes. In Proceedings of ICSM ’04, pages 40 – 49, sept. 2004. Google ScholarDigital Library
G. Gousios. The GHTorrent dataset and tool suite. In Proceedings of MSR ’13, May 2013. Google ScholarDigital Library
N. V. Ivankova, J. W. Creswell, and S. L. Stick. Using mixed-methods sequential explanatory design: From theory to practice. Field Methods, 18(1):3–20, 2006.Google ScholarCross Ref
C. Jensen and W. Scacchi. Role migration and advancement processes in OSSD projects: A comparative case study. In Proceedings of ICSE ’07, pages 364–374. IEEE Computer Society, 2007. Google ScholarDigital Library
G. Jeong, S. Kim, T. Zimmermann, and K. Yi. Improving code review by predicting reviewers and acceptance of patches. Research on Software Analysis for Error-free Computing Center Tech-Memo (ROSAEC MEMO), 2009.Google Scholar
Y. Jiang, B. Adams, and D. M. German. Will my patch make it? and how fast?: case study on the Linux kernel. In Proceedings of MSR ’13, pages 101–110. IEEE Press, 2013. Google ScholarDigital Library
S. Lessmann, B. Baesens, C. Mues, and S. Pietsch. Benchmarking classification models for software defect prediction: A proposed framework and novel findings. IEEE Trans. Softw. Eng., 34(4):485–496, July 2008. Google ScholarDigital Library
N. McDonald and S. Goggins. Performance and participation in open source software on github. In CHI ’13 Extended Abstracts on Human Factors in Computing Systems, CHI EA ’13, pages 139–144. ACM, 2013. Google ScholarDigital Library
A. Mockus, R. T. Fielding, and J. D. Herbsleb. Two case studies of open source software development: Apache and Mozilla. ACM Trans. Softw. Eng. Methodol., 11(3):309–346, 2002. Google ScholarDigital Library
N. Nagappan and T. Ball. Use of relative code churn measures to predict system defect density. In Proceedings of ICSE ’05, pages 284–292. ACM, 2005. Google ScholarDigital Library
K. Peterson. The github open source development process. Technical report, Mayo Clinic, May 2013.Google Scholar
R. Pham, L. Singer, O. Liskin, F. Figueira Filho, and K. Schneider. Creating a shared understanding of testing culture on a social coding site. In Proceedings of ICSE ’13, pages 112–121. IEEE Press, 2013. Google ScholarDigital Library
J. Ratzinger, M. Pinzger, and H. Gall. EQ-mine: predicting short-term defects for software evolution. In Proceedings of FASE ’07, pages 12–26. Springer-Verlag, 2007. Google ScholarDigital Library
P. C. Rigby and C. Bird. Convergent software peer review practices. In Proceedings of FSE ’13, 2013. Google ScholarDigital Library
P. C. Rigby and D. M. German. A preliminary examination of code review processes in open source projects. University of Victoria, Canada, Tech. Rep. DCS-305-IR, 2006.Google Scholar
P. C. Rigby, D. M. German, and M.-A. Storey. Open source software peer review practices: a case study of the Apache server. In Proceedings of ICSE ’08, pages 541–550. ACM, 2008. Google ScholarDigital Library
E. Shihab, C. Bird, and T. Zimmermann. The effect of branching strategies on software quality. In In Proceedings of ESEM ’12, pages 301–310. ACM, 2012. Google ScholarDigital Library
P. Weißgerber, D. Neu, and S. Diehl. Small patches get in! In Proceedings of MSR ’08, pages 67–76. ACM, 2008. Google ScholarDigital Library

Index Terms

An exploratory study of the pull-based software development model
1. Social and professional topics
  1. Professional topics
    1. Management of computing and information systems
      1. Project and people management
      2. Software management
        Software maintenance
2. Software and its engineering
  1. Software creation and management
    1. Software development process management
    2. Software post-development issues
  2. Software notations and tools
    1. Software configuration management and version control systems

Recommendations

Work practices and challenges in pull-based development: the contributor's perspective
ICSE '16: Proceedings of the 38th International Conference on Software Engineering

The pull-based development model is an emerging way of contributing to distributed software projects that is gaining enormous popularity within the open source software (OSS) world. Previous work has examined this model by focusing on projects and their ...
Read More
A dataset for pull-based development research
MSR 2014: Proceedings of the 11th Working Conference on Mining Software Repositories

Pull requests form a new method for collaborating in distributed software development. To study the pull request distributed development model, we constructed a dataset of almost 900 projects and 350,000 pull requests, including some of the largest ...
Read More
Nudge: Accelerating Overdue Pull Requests toward Completion
Pull requests are a key part of the collaborative software development and code review process today. However, pull requests can also slow down the software development process when the reviewer(s) or the author do not actively engage with the pull ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
ICSE 2014: Proceedings of the 36th International Conference on Software Engineering
May 2014
1139 pages
ISBN:9781450327565
DOI:10.1145/2568225
General Chair:
Pankaj Jalote
IIIT-Delhi, India
,
Program Chairs:
Lionel Briand
University of Luxembourg, Luxembourg
,
André van der Hoek
University of California, Irvine, USA
Copyright © 2014 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 31 May 2014
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
Pull-based development
distributed software development
empirical software engineering
pull request
Qualifiers
- research-article
Conference

Acceptance Rates
Overall Acceptance Rate276of1,856submissions,15%

Upcoming Conference

ICSE 2025

2025 IEEE/ACM 46th International Conference on Software Engineering

April 26 - May 3, 2025

Ottawa , ON , Canada
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 392
  Total Citations
  View Citations
- 2,891
  Total Downloads
- Downloads (Last 12 months)238
- Downloads (Last 6 weeks)31
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

An exploratory study of the pull-based software development model

ICSE 2014: Proceedings of the 36th International Conference on Software Engineering

ABSTRACT

References

Cited By

Index Terms

Recommendations

Work practices and challenges in pull-based development: the contributor's perspective

A dataset for pull-based development research

Nudge: Accelerating Overdue Pull Requests toward Completion