ACM Home Page
Please provide us with feedback. Feedback
Mining large software compilations over time: another perspective of software evolution
Full text PdfPdf (185 KB)
Source International Conference on Software Engineering archive
Proceedings of the 2006 international workshop on Mining software repositories table of contents
Shanghai, China
SESSION: Repositories table of contents
Pages: 3 - 9  
Year of Publication: 2006
ISBN:1-59593-397-2
Authors
Gregorio Robles  Universidad Rey Juan Carlos
Jesus M. Gonzalez-Barahona  Universidad Rey Juan Carlos
Martin Michlmayr  University of Cambridge
Juan Jose Amor  Universidad Rey Juan Carlos
Sponsors
ACM: Association for Computing Machinery
SIGSOFT: ACM Special Interest Group on Software Engineering
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 8,   Downloads (12 Months): 63,   Citation Count: 2
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
Save this Article to a Binder    Display Formats: BibTex  EndNote ACM Ref   
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1137983.1137986
What is a DOI?

ABSTRACT

With the success of libre (free, open source) software, a new type of software compilation has become increasingly common. Such compilations, often referred to as 'distributions', group hundreds, if not thousands, of software applications and libraries written by independent parties into an integrated system. Software compilations raise a number of questions that have not been targeted so far by software evolution, which usually focuses on the evolution of single applications. Undoubtedly, the challenges that software compilations face differ from those found in single software applications. Nevertheless, it can be assumed that both, the evolution of applications and that of software compilations, have similarities and dependencies.In this sense, we identify a dichotomy, common to that in economics, of software evolution in the small (micro-evolution) and in the large (macro-evolution). The goal of this paper is to study the evolution of a large software compilation, mining the publicly available repository of a well-known Linux distribution, Debian. We will therefore investigate changes related to hundreds of millions of lines of code over seven years. The aspects that will be covered in this paper are size (in terms of number of packages and of number of lines of code), use of programming languages, maintenance of packages and file sizes.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
J. J. Amor, J. M. Gonzalez-Barahona, G. Robles, and I. Herraiz. Measuring libre software using Debian 3.1 (Sarge) as a case study: preliminary results. Upgrade Magazine, Aug. 2005.
 
2
 
3
 
4
J. M. Gonzalez-Barahona, M. A. Ortuno Perez, P. de las Heras, J. Centeno, and V. Matellan. Counting potatoes: the size of Debian 2.2. Upgrade Magazine, II(6):60--66, Dec. 2001.
 
5
 
6
M. M. Lehman and J. F. Ramil. Implications of laws of software evolution on continuing successful use of cots software. Technical report, Imperial College, 1998.
 
7
 
8
 
9
M. Michlmayr and B. M. Hill. Quality and the reliance on individuals in free software projects. In Proceedings 3rd Workshop on Open Source Software Engineering, pages 105--109, Portland, USA, 2003.
 
10
 
11
G. Succi, J. W. Paulson, and A. Eberlein. Preliminary results from an empirical study on the growth of open source and commercial software products. In EDSER-3 Workshop, Toronto, Canada, May 2001.
 
12
 
13
 
14
D. A. Wheeler. More than a gigabuck: Estimating GNU/Linux's size. Technical report, June 2001.


Collaborative Colleagues:
Gregorio Robles: colleagues
Jesus M. Gonzalez-Barahona: colleagues
Martin Michlmayr: colleagues
Juan Jose Amor: colleagues