ACM Home Page
Please provide us with feedback. Feedback
TA-RE: an exchange language for mining software repositories
Full text PdfPdf (287 KB)
Source International Conference on Software Engineering archive
Proceedings of the 2006 international workshop on Mining software repositories table of contents
Shanghai, China
SESSION: Repositories table of contents
Pages: 22 - 25  
Year of Publication: 2006
ISBN:1-59593-397-2
Authors
Sunghun Kim  University of California, Santa Cruz, CA
Thomas Zimmermann  Saarland University, Saarbrücken, Germany
Miryung Kim  University of Washington
Ahmed Hassan  University of Waterloo, Canada
Audris Mockus  Avaya labs
Tudor Girba  University of Berne, Switzerland
Martin Pinzger  University of Zurich, Switzerland
E. James Whitehead, Jr.  University of California, Santa Cruz, CA
Andreas Zeller  Saarland University, Saarbrücken, Germany
Sponsors
ACM: Association for Computing Machinery
SIGSOFT: ACM Special Interest Group on Software Engineering
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 7,   Downloads (12 Months): 47,   Citation Count: 1
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
Save this Article to a Binder    Display Formats: BibTex  EndNote ACM Ref   
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1137983.1137990
What is a DOI?

ABSTRACT

Software repositories have been getting a lot of attention from researchers in recent years. In order to analyze software repositories, it is necessary to first extract raw data from the version control and problem tracking systems. This poses two challenges: (1) extraction requires a non-trivial effort, and (2) the results depend on the heuristics used during extraction. These challenges burden researchers that are new to the community and make it difficult to benchmark software repository mining since it is almost impossible to reproduce experiments done by another team. In this paper we present the TA-RE corpus. TA-RE collects extracted data from software repositories in order to build a collection of projects that will simplify extraction process. Additionally the collection can be used for benchmarking. As the first step we propose an exchange language capable of making sharing and reusing data as simple as possible.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
2
 
3
 
4
V. Dallmeier, P. Weißgerber, and T. Zimmermann, "APFEL: A Preprocessing Framework For Eclipse," 2005, http://www.st.cs.uni-sb.de/softevo/apfel/.
 
5
 
6
 
7
 
8
 
9
10
 
11
12
 
13
14
15
 
16
 
17
18
 
19
D. J. Newman, S. Hettich, C. L. Blake, and C. J. Merz, "UCI Repository of machine learning databases," 1988, http://www.ics.uci.edu/~mlearn/MLRepository.html.
 
20
J. Sayyad Shirabad and T. J. Menzies, "The PROMISE Repository of Software Engineering Databases," 2005, http://promise.site.uottawa.ca/SERepository.
21
 
22
T. Zimmermann and P. Weißgerber, "Preprocessing CVS Data for Fine-Grained Analysis," Proc. of Int'l Workshop on Mining Software Repositories (MSR 2004), Edinburgh, Scotland, pp. 2--6, 2004.
 
23


Collaborative Colleagues:
Sunghun Kim: colleagues
Thomas Zimmermann: colleagues
Miryung Kim: colleagues
Ahmed Hassan: colleagues
Audris Mockus: colleagues
Tudor Girba: colleagues
Martin Pinzger: colleagues
E. James Whitehead, Jr.: colleagues
Andreas Zeller: colleagues