Here you can find various data related to our research, including corpora, results as well as scripts necessary for reproduction. We hope for and very much appreciate contributions directed on the same topics aiming to extend the existing data set.
We currently offer data sets to the following publications of ours:
R. Koschke, S. Bazrafshan. A Large-Scale Statistical Analysis of Software Clone Rates and Localization in Open-Source Programs Written in C, C++, C#, or Java. International Workshop on Software Clones, IEEE Computer Society, 2016.
J. Harder, R. Tiarks. A Controlled Experiment on Software Clones. International Conference on Program Comprehension, IEEE Computer Society Press, 2012; 219-228.
Here you can download data used in the experiment. The downloads include the results of the original study, a replication set consisting of various materials, the Eclipse Plugin developed to observe user interaction, and the sources of the two subject games.
- Download Results
- Download Replication Set
- Download Eclipse Plugin
- Download Sources of FrozenBubble
- Download Sources of Pacman
S. Bazrafshan, R. Koschke, N. Göde. Approximate Code Search in Program Histories. Working Conference on Reverse Engineering, IEEE Computer Society Press, 2011; 109-118.
Here you can find the detailed results of the performance analysis of our tool approx.
S. Bellon, R. Koschke, G. Antoniol, J. Krinke, E. Merlo. Comparison and Evaluation of Clone Detection Tools. IEEE Computer Society Transactions on Software Engineering September 2007; 33(9):577-591.
Using the first link you can download the data of Stefan Bellon's clone detector benchmark in RCF format. For each system that was analyzed in the main experiment, a rcf file is present. To load the files please unpack the source tarball in the directory where the rcf file is located. The results of each clone detector, as well as the candiates are stored in seperate versions of the rcfs. Details on the sources of the systems as an ISO image can be found using the second link.