Evaluating Clone Detection Techniques from a Refactoring Perspective

TitleEvaluating Clone Detection Techniques from a Refactoring Perspective
Publication TypeConference Paper
Year of Publication2004
AuthorsVan Rysselberghe, F, Demeyer S
Conference NameProceedings {ASE}'04 (Automated Software Engineering)
PublisherIEEE Press
Abstract

In the last decade, several researchers have investigated techniques to automatically detect duplicated code in programs exceeding hundreds of thousands lines of code. All of these techniques have known merits and deficiencies, but as of today, little is known on how these techniques fit into the refactoring process of object-oriented systems. This paper compares three representative detection techniques (simple line matching, parameterized matching, and metric fingerprints) by means of five small to medium sized cases and analyses the differences between the reported matches. Based on this comparison, we conclude that (1) simple line matching is best suited for a partial, yet advanced restructuring with little effort; (2) metric fingerprints work best for refactoring a system with minimal effort; (3) parameterized matching demands more effort yet allows a more profound, less obvious restructuring of the code.