Unscrambling code clones for one-to-one matching of duplicated code

Pamela Green, Peter Lane, Sven-Bodo Scholz, Austen Rainer

Research output: Book/ReportOther report

187 Downloads (Pure)

Abstract

Code clone detection tools find sections of code that are similar. Different tools use difference representations of the code and different matching algorithms. This diversity makes clone detection tools attractive for other code matching tasks, particularly where code has been edited or rearranged. However, the tools report on every match found. In some applications we are interested in one-to-one matching, meaning that each section of copied code in one file is matched to just one section of code in the other file. In this report we explore ways that clones reported by the detection tools can inflate the amount of matching code. We also explain, with the aid of a worked example, our method for unscrambling the output from clone detection tools to approximate one-to-one matching if the code in one file to that in another file.
Original languageEnglish
PublisherUniversity of Hertfordshire
Publication statusPublished - 2010

Publication series

NameUH Computer Science Technical Report
PublisherUniversity of Hertfordshire
Volume502

Keywords

  • clone detection

Fingerprint

Dive into the research topics of 'Unscrambling code clones for one-to-one matching of duplicated code'. Together they form a unique fingerprint.

Cite this