Scaling Up Software Birthmarks Using Fuzzy Hashing

Takehiro Tsuzaki, Teruaki Yamamoto, Haruaki Tamada, Akito Monden

Research output: Chapter in Book/Report/Conference proceedingChapter


To detect the software theft, software birthmarks have been proposed. Software birthmark systems extract software birthmarks, which are native characteristics of software, from binary programs, and compare them by computing the similarity between birthmarks. This paper proposes a new procedure for scaling up the birthmark systems. While conventional birthmark systems are composed of the birthmark extraction phase and the birthmark comparison phase, the proposed method adds two new phases between extraction and comparison, namely, compression phase, which employs fuzzy hashing, and pre-comparison phase, which aims to increase distinction property of birthmarks. The proposed method enables us to reduce the required time in the comparison phase, so that it can be applied to detect software theft among many larger scale software products. From an experimental evaluation, the authors found that the proposed method significantly reduces the comparison time, and keeps the distinction performance, which is one of the important properties of the birthmark. Also, the preservation performance is acceptable when the threshold value is properly set.

Original languageEnglish
Title of host publicationResearch Anthology on Recent Trends, Tools, and Implications of Computer Programming
PublisherIGI Global
Number of pages16
ISBN (Electronic)9781799830177
ISBN (Print)9781799830160
Publication statusPublished - Jan 1 2020

ASJC Scopus subject areas

  • Computer Science(all)

Cite this