Partitional Clustering of Malware Using K-Means

Renato Cordeiro De Amorim, Peter Komisarczuk

    Research output: Chapter in Book/Report/Conference proceedingChapter (peer-reviewed)peer-review

    3 Citations (Scopus)


    This paper describes a novel method aiming to cluster datasets containing malware behavioural data. Our method transform the data into an standardised data matrix that can be used in any clustering algorithm, finds the number of clusters in the data set and includes an optional visualization step for high-dimensional data using principal component analysis. Our clustering method deals well with categorical data, and it is able to cluster the behavioural data of 17,000 websites, acquired with Capture-HPC, in less than 2 min
    Original languageEnglish
    Title of host publicationCyberpatterns
    Subtitle of host publicationUnifying Design Patterns with Security and Attack Patterns
    PublisherSpringer Nature
    ISBN (Electronic)978-3-319-04447-7
    ISBN (Print)978-3-319-04446-0
    Publication statusPublished - May 2014


    Dive into the research topics of 'Partitional Clustering of Malware Using K-Means'. Together they form a unique fingerprint.

    Cite this