Abstract
The identification of non-genuine or malicious messages poses a variety of challenges due to the continuous changes in the techniques utilised by cyber-criminals. In this article, we propose a hybrid detection method based on a combination of image and text spam recognition techniques. In particular, the former is based on sparse representation-based classification, which focuses on the global and local image features, and a dictionary learning technique to achieve a spam and a ham sub-dictionary. On the other hand, the textual analysis is based on semantic properties of documents to assess the level of maliciousness. More specifically, we are able to distinguish between meta-spam and real spam. Experimental results show the accuracy and potential of our approach.
Original language | English |
---|---|
Pages (from-to) | 233-243 |
Number of pages | 11 |
Journal | Soft Computing |
Volume | 21 |
Issue number | 1 |
Early online date | 21 Dec 2015 |
DOIs | |
Publication status | Published - 1 Jan 2017 |
Keywords
- Image spam, Text spam, Semantic networks, Classication, Subclass Discriminant Analysis, Feature Selection, Sparse Representation