A text annotation method based on semantic sequences

J. Bao, C. Lyon, P.C.R. Lane

    Research output: Contribution to journalArticlepeer-review

    76 Downloads (Pure)

    Abstract

    This paper presents a text annotation method based on semantic sequences to label a document and a cluster of documents. The basic idea underlying the semantic sequence approach is to find locally frequent meanings to act as the labels of a document, using an ontology such as WordNet. The ontology is also used to measure the semantic similarity of labels that indicate similarity between documents. Further, a text clustering method based upon four natural rules is introduced to cluster documents and label each cluster. This method does not need any pre-defined number of clusters, which is necessary for the partitioning clustering method, and avoids the need to set appropriate levels as in the hierarachical clustering method.
    Original languageEnglish
    JournalProceedings of the Seventh International Workshop on Computational Semantics
    Publication statusPublished - 2007

    Keywords

    • semantic sequences
    • text annotation
    • WordNet
    • clustering

    Fingerprint

    Dive into the research topics of 'A text annotation method based on semantic sequences'. Together they form a unique fingerprint.

    Cite this