University of Hertfordshire

Between Sound and Spelling: Combining Phonetics and Clustering Algorithms to Improve Target Word Recovery

Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • Marcos Zampieri
  • Renato Cordeiro De Amorim
View graph of relations
Original languageEnglish
Title of host publicationAdvances in Natural Language Processing
Subtitle of host publication9th International Conference on NLP, PolTAL 2014
PublisherSpringer
Pages438-449
ISBN (Electronic)978-3-319-10888-9
ISBN (Print)978-3-319-10887-2
DOIs
Publication statusPublished - 2014
Event9th International Conference on NLP, PoITAL 2014 - Warsaw, Poland
Duration: 17 Sep 201419 Sep 2014

Publication series

NameLecture Notes in Artificial Intelligence
Volume8686

Conference

Conference9th International Conference on NLP, PoITAL 2014
CountryPoland
CityWarsaw
Period17/09/1419/09/14

Abstract

In this paper we revisit the task of spell checking focusing on target word recovery. We propose a new approach that relies on phonetic information to improve the accuracy of clustering algorithms in identifying misspellings and generating accurate suggestions. The use of phonetic information is not new to the task of spell checking and it was used successfully in previous approaches. The combination of phonetics and cluster-based methods for spell checking was to our knowledge not yet explored and it is the new contribution of our work. We report an improvement of 8.16% accuracy when compared to a previously proposed spell checking approach.

ID: 9822846