Abstract
The decreasing cost and the increasing availability of new technologies is enabling people to create their own digital libraries. One of the main topic in personal digital libraries is allowing people to select interesting information among all the different digital formats available today (pdf, html, tiff, etc.). Moreover the increasing availability of these on-line libraries, as well as the advent of the so called Semantic Web [1], is raising the demand for converting paper documents into digital, possibly semantically annotated, documents. These motivations drove us to design a new system which could enable the user to interact and query documents independently from the digital formats in which they are represented. In order to achieve this independence from the format we consider all the digital documents contained in a digital library as images. Our system tries to automatically detect the layout of the digital documents and recognize the geometric regions of interest. All the extracted information is then encoded with respect to a reference ontology, so that the user can query his digital library by typing free text or browsing the ontology.
Original language | English |
---|---|
Title of host publication | Proceedings of the 17th International Conference on Pattern Recognition, ICPR 2004 |
Editors | J. Kittler, M. Petrou, M. Nixon |
Pages | 671-674 |
Number of pages | 4 |
Volume | 2 |
DOIs | |
Publication status | Published - 2004 |
Event | Proceedings of the 17th International Conference on Pattern Recognition, ICPR 2004 - Cambridge, United Kingdom Duration: 23 Aug 2004 → 26 Aug 2004 |
Conference
Conference | Proceedings of the 17th International Conference on Pattern Recognition, ICPR 2004 |
---|---|
Country/Territory | United Kingdom |
City | Cambridge |
Period | 23/08/04 → 26/08/04 |