Abstract
The decreasing cost and the increasing availability of new technologies is enabling people to create their own digital libraries. One of the main topics in personal digital libraries is allowing people to select interesting information among all the different digital formats available today (pdf, html, tif, etc.). Moreover the increasing availability of these on-line libraries, as well as the advent of the so called Semantic Web [I], is raising the demand for converting paper documents into digital, possibly semantically annotated, documents. These motivations drove us to design a new system which could enable the user to interact and query documents independently from the digital formats in which they are represented. In order to achieve this independence from the format we consider all the digital documents contained in a digital library as images. Our system tries to automatically detect the layout of the digital documents and recognize the geometric regions of interest. All the extracted information is then encoded with respect to a reference ontology, so that the user can query his digital library by typing free text or browsing the ontology. This approach could help users, because they don't need to know SQL since they could only recall visual hints about a document: like find me a document with a picture on the left side and a big title centered on top of the page.
Original language | English |
---|---|
Title of host publication | Proceedings - 2004 IEEE Symposium on Visual Languages and Human Centric Computing |
Editors | P. Bottoni, C. Hundhausen, S. Levialdi, G. Tortora |
Pages | 283-284 |
Number of pages | 2 |
DOIs | |
Publication status | Published - 2004 |
Event | Proceedings - 2004 IEEE Symposium on Visual Languages and Human Centric Computing - Rome, Italy Duration: 26 Sept 2004 → 29 Sept 2004 |
Conference
Conference | Proceedings - 2004 IEEE Symposium on Visual Languages and Human Centric Computing |
---|---|
Country/Territory | Italy |
City | Rome |
Period | 26/09/04 → 29/09/04 |