Paper #25

 

L. Arlotta, V. Crescenzi, G. Mecca, P. Merialdo "Automatic annotation of data extracted from large web sites"

Keywords: data extraction, semantic web

 

In the framework of our ongoing project RoadRunner, we have have developed a prototype, called Labeller, that automatically annotates data extracted by automatically generated wrappers. We have experimented the prototype over a large number of real-life Web site, and we have obtained encouraging results. The underlying approach of the system has a general validity and it can be applied together with other wrapper generator systems.