|
We study the XML Web, i.e., the subset of the Web made of XML documents only. We consider a sample of about 200,000 XML documents publicly available on the Web to characterize (i) XML Web pervasiveness, and (ii) XML documents structures. Our results show that, despite its short history, XML already permeates the Web, both in terms of generic domains and geographically. Also, we extract statistics on XML document useful for the design of algorithms, tools and systems that use XML.
|