Finding Specification Pages According to Attributes
This paper presents a method for finding a specification page on the web for a given object (e.g., ``Titanic'') and its class label (e.g., ``film''). A specification page for an object is a web page which gives concise attribute-value information about the object (e.g., ``director''-``James Cameron'' for ``Titanic''). A simple unsupervised method using layout and symbolic decoration cues was applied to a large number of the web pages to acquire the class attributes. We used these acquired attributes to select a representative specification page for a given object from the web pages retrieved by a normal search engine. Experimental results revealed that our method greatly outperformed the normal search engine in terms of specification retrieval.
Yoshinag, N. and Torisaw, K. 2006. Finding specification pages according to attributes. In Proceedings of the 15th International Conference on World Wide Web (Edinburgh, Scotland, May 23 - 26, 2006). WWW '06. ACM Press, New York, NY, 1021-1022.
Sponsor of The CIO Dinner