Estimating Required Recall of Information Retrieval and Extraction for Successful Knowledge Acquisition from the Web
Information on the web is not only abundant but also redundant. This redundancy of information has an important consequence on the relation between the recall of an information gathering system and its capacity to harvest the core information of a certain domain of knowledge. In this paper we provide a new idea for estimating the necessary web coverage of a knowledge acquisition system in order to achieve a certain desired coverage of the contained core information.
Gatterbauer, W. 2006. Estimating required recall for successful knowledge acquisition from the web. In Proceedings of the 15th International Conference on World Wide Web (Edinburgh, Scotland, May 23 - 26, 2006). WWW '06. ACM Press, New York, NY, 969-970.
Sponsor of The CIO Dinner