Proximity Within Paragraph: A Measure to Enhance Document Retrieval Performance
We created a proximity measure, called Proximity Within Paragraph (PWP), which is based on the concept of value assignment to queried words, grouped by associated ideas within paragraphs. Based on the WT10G dataset, a test system comprising three test sets and fifty queries were constructed to evaluate the effectiveness of PWP by comparing it with the existing method: Minimum Distance Between Queried Pairs. A further experiment combines the scores obtained from both methods and the results suggest that the combination can significantly improve the effectiveness.
Palakvangsa-Na-Ayudhya, S. and Keane, J. A. 2006. Proximity within paragraph: a measure to enhance eocument retrieval performance. In Proceedings of the 15th International Conference on World Wide Web (Edinburgh, Scotland, May 23 - 26, 2006). WWW '06. ACM Press, New York, NY, 1033-1034.
Sponsor of The CIO Dinner