| Skip to main content | Skip to navigation |

Register Now!

Using Proportional Transportation Similarity with Learned Element Semantics for XML Document Clustering

  • Xiaojun Wan, Peking University, China
  • Jianwu Yang, Peking University, China

Full text:

Track: Posters

This paper proposes a novel approach to measuring XML document similarity by taking into account the semantics between XML elements. The motivation of the proposed approach is to overcome the problems of 'under-contribution' and 'over-contribution' existing in previous work. In the proposed approach, the element semantics are learned in an unsupervised way and the Proportional Transportation Similarity is proposed to evaluate XML document similarity by modeling the similarity calculation as a transportation problem. Experiments of clustering are performed on three ACM SIGMOD data sets and results show the improved performance of the proposed approach.

Citation

Wan, X. and Yang, J. 2006. Using proportional transportation similarity with learned element semantics for XML document clustering. In Proceedings of the 15th International Conference on World Wide Web (Edinburgh, Scotland, May 23 - 26, 2006). WWW '06. ACM Press, New York, NY, 961-962.
DOI= http://doi.acm.org/10.1145/1135777.1135965

Organised by

ECS Logo

in association with

BCS Logo ACM Logo

Platinum Sponsors

Sponsor of The CIO Dinner


Become a sponsor or exhibitor
Valid XHTML 1.0! IFIP logo WWW Conference Committee logo Web Consortium logo Valid CSS!