HTML2RSS: Automatic Generation of RSS Feed based on Structure Analysis of HTML Document
Track: Posters We present a system to automatically generate RSS feeds from HTML documents that consist of time-series items with date expressions, e.g., archives of weblogs, BBSs, chats, mailing lists, site update descriptions, and event announcements. Our system extracts date expressions, performs structure analysis of a HTML document, and detects or generates titles from the document. Citation Rutledge, L., Aroyo, L., and Stash, N. 2006. Determining user interests about museum collections. In Proceedings of the 15th International Conference on World Wide Web (Edinburgh, Scotland, May 23 - 26, 2006). WWW '06. ACM Press, New York, NY, 855-856. Citation Rotiroti, D. 2006. Strong authentication in web proxies. In Proceedings of the 15th International Conference on World Wide Web (Edinburgh, Scotland, May 23 - 26, 2006). WWW '06. ACM Press, New York, NY, 915-916. Citation Lau, T. P. and King, I. 2006. Bilingual web page and site readability assessment. In Proceedings of the 15th International Conference on World Wide Web (Edinburgh, Scotland, May 23 - 26, 2006). WWW '06. ACM Press, New York, NY, 993-994. Citation Zhou, Y. and Davis, J. 2006. Community discovery and analysis in blogspace. In Proceedings of the 15th International Conference on World Wide Web (Edinburgh, Scotland, May 23 - 26, 2006). WWW '06. ACM Press, New York, NY, 1017-1018. Citation Lin, Z., Lyu, M. R., and King, I. 2006. PageSim: a novel link-based measure of web page aimilarity. In Proceedings of the 15th International Conference on World Wide Web (Edinburgh, Scotland, May 23 - 26, 2006). WWW '06. ACM Press, New York, NY, 1019-1020. Citation Yoshinag, N. and Torisaw, K. 2006. Finding specification pages according to attributes. In Proceedings of the 15th International Conference on World Wide Web (Edinburgh, Scotland, May 23 - 26, 2006). WWW '06. ACM Press, New York, NY, 1021-1022. Citation Nanno, T. and Okumura, M. 2006. HTML2RSS: automatic generation of RSS feed based on structure analysis of HTML document. In Proceedings of the 15th International Conference on World Wide Web (Edinburgh, Scotland, May 23 - 26, 2006). WWW '06. ACM Press, New York, NY, 1061-1062. Citation Nanno, T. and Okumura, M. 2006. HTML2RSS: automatic generation of RSS feed based on structure analysis of HTML document. In Proceedings of the 15th International Conference on World Wide Web (Edinburgh, Scotland, May 23 - 26, 2006). WWW '06. ACM Press, New York, NY, 1075-1076. |
Platinum SponsorsSponsor of The CIO Dinner |
![]() |