Visually Guided Bottom-Up Table Detection and Segmentation in Web Documents
Track: Posters In the AllRight project, we are developing an algorithm for unsupervised table detection and segmentation that uses the visual rendition of a Web page rather than the HTML code. Our algorithm works bottom-up by grouping word bounding boxes into larger groups and uses a set of heuristics. It has already been implemented and a preliminary evaluation on about 6000 Web documents has been carried out. Citation Krüpl, B. and Herzog, M. 2006. Visually guided bottom-up table detection and segmentation in web documents. In Proceedings of the 15th International Conference on World Wide Web (Edinburgh, Scotland, May 23 - 26, 2006). WWW '06. ACM Press, New York, NY, 933-934. |
Platinum SponsorsSponsor of The CIO Dinner |
![]() |