Estimating Web Site Readability Using Content Extraction

Presented at: 18th International World Wide Web Conference (WWW2009)

by Thomas Gottron, Ludger Martin


Nowadays, information is primarily searched on the WWW. From a user perspective, the readability is an important criterion for measuring the accessibility and thereby the quality of an information. We show that modern content extraction algorithms help to estimate the readability of a web document quite accurate.

