My BS thesis was a research on web page clustering and classification. I compared different combinations of several classification methods and similarity metrics, and I proposed a new method improving the accuracy of the centroid-based method.
Thesis (in Turkish): Aygun2006.pdf