Back to homepage

Web algorithms

This page describes threads of research performed by myself and my collaborators. The goal is to put our work into a meaningful sequence, instead of just a list of papers; the goal is not to give an overview of the field. At some later time, this page may turn into a survey, but at this point the many wonderful contributions of others are not represented here.

Iterative methods and linear algebra

This section covers a set of largely-unrelated papers that study different iterative methods.

Combinatorial extraction algorithms

This paper considers extractions of templates from web pages using combinatorial techniques. It is focused primarily on measurements of the prevalence and evolution of web templating behavior.

In some followup work, we considered algorithms for segmenting websites in order to determine the most topically cohesive regions of the site.

Algorithms for dense subgraph and community finding

See this description of dense subgraph extraction for details.

Back to Andrew Tomkins homepage.