Clustering
In this task systems receive as input a set of web search results obtained when
performing a query for an (ambiguous) person name. The expected output is a clustering
of the web pages, where each cluster is assumed to contain all (and only those) pages
that refer to the same individual.
Attribute Extraction
This subtask consists of extracting 18 kinds of "attribute values" for target individuals
whose names appear on each of the provided Web pages. The organizers will distribute
the target Web pages in their original format (i.e., html), and the participant systems
have to extract attribute values from each page.