WePS2 Task Guidelines

Clustering

In this task systems receive as input a set of web search results obtained when performing a query for an (ambiguous) person name. The expected output is a clustering of the web pages, where each cluster is assumed to contain all (and only those) pages that refer to the same individual.

WePS2 Clustering task guideline

Attribute Extraction

This subtask consists of extracting 18 kinds of "attribute values" for target individuals whose names appear on each of the provided Web pages. The organizers will distribute the target Web pages in their original format (i.e., html), and the participant systems have to extract attribute values from each page.

WePS2 Attribute Extraction task guideline

WePS: searching information about entities in the Web