Projects

LiMoSINe

Linguistically Motivated Semantic Aggregation Engines

Duration: 2011-2014
Financing institution: European Comission, FP7-ICT
Summary: The LiMoSINe vision is to transition access to online information from a document-centric search paradigm focused on returning disconnected atomic pieces to a truly semantic aggregation paradigm. In this new paradigm, machines will understand a user's intent, discover and organize facts, identify opinions, experiences and trends, all from inherently multilingual online sources and open knowledge repositories. LiMoSINe's aggregation engines will automatically organize search results in semantically meaningful ways.

ELIAS

Evaluating Information Access Systems

Duration: 2011-2016
Financing institution: European Science Foundation
Summary: ELIAS will define a new measurement paradigm for the evaluation of search engines based on so-called living laboratories. This paradigm involves (i) exploitation of novel market places and forums where large numbers of users are recruited into early stage evaluation experiments to test a particular aspect of an information access system; and (ii) using operational systems as experimental platforms on which to conduct user-based experiments at scale.

HOLOPEDIA

The automatic encyclopedia of people and organizations.

Duration: 2010-2012
Financing institution: MICINN (TIN2010-21128-C02)
Summary: The main goal of the project is to develop algorithms, techniques and systems able to mine and aggregate information relative to people and organizations from unstructured and structured web sources, such as social networks, blogs, news, semantic web data, and websites in general.

MA2VICMR

Mejorando el Acceso, el Análisis y la Visibilidad de la Información y los Contenidos Multilingüe y Multimedia en Red para la Comunidad de Madrid

Duration: 2010-2013
Financing institution: Regional Government of Madrid (S2009/TIC-1542)
Summary: Improving access, analysis and visibility of multilingual and multimedia Web contents.

Buscamedia

Duration: 2009-2011
Financing institution: CDTI (CEN-20091026)
Summary: Development of a true Multimedia Semantic Search Engine.

Existes/WebOpinion

Financing institution: Sub-contracts by Grupo ALMA
Summary: Online Reputation Managing

QEAVis

Quantitative Evaluation of Academic Websites Visibility

Duration: 2008-2010
Financing institution: CICYT (TIN 2007-67581-C02-01)
Summary: Automated Classification of academic websites by topic and language, in order to create ranks with them. The main goal of the project is to improve the accessibility and visibility of academic information on the World Wide Web.

TrebleCLEF

Evaluation Best Practice and Collaboration for Multilingual Information Access

Financing institution: European Commission
Summary: TrebleCLEF supports the development and consolidation of expertise in the multidisciplinary research area of multilingual information access (MLIA) and disseminates this knowhow to the application communities through a set of complementary activities.

Text-Mess (subproyecto INES)

Duration: 2007-2009
Financing institution: CICYT (TIN2006-15265-C06-02)

MultiMatch

Multilingual/Multimedia Access To Cultural Heritage

Duration: 2006-2009
Financing institution: European Commission, 6FP (STREP 033104)
Summary: MultiMatch plans to develop a multilingual search engine specifically designed for access, organisation and personalised presentation of cultural heritage information.

MAVIR

Mejorando el acceso y visibilidad de la información multilingüe en red para la Comunidad de Madrid

Duration: 2006-2009
Financing institution: Comunidad de Madrid, IV PRICIT, (S-0505/TID/0267)
Summary: MAVIR es una red de investigación formada por un equipo multidisciplinar de científicos, técnicos, lingüistas y documentalistas para desarrollar un esfuerzo integrador en las líneas de investigación, formación y transferencia de tecnología.

MedIEQ

Quality Labelling of Medical Web Content using Multilingual Information Extraction.

Duration: 2006-2008
Financing institution: European Commission (EC Programme: Public Health 61383)
Summary: Quality Labelling of Medical Web Content using Multilingual Information Extraction

SWIISA

Speech Web and Images Interactive Search Assitants

Duration: 2006-2007
Financing institution: UNED
Summary: Estudio de aplicación de asistentes interactivos a tres línas: búsqueda translingüe sobre imágenes, sobre la Web y sobre transcripciones automáticas de reconocedores de habla.

R2D2 (subproyecto Syembra)

Recuperación de Respuestas en Documentos Digitalizados

Duration: 2003-2006
Financing institution: CICYT (TIC2003-07158-C04)
Summary: Evaluation of cross-lingual answer retrieval systems.

RIBIDI

Recuperación de Información en Bibliotecas Digitales

Duration: 2001-2004
Financing institution: CYTED VII.19
Summary: Cooperación iberoamericana en investigación y desarrollo de tecnologías para recuperación de información y bibliotecas digitales.

CLEF

Cross-Language Evaluation Forum

Duration: 2001-2003
Financing institution: European Commission, 5FP (IST-2000-31002)
Summary: Evaluation of Cross-Language Information Retrieval Systems for European Languages

ETB

European Schools Treasury Browser

Duration: 2000-2002
Financing institution: European Commission, 5FP (IST Programme)
Summary: Access to meta-information about educational resources and new technologies in Europe.

DELOS

DELOS: a Network of Excellence on Digital Libraries

Duration: 2000-2002
Financing institution: European Commission, IST Programme
Summary: The main objective of DELOS is to coordinate a joint programme of activities of the major European teams working in digital library related areas.

NAMIC

News Agencies Multilingual Information Categorization

Duration: 1999-2002
Financing institution: European Commission, 5FP (IST-1999-12392)
Summary: NAMIC main objective is to develop and bring to marketable stage advanced NLP technologies for multilingual news customization and broadcasting throughout distributed services.

EuroWordnet

Duration: 1996-1999
Financing institution: European Commission, 4FP (Telematics, LE 4003)
Summary: The project aimed at building a multilingual lexical database with semantic relations between words in 8 european languages (Spanish, English, Italian, Dutch, French, German, Estonian and Czech). Every monolingual wordnet is linked to the others via an InterLingual Index derived from Wordnet 1.5.

ELSNET LE Training Showcase

Financing institution: ACO*HUM (Socrates), ELSENET, European Commission
Summary: A project under the auspices of ELSNET and ACO*HUM excellence networks to develop 6 specialization courses around Natural Language Processing and Speech Recognition and synthesis. Our task was to develop an open distance learning course on Natural Language Processing and Information Retrieval.

Hermes

Duration: 2001-2003
Financing institution: CICYT (TIC2000-0335-C03-01)
Summary: Multilingual named-entity recognition, hyperlinking, phrase extraction, summarization and semantic indexing for information access on a digital news archive.

RILE

Servidor de Recursos para el Desarrollo de la Ingeniería Lingüística en Español

Duration: 1999-2000
Financing institution: M.I.N.E.R.
Summary: The goal of RILE is to develop a pilot for a server with resources, tools and information related to the development of applications in the field of Language Engineering for Spanish.

ITEM

Recuperación de Información Textual en un Entorno Multilíngüe

Duration: 1996-1999
Financing institution: CICyT (TIC96-1243-C03-01)
Summary: Development and integration of Language Engineering resources and tools for Spanish, Catalan, Basque and English and demonstration of such tools in a multilingual search engine with NLP capabilities.

ACQUILEX II

Duration: 1993-1995
Financing institution: European Commission (Esprit BRA 7315)
Summary: The goal was to explore the utility of constructing a multilingual lexical knowledge base from machine-readable versions of conventional dictionaries by exploring the utility of machine readable textual corpora as a source of lexical information not coded in conventional dictionaries, and by adding dictionary publishing partners to exploit the lexical database and corpus extraction software developed by the projects for conventional lexicography.