CAp 2017 - Conférence sur l'Apprentissage Automatique

Named Entity Recognition in Tweets

Named-Entity Recognition (NER) concerns the classification of textual segments of data in a predefined set of categories, like persons, organizations and locations. State-of-the-art NER systems achieve very high performance for a narrow set of entities and for noise-free and grammatically well-structured documents. But, in applications like Twitter where text is short, using an informal style and with an unreliable use of capitalization the recognition of entities becomes a challenging task.

The competition consists of identifying 13 types of entities (person, musicartist, organisation, geoloc, product, media, sportsteam, event, tvshow, movie, facility, transport line, other) in tweets. For example, the following phrase contains two types of entities. Note that entities may span several words.

Data and Evaluation

Training data will consist of 3,000 annotated French tweets with 12 types of entities in CoNLL format. Test data will also comprise 3,000 French tweets. The participants are free to use any type of external data in order to improve their systems. Systems will be judged on F1-score.

Inscription

To register fill the form: here.
Attention, in order to obtain the data, you first have to fill the form.

Important Dates

The challenge will be as follows:

~~Release of training data: January 23, 2017~~
~~Release of test data:: April 19, 2017~~
~~Submission of predictions : May 3, 2017~~

Ranking

Rank	System	F-Measure	Precision	Recall
1	Synapse Développement	58.89	73.65	49.06
2	High Institute of Technology - Agadir	52.19	58.95	46.83
3	TanDam	51.99	60.67	45.48
4	NER_Quebec	51.26	67.65	41.26
5	Swiss Chocolate	50.05	56.42	44.97
6	AMU-LIF	46.21	53.59	40.63
7	Lattice	45.46	78.76	31.95
8	Geolsemantics	21.28	19.66	23.18

~~System description papers : May 10, 2017~~
~~Notification of acceptance: May 24,2017~~
~~Camera-ready : June 14, 2017~~

Prize

600â‚¬ will be awarded to the first ranked system.

Organization comitee

Ioannis Partalas, Viseo R&D
Nadia Derbas, Viseo R&D
Georgios Balikas, University Grenoble Alpes
Massih-Reza Amini, University Grenoble Alpes
Coralie Reutenauer, SNCF
Amélie Martin, SNCF

Program Committee

Laurent Besacier, LIG (France)
Christian Boitet, LIG (France)
Caroline Brun, Xerox (France)
Xavier Carreras, Xerox (France)
Luca Dini, Innoradiant (France)
Dino Ienco, LIRMM (France)
Diana Inkpen, University of Ottawa (Canada)
Christiane Fellbaum, Princeton University (USA)
FranÃ§ois Jacquenet, Laboratoire Hubert Curien (France)
Ioannis Katakis, National and Kapodistrian University of Athens (Greece)
Makis Malakasiotis, AUEB (Greece)
Roberto Navigli, Sapienza University of Rome (Italy)
Damien Nouvel, Inalco (France)
Mathieu Roche, CIRAD (France)
Patrick Watrin, CENTAL (Belgique)