The BSNLP 2017 the shared task on multilingual named entity recognition, their normalization and cross-language matching in web documents in Slavic languages has been jointly co-organized by the Competence Centre on Text Mining and Analysis of the Joint Research Centre of the European Commission, University of West Bohemia, University of Helsinki and the University of Zagreb.

Data and code

Download Trump corpus
Download EU corpus
Download annotations

Download evaluation code

Please cite the shared task paper if you use these data or code.

Two datasets were prepared for evaluation, each consisting of documents extracted from the web and related to a given entity. One dataset contains documents related to Donald Trump, the recently elected President of United States and the second dataset contains documents related to the European Commission

The test datasets were created as follows. For each “focus” entity, we posed a separate search query to Google, in each of the seven target languages. The query returned links to documents only in the language of interest. We extracted the first 100 links 2 returned by the search engine, removed duplicate links, downloaded the corresponding HTML pages—mainly news articles or fragments thereof—and converted them into plain text, using a hybrid HTML parser.

The resulting set of partially “cleaned” documents were used to select circa 20–25 documents for each language and topic, for the preparation of the final test datasets. Annotations for Croatian, Czech, Polish, Russian, and Slovene were made by native speakers; annotations for Slovak were made by native speakers of Czech, capable of understanding Slovak. Annotations for Ukrainian were made partly by native speakers and partly by near-native speakers of Ukrainian. Cross-lingual alignment of the entity identifiers was performed by two annotators.

For more details please consult the shared task paper:
Jakub Piskorski, Lidia Pivovarova, Jan Šnajder, Josef Steinberger and Roman Yangarber The First Cross-Lingual Challenge on Recognition, Normalization, and Matching of Named Entities in Slavic Languages. BSNLP, 2017 (bib)

System descriptions

System	Description
JHU jhu	JHU/APL only attempted the NER and Entity Matching subtasks. We employed a statistical tagger called SVMLattice [1], with NER labels inferred by projecting English tags across bitext. The Illinois tagger [2] was used for English. A rule-based entity clusterer called "kripke" was used for Entity Matching [3]. [1] James Mayfield, Paul McNamee, Christine Piatko, and Claudia Pearce, Lattice-based Tagging Using Support Vector Machines. Proceedings of the Twelfth International ACM Conference on Information and Knowledge Management (CIKM 2003), pp. 303-308, November 2003. [2] Lev Ratinov and Dan Roth. 2009. Design challenges and misconceptions in named entity recognition. In Proceedings of the Thirteenth Conference on Computational Natural Language Learning (CoNLL '09). Association for Computational Linguistics, Stroudsburg, PA, USA, 147-155. [3] Paul McNamee, Tim Finin, Dawn Lawrie, and James Mayfield, HLTCOE Participation at TAC 2013. Proceedings of the Text Analysis Conference, Gaithersburg, Maryland, 18-19 November, 2013.
Liner2 pw	Liner2 is a generic framework which can be used to solve various tasks based on sequence labeling, i.e. recognition of named entities, temporal expressions, mentions of events. It provides a set of modules (based on statistical models, dictionaries, rules and heuristics) which recognize and annotate certain types of phrases. The framework was already used for recognition of named entities (different levels of granularity), temporal expressions and event mentions for Polish. Runs only for Polish at the moment.
LexiFlexi lf	LexiFlexi applies 3 lexico-semantic resources on input text in the following order: (a) match names from JRC Variant Names database [1] (circa 4,05 mln entries) and use the cross-lingual entity IDs therefrom, (b) match names from a huge collection (circa 6,82 mln entries) of multi-word named entities semi-automatically derrived from BabelNet on uncomsumed text using the method described in [2], and (c) match toponyms from the GeoNames gazetteer (circa 1,36 mln entries - only populated places) in unconsumed part of the texts and exploit cross-lingual IDs therefrom. Finally some language-independent heuristics are applied to match variants (abbreviated forms) of the named mentions of entities that were recognised using the the aforementioned lexical resources. [1] Maud Ehrmann, Guillaume Jacquet and Ralf Steinberger, JRC-Names: Multilingual entity name variants and titles as Linked Data.. In Semantic Web Journal, Volume 8(2), pages 283-295, 2017. [2] Sophie Chesney, Guillaume Jacquet, Ralf Steinberger and Jakub Piskorski, Multi-word Entity Classification in a Highly Multilingual Environment.. Proceedings of the 13-th Workshop on Multiword Expressions (MWE 2017). Held at EACL 2017, Valencia, Spain, 4 April 2017. This is an almost “out-of-the-box” baseline.
Sharoff shf	Serge Sharoff's system is an example of the Language Adaptation method [1] applied to the NER detection subtask. A multilingual word embedding space for all Slavonic languages in the task has been created using the model by Dinu et al [2] with the addition of Weighted-Levenshtein distance [3]. This space was used for training a Neural Network NER tagger based on the architecture presented in [4] using a Slovenian NER corpus [5] and applying the model to other languages in the shared task. [1] Serge Sharoff, 2017. Toward Pan-Slavic NLP: Some Experiments with Language Adaptation. Proc. BSNLP 2017. [2] Georgiana Dinu, Angeliki Lazaridou and Marco Baroni, 2015 Improving Zero-shot Learning by Mitigating the Hubness Problem. Proc. ICLR 2015. [3] Miguel Rios, Serge Sharoff, 2015. Obtaining SMT dictionaries for related languages. Proc. BUCC 2015. [4] Guillaume Lample, Miguel Ballesteros, Sandeep Subramanian, Kazuya Kawakami, Chris Dyer, 2016. Neural Architectures for Named Entity Recognition, Proc NAACL 2016. [5] Simon Krek, Tomaž Erjavec, Kaja Dobrovoljc, Nanika Holz, Nina Ledinek, Sara Može. 2012. Učni korpus ssj500k kot podatkovna zbirka. These experiments were presented as a part of Serge Sharoff's keynote talk at BSNLP 2017

Shared Task Resuts

Download complete results (split by entity type)

Average results for both corpora (f-measure)

Phase	Metric	Language
		cs		hr		pl		ru		sk		sl		ua
Recognition	Relaxed Partial	shf	49.66	jhu	49.48	pw	64.72	lf	63.17	lf	48.45	shf	57.08	lf	35.59
		lf	49.15	shf	47.84	shf	49.44	jhu	46.15	jhu	47.99	jhu	47.65	jhu	24.36
		jhu	46.92	lf	37.36	lf	47.84	shf	28.06			lf	45.19	shf	19.44
						jhu	45.92
	Relaxed Exact	lf	48.21	jhu	47.32	pw	64.07	lf	61.54	lf	47.11	shf	52.29	lf	35.59
		shf	46.36	shf	44.05	shf	46.46	jhu	43.70	jhu	46.35	jhu	44.96	jhu	21.31
		jhu	45.03	lf	36.33	lf	46.02	shf	27.48			lf	42.05	shf	18.72
						jhu	42.96
	Strict	shf	48.54	shf	48.60	pw	64.37	lf	54.57	jhu	46.62	shf	61.13	lf	27.54
		jhu	46.64	jhu	48.45	shf	50.82	jhu	44.55	lf	43.76	jhu	47.06	jhu	16.95
		lf	41.74	lf	34.05	lf	42.83	shf	28.55			lf	41.20	shf	15.36
						jhu	42.73
Normalization		shf	48.54	shf	48.60	pw	64.37	lf	54.58	jhu	46.63	shf	61.13	lf	27.55
		jhu	46.64	jhu	48.45	shf	50.82	jhu	44.55	lf	43.76	jhu	47.06	jhu	16.95
		lf	41.74	lf	34.05	lf	42.83	shf	28.55			lf	41.20	shf	15.26
						jhu	42.73
Entity matching	Document-level	jhu	15.65	lf	20.21	lf	20.52	lf	24.21	lf	20.10	lf	27.83	lf	4.79
		lf	9.97	jhu	11.68	pw	12.01	jhu	12.53	jhu	11.66	shf	24.48	shf	01.07
		shf	8.19	shf	7.88	jhu	9.76	shf	5.72			shf	11.38	jhu	0.51
						shf	6.9
	Single-language	jhu	23.90	jhu	19.90	lf	20.1	lf	43.78	jhu	27.29	jhu	30.84	lf	15.54
		lf	18.52	lf	15.52	jhu	17.94	jhu	22.03	lf	22.81	lf	22.62	jhu	6.36
		shf	4.47	shf	3.63	pw	6.85	shf	5.51	shf	0.0	shf	5.67	shf	2.53
						shf	3.6
		all langs
	Cross-lingual	lf	13.2
		jhu	10.0
		shf	2.3

Evaluation results for the Trump corpus (f-measure)

Phase	Metric	Language
		cs		hr		pl		ru		sk		sl		ua
Recognition	Relaxed Partial	shf	51.3	jhu	52.4	pw	66.7	lf	63.6	jhu	46.8	shf	55.2	lf	54.0
		lf	47.6	shf	51.3	shf	52.8	jhu	46.3	lf	46.8	jhu	47.3	jhu	38.8
		jhu	46.2	lf	37.0	lf	51.0	shf	21.9			lf	46.3	shf	24.02
						jhu	44.8
	Relaxed Exact	shf	49.2	jhu	50.8	pw	66.1	lf	62.6	jhu	46.2	shf	53.6	lf	53.3
		lf	46.6	shf	48.2	shf	49.9	jhu	43.1	lf	45.2	jhu	46.0	jhu	37.3
		jhu	46.1	lf	35.6	lf	48.8	shf	21.8			lf	44.2	shf	23.8
						jhu	43.4
	Strict	shf	52.6	shf	52.4	pw	66.6	lf	55.6	jhu	47.0	shf	62.6	lf	50.8
		jhu	46.1	jhu	50.4	shf	55.2	jhu	41.8	lf	44.8	jhu	46.2	jhu	33.2
		lf	42.2	lf	37.4	lf	48.0	shf	21.0			lf	44.2	shf	20.7
						jhu	41.0
Normalization		shf	52.6	shf	52.4	pw	66.6	lf	55.6	jhu	47.0	shf	62.6	lf	46.1
		jhu	46.1	jhu	50.4	shf	55.2	jhu	41.8	lf	44.8	jhu	46.2	jhu	33.3
		lf	42.1	lf	37.4	lf	48.0	shf	21.0			lf	44.2	shf	20.7
						jhu	41.1
Entity matching	Document-level	lf	16.0	lf	31.0	lf	30.0	lf	25.8	lf	26.4	lf	30.1	lf	14.7
		shf	9.2	shf	7.7	pw	10.8	jhu	11.2	jhu	10.2	shf	12.5	shf	3.0
		jhu	5.4	jhu	7.3	shf	8.2	shf	5.0			jhu	9.5
						jhu	6.3
	Single-language	jhu	19.3	lf	17.8	lf	24.0	lf	41.7	jhu	22.6	lf	29.4	lf	30.2
		lf	19.0	jhu	17.6	jhu	18.2	jhu	18.9	lf	21.4	jhu	28.7	jhu	10.7
		shf	5.0	shf	3.6	shf	3.7	shf	4.8			shf	6.8	shf	2.0
						shf	3.7
		all langs
	Cross-lingual	lf	14.3
		jhu	13.7
		shf	4.2

Evaluation results for the European Commission corpus (f-measure)

Phase	Metric	Language
		cs		hr		pl		ru		sk		sl		ua
Recognition	Relaxed Partial	lf	51.0	jhu	45.9	pw	61.8	lf	62.8	lf	50.3	shf	59.1	lf	28.4
		shf	47.6	shf	43.8	jhu	47.3	jhu	46.0	jhu	49.1	jhu	47.9	jhu	18.4
		jhu	47.6	lf	37.8	shf	44.5	shf	32.1			lf	43.8	shf	18.0
						lf	42.8
	Relaxed Exact	lf	50.0	jhu	43.1	pw	60.9	lf	60.7	lf	49.3	shf	57.1	lf	28.4
		jhu	44.4	shf	39.4	jhu	42.4	jhu	44.1	jhu	46.4	jhu	43.9	shf	17.2
		shf	43.1	lf	37.2	lf	41.5	shf	31.2			lf	39.3	jhu	14.7
						shf	41.3
	Strict	shf	47.7	jhu	46.2	pw	61.1	lf	53.7	jhu	46.1	shf	59.5	lf	20.8
		jhu	47.2	shf	44.3	jhu	44.8	jhu	46.5	lf	42.5	jhu	47.8	shf	13.7
		lf	41.2	hr	30.0	shf	44.2	shf	33.6			lf	37.5	jhu	10.8
						lf	34.6
	Normalization	jhu	47.2	jhu	46.2	pw	61.1	lf	53.7	jhu	46.2	shf	59.5	lf	20.8
		shf	43.6	shf	44.3	jhu	44.9	jhu	46.6	lf	42.5	jhu	47.8	shf	13.7
		lf	41.2	lf	29.9	shf	44.2	shf	33.6			lf	37.5	jhu	10.9
						lf	34.6
Entity Matching	Document-level	lf	25.0	jhu	16.1	jhu	13.8	lf	22.7	jhu	13.1	jhu	36.8	lf	1.6
		shf	7.0	shf	8.1	pw	13.4	jhu	13.7	lf	12.7	lf	25.4	jhu	0.6
		lf	3.0	lf	6.7	lf	6.7	shf	5.4			shf	10.2	shf	0.4
						shf	49.5
	Single-language	jhu	27.3	jhu	22.1	jhu	17.5	lf	45.8	jhu	30.6	jhu	32.2	lf	11.4
		lf	18.0	lf	12.8	lf	13.0	jhu	24.9	lf	23.9	lf	15.2	jhu	4.8
		shf	3.9	shf	3.6	pw	7.8	shf	1.5			shf	4.5	shf	0.8
						shf	3.5
		all langs
	Cross-lingual	lf	12.0
		jhu	5.3
		shf	1.5