Local models have recently attained astounding performances in Entity Disambiguation (ED), with generative and extractive formulations being the most promising research directions. However, previous works have so far limited their studies to using, as the textual representation of each candidate, only its Wikipedia title. Although certainly effective, this strategy presents a few critical issues, especially when titles are not sufficiently informative or distinguishable from one another. In this paper, we address this limitation and investigate the extent to which more expressive textual representations can mitigate it. We evaluate our approach thoroughly against standard benchmarks in ED and find extractive formulations to be particularly well-suited to such representations. We report a new state of the art on 2 out of the 6 benchmarks we consider and strongly improve the generalization capability over unseen patterns. We release our code, data and model checkpoints at https://github.com/SapienzaNLP/extend.
Dettaglio pubblicazione
2023, Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, Pages -
Entity Disambiguation with Entity Definitions (04b Atto di convegno in volume)
Procopio Luigi, Conia Simone, Barba Edoardo, Navigli Roberto
Gruppo di ricerca: Artificial Intelligence and Knowledge Representation, Gruppo di ricerca: Natural Language Processing
keywords