No Noun Phrase Left Behind: Detecting and Typing Unlinkable Entities
No Noun Phrase Left Behind: Detecting and Typing Unlinkable Entities
authors Thomas Lin, Mausam and Oren Etzioni
venue Conference on Empirical Methods in Natural Language Processing
year 2012
abstract Entity linking systems link noun-phrase mentions in text to their corresponding Wikipedia articles. However, NLP applications would gain from the ability to detect and type all entities mentioned in text, including the long tail of entities not prominent enough to have their own Wikipedia articles. In this paper we show that once the Wikipedia entities mentioned in a corpus of textual assertions are linked, this can further enable the detection and fine-grained typing of the unlinkable entities. Our proposed method for detecting unlinkable entities achieves 24% greater accuracy than a Named Entity Recognition baseline, and our method for fine-grained typing is able to propagate over 1,000 types from linked Wikipedia entities to unlinkable entities. Detection and typing of unlinkable entities can increase yield for NLP applications such as typed question answering.

download: pdf