Entity Linking at Web Scale
| authors | Thomas Lin, Mausam and Oren Etzioni |
| venue | Joint Workshop on Automatic Knowledge Base Construction and Web-scale Knowledge Extraction |
| year | 2012 |
| abstract | This paper investigates entity linking over millions of high-precision extractions from a corpus of 500 million Web documents, toward the goal of creating a useful knowledge base of general facts. This paper is the first to report on entity linking over this many extractions, and describes new opportunities (such as corpus-level features) and challenges we found when entity linking at Web scale. We present several techniques that we developed and also lessons that we learned. We envision a future where information extraction and entity linking are paired to automatically generate knowledge bases with billions of assertions over millions of linked entities. |
download:
pdf
