YooName Statistics
YooName’s self-maintenance routine was completed this morning, so we thought it was time to gather some statistics:
- YooName can recognize 175,552 unique named entities.
- 373,925 candidate entities were quarantined because of insufficient statistical and lexical evidence.
- When combining first names and last names, YooName recognizes 450 million personal names.
- YooName’s knowledge is gathered from 54,989 English-language Web pages.
- Our crawler examined 710,901 files (~50 GB) to find the knowledge-rich pages above.
- Disambiguation rules are created using ~300k textual passages out of 11,652 representative named entities on 1 TB of English text.
Compared to statistics compiled three months ago, YooName’s knowledge grew by 17%.