YooName Statistics

YooName’s self-maintenance routine was completed this morning, so we thought it was time to gather some statistics:

  • YooName can recognize 175,552 unique named entities.
  • 373,925 candidate entities were quarantined because of insufficient statistical and lexical evidence.
  • When combining first names and last names, YooName recognizes 450 million personal names.
  • YooName’s knowledge is gathered from 54,989 English-language Web pages.
  • Our crawler examined 710,901 files (~50 GB) to find the knowledge-rich pages above.
  • Disambiguation rules are created using ~300k textual passages out of 11,652 representative named entities on 1 TB of English text.

Compared to statistics compiled three months ago, YooName’s knowledge grew by 17%.

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s

%d bloggers like this: