wikipediaの新出単語を利用するZeitgeist

プログラムZeitgeistは、コンピュータに一般に使われているウェブ辞書wordnetにない新出単語をwikipediaから探し出し、単語の意味を理解させようとするもの。メールの内容理解やニュースのサマリー作成に使われる他、自社商品のブログ上での評価を気にする会社などがこのテクノロジーに興味を示している。

A program that works out the meaning of newly coined words using the online encyclopaedia Wikipedia could help machines understand the slang used in blogs and other informal texts, say researchers.
The program – called Zeitgeist – hunts through Wikipedia looking for entries about new words that do not appear in an online resource called WordNet, an official linguistics tool that is both a dictionary and a thesaurus. WordNet is used by researchers to help computers understand human language. New words, or neologisms, that do not appear in WordNet inevitably leave computers stumped.
"Zeitgeist is a neat tool," adds Carrol. But he points out that its limitations mean it can handle only 75% of the neologisms it finds in Wikipedia. Another technique is to use the context of a new word to guess at its meaning, he says. Adding that ability to Zeitgeist could make it much more powerful.

http://www.newscientisttech.com/article/dn9997-software-learns-new-words-from-wikipedia.html