Inventors
Srinivas Bangalore, Giuseppe Riccardi
Publication date
2001/11/13
Patent office
US
Patent number
6317707
Application number
09207326
Description
In a method of learning grammar from a corpus, context words are identified from a corpus. For the other non-context words, the method counts the occurrence of predetermined relationships which the context words, and maps the counted occurrences to a multidimensional frequency space. Clusters are grown from the frequency vectors. The clusters represent classes of words; words in the same cluster possess the same lexical significancy and provide an indicator of grammatical structure.
Total citations
200320042005200620072008200920102011201220132014201520162017201820192020202120222023133433336321124334239361484