WMTrans LemmatizerThe WMTrans Lemmatizer is a program that returns the citation form of any valid word for a specified language, as used in POS taggers. A valid word is a lexicalized word. Please refer to the Unknown Word Lemmatizer if you need to recognize potentially va |
|
|||||||||
|
|
|
|||||||||
Analysis Example
The Lemmatizer analyzes any word form, delivering as results a list of all corresponding citation forms together with their corresponding category (the Java version also offers an API function which only delivers the citation forms). Here are some examples of possible analysis interactions using the WMTrans Lemmatizer. The formal output syntax is described in the WMTrans developer zone.
German Examples
query -> ging
result -> gehen
(Cat V)
query -> moegen
result -> mögen
(Cat V)(Flach ouml),
(Cat N)(Flach ouml)
query -> moegen Filter: (Cat N)
result -> mögen
(Cat N)(Flach ouml)
English Examples
query -> did
result -> do
(Cat V)
query -> cat's Filter: (Cat N)
result -> cat
(Cat N)(Contraction N+'s/Clitic),
(Cat N)(Contraction N+have/V),
(Cat N)(Contraction N+be/V)
Italian Examples
query -> andai
result -> andare
(Cat V)
query -> cacciandolo
result -> cacciare
(Cat V)(Contraction lo/Pron+V)
Customer Reviews:
There are yet no reviews for this product.Please log in to write a review.
CanooWMTrans (WordManager Transducers) is a smart text processing software for information retrieval and language processing applications. Typical uses include integration into search engines, software for text indexing, text mining, language learning, hyperlink generation, spell checking, grammar checking, word stemming and machine translation applications. |
![]()
|
||||
