• eccenca
  • Support
  • Wiki
  • Download
HomeShopPublishingGetting Started

Browse Components...

by Category

Crawlers & Connectors (6) Document Converter (1) Text Analytics (16)    Classification (3)    Entity Extraction (2)    Linguistic Analyzers (11) Imaging Technology (8) Knowledge Modeling (5) Data Stores (1) User Interface Enhancement (2) System Tools (2) Cool Tools (3) Search Query Optimization (2)

by Provider


Bitext
Canoo
Digital Sonata Pty Ltd
Open Source and Academic

by Search

All

Browse all components

User

Your Account






Lost Password?
Forgot your username?
No account yet? Register

Download Area

Your Cart

Your Cart is currently empty.

You are here: Home Shop Text Analytics Linguistic Analyzers WMTrans Lemmatizer

WMTrans Lemmatizer

WMTrans Lemmatizer

The WMTrans Lemmatizer is a program that returns the citation form of any valid word for a specified language, as used in POS taggers. A valid word is a lexicalized word. Please refer to the Unknown Word Lemmatizer if you need to recognize potentially va

Linguistic Analyzers
Canoo
supported by provider
Ask for Pricing
Print
Tell a friend
Ask a question
This component is listed in following categories:

This component is supported by provider.



  • Overview
  • Reviews
  • Support
  • Provider

Analysis Example
The Lemmatizer analyzes any word form, delivering as results a list of all corresponding citation forms together with their corresponding category (the Java version also offers an API function which only delivers the citation forms). Here are some examples of possible analysis interactions using the WMTrans Lemmatizer. The formal output syntax is described in the WMTrans developer zone.

German Examples
query -> ging
result -> gehen
(Cat V)


query -> moegen
result -> mögen
(Cat V)(Flach ouml),
(Cat N)(Flach ouml)


query -> moegen Filter: (Cat N)
result -> mögen
(Cat N)(Flach ouml)

English Examples
query -> did
result -> do
(Cat V)


query -> cat's Filter: (Cat N)
result -> cat
(Cat N)(Contraction N+'s/Clitic),
(Cat N)(Contraction N+have/V),
(Cat N)(Contraction N+be/V)

Italian Examples
query -> andai
result -> andare
(Cat V)


query -> cacciandolo
result -> cacciare
(Cat V)(Contraction lo/Pron+V)

Customer Reviews:

There are yet no reviews for this product.
Please log in to write a review.
This component is supported by provider.



Canoo

WMTrans (WordManager Transducers) is a smart text processing software for information retrieval and language processing applications. Typical uses include integration into search engines, software for text indexing, text mining, language learning, hyperlink generation, spell checking, grammar checking, word stemming and machine translation applications.

WMTrans is based on the Canoo morphological dictionary, containing more than 250'000 lexemes and generating 3 million fully categorized word forms for German, as well as 50'000 lexemes for English and Italian respectively. The dictionaries include information on word formation dependencies, all types of morphological irregularities and spelling variants.

Demos based on WMTrans can be tested at: www.canoo.com/wmtrans

Canoo

Canoo Engineering AG
Kirschgartenstrasse 5
CH 4051 Basel
Switzerland

 
http://www.canoo.com
JoomlaWatch Stats 1.2.9 by Matej Koval
Copyright by eccenca, 1998-2009.
  • Home
  • Sitemap
  • Imprint