Lemmatize
The part-of-speech tags allow the tokens to be
lemmatized. Lemmatization involves turning a word into its base form, or lemma. For example, the words am, are
and is are all forms of the verb be. This process allows variants of the same word to be recognised.
Example request
Send the following set of tokens and associated part-of-speech tags:
POST /zoo-chatbot/lemmatize
{
"tokens": [
"any",
"giraffes",
"?"
],
"posTags": [
"DT",
"NNS",
"."
]
}
Response
Notice that the word any is already in its base form and the word giraffes is converted to its base form of
giraffe:
Response
{
"lemmas": [
"any",
"giraffe",
"?"
],
"probabilities": [
{
"lemma": "any",
"probability": 0.9965145882
},
{
"lemma": "giraffe",
"probability": 0.9917771516
},
{
"lemma": "?",
"probability": 0.4033399929
}
]
}