normalizeWords
Stem or lemmatize words
Syntax
Description
UsenormalizeWords
to reduce words to a root form. TolemmatizeEnglish words (reduce them to their dictionary forms), set the'Style'
option to'lemma'
.
The function supports English, Japanese, German, and Korean text.
reduces the words inupdatedDocuments
= normalizeWords(documents
)documents
to a root form. For English and German text, the function, by default, stems the words using the Porter stemmer for English and German text respectively. For Japanese and Korean text, the function, by default, lemmatizes the words using the MeCab tokenizer.
reduces each word in the string arrayupdatedWords
= normalizeWords(words
)words
to a root form.
reduces the words and also specifies the word language.updatedWords
= normalizeWords(words
,'Language',language
)