General natural language (tokenizing, stemming (English, Russian, Spanish), part-of-speech tagging, sentiment analysis, classification, inflection, phonetics, tfidf, WordNet, jaro-winkler, Levenshtein distance, Dice's Coefficient) facilities for node.
Determining the similarity of alphanumeric text based on trigram matching
Tokenizer for Vietnamese in Nodejs and Javascript
Text tokenization, transformation & analysis transducers, utilities, stop words, porter stemming, vector encodings, similarities