Important: This documentation covers Yarn 1 (Classic).
For Yarn 2+ docs and migration guide, see yarnpkg.com.

Package detail

eld-custom

nitotm36Apache-2.01.0.3TypeScript support: included

Fast and accurate natural language detection. Detector written in Javascript. Efficient language detector, Nito-ELD, ELD.

nlp, language, natural-language-processing, natural-language, language-detection, language-detector, language-identification

readme

forked from https://github.com/nitotm/efficient-language-detector-js

使い方

import { eld } from './src/languageDetector.js';

await eld.init("ngramsM60-subset-de-en-es-fr-it-ja-ko-pt-zh.js");
const result = eld.detect('Hello world');
console.log(result.language); // 'en'

サブセット生成

  1. generateSubset.js内の以下の部分を修正して、サブセットに含めたい言語を指定。精度を高めたい場合はbaseNgramFileNameをLにする
    const defaultLanguages = ["en", "ja", "zh", "es", "ko", "fr", "de", "pt", "it"];
node generateSubset.js

=> ngrams-main.jsが更新される

テスト

node test-detection.js