Package detail

charset

node-modules4.7mMIT1.0.1

Get the content charset from header and html content-type.

charset, content-type, ContentType, Content-Type, xml, encoding

readme

charset

Get the content charset from header and html content-type.

Install

$ npm install charset --save

Usage

Detect charset from http client response and content

var charset = require('charset');
var http = require('http');

http.get('http://nodejs.org', function (res) {
  res.on('data', function (chunk) {
    console.log(charset(res.headers, chunk));
    // or `console.log(charset(res, chunk));`
    res.destroy();
  });
});

Stdout will should log: utf8 .

Detect from String

charset(res.headers['content-type']);

Detect combine with jschardet

As you know, charset only detect from http response headers and html content-type meta tag. You can combine with jschardet to help you detect the finally charset.

This example codes come from stackoverflow#12326688:

var request = require('request');
var charset = require('charset');
var jschardet = require('jschardet');

request({
  url: 'http://www.example.com',
  encoding: null
}, function (err, res, body) {
  if (err) {
    throw err;
  }
  enc = charset(res.headers, body);
  enc = enc || jschardet.detect(body).encoding.toLowerCase();
  console.log(enc);
});

License

MIT

changelog

1.0.1 / 2017-09-07

fixes

[effda0c] - fix: limit match string (#11) (fengmk2 <fengmk2@gmail.com>)

others

[5ba8a49] - test: use npm scripts instead of Makefile (#9) (fengmk2 <m@fengmk2.com>)
[4787184] - fix example with jschardet (Xu Jingxin <sailxjx@gmail.com>)
[c2f94ef] - add combine example with jschardet. (fengmk2 <fengmk2@gmail.com>)

1.0.0 / 2014-09-17

add peek size, default is 512. fixed #4

0.1.0 / 2014-07-05

support charset(content-type-string)
update AUTHORS with new version of contributors

0.0.2 / 2014-01-19

add contributors
1 #2 read charset from encoding="utf8" for xml, handle spaces between = and inside of utf8 (@kof)

0.0.1 / 2012-10-08

first commit for charset.
Initial commit

Package detail

readme

charset

Install

Usage

Detect charset from http client response and content

Detect from String

Detect combine with jschardet

License

changelog

1.0.1 / 2017-09-07

1.0.0 / 2014-09-17

0.1.0 / 2014-07-05

0.0.2 / 2014-01-19

1 #2 read charset from encoding="utf8" for xml, handle spaces between = and inside of utf8 (@kof)

0.0.1 / 2012-10-08