Package detail

dom-to-image

tsayen970.1kMIT2.6.0

Generates an image from a DOM node using HTML5 canvas and SVG

dom, image, raster, render, html, canvas, svg

readme

DOM to Image

What is it

dom-to-image is a library which can turn arbitrary DOM node into a vector (SVG) or raster (PNG or JPEG) image, written in JavaScript. It's based on domvas by Paul Bakaus and has been completely rewritten, with some bugs fixed and some new features (like web font and image support) added.

Installation

NPM

npm install dom-to-image

Then load

/* in ES 6 */
import domtoimage from 'dom-to-image';
/* in ES 5 */
var domtoimage = require('dom-to-image');

Bower

bower install dom-to-image

Include either src/dom-to-image.js or dist/dom-to-image.min.js in your page and it will make the domtoimage variable available in the global scope.

<script src="path/to/dom-to-image.min.js" />
<script>
  domtoimage.toPng(node)
  //...
</script>

Usage

All the top level functions accept DOM node and rendering options, and return promises, which are fulfilled with corresponding data URLs.
Get a PNG image base64-encoded data URL and display right away:

var node = document.getElementById('my-node');

domtoimage.toPng(node)
    .then(function (dataUrl) {
        var img = new Image();
        img.src = dataUrl;
        document.body.appendChild(img);
    })
    .catch(function (error) {
        console.error('oops, something went wrong!', error);
    });

Get a PNG image blob and download it (using FileSaver, for example):

domtoimage.toBlob(document.getElementById('my-node'))
    .then(function (blob) {
        window.saveAs(blob, 'my-node.png');
    });

Save and download a compressed JPEG image:

domtoimage.toJpeg(document.getElementById('my-node'), { quality: 0.95 })
    .then(function (dataUrl) {
        var link = document.createElement('a');
        link.download = 'my-image-name.jpeg';
        link.href = dataUrl;
        link.click();
    });

Get an SVG data URL, but filter out all the <i> elements:

function filter (node) {
    return (node.tagName !== 'i');
}

domtoimage.toSvg(document.getElementById('my-node'), {filter: filter})
    .then(function (dataUrl) {
        /* do something */
    });

Get the raw pixel data as a Uint8Array with every 4 array elements representing the RGBA data of a pixel:

var node = document.getElementById('my-node');

domtoimage.toPixelData(node)
    .then(function (pixels) {
        for (var y = 0; y < node.scrollHeight; ++y) {
          for (var x = 0; x < node.scrollWidth; ++x) {
            pixelAtXYOffset = (4 * y * node.scrollHeight) + (4 * x);
            /* pixelAtXY is a Uint8Array[4] containing RGBA values of the pixel at (x, y) in the range 0..255 */
            pixelAtXY = pixels.slice(pixelAtXYOffset, pixelAtXYOffset + 4);
          }
        }
    });

All the functions under impl are not public API and are exposed only for unit testing.

Rendering options

filter

A function taking DOM node as argument. Should return true if passed node should be included in the output (excluding node means excluding it's children as well). Not called on the root node.

bgcolor

A string value for the background color, any valid CSS color value.

height, width

Height and width in pixels to be applied to node before rendering.

style

An object whose properties to be copied to node's style before rendering. You might want to check this reference for JavaScript names of CSS properties.

quality

A number between 0 and 1 indicating image quality (e.g. 0.92 => 92%) of the JPEG image. Defaults to 1.0 (100%)

cacheBust

Set to true to append the current time as a query string to URL requests to enable cache busting. Defaults to false

imagePlaceholder

A data URL for a placeholder image that will be used when fetching an image fails. Defaults to undefined and will throw an error on failed images

Browsers

It's tested on latest Chrome and Firefox (49 and 45 respectively at the time of writing), with Chrome performing significantly better on big DOM trees, possibly due to it's more performant SVG support, and the fact that it supports CSSStyleDeclaration.cssText property.

Internet Explorer is not (and will not be) supported, as it does not support SVG <foreignObject> tag

Safari is not supported, as it uses a stricter security model on <foreignObject> tag. Suggested workaround is to use toSvg and render on the server.`

Dependencies

Source

Only standard lib is currently used, but make sure your browser supports:

Promise
SVG <foreignObject> tag

Tests

Most importantly, tests depend on:

js-imagediff, to compare rendered and control images
ocrad.js, for the parts when you can't compare images (due to the browser rendering differences) and just have to test whether the text is rendered

How it works

There might some day exist (or maybe already exists?) a simple and standard way of exporting parts of the HTML to image (and then this script can only serve as an evidence of all the hoops I had to jump through in order to get such obvious thing done) but I haven't found one so far.

This library uses a feature of SVG that allows having arbitrary HTML content inside of the <foreignObject> tag. So, in order to render that DOM node for you, following steps are taken:

Clone the original DOM node recursively
Compute the style for the node and each sub-node and copy it to corresponding clone
- and don't forget to recreate pseudo-elements, as they are not cloned in any way, of course
Embed web fonts
- find all the @font-face declarations that might represent web fonts
- parse file URLs, download corresponding files
- base64-encode and inline content as data: URLs
- concatenate all the processed CSS rules and put them into one <style> element, then attach it to the clone
Embed images
- embed image URLs in <img> elements
- inline images used in background CSS property, in a fashion similar to fonts
Serialize the cloned node to XML
Wrap XML into the <foreignObject> tag, then into the SVG, then make it a data URL
Optionally, to get PNG content or raw pixel data as a Uint8Array, create an Image element with the SVG as a source, and render it on an off-screen canvas, that you have also created, then read the content from the canvas
Done!

Things to watch out for

if the DOM node you want to render includes a <canvas> element with something drawn on it, it should be handled fine, unless the canvas is tainted - in this case rendering will rather not succeed.
at the time of writing, Firefox has a problem with some external stylesheets (see issue #13). In such case, the error will be caught and logged.

Authors

Anatolii Saienko, Paul Bakaus (original idea)

License

MIT

changelog

2013-09-04 Antonio Diaz Diaz antonio@gnu.org

* Version 0.23-pre1 released.
* Improvements in character recognition.

2013-07-09 Antonio Diaz Diaz antonio@gnu.org

* Version 0.22 released.
* Scaling and smoothing are now made before thresholding.
* Improvements in character recognition.
* ocradlib.h: Added new function OCRAD_set_utf8_format.
* Small improvements have been made in manual and man page.
* Changed quote characters in messages as advised by GNU Standards.
* configure: Options now accept a separate argument.
* configure: 'datadir' renamed to 'datarootdir'.
* Makefile.in: Added new target 'install-bin'.

2011-01-10 Antonio Diaz Diaz ant_diaz@teleline.es

* Version 0.21 released.
* Fixed some internal errors triggered by noisy input.
* ocrad.texinfo: Added chapter 'OCR Results File'.
* main.cc: Set stdin/stdout in binary mode on MSVC and OS2.

2010-07-16 Antonio Diaz Diaz ant_diaz@teleline.es

* Version 0.20 released.
* ocradlib.h: Added new function OCRAD_scale.
* ocradlib.h: Added new function OCRAD_result_chars_total.
* ocradlib.h: Added new function OCRAD_result_chars_block.
* ocradlib.h: Added new function OCRAD_result_chars_line.

2010-01-27 Antonio Diaz Diaz ant_diaz@teleline.es

* Version 0.19 released.
* Added library interface (ocradlib.h).
* Option '--crop' replaced with similar but different option
  '--cut', which can accept coordinates taken from the ORF file.
* Recognition of files with a single character and without white
  space at the edges has been fixed.
* testsuite/check.sh: Added new tests for the library interface
  and for single character images.
* New files ocradlib.h, ocradlib.cc, ocrcheck.cc.
* Makefile.in: Added '--name' option to help2man invocation.

2009-05-08 Antonio Diaz Diaz ant_diaz@teleline.es

* Version 0.18 released.
* Added a layout analyser able to process arbitrary pages.
* Added new option '--quiet'.
* The '--layout' option no more accepts an argument.
* The '--crop' option now accepts negative coordinates.
* New recognized letter; 'a' with ring above.
* Fixed recognition on files with a single big character.
* Fixed bug that didn't write maxval when saving pgm or ppm.
* Fixed some includes that prevented compilation with GCC 4.3.0.
* 'make install-info' should now work on Debian and OS X.
* Makefile.in: Man page is now installed by default.
* New file testsuite/check.sh.
* Arg_parser updated to 1.2.
* Verbosity control of messages has been modified.

2007-06-29 Antonio Diaz Diaz ant_diaz@teleline.es

* Version 0.17 released.
* License updated to GPL version 3 or later.
* '--scale' no more suppresses ORF output.
* Improved removal of thick frames.
* Changed 'Textline' to accept more than one big initial.
* Class 'Block' renamed to 'Blob'.
* 'configure' and 'Makefile.in' have been modified to be more
  GNU-standards compliant.

2006-10-20 Antonio Diaz Diaz ant_diaz@teleline.es

* Version 0.16 released.
* Added new option '--filter'.
* Better algorithm for vertical space detection (blank lines).
* Some fixes made to 'configure' script.
* Added two new debug levels.
* Improvements in character recognition.

2006-04-03 Antonio Diaz Diaz ant_diaz@teleline.es

* Version 0.15 released.
* Added new argument parser that replaces 'getopt_long'.
* Fixed a bug that prevented compilation with GCC 4.1.

2006-02-15 Antonio Diaz Diaz ant_diaz@teleline.es

* Version 0.14 released.
* Ocrad is now able to read ppm files.
* Added new class 'Page_image' (256-level greymap).
* Added automatic and adaptive binarization by Otsu's method.
* Added new option '--crop'.
* Added two new chapters 'Image Format Conversion' and
  'Algorithm' to the texinfo file.
* Target 'check' added to Makefile.
* Changed 'ocrad.png' icon to color, one line.

2005-10-10 Antonio Diaz Diaz ant_diaz@teleline.es

* Version 0.13 released.
* Ocrad is now able to read pgm files.
* Added new rational number class.
* Use rationals instead of integers in space detection algorithm.
* Better algorithm for space detection in tables.
* 'vector<bool>' replaced by 'vector<char>' in bitmap (faster).
* Variable-size arrays replaced by vectors in block.cc.
* Fixed sizeof(size_t) != sizeof(int) on some 64 bit systems.
* Improved number recognition (mainly in textline_r2.cc).
* Overflow detection when loading or scaling file.
* Fixed a miscompilation with GCC 3.3.1.
* Class 'Vrhomboid' merged into files 'track.h' and 'track.cc'.

2005-06-07 Antonio Diaz Diaz ant_diaz@teleline.es

* Version 0.12 released.
* Change in internal representation; Blockmap has been eliminated.
* Text inside tables of solid lines is now recognized.
* Improvements in character recognition.
* Fixed possible integer overflow when loading pbm file.

2005-02-12 Antonio Diaz Diaz ant_diaz@teleline.es

* Version 0.11 released.
* Added new option '--scale'.
* Improvements in character recognition.
* Fixed bug in '--transform' (introduced in 0.10).

2004-12-09 Antonio Diaz Diaz ant_diaz@teleline.es

* Version 0.10 released.
* Added new suboption '-D7X'.
* Change in internal representation; only 1 Blockmap per Textpage.
* Use of absolute coordinates in ORF file.
* Improved space detection algorithm.
* Improvements in character recognition and separation.

2004-10-23 Antonio Diaz Diaz ant_diaz@teleline.es

* Version 0.9 released.
* Added new option '--transform'.
* 'DESTDIR' now works as expected.
* New class 'Textpage' is top of internal representation.

2004-05-23 Antonio Diaz Diaz ant_diaz@teleline.es

* Version 0.8 released.
* Better algorithm for line detection.
* New feature '-x -' (export ORF file to stdout).
* Small improvements in picture elimination.

2004-02-09 Antonio Diaz Diaz ant_diaz@teleline.es

* Version 0.7 released.
* Internal change to UCS instead of ISO 8859-1.
* Default charset is now ISO 8859-15 (latin9).
* Ocrad now recognizes Turkish characters (ISO 8859-9).
* Added new output format (UTF-8).
* Added new options '--charset' and '--format'.
* Added man page.

2003-12-18 Antonio Diaz Diaz ant_diaz@teleline.es

* Version 0.6 released.
* 'configure' is now compatible with 'sh'.
* Better algorithm for lowercase-uppercase decision.
* Small changes to line detector.
* Fixed bug (output of char 0 when separating some merged chars).

2003-10-18 Antonio Diaz Diaz ant_diaz@teleline.es

* Version 0.5 released.
* Corrected bug when creating ORF file from stdin.
* Added the ability to read multiple files from stdin.
* Use 'vector' instead of 'list' due to problem with GCC 3.3.1.
* Faster 'processing' of pictures.

2003-09-03 Antonio Diaz Diaz ant_diaz@teleline.es

* Version 0.4 released.
* More standard configure and Makefile.
* Added info file.
* Small changes to layout detector.
* Character codes > 127 now in ISO_8859_1::<charname> format.
* Added new option '--invert'.

2003-07-19 Antonio Diaz Diaz ant_diaz@teleline.es

* Version 0.3 released.
* ORF file feature added.
* Recursive 'layout detector' added.

This file is a collection of facts, and thus it is not copyrightable, but just in case, you have unlimited permission to copy, distribute and modify it.