Important: This documentation covers Yarn 1 (Classic).
For Yarn 2+ docs and migration guide, see yarnpkg.com.

Package detail

assemblyscript-regex

colineberhardt10.4kMIT1.6.4TypeScript support: included

A regex engine built with AssemblyScript

readme

assemblyscript-regex

A regex engine for AssemblyScript.

AssemblyScript is a new language, based on TypeScript, that runs on WebAssembly. AssemblyScript has a lightweight standard library, but lacks support for Regular Expression. The project fills that gap!

This project exposes an API that mirrors the JavaScript RegExp class:

const regex = new RegExp("fo*", "g");
const str = "table football, foul";

let match: Match | null = regex.exec(str);
while (match != null) {
  // first iteration
  //   match.index = 6
  //   match.matches[0] = "foo"

  // second iteration
  //   match.index = 16
  //   match.matches[0] = "fo"
  match = regex.exec(str);
}

Project status

The initial focus of this implementation has been feature support and functionality over performance. It currently supports a sufficient number of regex features to be considered useful, including most character classes, common assertions, groups, alternations, capturing groups and quantifiers.

The next phase of development will focussed on more extensive testing and performance. The project currently has reasonable unit test coverage, focussed on positive and negative test cases on a per-feature basis. It also includes a more exhaustive test suite with test cases borrowed from another regex library.

Feature support

Based on the classfication within the MDN cheatsheet

Character sets

  • <input checked="" disabled="" type="checkbox"> .
  • <input checked="" disabled="" type="checkbox"> \d
  • <input checked="" disabled="" type="checkbox"> \D
  • <input checked="" disabled="" type="checkbox"> \w
  • <input checked="" disabled="" type="checkbox"> \W
  • <input checked="" disabled="" type="checkbox"> \s
  • <input checked="" disabled="" type="checkbox"> \S
  • <input checked="" disabled="" type="checkbox"> \t
  • <input checked="" disabled="" type="checkbox"> \r
  • <input checked="" disabled="" type="checkbox"> \n
  • <input checked="" disabled="" type="checkbox"> \v
  • <input checked="" disabled="" type="checkbox"> \f
  • <input disabled="" type="checkbox"> [\b]
  • <input disabled="" type="checkbox"> \0
  • <input disabled="" type="checkbox"> \cX
  • <input checked="" disabled="" type="checkbox"> \xhh
  • <input checked="" disabled="" type="checkbox"> \uhhhh
  • <input disabled="" type="checkbox"> \u{hhhh} or \u{hhhhh}
  • <input checked="" disabled="" type="checkbox"> \

Assertions

  • <input checked="" disabled="" type="checkbox"> ^
  • <input checked="" disabled="" type="checkbox"> $
  • <input disabled="" type="checkbox"> \b
  • <input disabled="" type="checkbox"> \B

Other assertions

  • <input disabled="" type="checkbox"> x(?=y) Lookahead assertion
  • <input disabled="" type="checkbox"> x(?!y) Negative lookahead assertion
  • <input disabled="" type="checkbox"> (?<=y)x Lookbehind assertion
  • <input disabled="" type="checkbox"> (?<!y)x Negative lookbehind assertion

Groups and ranges

  • <input checked="" disabled="" type="checkbox"> x|y
  • <input checked="" disabled="" type="checkbox"> [xyz][a-c]
  • <input checked="" disabled="" type="checkbox"> [^xyz][^a-c]
  • <input checked="" disabled="" type="checkbox"> (x) capturing group
  • <input disabled="" type="checkbox"> \n back reference
  • <input disabled="" type="checkbox"> (?<Name>x) named capturing group
  • <input checked="" disabled="" type="checkbox"> (?:x) Non-capturing group

Quantifiers

  • <input checked="" disabled="" type="checkbox"> x*
  • <input checked="" disabled="" type="checkbox"> x+
  • <input checked="" disabled="" type="checkbox"> x?
  • <input checked="" disabled="" type="checkbox"> x{n}
  • <input checked="" disabled="" type="checkbox"> x{n,}
  • <input checked="" disabled="" type="checkbox"> x{n,m}
  • <input disabled="" type="checkbox"> x*? / x+? / ...

RegExp

  • <input checked="" disabled="" type="checkbox"> global
  • <input disabled="" type="checkbox"> sticky
  • <input checked="" disabled="" type="checkbox"> case insensitive
  • <input checked="" disabled="" type="checkbox"> multiline
  • <input checked="" disabled="" type="checkbox"> dotAll
  • <input disabled="" type="checkbox"> unicode

Development

This project is open source, MIT licenced and your contributions are very much welcomed.

To get started, check out the repository and install dependencies:

$ npm install

A few general points about the tools and processes this project uses:

  • This project uses prettier for code formatting and eslint to provide additional syntactic checks. These are both run on npm test and as part of the CI build.
  • The unit tests are executed using as-pect - a native AssemblyScript test runner
  • The specification tests are within the spec folder. The npm run test:generate target transforms these tests into as-pect tests which execute as part of the standard build / test cycle
  • In order to support improved debugging you can execute this library as TypeScript (rather than WebAssembly), via the npm run tsrun target.