Scribe.js

Scribe.js is a JavaScript library that performs OCR and extracts text from images and PDFs.

Common use cases:

Recognize text from images.
Extract text from user-uploaded .pdf files.
1. If the .pdf file is already text-native, scribe.js can extract the existing text.
2. If the .pdf file is image-native, scribe.js can recognize text using OCR.
Write .pdf files that include a high-quality invisible text layer.
1. scribe.js can insert text into an existing .pdf file, making it searchable.

Scribe.js is a library intended for developers. End users who want to scan documents should see the officially-supported GUI at scribeocr.com (repo here).

Setup

Install from npm by running the following:

npm i scribe.js-ocr

Scribe.js is written in JavaScript using ESM, so can be imported directly from browser or Node.js JavaScript code.

// Import statement in browser:
import scribe from 'node_modules/scribe.js-ocr/scribe.js';
// Import statement for Node.js:
import scribe from 'scribe.js-ocr';

// Basic usage
scribe.extractText(['https://tesseract.projectnaptha.com/img/eng_bw.png'])
	.then((res) => console.log(res))

When using Scribe.js in the browser, all files must be served from the same origin as the file importing Scribe.js. This means that importing Scribe.js from a CDN will not work. There is no UMD version.

Scribe.js vs. Tesseract.js

Considering whether Scribe.js or Tesseract.js is better for your project? Read this article.

Documentation

Contributing

To work on a local copy, simply clone with --recurse-submodules and install. Please run the automated tests before making a PR.

## Clone the repo, including recursively cloning submodules
git clone --recurse-submodules git@github.com:scribeocr/scribe.js.git
cd scribe.js

## Install dependencies
npm i

## Make changes
## [...]

## Run automated tests before making PR
npm run test

Name		Name	Last commit message	Last commit date
Latest commit History 684 Commits
.github/workflows		.github/workflows
cli		cli
dev		dev
docs		docs
examples		examples
fonts		fonts
fonts_raw		fonts_raw
js		js
lib		lib
mupdf		mupdf
scrollview-web @ 95dfbf6		scrollview-web @ 95dfbf6
tess		tess
tests		tests
.eslintrc.json		.eslintrc.json
.gitignore		.gitignore
.gitmodules		.gitmodules
.npmignore		.npmignore
LICENSE		LICENSE
README.md		README.md
jsconfig.json		jsconfig.json
karma.conf.cjs		karma.conf.cjs
package-lock.json		package-lock.json
package.json		package.json
scribe.js		scribe.js

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Scribe.js

Setup

Scribe.js vs. Tesseract.js

Documentation

Contributing

About

Releases

Packages

Languages

License

neolev/scribe.js

Folders and files

Latest commit

History

Repository files navigation

Scribe.js

Setup

Scribe.js vs. Tesseract.js

Documentation

Contributing

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages