Skip to content

Commit 0c39d34

Browse files
committed
Update tesseract man page about both OCR engines in tesseract 4
1 parent 4237987 commit 0c39d34

File tree

1 file changed

+6
-0
lines changed

1 file changed

+6
-0
lines changed

doc/tesseract.1.asc

+6
Original file line numberDiff line numberDiff line change
@@ -17,6 +17,12 @@ between 1985 and 1995. In 1995, this engine was among the top 3 evaluated by
1717
UNLV. It was open-sourced by HP and UNLV in 2005, and has been developed
1818
at Google since then.
1919

20+
Tesseract 4 adds a new neural net (LSTM) based OCR engine which is focused
21+
on line recognition, but also still supports the legacy Tesseract OCR engine of
22+
Tesseract 3 which works by recognizing character patterns. Compatibility with
23+
Tesseract 3 is enabled by --oem 0. It also needs traineddata files which support
24+
the legacy engine, for example those from the tessdata repository.
25+
2026

2127
IN/OUT ARGUMENTS
2228
----------------

0 commit comments

Comments
 (0)