Skip to content

Commit bdf6629

Browse files
stweilzdenop
authored andcommitted
Update version in README and manpages (#1381)
Signed-off-by: Stefan Weil <sw@weilnetz.de>
1 parent 8fb6874 commit bdf6629

File tree

3 files changed

+22
-22
lines changed

3 files changed

+22
-22
lines changed

README.md

+1-1
Original file line numberDiff line numberDiff line change
@@ -33,7 +33,7 @@ In 2005 Tesseract was open sourced by HP. Since 2006 it is developed by Google.
3333

3434
The latest stable version is **[3.05.01](https://github.com/tesseract-ocr/tesseract/releases/tag/3.05.01)**, released on June 1, 2017. Latest source code for 3.05 is available from [3.05 branch on GitHub](https://github.com/tesseract-ocr/tesseract/tree/3.05).
3535

36-
Source code for the new **[LSTM based 4.00.00alpha version](https://github.com/tesseract-ocr/tesseract)** is available from the master branch on GitHub. Please note this branch is under active development.
36+
Source code for the new **[LSTM based 4.0 version](https://github.com/tesseract-ocr/tesseract)** is available from the master branch on GitHub. Please note this branch is under active development.
3737

3838
See **[Release Notes](https://github.com/tesseract-ocr/tesseract/wiki/ReleaseNotes)** and **[Change Log](https://github.com/tesseract-ocr/tesseract/blob/master/ChangeLog)** for more details of the releases.
3939

doc/combine_tessdata.1.asc

+19-19
Original file line numberDiff line numberDiff line change
@@ -11,7 +11,7 @@ SYNOPSIS
1111

1212
DESCRIPTION
1313
-----------
14-
combine_tessdata(1) is the main program to combine/extract/overwrite/list/compact
14+
combine_tessdata(1) is the main program to combine/extract/overwrite/list/compact
1515
tessdata components in [lang].traineddata files.
1616

1717
To combine all the individual tessdata components (unicharset, DAWGs,
@@ -59,10 +59,10 @@ OPTIONS
5959
6060
*-c* '.traineddata' 'FILE'...:
6161
Compacts the LSTM component in the .traineddata file to int.
62-
62+
6363
*-d* '.traineddata' 'FILE'...:
6464
Lists directory of components from the .traineddata file.
65-
65+
6666
*-e* '.traineddata' 'FILE'...:
6767
Extracts the specified components from the .traineddata file
6868
@@ -81,15 +81,15 @@ CAVEATS
8181
COMPONENTS
8282
----------
8383
The components in a Tesseract lang.traineddata file as of
84-
Tesseract 4.00alpha are briefly described below; For more information on
84+
Tesseract 4.0 are briefly described below; For more information on
8585
many of these files, see
8686
<https://github.com/tesseract-ocr/tesseract/wiki/TrainingTesseract>
8787
and
8888
<https://github.com/tesseract-ocr/tesseract/wiki/TrainingTesseract-4.00>
8989
9090
lang.config::
9191
(Optional) Language-specific overrides to default config variables.
92-
For 4.00alpha traineddata files, lang.config provides control parameters which
92+
For 4.0 traineddata files, lang.config provides control parameters which
9393
can affect layout analysis, and sub-languages.
9494
9595
lang.unicharset::
@@ -148,36 +148,36 @@ lang.params-model::
148148
(Optional - 3.0x legacy tesseract) .
149149
150150
lang.lstm::
151-
(Required - 4.00alpha LSTM) Neural net trained recognition model generated by lstmtraining.
151+
(Required - 4.0 LSTM) Neural net trained recognition model generated by lstmtraining.
152152
153153
lang.lstm-punc-dawg::
154-
(Optional - 4.00alpha LSTM) A dawg made from punctuation patterns found around words.
154+
(Optional - 4.0 LSTM) A dawg made from punctuation patterns found around words.
155155
The "word" part is replaced by a single space. Uses lang.lstm-unicharset.
156-
156+
157157
lang.lstm-word-dawg::
158-
(Optional - 4.00alpha LSTM) A dawg made from dictionary words from the language.
158+
(Optional - 4.0 LSTM) A dawg made from dictionary words from the language.
159159
Uses lang.lstm-unicharset.
160160
161161
lang.lstm-number-dawg::
162-
(Optional - 4.00alpha LSTM) A dawg made from tokens which originally contained digits.
162+
(Optional - 4.0 LSTM) A dawg made from tokens which originally contained digits.
163163
Each digit is replaced by a space character. Uses lang.lstm-unicharset.
164-
164+
165165
lang.lstm-unicharset::
166-
(Required - 4.00alpha LSTM) The unicode character set that Tesseract recognizes, with properties.
166+
(Required - 4.0 LSTM) The unicode character set that Tesseract recognizes, with properties.
167167
Same unicharset must be used to train the LSTM and build the lstm-*-dawgs files.
168168
169169
lang.lstm-recoder::
170-
(Required - 4.00alpha LSTM) Unicharcompress, aka the recoder, which maps the unicharset
170+
(Required - 4.0 LSTM) Unicharcompress, aka the recoder, which maps the unicharset
171171
further to the codes actually used by the neural network recognizer. This is created as
172172
part of the starter traineddata by combine_lang_model.
173-
173+
174174
lang.version::
175-
(Optional) Version string for the traineddata file.
176-
First appeared in version 4.00alpha of Tesseract.
177-
Old version of traineddata files will report Version string:Pre-4.0.0.
178-
4.00alpha version of traineddata files may include the network spec
175+
(Optional) Version string for the traineddata file.
176+
First appeared in version 4.0 of Tesseract.
177+
Old version of traineddata files will report Version string:Pre-4.0.0.
178+
4.0 version of traineddata files may include the network spec
179179
used for LSTM training as part of version string.
180-
180+
181181
HISTORY
182182
-------
183183
combine_tessdata(1) first appeared in version 3.00 of Tesseract

doc/tesseract.1.asc

+2-2
Original file line numberDiff line numberDiff line change
@@ -115,7 +115,7 @@ SINGLE OPTIONS
115115
LANGUAGES
116116
---------
117117

118-
The currently available traineddata files for tesseract 4.00
118+
The currently available traineddata files for tesseract 4.0
119119
for the following languages are in
120120
(in https://github.com/tesseract-ocr/tessdata_fast):
121121

@@ -244,7 +244,7 @@ argument '-l foo'.
244244
SCRIPTS
245245
-------
246246
247-
The traineddata files for the following scripts for tesseract 4.00
247+
The traineddata files for the following scripts for tesseract 4.0
248248
are also in https://github.com/tesseract-ocr/tessdata_fast.
249249
250250
In most cases, each of these contains all the languages that use that script PLUS English.

0 commit comments

Comments
 (0)