|
1 | 1 | 2014-02-04 v3.03
|
2 |
| -* Added new training tool text2image to generate box/tif file pairs from |
3 |
| - text and truetype fonts. |
4 |
| -* Added support for PDF output with searchable text. |
5 |
| -* Removed entire IMAGE class and all code in image directory. |
6 |
| -* Tesseract executable: support for output to stdout; limited support for one |
7 |
| - page images from stdin (especially on Windows) |
8 |
| -* Added Renderer to API to allow document-level processing and output |
9 |
| - of document formats, like hOCR, PDF. |
10 |
| -* Major refactor of word-level recognition, beam search, eliminating dead code. |
11 |
| -* Refactored classifier to make it easier to add new ones. |
12 |
| -* Generalized feature extractor to allow feature extraction from greyscale. |
13 |
| -* Improved sub/superscript treatment. |
14 |
| -* Improved baseline fit. |
15 |
| -* Added set_unicharset_properties to training tools. |
16 |
| -* Many bug fixes. |
17 |
| -* More training source data included. |
18 |
| - |
| 2 | + * Added new training tool text2image to generate box/tif file pairs from |
| 3 | + text and truetype fonts. |
| 4 | + * Added support for PDF output with searchable text. |
| 5 | + * Removed entire IMAGE class and all code in image directory. |
| 6 | + * Tesseract executable: support for output to stdout; limited support for one |
| 7 | + page images from stdin (especially on Windows) |
| 8 | + * Added Renderer to API to allow document-level processing and output |
| 9 | + of document formats, like hOCR, PDF. |
| 10 | + * Major refactor of word-level recognition, beam search, eliminating dead code. |
| 11 | + * Refactored classifier to make it easier to add new ones. |
| 12 | + * Generalized feature extractor to allow feature extraction from greyscale. |
| 13 | + * Improved sub/superscript treatment. |
| 14 | + * Improved baseline fit. |
| 15 | + * Added set_unicharset_properties to training tools. |
| 16 | + * Many bug fixes. |
| 17 | + * More training source data included. |
19 | 18 |
|
20 | 19 | 2012-02-01 - v3.02
|
21 | 20 | * Moved ResultIterator/PageIterator to ccmain.
|
|
107 | 106 | 301, among others)
|
108 | 107 |
|
109 | 108 | June 2006 - V1.0 of open source Tesseract checked-in.
|
| 109 | + |
110 | 110 | Sep 7 2006 - V1.01.
|
111 |
| - Added mfcpch.cpp and getopt.cpp for VC++. |
112 |
| - Fixed problem with greyscale images and no libtiff. |
113 |
| - Stopped debug window from being used for the usage output. |
114 |
| - Fixed load of inttemp for big-endian architectures. |
115 |
| - Fixed some Mac compilation issues. |
| 111 | + * Added mfcpch.cpp and getopt.cpp for VC++. |
| 112 | + * Fixed problem with greyscale images and no libtiff. |
| 113 | + * Stopped debug window from being used for the usage output. |
| 114 | + * Fixed load of inttemp for big-endian architectures. |
| 115 | + * Fixed some Mac compilation issues. |
| 116 | + |
116 | 117 | Oct 4 2006 - V1.02
|
117 |
| - Removed dependency on Aspirin. |
118 |
| - Fixed a few missing Apache license headers. |
119 |
| - Removed $log. |
| 118 | + * Removed dependency on Aspirin. |
| 119 | + * Fixed a few missing Apache license headers. |
| 120 | + * Removed $log. |
| 121 | + |
120 | 122 | Feb 2 2007 - V1.03
|
121 |
| - Added mftraining and cntraining. |
122 |
| - Added baseapi with adaptive thresholding for grey and color. |
123 |
| - Fixed many memory leaks. |
124 |
| - Fixed several bugs including lack of use of adaptive classifier. |
125 |
| - Added ifdefs to eliminate graphics code and add embedded platform support. |
126 |
| - Incorporated several patches, including 64-bit builds, Mac builds. |
127 |
| - Minor accuracy improvements. |
| 123 | + * Added mftraining and cntraining. |
| 124 | + * Added baseapi with adaptive thresholding for grey and color. |
| 125 | + * Fixed many memory leaks. |
| 126 | + * Fixed several bugs including lack of use of adaptive classifier. |
| 127 | + * Added ifdefs to eliminate graphics code and add embedded platform support. |
| 128 | + * Incorporated several patches, including 64-bit builds, Mac builds. |
| 129 | + * Minor accuracy improvements. |
| 130 | + |
128 | 131 | May 15 2007 - V1.04
|
129 |
| - Added dll exports for Windows. |
130 |
| - Fixed name collisions with stl etc. |
131 |
| - Made some preliminary changes ready for unicodeization. |
132 |
| - Several bug fixes discovered during unicodeization. |
| 132 | + * Added dll exports for Windows. |
| 133 | + * Fixed name collisions with stl etc. |
| 134 | + * Made some preliminary changes ready for unicodeization. |
| 135 | + * Several bug fixes discovered during unicodeization. |
| 136 | + |
133 | 137 | July 02 2007 - V2.00
|
134 |
| - Converted internal character handling to UTF8. |
135 |
| - Trained with 6 languages. |
136 |
| - Added unicharset_extractor, wordlist2dawg. |
137 |
| - Added boxfile creation mode. |
138 |
| - Added UNLV regression test capability. |
139 |
| - Fixed problems with copyright and registered symbols. |
140 |
| - Fixed extern "C" declarations problem. |
| 138 | + * Converted internal character handling to UTF8. |
| 139 | + * Trained with 6 languages. |
| 140 | + * Added unicharset_extractor, wordlist2dawg. |
| 141 | + * Added boxfile creation mode. |
| 142 | + * Added UNLV regression test capability. |
| 143 | + * Fixed problems with copyright and registered symbols. |
| 144 | + * Fixed extern "C" declarations problem. |
| 145 | + |
141 | 146 | August 27 2007 - V2.01
|
142 |
| - Fixed UTF8 input problems with box file reader. |
143 |
| - Fixed various infinite loops and crashes in dawg code. |
144 |
| - Removed include of config_auto.h from host.h. |
145 |
| - Added automatic wctype encoding to unicharset_extractor. |
146 |
| - Fixed dawg table too full error. |
147 |
| - Removed svn files from tarball. |
148 |
| - Added new functions to tessdll. |
149 |
| - Increased maximum utf8 string in a classification result to 8. |
| 147 | + * Fixed UTF8 input problems with box file reader. |
| 148 | + * Fixed various infinite loops and crashes in dawg code. |
| 149 | + * Removed include of config_auto.h from host.h. |
| 150 | + * Added automatic wctype encoding to unicharset_extractor. |
| 151 | + * Fixed dawg table too full error. |
| 152 | + * Removed svn files from tarball. |
| 153 | + * Added new functions to tessdll. |
| 154 | + * Increased maximum utf8 string in a classification result to 8. |
150 | 155 |
|
151 | 156 | January 23 2008 - V2.02
|
152 |
| - Improvements to clustering, training and classifier. |
153 |
| - Major internationalization improvements for large-character-set |
154 |
| - languages, eg Kannada. |
155 |
| - Removed some compiler warnings. |
156 |
| - Added multipage tiff support for training and running. |
157 |
| - Updated graphics output to talk to new java-based viewer. |
158 |
| - Added ability to save n-best lists. |
159 |
| - Added leptonica support for more file types. |
160 |
| - Improved Init/End to make them safe. |
161 |
| - Reduced memory use of dictionaries. |
162 |
| - Added some new APIs to TessBaseAPI. |
| 157 | + * Improvements to clustering, training and classifier. |
| 158 | + * Major internationalization improvements for large-character-set |
| 159 | + * languages, eg Kannada. |
| 160 | + * Removed some compiler warnings. |
| 161 | + * Added multipage tiff support for training and running. |
| 162 | + * Updated graphics output to talk to new java-based viewer. |
| 163 | + * Added ability to save n-best lists. |
| 164 | + * Added leptonica support for more file types. |
| 165 | + * Improved Init/End to make them safe. |
| 166 | + * Reduced memory use of dictionaries. |
| 167 | + * Added some new APIs to TessBaseAPI. |
| 168 | + |
163 | 169 | April 21 2008 - V2.02 (again)
|
164 |
| - Fixed namespace collisions with jpeg library (INT32). |
165 |
| - Portability fixes for Windows for new code. |
166 |
| - Updates to autoconf system for new code. |
| 170 | + * Fixed namespace collisions with jpeg library (INT32). |
| 171 | + * Portability fixes for Windows for new code. |
| 172 | + * Updates to autoconf system for new code. |
| 173 | + |
167 | 174 | April 22 2008 - V2.03
|
168 |
| - Fixed crash introduced in 2.02. |
169 |
| - Fixed lack of tessembedded.cpp in distribution. |
170 |
| - Added test for leptonica header files and conditional test for lib. |
| 175 | + * Fixed crash introduced in 2.02. |
| 176 | + * Fixed lack of tessembedded.cpp in distribution. |
| 177 | + * Added test for leptonica header files and conditional test for lib. |
| 178 | + |
171 | 179 | June 30 2009 - V2.04
|
172 |
| - Integrated bug fixes and patches and misc changes for portability. |
173 |
| - Integrated a patch to remove some of the "access" macros. |
174 |
| - Removed dependence on lua from the viewer, speeding it up |
175 |
| - dramatically. |
176 |
| - Fixed the viewer so it compiles and runs properly! |
177 |
| - Specifically fixing issues: 1, 63, 67, 71, 76, 81, 82, 106, 111, |
178 |
| - 112, 128, 129, 130, 133, 135, 142, 143, 145, 147, 153, 154, 160, |
179 |
| - 165, 170, 175, 177, 187, 192, 195, 199, 201, 205, 209, 108, 169 |
| 180 | + * Integrated bug fixes and patches and misc changes for portability. |
| 181 | + * Integrated a patch to remove some of the "access" macros. |
| 182 | + * Removed dependence on lua from the viewer, speeding it up |
| 183 | + dramatically. |
| 184 | + * Fixed the viewer so it compiles and runs properly! |
| 185 | + * Specifically fixing issues: 1, 63, 67, 71, 76, 81, 82, 106, 111, |
| 186 | + 112, 128, 129, 130, 133, 135, 142, 143, 145, 147, 153, 154, 160, |
| 187 | + 165, 170, 175, 177, 187, 192, 195, 199, 201, 205, 209, 108, 169 |
0 commit comments