This page provides you the information about development history of VintaSoft OCR .NET Plug-in.
The information about the Plug-in API history can be obtained from documentation.
Added support for .NET 9 in Windows, Linux and macOS.
The used Tesseract OCR engine has been updated to version 5.5.0.
The Vintasoft.Imaging.Ocr.ML assembly now uses Microsoft.ML 4.0.0.
Demo applications:
Web demo-application "OCR Demo" now allows to recognize text in all loaded images, save text recognition result to a searchable PDF document, open created searchable PDF document with ability to select and search text.
Added support for .NET 8.0 in Windows, Linux and macOS.
The used Tesseract OCR engine has been updated to version 5.3.3.
Now all text blocks received from the image segmentation command are marked as blocks of RecognizeSingleColumn type. Previously, these blocks were marked as blocks of RecognizeSingleBlocks type. This change increased the quality of text recognition for complex text and did not reduce the overall performance of text recognition.
Added the compatibility support for Visual Studio 2022.
Supported operation systems:
Added the compatibility support for OS Windows 11.
Discontinued the compatibility support for OS Windows Server 2003.
The used Tesseract OCR engine has been updated to version 5.0. Our tests have shown that Tesseract OCR 5 and Tesseract OCR 4 provide similar text recognition results but Tesseract OCR 5 is up to 2 times faster than Tesseract OCR 4.
Added the ability to convert OcrPage object to a TextRegion object (OcrDocument.Create and OcrPage.Create methods).
Demo applications:
Added new functionality to the OCR Demo application:
Added the ability to load OCR results as text from PDF document.
Fixed several minor bugs.
Web development:
Demo applications:
Improved code of ASP.NET OCR Demo (ASP.NET Core Angular OCR Demo, ASP.NET MVC OCR Demo, ASP.NET WebForms OCR Demo) and now the demo allows to:
view document before text recognition
preprocess document pages before text recognition
recognize text from the whole document, separate page or page region.
The used Tesseract OCR engine has been updated to version 4.0:
improved the quality and performance of text recognition
extended the list of supported languages for recognition
Added the ability to recognize text in several languages using the functionality of Tesseract OCR engine. Previous versions allowed to recognize text in several languages using the SDK functionality.
OCR demo application has been added the ability to select several languages for text recognition.
Added the ability to specify the ortogonal rotation of text region before the text recognition. In previous versions all text was recognized as non-rotated.