Added support for .NET 8.0 in Windows, Linux and macOS.
The used Tesseract OCR engine has been updated to version 5.3.3.
Now all text blocks received from the image segmentation command are marked as blocks of RecognizeSingleColumn type. Previously, these blocks were marked as blocks of RecognizeSingleBlocks type. This change increased the quality of text recognition for complex text and did not reduce the overall performance of text recognition.
Added the compatibility support for Visual Studio 2022.
Supported operation systems:
Added the compatibility support for OS Windows 11.
Discontinued the compatibility support for OS Windows Server 2003.
The used Tesseract OCR engine has been updated to version 5.0. Our tests have shown that Tesseract OCR 5 and Tesseract OCR 4 provide similar text recognition results but Tesseract OCR 5 is up to 2 times faster than Tesseract OCR 4.
Added the ability to convert OcrPage object to a TextRegion object (OcrDocument.Create and OcrPage.Create methods).
Added new functionality to the OCR Demo application:
Added the ability to load OCR results as text from PDF document.
Fixed several minor bugs.
Improved code of ASP.NET OCR Demo (ASP.NET Core Angular OCR Demo, ASP.NET MVC OCR Demo, ASP.NET WebForms OCR Demo) and now the demo allows to:
view document before text recognition
preprocess document pages before text recognition
recognize text from the whole document, separate page or page region.