Détection de l'orientation d'une image de document texte en .NET

VintaSoft Imaging .NET SDK et VintaSoft Document Cleanup .NET Plug-in fournissent des commandes pour le traitement et le nettoyage des images de documents.
La commande GetTextOrientationCommand a pour but de détecter l'orientation des images contenant du texte latin pivoté de 90, 180 ou 270 degrés.
GetTextOrientationCommand n'est pas adapté au traitement de:

images ne contenant pas de texte
images contenant du texte non latin
images contenant du texte latin uniquement en majuscules
images, qui sont pivotées selon un angle différent de 0, 90, 180 ou 270 degrés (utilisez la commande DeskewCommand pour redresser l'image avant la détection de son orientation)
images contenant seulement quelques mots (insuffisant pour déterminer l'orientation de l'image)

VintaSoft Imaging .NET SDK et VintaSoft OCR .NET Plug-in fournissent la commande GetTesseractOcrTextOrientationCommand, dont le but est de détecter l'orientation de l'image du document à l'aide du moteur OCR Tesseract. La commande GetTesseractOcrTextOrientationCommand peut détecter l'orientation de toute image contenant du texte (elle ne présente pas les limitations de la commande GetTextOrientationCommand), mais la commande GetTesseractOcrTextOrientationCommand fonctionne jusqu'à 5 fois plus lentement que la commande GetTextOrientationCommand.

Sur la base de ce qui précède, il est nécessaire d'utiliser la commande GetTextOrientationCommand ou la commande GetTesseractOcrTextOrientationCommand en fonction du type d'images de documents d'entrée.

Il est également possible d'utiliser les deux commandes simultanément, et plus précisément:

détecte l'orientation de l'image à l'aide de GetTextOrientationCommand (plus rapide et détecte correctement l'orientation pour la majorité des documents contenant du texte latin)
si la commande GetTextOrientationCommand ne parvient pas à détecter l'orientation de l'image du document, détectez l'orientation de l'image à l'aide de GetTesseractOcrTextOrientationCommand (plus lente, mais détecte correctement l'orientation pour la quasi-totalité des documents)

L'utilisation combinée des deux commandes permet d'obtenir une efficacité et une qualité maximales pour la détection de l'orientation de l'image du document.

Voici un code C# qui montre comment détecter l'orientation d'une image de document à l'aide de la commande GetTextOrientationCommand:

/// <summary>
/// Returns an orientation angle of document image using statistics for Latin symbols.
/// </summary>
/// <param name="filename">The path to a file with document image.</param>
public static void GetDocumentImageOrientationUsingLatinSymbolStat(string filename)
{
    // create an image collection
    using (Vintasoft.Imaging.ImageCollection images = new Vintasoft.Imaging.ImageCollection())
    {
        // add images from file to the image collection
        images.Add(filename);

        // create an instance of GetTextOrientationCommand class
        Vintasoft.Imaging.ImageProcessing.Info.GetTextOrientationCommand getTextOrientationCommand1 =
            new Vintasoft.Imaging.ImageProcessing.Info.GetTextOrientationCommand();

        // for each image in image collection
        for (int i = 0; i < images.Count; i++)
        {
            // get image
            Vintasoft.Imaging.VintasoftImage image = images[i];

            // determine orientation of document image using statistics for Latin symbols
            getTextOrientationCommand1.ExecuteInPlace(image);

            // write result to the console
            System.Console.WriteLine(string.Format("Filename: {0}, page: {1}, page orientation: {2}, confidence: {3}",
                System.IO.Path.GetFileName(filename),
                i,
                getTextOrientationCommand1.Orientation,
                getTextOrientationCommand1.Confidence));
        }

        // free images
        images.ClearAndDisposeItems();
    }
}

Voici un code C# qui montre comment détecter l'orientation d'une image de document à l'aide de la commande GetTesseractOcrTextOrientationCommand:

/// <summary>
/// Returns an orientation angle of document image using Tesseract OCR.
/// </summary>
/// <param name="filename">The path to a file with document image.</param>
/// <param name="tesseractOcrDllDirectory">A path to a directory, where Tesseract5.Vintasoft.xXX.dll files are located.</param>
public static void GetDocumentImageOrientationUsingTesseractOCR(string filename, string tesseractOcrDllDirectory)
{
    // create an image collection
    using (Vintasoft.Imaging.ImageCollection images = new Vintasoft.Imaging.ImageCollection())
    {
        // add images from file to the image collection
        images.Add(filename);

        // create an instance of GetTesseractOcrTextOrientationCommand class
        using (Vintasoft.Imaging.ImageProcessing.Ocr.Tesseract.GetTesseractOcrTextOrientationCommand getTextOrientationCommand =
            new Vintasoft.Imaging.ImageProcessing.Ocr.Tesseract.GetTesseractOcrTextOrientationCommand())
        {
            // specify path to a directory, where Tesseract5.Vintasoft.xXX.dll files are located
            getTextOrientationCommand.TesseractOcrDllDirectory = tesseractOcrDllDirectory;

            // for each image in image collection
            for (int i = 0; i < images.Count; i++)
            {
                // get image
                Vintasoft.Imaging.VintasoftImage image = images[i];

                // determine orientation of document image using Tesseract OCR
                getTextOrientationCommand.ExecuteInPlace(image);

                // write result to the console
                System.Console.WriteLine(string.Format("Filename: {0}, page: {1}, page orientation: {2}",
                    System.IO.Path.GetFileName(filename),
                    i,
                    getTextOrientationCommand.Orientation));
            }
        }

        // free images
        images.ClearAndDisposeItems();
    }
}

Voici un code C# qui montre comment détecter l'orientation d'une image de document à l'aide de la commande GetTextOrientationCommand et GetTesseractOcrTextOrientationCommand:

/// <summary>
/// Returns an orientation angle of document image using statistics for Latin symbols and using Tesseract OCR.
/// </summary>
/// <param name="filename">The path to a file with document image.</param>
/// <param name="tesseractOcrDllDirectory">A path to a directory, where Tesseract5.Vintasoft.xXX.dll files are located.</param>
public static void GetDocumentImageOrientationUsingLatinSymbolStatAndOcrTesseract(string filename, string tesseractOcrDllDirectory)
{
    // create an image collection
    using (Vintasoft.Imaging.ImageCollection images = new Vintasoft.Imaging.ImageCollection())
    {
        // add images from file to the image collection
        images.Add(filename);

        // create an instance of GetTextOrientationCommand class
        Vintasoft.Imaging.ImageProcessing.Info.GetTextOrientationCommand getTextOrientationCommand1 =
            new Vintasoft.Imaging.ImageProcessing.Info.GetTextOrientationCommand();

        // create an instance of GetTesseractOcrTextOrientationCommand class
        using (Vintasoft.Imaging.ImageProcessing.Ocr.Tesseract.GetTesseractOcrTextOrientationCommand getTextOrientationCommand2 =
            new Vintasoft.Imaging.ImageProcessing.Ocr.Tesseract.GetTesseractOcrTextOrientationCommand())
        {
            // specify path to a directory, where Tesseract5.Vintasoft.xXX.dll files are located
            getTextOrientationCommand2.TesseractOcrDllDirectory = tesseractOcrDllDirectory;

            // for each image in image collection
            for (int i = 0; i < images.Count; i++)
            {
                // get image
                Vintasoft.Imaging.VintasoftImage image = images[i];

                // determine orientation of document image using statistics for Latin symbols (works for Latin text only)
                getTextOrientationCommand1.ExecuteInPlace(image);
                // if orientation is detected and orientation result has high confidence
                if (getTextOrientationCommand1.Orientation != Vintasoft.Imaging.ImageProcessing.Info.ImageOrthogonalOrientation.Undefined &&
                    getTextOrientationCommand1.Confidence > 0.3)
                {
                    // write result to the console
                    System.Console.WriteLine(string.Format("Filename: {0}, page: {1}, page orientation: {2}, confidence: {3}",
                        System.IO.Path.GetFileName(filename),
                        i,
                        getTextOrientationCommand1.Orientation,
                        getTextOrientationCommand1.Confidence));
                }
                // if orientation is not detected or orientation result has low confidence
                else
                {
                    // determine orientation of document image using Tesseract OCR (works for any text)
                    getTextOrientationCommand2.ExecuteInPlace(image);

                    // write result to the console
                    System.Console.WriteLine(string.Format("Filename: {0}, page: {1}, page orientation: {2}, confidence: {3}",
                        System.IO.Path.GetFileName(filename),
                        i,
                        getTextOrientationCommand2.Orientation,
                        getTextOrientationCommand2.Confidence));
                }
            }
        }

        // free images
        images.ClearAndDisposeItems();
    }
}