Search found 11 matches

by BenjaminA
Mon Oct 12, 2020 2:35 pm
Forum: VintaSoft Imaging .NET SDK and Plug-ins Discussions
Topic: PDF to HTML
Replies: 1
Views: 1814

PDF to HTML

Hi !
Is it possible to convert pdf to html?
by BenjaminA
Mon Sep 02, 2019 1:46 pm
Forum: VintaSoft Imaging .NET SDK and Plug-ins Discussions
Topic: Replace objects text of OcrObject
Replies: 3
Views: 2184

Re: Replace objects text of OcrObject

I couldn't find the solution in the documentation and therefore I write this for those who read this in the future.

In order to replace text you must use SetObjectText and cast OcrObject to OcrTextObject .

ocrResultsEditor.SetObjectText((OcrTextObject)ocrObject, "newText");
by BenjaminA
Mon Sep 02, 2019 11:57 am
Forum: VintaSoft Imaging .NET SDK and Plug-ins Discussions
Topic: Replace objects text of OcrObject
Replies: 3
Views: 2184

Replace objects text of OcrObject

Problem description: Some times when we run OCR we don't get a very good interpretation. For example "xxxx@xxx.se" became "xxxx(@xxx.se" or "i" became "|" Solution There is OcrResultsEditor.RemoveObjects that are used to remove. We need to be able also to repl...
by BenjaminA
Fri Aug 23, 2019 5:51 pm
Forum: VintaSoft Imaging .NET SDK and Plug-ins Discussions
Topic: Run OCR only on non searchable pages
Replies: 10
Views: 3901

Re: Run OCR only on non searchable pages

The PdfPage.RemoveText method removes only text and does not remove images. Yes I think you are right. In some cases we do not have a text + image layer. We have only text layer. When we run OCR on such PDFs, we get two text on PDF. So I must first convert page to Image then run OCR. How can I conv...
by BenjaminA
Fri Aug 23, 2019 3:55 pm
Forum: VintaSoft Imaging .NET SDK and Plug-ins Discussions
Topic: Run OCR only on non searchable pages
Replies: 10
Views: 3901

Re: Run OCR only on non searchable pages

Hi !
I used PdfPage.RemoveText() to remove text. The problem is that the image is also removed for som PDF.

We have two kind of PDFs. One that are already searchable and the other that has been OCR (image+text layer).
by BenjaminA
Wed Jul 31, 2019 10:29 am
Forum: VintaSoft Imaging .NET SDK and Plug-ins Discussions
Topic: Add VintasoftPDF.Net Demo after OCR is done
Replies: 3
Views: 1889

Re: Add VintasoftPDF.Net Demo after OCR is done

Hi !
I do not want to disable Vintasoft watermarks. I just ask if it is possible to add them after OCR is done. I test different settings for OCR to get the best results possible but watermarks make it very difficult.
by BenjaminA
Tue Jul 30, 2019 6:05 pm
Forum: VintaSoft Imaging .NET SDK and Plug-ins Discussions
Topic: Add VintasoftPDF.Net Demo after OCR is done
Replies: 3
Views: 1889

Add VintasoftPDF.Net Demo after OCR is done

Hi ! We are evaluating vintasoft OCR. When we run OCR on PDF we get weird symbols/spaces on PDF. I think it's because vintasoft add VintasoftPDF.Net Demo logo on PDF and OCR API trying to interpret logo to text. Some PDF get VintasoftPDF.Net Demo vertically and then OCR interpreter is really bad. Is...
by BenjaminA
Tue Jul 30, 2019 11:11 am
Forum: VintaSoft Imaging .NET SDK and Plug-ins Discussions
Topic: Run OCR only on non searchable pages
Replies: 10
Views: 3901

Re: Run OCR only on non searchable pages

Hi !
Thank you !
Is it possible to remove previous text layer on pdf?
by BenjaminA
Thu Jul 25, 2019 3:22 pm
Forum: VintaSoft Imaging .NET SDK and Plug-ins Discussions
Topic: Run OCR only on non searchable pages
Replies: 10
Views: 3901

Re: Run OCR only on non searchable pages

I did search a lot but could not find any sample code that showing how to create a new PDF and only do ocr on non searchable pages .
Can you please provide some sample code?
Thank you
by BenjaminA
Thu Jul 25, 2019 3:08 pm
Forum: VintaSoft Imaging .NET SDK and Plug-ins Discussions
Topic: Run OCR only on non searchable pages
Replies: 10
Views: 3901

Re: Run OCR only on non searchable pages

I have this PDF : https://srv-file2.gofile.io/download/xcUz0M/1.pdf
The PDF contains only image, but PdfPage.IsImageOnly = false
Way is it so ?