Hi !
Is it possible to convert pdf to html?
Search found 11 matches
- Mon Oct 12, 2020 2:35 pm
- Forum: VintaSoft Imaging .NET SDK and Plug-ins Discussions
- Topic: PDF to HTML
- Replies: 1
- Views: 1903
- Mon Sep 02, 2019 1:46 pm
- Forum: VintaSoft Imaging .NET SDK and Plug-ins Discussions
- Topic: Replace objects text of OcrObject
- Replies: 3
- Views: 2203
Re: Replace objects text of OcrObject
I couldn't find the solution in the documentation and therefore I write this for those who read this in the future.
In order to replace text you must use SetObjectText and cast OcrObject to OcrTextObject .
ocrResultsEditor.SetObjectText((OcrTextObject)ocrObject, "newText");
In order to replace text you must use SetObjectText and cast OcrObject to OcrTextObject .
ocrResultsEditor.SetObjectText((OcrTextObject)ocrObject, "newText");
- Mon Sep 02, 2019 11:57 am
- Forum: VintaSoft Imaging .NET SDK and Plug-ins Discussions
- Topic: Replace objects text of OcrObject
- Replies: 3
- Views: 2203
Replace objects text of OcrObject
Problem description: Some times when we run OCR we don't get a very good interpretation. For example "xxxx@xxx.se" became "xxxx(@xxx.se" or "i" became "|" Solution There is OcrResultsEditor.RemoveObjects that are used to remove. We need to be able also to repl...
- Fri Aug 23, 2019 5:51 pm
- Forum: VintaSoft Imaging .NET SDK and Plug-ins Discussions
- Topic: Run OCR only on non searchable pages
- Replies: 10
- Views: 3920
Re: Run OCR only on non searchable pages
The PdfPage.RemoveText method removes only text and does not remove images. Yes I think you are right. In some cases we do not have a text + image layer. We have only text layer. When we run OCR on such PDFs, we get two text on PDF. So I must first convert page to Image then run OCR. How can I conv...
- Fri Aug 23, 2019 3:55 pm
- Forum: VintaSoft Imaging .NET SDK and Plug-ins Discussions
- Topic: Run OCR only on non searchable pages
- Replies: 10
- Views: 3920
Re: Run OCR only on non searchable pages
Hi !
I used PdfPage.RemoveText() to remove text. The problem is that the image is also removed for som PDF.
We have two kind of PDFs. One that are already searchable and the other that has been OCR (image+text layer).
I used PdfPage.RemoveText() to remove text. The problem is that the image is also removed for som PDF.
We have two kind of PDFs. One that are already searchable and the other that has been OCR (image+text layer).
- Wed Jul 31, 2019 10:29 am
- Forum: VintaSoft Imaging .NET SDK and Plug-ins Discussions
- Topic: Add VintasoftPDF.Net Demo after OCR is done
- Replies: 3
- Views: 1896
Re: Add VintasoftPDF.Net Demo after OCR is done
Hi !
I do not want to disable Vintasoft watermarks. I just ask if it is possible to add them after OCR is done. I test different settings for OCR to get the best results possible but watermarks make it very difficult.
I do not want to disable Vintasoft watermarks. I just ask if it is possible to add them after OCR is done. I test different settings for OCR to get the best results possible but watermarks make it very difficult.
- Tue Jul 30, 2019 6:05 pm
- Forum: VintaSoft Imaging .NET SDK and Plug-ins Discussions
- Topic: Add VintasoftPDF.Net Demo after OCR is done
- Replies: 3
- Views: 1896
Add VintasoftPDF.Net Demo after OCR is done
Hi ! We are evaluating vintasoft OCR. When we run OCR on PDF we get weird symbols/spaces on PDF. I think it's because vintasoft add VintasoftPDF.Net Demo logo on PDF and OCR API trying to interpret logo to text. Some PDF get VintasoftPDF.Net Demo vertically and then OCR interpreter is really bad. Is...
- Tue Jul 30, 2019 11:11 am
- Forum: VintaSoft Imaging .NET SDK and Plug-ins Discussions
- Topic: Run OCR only on non searchable pages
- Replies: 10
- Views: 3920
Re: Run OCR only on non searchable pages
Hi !
Thank you !
Is it possible to remove previous text layer on pdf?
Thank you !
Is it possible to remove previous text layer on pdf?
- Thu Jul 25, 2019 3:22 pm
- Forum: VintaSoft Imaging .NET SDK and Plug-ins Discussions
- Topic: Run OCR only on non searchable pages
- Replies: 10
- Views: 3920
Re: Run OCR only on non searchable pages
I did search a lot but could not find any sample code that showing how to create a new PDF and only do ocr on non searchable pages .
Can you please provide some sample code?
Thank you
Can you please provide some sample code?
Thank you
- Thu Jul 25, 2019 3:08 pm
- Forum: VintaSoft Imaging .NET SDK and Plug-ins Discussions
- Topic: Run OCR only on non searchable pages
- Replies: 10
- Views: 3920
Re: Run OCR only on non searchable pages
I have this PDF : https://srv-file2.gofile.io/download/xcUz0M/1.pdf
The PDF contains only image, but PdfPage.IsImageOnly = false
Way is it so ?
The PDF contains only image, but PdfPage.IsImageOnly = false
Way is it so ?