Hi,
we're using Aspose.PDF [.NET] (10.4.0.0) to convert different kind of files to PDF-A.
Currently we're facing problems with some scanned PDF documents.
After the conversion to PDF-A the OCR information is somehow clinched.
Please find below a simplified code sample to reproduce the problem:
using (Stream inputStream = new FileStream("SampleDocument.pdf", FileMode.Open))
{
using (Document document = new Document(inputStream))
{
using (Stream outputLogStream = new MemoryStream())
{
document.Convert(outputLogStream, PdfFormat.PDF_A_3B, ConvertErrorAction.Delete);
}
document.Save("SampleDocumentAsPdfA.pdf");
}
}
I've attached a sample document, it's outcoming PDF-A version and some screenshots of the problem to this post.
Could you provide us a solution for our problem?
Best regards,
Julian