简体繁体 English

如何使用 ITextSharp 验证 pdf 是基于文本的？

[英]How to verify that pdf is text based using ITextSharp?

原文 2011-06-11 17:56:25 0 1 c#/ pdf/ itextsharp/ itext

I need to verify that the pdf report is text based (and not bitmap based; however it could contain some images).我需要验证 pdf 报告是基于文本的（而不是基于 bitmap；但是它可能包含一些图像）。 I do not need to extract the text, just to verify that it is text based.我不需要提取文本，只是为了验证它是基于文本的。

Is there a way how to perform such a verification using ITextSharp library?有没有办法如何使用 ITextSharp 库执行这样的验证？

Thanks in advance,提前致谢，

Stefan斯特凡

1 个解决方案

You can look for text drawing commands easily enough.您可以很容易地查找文本绘图命令。 The least work on your part would be to try to extract the text and see if anything is there.您要做的最少的工作是尝试提取文本并查看是否有任何内容。 Ideally you'd know some of the text it should contain and search for it.理想情况下，您应该知道它应该包含的一些文本并搜索它。 A single sentence or phrase would be plenty for this sort of testing.对于这种测试，一个句子或短语就足够了。

Text extraction with iText is pretty trivial these days.如今，使用 iText 进行文本提取非常简单。 Lots of examples floating around SO, and the web. SO和web周围有很多例子。

如何使用itextsharp库从pdf复制仅带阴影的文本？ - How to copy only hilighted text from pdf using itextsharp library?

如何使用iTextSharp从PDF中提取“标记为编辑”的文本？ - How to extract text 'marked for redaction' from a PDF using iTextSharp?

如何使用ITEXTSHARP将动态文本添加到PDF工具栏 - How to add dynamic text to PDF toolbar using ITEXTSHARP

如何使用 itextsharp 在 pdf 文件中的文本框控件中底部对齐文本 - How to bottom align text in a textbox control in pdf file using itextsharp

我们如何使用带空格的itextsharp从pdf中提取文本？ - how can we extract text from pdf using itextsharp with spaces?

iTextSharp - 使用 C# 如何将文本放置到 PDF - iTextSharp - Using C# How to Place Text Onto PDF

如何使用 iTextSharp 在 PDF 中显示 ✔？ - How to display ✔ in PDF using iTextSharp?

如何使用ITextSharp保存PDF？ - How to save PDF using ITextSharp?

使用iTextSharp验证PDF是否受到保护/保护 - Verify if a PDF is secured/protected with iTextSharp

使用 iTextsharp 从 PDF 中提取乌尔都语文本 - Extracting Urdu Text from PDF using iTextsharp

暂无

暂无

声明:本站的技术帖子网页，遵循CC BY-SA 4.0协议，如果您需要转载，请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

相关问题 如何使用itextsharp库从pdf复制仅带阴影的文本？ - How to copy only hilighted text from pdf using itextsharp library? 如何使用iTextSharp从PDF中提取“标记为编辑”的文本？ - How to extract text 'marked for redaction' from a PDF using iTextSharp? 如何使用ITEXTSHARP将动态文本添加到PDF工具栏 - How to add dynamic text to PDF toolbar using ITEXTSHARP 如何使用 itextsharp 在 pdf 文件中的文本框控件中底部对齐文本 - How to bottom align text in a textbox control in pdf file using itextsharp 我们如何使用带空格的itextsharp从pdf中提取文本？ - how can we extract text from pdf using itextsharp with spaces? iTextSharp - 使用 C# 如何将文本放置到 PDF - iTextSharp - Using C# How to Place Text Onto PDF 如何使用 iTextSharp 在 PDF 中显示 ✔？ - How to display ✔ in PDF using iTextSharp? 如何使用ITextSharp保存PDF？ - How to save PDF using ITextSharp? 使用iTextSharp验证PDF是否受到保护/保护 - Verify if a PDF is secured/protected with iTextSharp 使用 iTextsharp 从 PDF 中提取乌尔都语文本 - Extracting Urdu Text from PDF using iTextsharp

相关标签

粤ICP备18138465号 © 2020-2024 STACKOOM.COM