[英]Deploy Python app with textract module to Google Cloud Platform
I want to create a Python script that will parse 40.000 PDF files(text and images). 我想创建一个Python脚本,可以解析40.000个PDF文件(文本和图像)。 Since I saw that there is no easy method to check if a page contains images I think I should use textract module.
因为我看到没有简单的方法可以检查页面是否包含图像,所以我认为我应该使用textract模块。
Ideally I would deploy to Google App Engine. 理想情况下,我将部署到Google App Engine。
My question is, for textract I've also installed other packages beside Python to my system. 我的问题是,对于textract,我还安装了Python以外的其他软件包到系统中。 Can I deploy the script(with proper requirements.txt file) on Google Cloud App Engine without problem?
我可以毫无问题地在Google Cloud App Engine上部署脚本(带有正确的requirements.txt文件)吗? or I will to use something else?
还是我会使用其他东西?
It is possible to use App Engine, but only with the Flexible environment and using a custom runtime , which allows you to add non-python dependencies (and also python dependencies not installable via pip
): 可以使用App Engine,但只能在Flexible环境和自定义运行时中使用 ,这允许您添加非python依赖项(以及无法通过
pip
安装的python依赖项):
Custom runtimes allow you to define new runtime environments, which might include additional components like language interpreters or application servers.
自定义运行时允许您定义新的运行时环境,其中可能包括其他组件,例如语言解释器或应用程序服务器。
See also Building Custom Runtimes . 另请参阅构建自定义运行时 。
声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.