简体   繁体   中英

AWS Glue python shell - Using multiple libraries

I was using AWS glue python shell. The program uses multiple python libraries which not natively available for AWS. Glue can take.egg or.whl files for external library reference. All we need to do is put these.egg or.whl file in some S3 location and point to it using it's full path. I tried with one external library [for instance openpyxl] and it worked. Now the problem is since I have multiple external libraries like pandas, numpy, openpyxl and pytz to be referred, I cant give full path of all these packages as only path can be specified as external python library reference. I tried giving the s3 folder name where I placed all these packages, it does not work.

How can I specify these multile.egg or.whl files so that my glue job can use them.

  • Save your library in (.whl) format in s3 location.
  • You can add libraries in "Python Library Path" in comma separated format. example- s3://libs/xyz.whl, s3://libs/abc.whl

参考图片

This question is already answered by gbeaven, but for some reasons I am unable mark it as answer. This was fixed by comma separating the file paths in the additional python modules.

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM