简体   繁体   English

如何为PySpark ETL设置本地开发环境以在AWS Glue中运行?

[英]How to set up a local development environment for PySpark ETL to run in AWS Glue?

PyCharm professional supports connecting, deploying and remote debugging of AWS Glue developer endpoint ( https://docs.aws.amazon.com/glue/latest/dg/dev-endpoint-tutorial-pycharm.html ) , but I can't figure out how to use VS Code (my code editor of choice) for this purpose. PyCharm Professional支持AWS Glue开发人员终端节点( https://docs.aws.amazon.com/glue/latest/dg/dev-endpoint-tutorial-pycharm.html )的连接,部署和远程调试,但是我不知道了解如何为此目的使用VS Code(我选择的代码编辑器)。 Does VS Code support any of these functionalities? VS Code是否支持以下任何功能? Or is there another free alternative to PyCharm professional with the same capabilities? 还是具有相同功能的PyCharm专业版还有其他免费替代品?

I have not use pyCharm, but have setup a local Development End Point with Zeppelin, for my Glue jobs development / testing. 我没有使用pyCharm,但是与Zeppelin一起设置了本地开发端点,用于我的Glue工作开发/测试。 Please see my related posts & references for setting up local development end point. 请参阅我的相关文章和参考资料以设置本地开发终点。 Maybe you can try it, if it is useful, and you can try to use pyCharm instead of Zeppelin. 如果有用,也许可以尝试一下,也可以尝试使用pyCharm代替Zeppelin。

Reference : Is it possible to use Jupyter Notebook for AWS Glue instead of Zeppelin & Link for zeppelin local development endpoint SO discussions 参考: 是否可以将Jupyter Notebook用于AWS Glue而不是ZeppelinLink用于 Zeppelin 本地开发端点SO讨论

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM