[英]Migrating A Java Application to Hadoop : Architecture/Design Roadblocks?
Alrite.. so.. here's a situation: I am responsible for architect-ing the migration of an ETL software (EAI, rather) that is java-based. Alrite.. 所以.. 这是一种情况:我负责构建基于 Java 的 ETL 软件(更确切地说是 EAI)的迁移。 I'll have to migrate this to Hadoop (the apache version).我必须将它迁移到 Hadoop(apache 版本)。 Now, technically this is more like a reboot and not a migration - coz I've got no database to migrate.现在,从技术上讲,这更像是重新启动而不是迁移 - 因为我没有要迁移的数据库。 This is about leveraging Hadoop, such that, the Transformation phase (of 'ETL') is parallel-iz-ed.这是关于利用 Hadoop,这样,转换阶段('ETL')是并行化的。 This would make my ETL software,这将使我的 ETL 软件,
I've tested this configuration out - changed my transformation algos into a mapreduce model, tested it out on a high end Hadoop cluster and bench-marked the performance.我已经对此配置进行了测试 - 将我的转换算法更改为 mapreduce model,在高端 Hadoop 集群上对其进行了测试,并对性能进行了基准测试。 Now, I'm trying to understand and document all those things that could stand in the way of this application redesign/ rearch / migration.现在,我正在尝试理解和记录所有可能阻碍此应用程序重新设计/研究/迁移的事情。 Here's a few I could think of:这里有几个我能想到的:
I look forward to hearing from you with possible answers to above questions and more questions/facts I'd need to consider, based on your experiences with Hadoop / problem analysis.根据您对 Hadoop/问题分析的经验,我期待收到您对上述问题的可能答案以及我需要考虑的更多问题/事实。 Like always, I appreciate your help and thank ya all in advance.像往常一样,我感谢您的帮助,并提前感谢大家。
声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.