I wrote a script using RevoScaleR and dplyrXdf, to my surprise when using HDInsights (Microsoft Azure managed Spark cluster service) I get an installation of R 3.3.3 and I can't install dplyrXdf, neither is the package in the repository nor can I install from git using devtools, I managed to get it installed once updating every single dependence from it's respective github repository but this is madness, took me hours... The biggest issue seems to be dplyr 0.5 which is the latest avaiable package for this service (current CRAN package is 0.7.4) Am I doing something wrong? maybe something in provisioning (like selecting the wrong type of cluster)? I can not believe MS would put so much work in R and not update it's cluster service, I must be missing something here.
The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.