繁体   English   中英

删除空格:清除R中的数据Web数据中的奇数格式

[英]Removing White space: cleaning data in R Web data odd formatting in

因此,当我使用write.csv时,我在网上抓到了一些数据,在Excel中我得到了巨大的空白。 这是我数据框中的两行示例:

dat <- data.frame(one="\r\n Something", two="\n\n\n another one"

会有人碰巧知道如何解决删除空白的问题吗?

您在这里有两个半复杂的问题。 首先是“有人碰巧知道如何解决删除空白的问题吗?” 对我来说太模糊和复杂了,以至于除了建议使用stringr软件包中的功能以外,还不能真正帮助您? ¯\\ _(ツ)_ /¯idk是否有帮助?

第二个“中学:有人可以通过向我展示如何清理“ referee.report”文本来帮助我吗?这是我最感兴趣的列。我特别想删除其中的“ \\ r \\ n”其他事情。” 还有更多要解决的问题。

referee.report = structure(c("\r\n                                    \r\n                                        DOI: 10.5256/f1000research.6599.r7859\r\n                                    \r\n                                                                                                                                                                                                                        \r\n                                        I have read the revised article by Horrell and D'Orazio. They have responded appropriately to\r\n                                                                                    ... Continue reading\r\n                                                                            \r\n                                    \r\n                                        I have read the revised article by Horrell and D'Orazio. They have responded appropriately to the concerns/questions raised by all 3 reviewers. Accordingly, I recommend indexing the submitted revised article.\r\n                                        \r\n                                                                                            \r\n                                                                                                                I have read this submission. I believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.                                                                                                     \r\n                                                                                    \r\n                                        Competing Interests:\r\n                                        No competing interests were disclosed.\r\n                                                                                Close\r\n                                    \r\n                                    \r\n                                        REPORT A CONCERN\r\n                                    \r\n                                ", 
                             "\r\n                                    \r\n                                        DOI: 10.5256/f1000research.6601.r7701\r\n                                    \r\n                                                                                                                                                                                                                        \r\n                                        The revision\r\n                                                                                    ... Continue reading\r\n                                                                            \r\n                                    \r\n                                        The revision is approved\r\n                                        \r\n                                                                                            \r\n                                                                                                                I have read this submission. I believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.                                                                                                     \r\n                                                                                    \r\n                                        Competing Interests:\r\n                                        No competing interests were disclosed.\r\n                                                                                Close\r\n                                    \r\n                                    \r\n                                        REPORT A CONCERN\r\n                                    \r\n                                "
), .Names = c("http://f1000research.com/articles/3-288/v2", "http://f1000research.com/articles/4-34/v2"
))

cleanOutput <- function(listObject){
  listObject = sapply(listObject, str_split,"\\r\\n")
  listObject = sapply(listObject, trimws)
  listObject = paste(listObject[listObject!=""]) ##This line eliminates empty values and NAs
  return(listObject)
}

cleanOutput(referee.report)

试试这个功能?

编辑:

此版本从行首删除\\ t。

编辑:结果str_trim删除行的开头的“ \\ t”。 不需要编辑。

更新后 ,Polka的代码可以执行某些操作,但是lapply会删除\\,但是由于列表形式的变量,我需要将其转换为字符,但是当我执行\\时会返回:

更新 paste()以连接所有字符串并返回单个值会产生相同的结果。

暂无
暂无

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM