[英]Removing White space: cleaning data in R Web data odd formatting in
因此,当我使用write.csv时,我在网上抓到了一些数据,在Excel中我得到了巨大的空白。 这是我数据框中的两行示例:
dat <- data.frame(one="\r\n Something", two="\n\n\n another one"
会有人碰巧知道如何解决删除空白的问题吗?
您在这里有两个半复杂的问题。 首先是“有人碰巧知道如何解决删除空白的问题吗?” 对我来说太模糊和复杂了,以至于除了建议使用stringr软件包中的功能以外,还不能真正帮助您? ¯\\ _(ツ)_ /¯idk是否有帮助?
第二个“中学:有人可以通过向我展示如何清理“ referee.report”文本来帮助我吗?这是我最感兴趣的列。我特别想删除其中的“ \\ r \\ n”其他事情。” 还有更多要解决的问题。
referee.report = structure(c("\r\n \r\n DOI: 10.5256/f1000research.6599.r7859\r\n \r\n \r\n I have read the revised article by Horrell and D'Orazio. They have responded appropriately to\r\n ... Continue reading\r\n \r\n \r\n I have read the revised article by Horrell and D'Orazio. They have responded appropriately to the concerns/questions raised by all 3 reviewers. Accordingly, I recommend indexing the submitted revised article.\r\n \r\n \r\n I have read this submission. I believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard. \r\n \r\n Competing Interests:\r\n No competing interests were disclosed.\r\n Close\r\n \r\n \r\n REPORT A CONCERN\r\n \r\n ",
"\r\n \r\n DOI: 10.5256/f1000research.6601.r7701\r\n \r\n \r\n The revision\r\n ... Continue reading\r\n \r\n \r\n The revision is approved\r\n \r\n \r\n I have read this submission. I believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard. \r\n \r\n Competing Interests:\r\n No competing interests were disclosed.\r\n Close\r\n \r\n \r\n REPORT A CONCERN\r\n \r\n "
), .Names = c("http://f1000research.com/articles/3-288/v2", "http://f1000research.com/articles/4-34/v2"
))
cleanOutput <- function(listObject){
listObject = sapply(listObject, str_split,"\\r\\n")
listObject = sapply(listObject, trimws)
listObject = paste(listObject[listObject!=""]) ##This line eliminates empty values and NAs
return(listObject)
}
cleanOutput(referee.report)
试试这个功能?
编辑:
此版本从行首删除\\ t。
编辑:结果str_trim删除行的开头的“ \\ t”。 不需要编辑。
更新后 ,Polka的代码可以执行某些操作,但是lapply会删除\\,但是由于列表形式的变量,我需要将其转换为字符,但是当我执行\\时会返回:
更新 paste()以连接所有字符串并返回单个值会产生相同的结果。
声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.