如何从字符串中删除字符并只保留 R 中的数字？

Question

a<- "\n\t\t\t\n\t\t\t\New\n\t\t\t\n\t\t\t\n\t\t\t\t\n\t\t\t\t\t - \n\t\t\t\t\n\t\t\t\t95\n\t\t\t\tdays\n\t\t\t\n\t\t"

How to isolate only the number 95 from this string?如何从该字符串中仅隔离数字 95？ I tried the gsub and str_replace but it removes the 95 too I removed this string from a site through the rvest package我尝试了gsub和str_replace但它也删除了 95 我通过rvest包从站点中删除了这个字符串

Answer 1

We can use gsub from base R to remove all characters that are not digits我们可以使用base R gsub删除所有不是数字的字符

gsub("\\D+", "", a)
#[1] "95"

Or as commented by @G Grothendieck或者正如@G Grothendieck 所评论的那样

gsub("\\D", "", a)

Or with str_remove_all或者使用str_remove_all

library(stringr)
str_remove_all(a, "\\D+")
#[1] "95"

Answer 2

The previous answers have approached the desired output negatively, by defining patterns for what is to be removed, namely anything that is not a number (hence \\\\D with uppercase D).通过定义要删除的内容的模式，即任何不是数字的内容（因此\\\\D带有大写 D），先前的答案已经否定了所需的输出。 Here's a positive solution defining what is to be kept, and extracting it via a self-defined function extract :这是一个定义要保留的内容并通过自定义函数extract的肯定解决方案：

Define function, including the pattern to be matched \\\\d{2} (ie, two contiguous numbers):定义函数，包括要匹配的模式\\\\d{2} （即两个连续的数字）：

extract <- function(x) unlist(regmatches(x, gregexpr("\\d{2}", x, perl = T)))

Apply function to data a :将函数应用于数据a ：

extract(a)
[1] "95"

Answer 3

我打算建议使用readr::parse_number但后来我了解到它会在- readr::parse_number失败，然后需要额外的工作，如解释here 。

如何从字符串中删除字符并只保留 R 中的数字？

问题描述

3 个解决方案

解决方案1
2 2019-12-25 20:24:53

解决方案2
0 2019-12-25 21:38:41

解决方案3
-1 2019-12-25 21:27:54

如何从字符串中删除字符并只保留 R 中的数字？

问题描述

3 个解决方案

解决方案1 2 2019-12-25 20:24:53

解决方案2 0 2019-12-25 21:38:41

解决方案3 -1 2019-12-25 21:27:54

解决方案1
2 2019-12-25 20:24:53

解决方案2
0 2019-12-25 21:38:41

解决方案3
-1 2019-12-25 21:27:54