How can I count words in an observation in R

Question

I need to count the words occurring in an observation. Some observations have only one word, but others have more than one and are separated by "/".

Here is a data set for a reproducible example.

data.1 <-read.csv(text = "
obs, a
2, date
3, Ari
4, MO
5, date/Ari
6, date/MO
7, Ari/MO
8, date/Ari/MO
")

This is the solution I came up with. But this solution is not counting correctly those observations with only one word.

data.2 <-  data.1 %>% mutate(n.words = lengths(gregexpr("/", a)) + 1)

This is the output

Any help will be really appreciated.

Thanks.

Answer 1

This worked for me:

lengths(strsplit(as.character(data.1$a),"/"))

Answer 2

We could use str_count from stringr

library(stringr)
library(dplyr)
data.1 %>% 
    mutate(n.words = str_count(a, "\\w+"))

How can I count words in an observation in R

Question

2 answers

solution1
1 ACCPTED 2020-06-10 16:12:20

solution2
1 2020-06-10 18:37:16

How can I count words in an observation in R

Question

2 answers

solution1 1 ACCPTED 2020-06-10 16:12:20

solution2 1 2020-06-10 18:37:16

solution1
1 ACCPTED 2020-06-10 16:12:20

solution2
1 2020-06-10 18:37:16