Count the number of rows where all columns have identical values

Question

I have a dataframe and I want to count the number of rows which have the same value for all the columns, within each row.

For example, I have this data:

cmp <- read.table(text = "
A B C D
1 1 1 0
1 1 1 1
2 2 2 2
3 3 3 0", header = TRUE)

Here, the count is 2, because the second row and third row have only one unique value each, only 1 s, and only 2 s, respectively.

Thanks in advance.

Answer 1

This, which uses apply() to count the number of distinct elements in each row, should do the trick:

sum(apply(cmp, 1, function(x) length(unique(x))==1))
## [1] 2

Answer 2

Count the number of values per row which are equal to the first value. If this count is equal to the number of columns, then all values in the row are identical.

sum(rowSums(cmp == cmp[ , 1]) == ncol(cmp))
#[1] 2

Answer 3

You could check if maximum value and minimum value across the rows are same

sum(do.call(pmax, cmp) == do.call(pmin, cmp))
#[1] 2

To obtain the rows where identical values are present

which(do.call(pmax, cmp) == do.call(pmin, cmp))
#[1] 2 3

Answer 4

The tidyverse way:

df %>% 
  rowwise() %>% 
  mutate(unique_vals = length(unique(c_across(everything()))))

This gives you the number of unique values for the selected columns -- feel free to change everything() to whatever you need. You can then filter/sum this variable as you please.

Count the number of rows where all columns have identical values

Question

4 answers

solution1
6 ACCPTED 2017-08-29 21:53:23

solution2
4 2017-08-29 22:17:34

solution3
3 2017-08-29 22:16:38

solution4
0 2021-04-09 08:01:57

Count the number of rows where all columns have identical values

Question

4 answers

solution1 6 ACCPTED 2017-08-29 21:53:23

solution2 4 2017-08-29 22:17:34

solution3 3 2017-08-29 22:16:38

solution4 0 2021-04-09 08:01:57

solution1
6 ACCPTED 2017-08-29 21:53:23

solution2
4 2017-08-29 22:17:34

solution3
3 2017-08-29 22:16:38

solution4
0 2021-04-09 08:01:57