如何从数据集中获取一列，并根据另一列的最大值对其进行测试？

Question

I am currently working in the package nyclflights13, data set flights.我目前在 package nyclflights13 工作，数据集航班。 There is a column for the name of a plane, and a column for how many times that plane flew.有一列是飞机的名称，一列是那架飞机飞行了多少次。 I want to know which plane flew the most amount of times.我想知道哪架飞机飞行次数最多。 Also I would like to omit any missing values, ie any NA;s.此外，我想省略任何缺失值，即任何 NA;s。

I know that I am going to have to use the summarise () function and the select function with a - to omit the missing values.我知道我将不得不使用带有 - 的 summarise () function 和 select function 来省略缺失值。 I'm just not sure how to do that exactly.我只是不确定如何做到这一点。

Answer 1

I used this code to tally the number of rows in flights with each value of tailnum .我使用此代码来计算flights中每个tailnum值的行数。

library(magrittr)
library(nycflights13)
data(flights)

flights %>% 
  dplyr::group_by(tailnum) %>% 
  dplyr::tally() %>% 
  dplyr::arrange(desc(n))

You can then ignore the top row when examining the results, as it is for NA values of tailnum .然后，您可以在检查结果时忽略第一行，因为它适用于tailnum的NA值。

如何从数据集中获取一列，并根据另一列的最大值对其进行测试？

问题描述

1 个解决方案

解决方案1
0 已采纳 2020-07-30 18:14:42

如何从数据集中获取一列，并根据另一列的最大值对其进行测试？

问题描述

1 个解决方案

解决方案1 0 已采纳 2020-07-30 18:14:42

解决方案1
0 已采纳 2020-07-30 18:14:42