分组后对data.table进行排序

Question

I have a data like this 我有这样的数据

z = data.table(name =sample(letters[1:6],10,replace=T),
Var1=rnorm(10,1,2),
Var2 =runif(10,1,10))

I want sum the var1by name and sort the sum like the fake code 我想要对var1by名称求和，并像伪代码那样对总和进行排序

z[order(sum(Var1)),sum(Var1),by =name]

But the code above doesn't work,any insights? 但上面的代码不起作用，任何见解？

Answer 1

This can be done in a two step process - ie 1) create a sum column grouped by 'name' then order based on the new column ('V1') and assign that column to NULL (if the intention is to order the whole dataset) 这可以在两个步骤的过程来完成-即1）创建sum由“名称”则分组列order基于新的柱（“V1”）和指定该列进行NULL（如果目的是订购整个数据集）

z[, V1 := sum(Var1) , by = name][order(V1)][, V1 := NULL][]

If we are only ordering the summarized output 如果我们只是订购汇总输出

z[, sum(Var1) , by = name][order(V1)]

Or as @Frank suggested set functions will be more efficient 或者@Frank建议的set功能将更有效

z[, V1 := sum(Var1) , by = name]
setorder(z, V1)[, V1 := NULL]

分组后对data.table进行排序

问题描述

1 个解决方案

解决方案1
0 已采纳 2016-12-09 05:50:06

分组后对data.table进行排序

问题描述

1 个解决方案

解决方案1 0 已采纳 2016-12-09 05:50:06

解决方案1
0 已采纳 2016-12-09 05:50:06