簡體 English 中英

使用data.table根據另一列中的因子水平在一個列中用新因子水平替換NAs

[英]Replacing NAs witha new factor level in one column based on factor level in another column using data.table

原文 2018-10-26 19:42:02 3 1 r/ data.table

DATA = data.table(col_1 = factor(c("A", "B", "C", "C", "B", "A", "C")),
                  col_2 = factor(c("stuff", NA, NA, "stuff", NA, "different_stuff", NA)))

我有一個大數據集，其中我要用新的因子級別（例如yet_another_stuff替換col2中對應於col1 C的NAs 。 NAs超過了C級的觀測值，我不想替換像B一樣屬於其他級別的NAs 。

上載此數據集后，列已屬於類別因子。

由於數據集的大小，我非常希望使用data.table包來這樣做。

1 個解決方案

我們可以在i指定邏輯條件，並在'col_2'中分配與'yet_another_stuff'條件對應的那些值

DATA[is.na(col_2) & col_1 == "C", col_2 := "yet_another_stuff"]

基於另一列的每個因子水平的比例數據框

[英]proportion data frame for each factor level based on another column

在data.table中刪除因子級別

[英]Removing factor level in data.table

根據另一個因素的水平改變一個因素的水平

[英]Change the level of a factor based on the level of another factor

用同一列按因子分組的方式替換data.table列中的NA

[英]replace NAs in a column of a data.table with means of the same column grouped by a factor

在R中的data.table中基於另一個因素匯總一個因素

[英]Aggregating one factor based on another in data.table in R

將因子級別順序從一列復制到另一列

[英]copy factor level order from one column to another

如何更改 data.table 中因子列的級別

[英]How does one change the levels of a factor column in a data.table

根據兩個data.frames / data.tables在因子級別上計算新列

[英]calculate new column on factor level based on two data.frames/data.tables

添加具有一個水平因子平均值的列

[英]Add a column with one level factor mean value

用因子列融解R data.table

[英]Melting an R data.table with a factor column

暫無

暫無

聲明:本站的技術帖子網頁，遵循CC BY-SA 4.0協議，如果您需要轉載，請注明本站網址或者原文地址。任何問題請咨詢:yoyou2525@163.com.

相關問題 基於另一列的每個因子水平的比例數據框在data.table中刪除因子級別根據另一個因素的水平改變一個因素的水平用同一列按因子分組的方式替換data.table列中的NA 在R中的data.table中基於另一個因素匯總一個因素將因子級別順序從一列復制到另一列如何更改 data.table 中因子列的級別根據兩個data.frames / data.tables在因子級別上計算新列添加具有一個水平因子平均值的列用因子列融解R data.table

相關標簽

粵ICP備18138465號 © 2020-2024 STACKOOM.COM