簡體 English 中英

將NA和/或因子水平隨機更改為R中的其他因子水平

[英]Randomly changing NAs and/or factor level to other factor levels in R

原文 2019-09-17 14:57:58 8 2 r/ dplyr

我有一個數據框，其中一列是具有3個級別的類別變量“組”：“ A”，“ B”，“未知”，並且它還具有NA。

我想獲取所有“未知”和NA，並隨機將一半分配給“ A”，將一半分配給“ B”。 我試過在dplyr中使用mutate()和replace()函數，但是想不出如何將它們均等地分配給任一組。

2 個解決方案

像這樣的東西...

replacements = sample ( c ( 'A', 'B' ), number_wanted, replace = TRUE )

...應該可以

有一個可重現的示例（reprex）會很有用。

data.table包提供了一個簡潔的解決方案。

library(data.table)

setDT(df) # make your data.frame into a data.table

# filter for rows where your grouping variable is NA or equals "Unknown" then randomly select A or B. .N is a special data.table character representing the number of rows in the selection

df[is.na(group_var) | group_var == "Unknown", group_var := sample(c("A", "B"), .N)]

R更改一個因子水平的變量值以表示每日因子水平的值平均值

[英]R changing variable value of one factor level to represent value mean of factor levels by day

如何使用 R 配方處理因新因子水平而導致的 NA？

[英]How to handle NAs due to novel factor levels using R recipes?

R：因子水平，重新編碼為'其他'

[英]R: factor levels, recode rest to 'other'

在 R 中繪制因子水平

[英]Plot factor levels in R

R（工作室）因子與水平

[英]R (studio) factor with levels

訪問R中的因子水平

[英]Access the levels of a factor in R

R-獲得一個因子的水平

[英]R - obtaining the levels in a factor

R中的因子水平

[英]Factor Levels in R

比較R中的因子水平

[英]Compare factor levels in R

重新分類 R 中的因子水平

[英]Reclassifying factor levels in R

暫無

暫無

聲明:本站的技術帖子網頁，遵循CC BY-SA 4.0協議，如果您需要轉載，請注明本站網址或者原文地址。任何問題請咨詢:yoyou2525@163.com.

相關問題 R更改一個因子水平的變量值以表示每日因子水平的值平均值如何使用 R 配方處理因新因子水平而導致的 NA？ R：因子水平，重新編碼為'其他' 在 R 中繪制因子水平 R（工作室）因子與水平訪問R中的因子水平 R-獲得一個因子的水平 R中的因子水平比較R中的因子水平重新分類 R 中的因子水平

相關標簽

粵ICP備18138465號 © 2020-2024 STACKOOM.COM