[英]Plot() function in R disorganized categorical x-values from .csv
我正在嘗試創建家庭收入的圖表,但是x值的順序錯誤。
這是我從工作的API中獲得的家庭收入類別:“ 0-15k”,“ 15k-25k”,“ 25k-35k”,“ 35k-50k”,“ 50k-75k”,“ 75k-100k”,“ 100k -125k”,“ 125k-150k”,“ 150k-175k”,“ 175k-200k”,“ 200k-250k”,“ 250k +”。 為了保密起見,我拿出了電子郵件的名稱,僅使用了前25行。
我的劇本:
#Load Data
Store2 <- read.csv("/Users/Documents/Work/Data/Client2.csv",
na.strings = "", head = TRUE)
#First 25 rows
Email Age Gender HouseholdIncome MaritalStatus
1 @aol.com <NA> Male <NA> <NA>
2 @yahoo.com 45-54 Female <NA> <NA>
3 @stratatec.com <NA> <NA> <NA> <NA>
4 @gmail.com <NA> <NA> <NA> <NA>
5 5@yahoo.com 45-54 Female 75k-100k Married
6 @aol.com 25-34 Male 75k-100k Married
7 @yahoo.com 35-44 Female 125k-150k Married
8 d@sbcglobal.net 55-64 Male 75k-100k Married
9 @yahoo.com 65+ Female 25k-35k Married
10 @me.com <NA> Female <NA> <NA>
11 @sunupcorp.com <NA> Female <NA> <NA>
12 @yahoo.com 45-54 Male 75k-100k Married
13 @att.net <NA> <NA> <NA> <NA>
14 @verizon.net <NA> Male <NA> <NA>
15 @yahoo.com 45-54 Male 50k-75k <NA>
16 @gmail.com 45-54 Male 50k-75k <NA>
17 @roadrunner.com 45-54 Female 15k-25k Single
18 @aol.com 35-44 Male 50k-75k Single
19 @yahoo.com 45-54 Male 125k-150k Single
20 @aol.com <NA> <NA> <NA> <NA>
21 @gmail.com 25-34 Male <NA> <NA>
22 @yahoo.com 25-34 Male 50k-75k Single
23 @gmail.com 55-64 Male 150k-175k Married
24 @trellnjoyce.com <NA> Female 35k-50k Married
25 @aol.com 65+ Male 50k-75k Married
編輯:我對繪圖進行了更改,但X軸標簽在路上。
#Plot of Household Income
res <- ordered(Store2$HouseholdIncome, levels=c("0-15k", "15k-25k", "25k-35k",
"35k-50k", "50k-75k", "75k-100k",
"100k-125k", "125k-150k", "150k-175k",
"175k-200k", "200k-250k", "250k+"))
#Set dimensions
par(mar=c(8,4,4,3))
#Create plot
plot(res, main = "Distribution of Household Income", xlab = "",
ylab = "Density", las=2, ylim = c(0,2000))
mtext(text="HouseholdIncome", side=1, line=6)
確保正確地訂購了HouseholdIncome因子的水平,例如,如下所示:
res <- ordered(Store2$HouseholdIncome, levels=c("0-15k", "15k-25k", "25k-35k", "35k-50k", "50k-75k", "75k-100k", "100k-125k", "125k-150k", "150k-175k", "175k-200k", "200k-250k", "250k+"))
par(mar=c(10,3,3,3))
plot(res, main = "Distribution of Household Income",
xlab = "", ylab = "Density", las=2)
mtext(text="HouseholdIncome", side=1, line=3)
聲明:本站的技術帖子網頁,遵循CC BY-SA 4.0協議,如果您需要轉載,請注明本站網址或者原文地址。任何問題請咨詢:yoyou2525@163.com.