简体   繁体   中英

connect points in a time series with NA fields in R

This is the data frame i am working with:

Month   HSI GSI K
Dec-12  1.703   0.516   0.315
Jan-13  1.841   0.441   0.316
Feb-13  NA  NA  NA
Mar-13  NA  NA  NA
Apr-13  NA  NA  NA
May-13  3.365   0.627   0.324
Jun-13  NA  NA  NA
Jul-13  NA  NA  NA
Aug-13  4.097   0.456   0.317
Sep-13  NA  NA  NA
Oct-13  2.582   0.977   0.336
Nov-13  3.728   1.178   0.352
Dec-13  2.211   3.937   0.352
Jan-14  1.617   1.163   0.336

I am trying to plot HSI, GSI and K on the same graph. I have no problem with that. However, my problem is that my curves are discontinuous due to the NA fields, and i am only getting the year on the x axis. This is what u an using:

library(zoo)     #to plot multiple lines in the graph
library(timeDate)     #to create a time series by month
tS = timeSequence(from = "2012-12-01", to = "2014-01-01", by = "month")
plot(tS,HSI, type="l",ann="False",ylim=c(1,5), pch=22, lty=1,lwd=2, col="red")
lines(tS,GSI,type="l",pch=22,lty=1,lwd=2, col="green")     #to add other lines
lines(tS,K, type="l",pch=22,lty=1,lwd=2, col="blue")

Any help please? Thank you

You may convert your data frame to zoo object and use the plot.zoo facilities:

library(zoo)
z <- zoo(df[ , c("HSI", "GSI", "K")], order.by = as.yearmon(df$Month, "%b-%y"))

plot(na.omit(z), plot.type = "single", col = c("red", "green", "blue"))

plot(na.omit(z), plot.type = "single", col = c("red", "green", "blue"), xaxt = "n")
axis(side = 1, at = index(z), labels = format(index(z), "%b-%y"))

在此处输入图片说明

You may also try ggplot alternatives:

library(scales)
library(ggplot2)

z2 <- fortify(z, melt = TRUE)

ggplot(data = na.omit(z2), aes(x = Index, y = Value, colour = Series)) +
  geom_line() +
  scale_x_yearmon(breaks = z2$Index, format = "%b %Y") +
  theme(axis.text.x  = element_text(angle = 45, vjust = 1, hjust = 1))

在此处输入图片说明

Another ggplot possibility:

# convert year-month dates to as.POSIXct
df$Month <- as.POSIXct(as.Date(paste0(df$Month, "-01"), "%b-%y-%d"))

# reshape data from wide to long
library(reshape2)
df2 <- melt(df, id.var = "Month")

ggplot(data = na.omit(df2), aes(x = Month, y = value, colour = variable)) +
  geom_line()

在此处输入图片说明

When you have dates in as.POSIXct format you can easily format labels using scale_x_datetime : breaks = date_breaks and labels = date_format , eg

ggplot(data = na.omit(df2), aes(x = Month, y = value, colour = variable)) +
  geom_line() +
  scale_x_datetime(breaks = date_breaks("2 months"),
                   labels = date_format("%Y-%m"))

在此处输入图片说明

You can use complete.cases as follows. Let t be your data.frame:

x <- complete.cases(t)
plot(tS[x],t$HSI[x], type="l",ann="False",ylim=c(0,5), pch=22, lty=1,lwd=2, col="red")
lines(tS[x],t$GSI[x],type="l",pch=22,lty=1,lwd=2, col="green")     #to add other lines
lines(tS[x],t$K[x], type="l",pch=22,lty=1,lwd=2, col="blue")

Note that I changed the ylim in the call to plot because the blue line was outside the plot limits.

在此处输入图片说明

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM