且构网

分享程序员开发的那些事...
且构网 - 分享程序员编程开发的那些事

使用ggplot绘制大量时间序列.有可能加快速度吗?

更新时间:2023-01-07 08:58:59

部分问题要求一种更好的方式绘制这些数据".

Part of your question asks for a "better way to plot these data".

本着这种精神,您似乎有两个问题,首先,您希望沿x轴绘制> 35,000个点,正如一些评论所指出的那样,这将导致除非常大的像素以外的任何像素重叠,高分辨率显示器.其次,也是更重要的IMO,您正在尝试在同一图上绘制69个时间序列(站).在这种情况下,热图可能是更好的方法.

In that spirit, you seem to have two problems, First, you expect to plot >35,000 points along the x-axis, which, as some of the comments point out, will result in pixel overlap on anything but an extremely large, high resolution monitor. Second, and more important IMO, you are trying to plot 69 time series (stations) on the same plot. In this type of situation a heatmap might be a better approach.

library(data.table)
library(ggplot2)
library(reshape2)          # for melt(...)
library(RColorBrewer)      # for brewer.pal(...)
url <-  "http://dl.dropboxusercontent.com/s/bxioonfzqa4np6y/timeSeries.txt"
dt  <- fread(url)
dt[,Year:=year(as.Date(date))]

dt.melt  <- melt(dt[,-1,with=F],id="Year",variable.name="Station")
dt.agg   <- dt.melt[,list(y=sum(value)),by=list(Year,Station)]
dt.agg[,Station:=factor(Station,levels=rev(levels(Station)))]
ggplot(dt.agg,aes(x=Year,y=Station)) + 
  geom_tile(aes(fill=y)) +
  scale_fill_gradientn("Annual\nPrecip. [mm]",
                       colours=rev(brewer.pal(9,"Spectral")))+
  scale_x_continuous(expand=c(0,0))+
  coord_fixed()

请注意使用data.tables.您的数据集非常大(因为所有列; 35,000行并不那么大).在这种情况下,data.tables将大大加快处理速度,尤其是fread(...),它比基本R中的文本导入功能要快得多.

Note the use of data.tables. Your dataset is fairly large (because of all the columns; 35,000 rows is not all that large). In this situation data.tables will speed up processing substantially, especially fread(...) which is much faster than the text import functions in base R.