2 つのデータ テーブルがありweights
ますvalues
。
テーブルには、次のweights
5 つの列があります。
first POSIXct
late POSIXct
nodeid integer
aggid integer
weight numeric
values
テーブルにはこれらの列があります
nodeid integer
Date POSIXct
hour integer
value decimal
アイデアは、重み付けに基づいてノードの加重平均を集計ノードにする新しいテーブルを生成することです。ただし、重み付けは時間の経過とともに変化するため、最初の日付と遅い日付に基づいて一致させる必要があります。これを行う SQL 構文は次のようになります。
select v.Date, v.hour, w.aggid, sum(v.value*w.weight) as aggvalue
from values v inner join weights w
on v.nodeid=w.nodeid and v.date between w.first and w.late
group by aggid, date, hour
between
SQL構文のロジックを考えると、これをどこから始めるべきかよくわかりません。これは data.table 構文で可能ですか、またはweights
範囲を使用するのではなく、毎日の行を持つようにテーブルを変換する必要がありますか?
これがいくつかのサンプルデータです(長くてすみません)...
values<-data.table(nodeid = c(1L, 1L, 1L, 1L, 1L, 2L, 2L, 2L, 2L,
2L, 3L, 3L, 3L, 3L, 3L, 4L, 4L, 4L, 4L, 4L, 5L, 5L, 5L, 5L, 5L,
6L, 6L, 6L, 6L, 6L), Date = c("2013-07-02", "2013-07-02", "2013-07-05",
"2013-07-08", "2013-07-10", "2013-07-02", "2013-07-02", "2013-07-05",
"2013-07-08", "2013-07-10", "2013-07-02", "2013-07-02", "2013-07-05",
"2013-07-08", "2013-07-10", "2013-07-02", "2013-07-02", "2013-07-05",
"2013-07-08", "2013-07-10", "2013-07-02", "2013-07-02", "2013-07-05",
"2013-07-08", "2013-07-10", "2013-07-02", "2013-07-02", "2013-07-05",
"2013-07-08", "2013-07-10"), hour = c(1L, 2L, 23L, 2L, 2L, 1L,
2L, 23L, 2L, 2L, 1L, 2L, 23L, 2L, 2L, 1L, 2L, 23L, 2L, 2L, 1L,
2L, 23L, 2L, 2L, 1L, 2L, 23L, 2L, 2L), value = c(8.234, 3.218,
0.787, 8.689, 6.218, 6.89, 1.914, 2.459, 6.683, 8.122, 0.281,
1.136, 1.993, 7.27, 9.582, 5.777, 1.375, 9.204, 7.862, 0.633,
2.433, 1.842, 7.178, 10.692, 1.417, 1.259, 2.619, 0.031, 6.744,
5.941))
weights<-data.table(first = c("2013-07-01", "2013-07-01", "2013-07-01",
"2013-07-01", "2013-07-01", "2013-07-01", "2013-07-08", "2013-07-08",
"2013-07-08", "2013-07-08", "2013-07-08", "2013-07-08"), late = c("2013-07-07",
"2013-07-07", "2013-07-07", "2013-07-07", "2013-07-07", "2013-07-07",
"2013-07-20", "2013-07-20", "2013-07-20", "2013-07-20", "2013-07-20",
"2013-07-20"), nodeid = c(1L, 2L, 3L, 4L, 5L, 6L, 1L, 2L, 3L,
4L, 5L, 6L), aggid = c(1L, 1L, 1L, 2L, 2L, 2L, 1L, 1L, 1L, 2L,
2L, 2L), weight = c(0.5, 0.25, 0.25, 0.3, 0.5, 0.2, 0.6, 0.2,
0.2, 0.4, 0.45, 0.15))
exresults<-data.table(aggid = c(1L, 2L, 1L, 2L, 1L, 2L, 1L, 2L, 1L,
2L), Date = c("2013-07-02", "2013-07-02", "2013-07-02", "2013-07-02",
"2013-07-05", "2013-07-05", "2013-07-08", "2013-07-08", "2013-07-10",
"2013-07-10"), hour = c(1L, 1L, 2L, 2L, 23L, 23L, 2L, 2L, 2L,
2L), aggvalue = c(5.90975, 3.2014, 2.3715, 1.8573, 1.5065, 6.3564,
8.004, 8.9678, 7.2716, 1.782))