The most commonly used way: spliting time series feature into year-month-day-hour-minute-second
有時單純切分時間特徵所得到的欄位與目標值關係並不直接,此時我們可以思考目標值是否與一些時間週期有關係(找尋目標值與時間有沒有循環的特性)。這樣的特性可能發生於:
Sometimes the columns after splitting up the time series feature still don't have stong relationship with the target values. We can then check if there's some sort of time cycle that make the target value repeat itself. Some possible time cycle could be:
前述的週期所需數值都可由時間欄位搭配正弦函數或餘弦函數加以組合:
Time cycle mentioned above could be achieved by using time series splitted columns with cos or sin functions.
也可能從時段區間的事件計數推估事件發生的機率,如:網站銷售預測、累計點擊量等。
It is also possible to predict the possibility of observing countings of an event in a certain time frame.
本篇文章請參考Github。The code is available on Github.
文中若有錯誤還望不吝指正,感激不盡。
Please let me know if there’s any mistake in this article. Thanks for reading.
Reference 參考資料:
[1] 第二屆機器學習百日馬拉松內容
[2] Time Series Anomaly Detection Algorithms