[Day12] 技術指標計算 - ES + Pandas - iT 邦幫忙::一起幫忙解決難題，拯救 IT 人的一天

第 12 屆 iThome 鐵人賽

DAY 12

Elastic Stack on Cloud

Elastic 戰台股系列第 12 篇

[Day12] 技術指標計算 - ES + Pandas

12th鐵人賽

華叔

團隊搭著 ESTC 飛上天

2020-09-27 19:06:45

2984 瀏覽

分享至

在 Python 做數值資料運算分析，Pandas 這種神級工具不能不用。今天就來玩玩看吧！

安裝套件

在 Day08，我的 Dockerfile 已經介紹了玩轉今天的主題，所需要安裝的套件：

pip install numpy
pip install pandas

import 套件

要利用上述的 Library 進行 Elasticsearch Document 分析，我們先 Import 它們。

import numpy as np
import pandas as pd
from elasticsearch import Elasticsearch

從 ES 拿資料

用昨天學到的查詢技巧，從 ES Cloud 拿 30 筆 2317 的盤後資料：

es = Elasticsearch(end_point, http_auth=(...))

s = Search(using=es, index="history-prices-python") \
    .query("match", stock_id="2317") \
    .sort({"date": {"order": "desc"}})    
s = s[0:30]
response = s.execute()

建立 numpy array

計畫是把每個 filed (stock_id, date, open, high, low, close, volume) 各自建立成 ndarray 物件，稍後組成 Dataframe。

doc_fields = {}
for num, doc in enumerate(elastic_docs):
    source_data = doc["_source"]
    for key in source_data:
        try:
            doc_fields[key] = np.append(doc_fields[key], source_data[key])
        except KeyError:
            doc_fields[key] = np.array([source_data[key]])

看看結果：

for key, val in doc_fields.items():
    print (key, ":", val)

看來不錯！

建立 dataframe

前面我把每一個 Field 轉換成了 NumPy ndarray 物件，並且組合成一個 dictionary 資料結構 (doc_fields)，要建立成 Dataframe 只要將它丟進 Pandata.DataFrame 方法中，有沒有簡單到想哭！

elastic_df = pd.DataFrame(doc_fields)

print ('elastic_df:', type(elastic_df), "\n")
print (elastic_df) # print out the DF object's contents

明天就可以真正的來玩技術指標分析了… 漫漫長路

[Day11] 技術指標計算 - 用 Python-client 搜尋

[Day13] 技術指標分析 - New Index + TA-Lib

系列文

Elastic 戰台股共 30 篇

RSS系列文訂閱系列文

70 人訂閱

完整目錄

直播研討會

{{ item.channelVendor }} {{ item.webinarstarted }} |

直播中

尚未有邦友留言

立即登入留言

參賽組數

1064 組

團體組數

40 組

累計文章數

22201 篇

完賽人數

600 人

15th鐵人賽 16th鐵人賽 13th鐵人賽 14th鐵人賽 12th鐵人賽 11th鐵人賽鐵人賽 2019鐵人賽 javascript 2018鐵人賽 python 2017鐵人賽 windows php c# windows server linux css react vue.js

IT邦幫忙

Elastic 戰台股系列 第 12 篇