系列文章 : [gem5] 從零開始的 gem5 學習筆記
在上一篇提到,假如硬體參數不同,在相同的軟體( 演算法 )下,會有什麼樣的效能差異。
而這篇則是來實驗看看 假如硬體參數相同,在不同的軟體( 演算法 )下,會有什麼樣的效能差異。
sort 的實作在這裡 https://github.com/TommyWu-fdgkhdkgh/simple-riscv-vp/blob/main/firmware/sort.c,但必須誠實的說,這是 gemini-cli 寫的,終於搭上了現在最夯的 vibe-coding 熱潮。
花六百多訂閱的 gemini AI pro 沒有白費 QQ 感謝 gemini!
# Number of instructions simulated (Count)
# Number of cpu cycles simulated (Cycle)
# CPI: cycles per instruction (core level) ((Cycle/Count))
# how many mtime ticks
# Number of instructions simulated (Count)
# Number of cpu cycles simulated (Cycle)
# CPI: cycles per instruction (core level) ((Cycle/Count))
# how many mtime ticks
# Number of instructions simulated (Count)
# Number of cpu cycles simulated (Cycle)
# CPI: cycles per instruction (core level) ((Cycle/Count))
# how many mtime ticks
# Number of instructions simulated (Count)
# Number of cpu cycles simulated (Cycle)
# CPI: cycles per instruction (core level) ((Cycle/Count))
# how many mtime ticks
總結一下 bubble-sort v.s. merge-sort 數據的結果
硬體參數完全相同,所以可以看到 cycle per instruction (cpi) 幾乎一模一樣。numCycles 這項數據來說, merge-sort 明顯少於 bubble sort。