阿，又是一個RAG :: 2025 iThome 鐵人賽

poyuanchih (poyuanchih)

iT邦新手 5 級 ‧ 點數 267

數學系

1154

累計瀏覽數

站內簡訊追蹤

鐵人檔案

2025 iThome 鐵人賽

回列表

生成式 AI

阿，又是一個RAG 系列

這是一個以 RAG為名的系列，但其實大部分篇幅應該都在處理資料，預計內容包含：
(1) 取得(context, question, answer)對的三種方法
(2) 以Label Studio建立ground-truth
(3) 搭建 RAG baseline
(4) 探索驗證框架
(5) 各式方法論的實測
我會實測檢索的recall 、答題的忠實度以及 LLM as a judge的表現
看看現代 LLM／RAG／Agent 在自製題目集的能力與限制

鐵人鍊成｜共 30 篇文章｜ 1 人訂閱訂閱系列文 RSS系列文

0 Like 0 留言 177 瀏覽

DAY 11

Day10: CitationQueryEngine 與 Workflow

Situation 先前我們在 Day7: perplexity api 初探與 fact_ckecher 功能測試調用 perplexity api 時，...

2025-09-25 ‧ 由 poyuanchih 分享

0 Like 0 留言 155 瀏覽

DAY 12

Day11: SubQuestionQueryEngine(上): SubQuestion 與 Workflow

Situation 先前我們在 Day6: pdf2txt 使用 llama-parse 與 mistral-ocr 時，我們已經把包含一系列單選題的 pdf...

2025-09-26 ‧ 由 poyuanchih 分享

0 Like 0 留言 166 瀏覽

DAY 13

Day12: SubQuestionQueryEngine(中): Streaming events 與 ReActAgent

Situation 我們昨天架構了 SubQuestionQueryEngine as workflow 的整體工作流程並且測試了 prompt llm 來...

2025-09-27 ‧ 由 poyuanchih 分享

0 Like 0 留言 157 瀏覽

DAY 14

Day13: SubQuestionQueryEngine(下): combine_answer 與 update_prompts

Situation 我們正在嘗試用 llama-index 的 workflow 自行架構 SubQuestionQueryEngine 應用情境為：給定一...

2025-09-28 ‧ 由 poyuanchih 分享

0 Like 0 留言 211 瀏覽

DAY 15

Day14: 用 llama-index 的 workflow 來把 FunctionCallingAgent 寫出來

Situation 我們前面大多是關注在怎麼使用 Agent 包含 Day8: Tavily 與 FunctionAgent 的 FunctionAgent...

2025-09-29 ‧ 由 poyuanchih 分享

0 Like 0 留言 193 瀏覽

DAY 16

Day15: 用 llama-index 的 workflow 來把 ReActAgent 兜出來

TL;DR 今天的完整程式碼在這裡有一個 ipynb 用來釐清內部的細節 ReAct_parser_and_formatter 具體來說是: ReA...

2025-09-30 ‧ 由 poyuanchih 分享

0 Like 0 留言 187 瀏覽

DAY 17

Day16: Pydantic 與 Structured Output

Situation 今天是個全新的篇章，我們來探索一下新的主題: llm 的 Structured Output 這個系列預計會包含：本篇 txt2jso...

2025-10-01 ‧ 由 poyuanchih 分享

0 Like 0 留言 166 瀏覽

DAY 18

Day17: exam_and_structured_output_dataset

TL;DR 今天的交付物是兩個 Dataset: 一個是structured_output_dataset 👉 點我看長相一個是exam_data...

2025-10-02 ‧ 由 poyuanchih 分享

0 Like 0 留言 160 瀏覽

DAY 19

Day18: structured output challenge

Intro 我們昨天產出了一份 toy problem:= structured_output_dataset.json 👉 點我看長相今天要來跑實驗啦...

2025-10-03 ‧ 由 poyuanchih 分享

0 Like 0 留言 159 瀏覽

DAY 20

Day19: evaluator in llama-index

Intro 我們今天先來熟悉一下 llama-index 世界的 Evaluating 具體來說，我們會在範例資料上使用幾種不同的 evaluator 來實...

2025-10-04 ‧ 由 poyuanchih 分享

poyuanchih的鐵人檔案

poyuanchih的收藏

poyuanchih的追蹤

poyuanchih的Like

poyuanchih的紀錄

poyuanchih的訂閱列表