從「這次不會壞吧」到自動化的未來：17 模型版本控制 - Weight&Biases - iT 邦幫忙::一起幫忙解決難題，拯救 IT 人的一天

2024 iThome 鐵人賽

AI/ ML & Data

從「這次不會壞吧」到自動化的未來：從 DevOps 到 MLOps，改造模型佈署之旅系列第 17 篇

從「這次不會壞吧」到自動化的未來：17 模型版本控制 - Weight&Biases

16th鐵人賽 mlops devops machine learning 自動化

aif_tw

2024-10-25 08:46:10

1421 瀏覽

分享至

大致功能與 Tensorboard 相同，但是整合了更多的深度學習(Llamaindex、Langchain、HuggingFace…)與機器學習(Scikit-Learn、XGBoost….)開發工具，目前觀察下來相較於Tensorboard有更精美的UI，及更好的視覺體驗。
支援上傳Artifact的檔案(訓練完成的模型，資料前處理的pickle檔….等等)，以方便後續部署與版本控制
針對模型部署提供極為方便的功能，能將實驗最好的模型進行註冊，以利後續開發鎖定模型版本，已有許多文章分享使用方式。
注意：需要申請Weight&Biases 帳號，實驗紀錄會上傳至該帳號底下，需要確保網路順暢，因為需要將檔案上傳至Weight&Biases雲端儲存空間，每個人有免費的100G可以使用

圖片來源:https://wandb.ai/site

如何在Python開發環境使用，以及程式碼需要如何撰寫：

工具安裝：

pip install wandb -qU
申請帳號 https://wandb.ai/home

登入後，點選https://wandb.ai/authorize 取得 API key

程式碼撰寫：(以一個簡易的資料做示範)

當程式執行到 wandb.login()時，會出現以下選擇，因為上一步已經先申請好帳號，所以這邊選 2
接下來會要求輸入API key，API key的取得方式可以參考以上第二步

import wandb
wandb.login()
import random

# Launch 5 simulated experiments
total_runs = 5
for run in range(total_runs):
  # Start a new run to track this script
  wandb.init(
      # Set the project where this run will be logged
      project="basic-intro", 
      # We pass a run name (otherwise it’ll be randomly assigned, like sunshine-lollypop-10)
      name=f"experiment_{run}", 
      # Track hyperparameters and run metadata
      config={
      "learning_rate": 0.02,
      "architecture": "CNN",
      "dataset": "CIFAR-100",
      "epochs": 10,
      })

  # This simple block simulates a training loop logging metrics
  epochs = 10
  offset = random.random() / 5
  for epoch in range(2, epochs):
      acc = 1 - 2 ** -epoch - random.random() / epoch - offset
      loss = 2 ** -epoch + random.random() / epoch + offset

      # Log metrics from your script to W&B
      wandb.log({"acc": acc, "loss": loss})

  # Mark the run as finished
  wandb.finish()