Day 27 - NER 模型訓練 (2) - iT 邦幫忙::一起幫忙解決難題，拯救 IT 人的一天

2023 iThome 鐵人賽

DAY 27

自我挑戰組

30天從零開始學習NLP(自然語言處理) 系列第 27 篇

Day 27 - NER 模型訓練 (2)

15th鐵人賽 training 微調 ner 建立參數

肉彈

2023-10-12 10:36:40

668 瀏覽

分享至

接著前一天繼續

5. 定義模型

from transformers import BertForTokenClassification

model = BertForTokenClassification.from_pretrained(model_checkpoint, num_labels=len(label_list))

導入 BertForTokenClassification 類，使用 from_pretrained 方法加載 bert-base-chinese
num_labels 參數指定了模型要預測的標籤數量，這裡的標籤數量是9

6. 建立各種參數

訓練參數

from tansformers import TrainerArguments

args = TrainingArguments(
    output_dir="outputs/bert-base-chinese",
    evaluation_strategy="steps",
    save_strategy="steps",
    save_steps=500,
    eval_steps=500,
    learning_rate=2e-5,
    per_device_train_batch_size=16,
    per_device_eval_batch_size=16,
    num_train_epochs=3,
    weight_decay=0.01,
    fp16=True,
    no_cuda=False,
)

output_dir會設置訓練完後模型存放的位置
evaluation_strategy和save_strategy設置為steps，以步驟為單位去評估模型和保存模型的檢查點
eval_steps和save_steps設置為500，每 500 步會去評估和保存模型
learning_rate是學習率，
per_device_train_batch_size和per_device_eval_batch_size設置為16，用於指定每次訓練和評估時模型接收的樣本數量，每個批次包含16個樣本
num_train_epochs設置為3，訓練的總輪數
weight_decay設置為0.01，權重衰減用於控制模型的正規畫﹐以防止過度擬和
fp16是否需要使用混和精度訓練，可以加速訓練過程，但是需要有特定的硬件才會有用
no_cuda通常如果有可用的 CUDA 設備，會建議使用 CUDA 來加速訓練

建立數據收集器

from transformers import DataCollatorForTokenClassification

data_collator = DataCollatorForTokenClassification(tokenizer)

建立這個數據收集器的用意是將處理過的數據批次在一起，並對它們進行 padding，確保它們適合於模型的評估和訓練

定義一個用於計算評估指標的函數 compute_metrics

from datasets import load_metric

metric = load_metric("seqeval")

def compute_metrics(p):
    predictions, labels = p
    predictions = np.argmax(predictions, axis=2)

    true_predictions = [
        [label_list[p] for (p, l) in zip(prediction, label) if l != -100]
        for prediction, label in zip(predictions, labels)
    ]
    true_labels = [
        [label_list[l] for (p, l) in zip(prediction, label) if l != -100]
        for prediction, label in zip(predictions, labels)
    ]
    results = metric.compute(predictions=true_predictions, references=true_labels)
    return {
        "precision": results["overall_precision"],
        "recall": results["overall_recall"],
        "f1": results["overall_f1"],
        "accuracy": results["overall_accuracy"],
    }

```
metric = load_metric("seqeval")
```
- seqeval 庫提供了方便的方法來計算這些指標，特別適用於處理標記分類的序列資料，通過加載 seqeval 度量標準
```
predictions, labels = p
predictions = np.argmax(predictions, axis=2)
```
- 針對每個標記的各個標籤分數，選擇最高的類別來確認標籤

true_predictions = [
   [label_list[p] for (p, l) in zip(prediction, label) if l != -100]
   for prediction, label in zip(predictions, labels)
]
true_labels = [
   [label_list[l] for (p, l) in zip(prediction, label) if l != -100]
   for prediction, label in zip(predictions, labels)
]

兩個列表用於儲存處理過後的預測結果和真實標籤

results = metric.compute(predictions=true_predictions, references=true_labels)
return {
    "precision": results["overall_precision"],
    "recall": results["overall_recall"],
    "f1": results["overall_f1"],
    "accuracy": results["overall_accuracy"],
}

計算性能指標，包含精準度、召回率、F1值 (這些在下次講評估結果時會詳細講)

創建 Trainer 對象

from transformers import Trainer

trainer = Trainer(
    model,
    args,
    train_dataset=tokenized_datasets["train"],
    eval_dataset=tokenized_datasets["validation"],
    data_collator=data_collator,
    tokenizer=tokenizer,
    compute_metrics=compute_metrics
)