簡體   English   中英

如何將帶有 Pytorch 的二進制遷移學習 model 擴展到多個圖像類別?

[英]How to extend a binary transfer learning model with Pytorch to multiple image categories?

我正在處理一些代碼,這些代碼使用 ResNet-18 model 對臭名昭著的狗與貓圖像分類進行分類,我想將其擴展為能夠對兩個以上的圖像類別進行分類(如狗、貓、倉鼠和倉鼠)。 ...)。 特別是我有5個類別。 我是遷移學習的新手,我不確定我必須對代碼進行哪些更改才能使其正常工作。

import torch
import numpy as np
import torch.nn.functional as F
from torch.nn import Linear
from torch.utils.data import DataLoader, random_split
from torch.optim import Adam
from torchvision.transforms import Compose, Resize, ToTensor
from torchvision.datasets import ImageFolder
from torchvision.models import resnet18
from matplotlib import pyplot as plt
import random

transform = Compose([Resize((128,128)), ToTensor()])

ds = ImageFolder("*Image_Folder*", transform=transform)
ds_train, ds_val = random_split(ds, [3250, 1073])
dl_train = DataLoader(ds_train, batch_size= 32, shuffle=True)
dl_val = DataLoader(ds_val, batch_size= len(ds_val), shuffle= True)

model = resnet18(pretrained=True)
model.requires_grad_(False)
model.fc = Linear(model.fc.in_features, 5)
X_val, y_val = next(dl_val.__iter__())

opt = torch.optim.Adam(model.parameters(), lr=0.001)
def accuracy(yy, y):
  return torch.mean(1.0*(yy == y))
X_val.shape, y_val.shape
y_val = y_val.reshape(-1, 1).float()

for epoch in range(10):
  losses = []
  accs = []
  losses_val = []
  accs_val = []
  model.train()
  for X, y in dl_train:
    y = y.reshape(-1, 1).float()
    yy = torch.sigmoid(model(X))
    loss = F.binary_cross_entropy(yy, y)
    losses.append(loss.item()) 
    loss.backward()
    opt.step()
    opt.zero_grad()
    acc = accuracy(torch.round(yy), y)
    accs.append(acc.item())
  model.eval()
  with torch.no_grad():
    yy_val = torch.sigmoid(model(X_val))
    loss_val = F.binary_cross_entropy(yy_val, y_val)
    losses_val.append(loss_val.item()) 
    acc_val = accuracy(torch.round(yy_val), y_val)
    accs_val.append(acc_val.item())
  print(f"Epoch {epoch}: t-loss = {np.mean(losses):.4f}, t-acc = {np.mean(accs):.4f}, v-loss = {loss_val:.4f}, v-acc = {acc_val:.4f}")

我相信代碼對於 for 循環來說很好,但是它可能是我需要添加或更改的東西。 目前線loss = F.binary_cross_entropy(yy, y)是什么給了我一個錯誤ValueError: Using a target size (torch.Size([32, 1])) that is different to the input size (torch.Size([32, 5])) is deprecated. Please ensure they have the same size. ValueError: Using a target size (torch.Size([32, 1])) that is different to the input size (torch.Size([32, 5])) is deprecated. Please ensure they have the same size.

這是我正在使用的數據: https://www.kaggle.com/alxmamaev/flowers-recognition

Binary Cross Entropy 是為二進制分類任務設計的損失 function。

為了將此 model 轉換為能夠進行 5 類分類的層,除了將最后一層的寬度更改為 5 之外,您還需要將損失 function 更改為多項評分器,例如CrossEntropyLoss()

暫無
暫無

聲明:本站的技術帖子網頁,遵循CC BY-SA 4.0協議,如果您需要轉載,請注明本站網址或者原文地址。任何問題請咨詢:yoyou2525@163.com.

 
粵ICP備18138465號  © 2020-2024 STACKOOM.COM