Variational Autoencoder

[ 인공지능/프레임워크 or 라이브러리 ]

2024년 03월 20일 06시 34분 30초에 업로드 된 글입니다.

작성자: 재형이

나는 할 수 있다
요즘 실습 위주라 코드만 막 올라가고 있긴 한데...ㅎ 몰라~

실습 목표

VAE를 설계하고 학습시켜 이미지를 생성하는 모델을 만듭니다. (데이터셋: MNIST)

문제 정의

VAE

VAE

[Step1] Load libraries & Datasets

import numpy as np
import matplotlib.pyplot as plt

from torchvision.transforms import ToTensor

from torch.utils.data import DataLoader
from torchvision import datasets, transforms

import torch.nn as nn
import torch.nn.functional as F
import torch

train_images = datasets.MNIST(
    root= 'data',
    train= True,
    download= True,
    transform= ToTensor() 
)

test_images = datasets.MNIST(
    root= 'data',
    train= True,
    download= True,
    transform= ToTensor() 
)

Downloading http://yann.lecun.com/exdb/mnist/train-images-idx3-ubyte.gz
Downloading http://yann.lecun.com/exdb/mnist/train-images-idx3-ubyte.gz to data/MNIST/raw/train-images-idx3-ubyte.gz
100%|██████████| 9912422/9912422 [00:00<00:00, 117782222.75it/s]
Extracting data/MNIST/raw/train-images-idx3-ubyte.gz to data/MNIST/raw

Downloading http://yann.lecun.com/exdb/mnist/train-labels-idx1-ubyte.gz
Downloading http://yann.lecun.com/exdb/mnist/train-labels-idx1-ubyte.gz to data/MNIST/raw/train-labels-idx1-ubyte.gz
100%|██████████| 28881/28881 [00:00<00:00, 23498679.69it/s]Extracting data/MNIST/raw/train-labels-idx1-ubyte.gz to data/MNIST/raw

Downloading http://yann.lecun.com/exdb/mnist/t10k-images-idx3-ubyte.gz
Downloading http://yann.lecun.com/exdb/mnist/t10k-images-idx3-ubyte.gz to data/MNIST/raw/t10k-images-idx3-ubyte.gz

100%|██████████| 1648877/1648877 [00:00<00:00, 30957852.60it/s]
Extracting data/MNIST/raw/t10k-images-idx3-ubyte.gz to data/MNIST/raw

Downloading http://yann.lecun.com/exdb/mnist/t10k-labels-idx1-ubyte.gz
Downloading http://yann.lecun.com/exdb/mnist/t10k-labels-idx1-ubyte.gz to data/MNIST/raw/t10k-labels-idx1-ubyte.gz
100%|██████████| 4542/4542 [00:00<00:00, 19539003.86it/s]Extracting data/MNIST/raw/t10k-labels-idx1-ubyte.gz to data/MNIST/raw

[Step2] Set hyperparameters

# 하이퍼파라미터 준비
EPOCH = 10
BATCH_SIZE = 64
LEARNING_RATE = 1e-3
DEVICE = torch.device("cuda" if torch.cuda.is_available() else "cpu")
print("Using Device:", DEVICE)

[Step3] Create DataLoader

train_loader = DataLoader(train_images, batch_size = BATCH_SIZE, shuffle = True)
test_loader = DataLoader(test_images, batch_size = BATCH_SIZE, shuffle = True)

[Step4] Set Network Structure (구조)

class VAE(nn.Module):
    def __init__(self, latent_dim):
        super(VAE, self).__init__()
        self.flatten = nn.Flatten()
        self.encoder = nn.Sequential(
            nn.Linear(28 * 28, 512),
            nn.ReLU(),
            nn.Linear(512, 256),
            nn.ReLU(),
        )
        self.fc_mu = nn.Linear(256, latent_dim)
        self.fc_var = nn.Linear(256, latent_dim)
        self.decoder = nn.Sequential(
            nn.Linear(latent_dim, 256),
            nn.ReLU(),
            nn.Linear(256, 512),
            nn.ReLU(),
            nn.Linear(512, 28 * 28),
            nn.Sigmoid(),
        )

    def encode(self, x):            
        result = self.encoder(x)
        mu = self.fc_mu(result)
        var = self.fc_var(result)
        return mu, var

    def decode(self, z):
        result = self.decoder(z)
        return result   

    def reparameterize(self, mu, var):
        std = torch.exp(var / 2)
        eps = torch.randn_like(std)
        return mu + (eps * std) 

    def forward(self, x):
        x = self.flatten(x)
        mu, var = self.encode(x)
        z = self.reparameterize(mu, var)
        out = self.decode(z)
        return  out, mu, var

[Step5] Create Model instance

model = VAE(10).to(DEVICE)
print(model)

VAE(
  (flatten): Flatten(start_dim=1, end_dim=-1)
  (encoder): Sequential(
    (0): Linear(in_features=784, out_features=512, bias=True)
    (1): ReLU()
    (2): Linear(in_features=512, out_features=256, bias=True)
    (3): ReLU()
  )
  (fc_mu): Linear(in_features=256, out_features=10, bias=True)
  (fc_var): Linear(in_features=256, out_features=10, bias=True)
  (decoder): Sequential(
    (0): Linear(in_features=10, out_features=256, bias=True)
    (1): ReLU()
    (2): Linear(in_features=256, out_features=512, bias=True)
    (3): ReLU()
    (4): Linear(in_features=512, out_features=784, bias=True)
    (5): Sigmoid()
  )
)

[Step6] Model compile

def loss_function(recon_x, x, mu, var):
    recon_loss = F.binary_cross_entropy(recon_x, x.view(-1, 28*28), reduction='sum')
    kl_loss = -0.5 * torch.sum(1 + var - mu.pow(2) - var.exp())
    return recon_loss + kl_loss
    
optimizer = torch.optim.Adam(model.parameters(), lr= LEARNING_RATE)

[Step7] Set train loop

def train(train_loader, model, loss_fn, optimizer):
    model.train()

    for batch, (X, y) in enumerate(train_loader):
        X, y = X.to(DEVICE), y.to(DEVICE)
        decoded, mu, var= model(X)

        # 손실계산
        loss = loss_fn(decoded, X, mu, var)

        # 역전파
        optimizer.zero_grad() 
        loss.backward()
        optimizer.step()

    # 결과 시각화
    origin_data = X[:5].view(-1, 28*28).type(torch.FloatTensor)/255.
    decoded_data = decoded[:5].view(-1, 28*28).type(torch.FloatTensor)/255.
 
    f, axs = plt.subplots(2, 5, figsize=(5, 2))    
    for i in range(5):
        img = np.reshape(origin_data.data.numpy()[i],(28, 28))
        axs[0][i].imshow(img, cmap='gray')
        axs[0][i].set_xticks(())
        axs[0][i].set_yticks(())

    for i in range(5):
        img = np.reshape(decoded_data.to("cpu").data.numpy()[i], (28, 28)) 
        axs[1][i].imshow(img, cmap='gray')
        axs[1][i].set_xticks(()) 
        axs[1][i].set_yticks(())
    plt.show()

[Step8] Run Model

for i in range(EPOCH):
    print(f"Epoch {i+1} \n------------------------")
    train(train_loader, model, loss_function, optimizer)

reference
https://github.com/lyeoni/pytorch-mnist-VAE/blob/master/pytorch-mnist-VAE.ipynb
https://subinium.github.io/VAE-AE/
https://github.com/AntixK/PyTorch-VAE/blob/a6896b944c918dd7030e7d795a8c13e5c6345ec7/models/vanilla_vae.py#L2
https://www.slideshare.net/NaverEngineering/ss-96581209
https://github.com/pytorch/examples/blob/main/vae/main.py

'인공지능 > 프레임워크 or 라이브러리' 카테고리의 다른 글

GAN (2)	2024.03.22
LSTM Classifier (0)	2024.03.21
VGGNet을 사용한 이미지 분류기 실습 (2)	2024.03.19
AlexNet을 사용한 이미지 분류기 실습 (0)	2024.03.18
인공 신경망 코드로 구현해서 다중 분류해보기 (2) (4)	2024.03.17

다음글이 없습니다.

이전글이 없습니다.

실습 목표

문제 정의

VAE

[Step1] Load libraries & Datasets

[Step2] Set hyperparameters

[Step3] Create DataLoader

[Step4] Set Network Structure (구조)

[Step5] Create Model instance

[Step6] Model compile

[Step7] Set train loop

[Step8] Run Model

'인공지능 > 프레임워크 or 라이브러리' 카테고리의 다른 글

티스토리툴바