Python中的自動編碼器是什麼？-Python教學-PHP中文網

Python中的自動編碼器是什麼？

王林

發布： 2023-06-03 20:10:38

原創

2384 人瀏覽過

自動編碼器是一種無監督學習演算法，能夠學習到輸入資料的特徵表達，其在深度學習中已廣泛應用。本篇將介紹Python中的自動編碼器。

一、自動編碼器簡介

自動編碼器（Autoencoder）是一種神經網絡，它包含一個編碼器和一個解碼器。編碼器將輸入資料（如圖像、文字）壓縮為一個小的向量，解碼器根據這個向量重建原始輸入資料。透過這樣壓縮-重構的過程，自動編碼器可以學習輸入資料的低維表示，即特徵表達。

自動編碼器的訓練過程是無監督的，不需要標註資料。其原理是最小化輸入與輸出之間的重構誤差，使編碼器和解碼器共同學習輸入資料的特徵表示。自動編碼器的結構可以多樣化，如一般自動編碼器、卷積自動編碼器、循環自動編碼器等。

二、Python實作自動編碼器

Python中實作自動編碼器通常使用深度學習框架，如TensorFlow、Keras、PyTorch等。以下是一個基本的自動編碼器範例，使用Keras實作：

from keras.layers import Input, Dense
from keras.models import Model

# 定义编码器
input_img = Input(shape=(784,))
encoded = Dense(128, activation='relu')(input_img)
encoded = Dense(64, activation='relu')(encoded)
encoded_output = Dense(32, activation='relu')(encoded)

# 定义解码器
decoded = Dense(64, activation='relu')(encoded_output)
decoded = Dense(128, activation='relu')(decoded)
decoded_output = Dense(784, activation='sigmoid')(decoded)

# 定义自动编码器模型
autoencoder = Model(inputs=input_img, outputs=decoded_output)

# 编译模型
autoencoder.compile(optimizer='adam', loss='binary_crossentropy')

# 加载数据
from keras.datasets import mnist
import numpy as np

(x_train, _), (x_test, _) = mnist.load_data()
x_train = x_train.astype('float32') / 255.
x_test = x_test.astype('float32') / 255.
x_train = x_train.reshape((len(x_train), np.prod(x_train.shape[1:])))
x_test = x_test.reshape((len(x_test), np.prod(x_test.shape[1:])))

# 训练模型
autoencoder.fit(x_train, x_train,
                epochs=50,
                batch_size=256,
                shuffle=True,
                validation_data=(x_test, x_test))

登入後複製

在此範例中，使用Dense層定義編碼器和解碼器，啟動函數為relu和sigmoid。以MNIST手寫數字資料集為例，訓練模型50個epochs。透過訓練建立的模型，可以透過編碼器獲得資料的低維特徵表示。

三、自動編碼器的應用

自動編碼器廣泛應用於特徵學習、資料降維、影像壓縮等領域。以下是自動編碼器在影像壓縮中的應用範例：

# 压缩图像
encoded_imgs = encoder.predict(x_test)
# 解压缩图像
decoded_imgs = decoder.predict(encoded_imgs)

# 可视化图像
import matplotlib.pyplot as plt
n = 10  # 选择要可视化的图像数量
plt.figure(figsize=(20, 4))
for i in range(n):
    # 原始图像
    ax = plt.subplot(2, n, i + 1)
    plt.imshow(x_test[i].reshape(28, 28))
    plt.gray()
    ax.get_xaxis().set_visible(False)
    ax.get_yaxis().set_visible(False)

    # 压缩后的图像
    ax = plt.subplot(2, n, i + 1 + n)
    plt.imshow(decoded_imgs[i].reshape(28, 28))
    plt.gray()
    ax.get_xaxis().set_visible(False)
    ax.get_yaxis().set_visible(False)
plt.show()

登入後複製

以上範例中，使用訓練好的自動編碼器對MNIST手寫數位資料集進行影像壓縮，在壓縮和解壓縮過程中去除了噪聲，並且透過視覺化，我們可以看到壓縮後的圖像還原得相當不錯。

四、結語

自動編碼器是深度學習中非常基礎的模型之一，是了解深度學習不可或缺的一步。 Python中實作自動編碼器非常方便，只需要選擇合適的深度學習框架即可，例如Keras、PyTorch等。透過自動編碼器，我們可以學習輸入資料的重要特徵，實現影像壓縮、特徵學習等應用。

以上是Python中的自動編碼器是什麼？的詳細內容。更多資訊請關注PHP中文網其他相關文章！