超強力なピトーチ作戦！！-AI-php.cn

超強力なピトーチ作戦！！

PHPz

リリース： 2024-01-06 21:02:14

転載

1149 人が閲覧しました

こんにちは、暁荘！はじめまして！何かお手伝いできることはありますか？

過去数日間、ディープラーニングに関するコンテンツをいくつか共有してきました。

さらに、Pytorch には numpy や pandas に似た一般的なデータ処理関数がいくつかあり、これらも同様に重要で興味深いものです。

同様に、PyTorch もデータ処理と変換のための多くの関数を提供します。

次に、最も重要で必要な機能を見てみましょう。

一个超强 Pytorch 操作！！

torch.Tensor

PyTorch では、torch.Tensor はテンソルを表すために使用される基本的なデータ構造です。テンソルは、数値やブール値などのさまざまなタイプのデータを含めることができる多次元配列です。 torch.Tensor のコンストラクターを使用してテンソルを作成することも、他の関数を使用してテンソルを作成することもできます。

import torch# 创建一个空的张量empty_tensor = torch.Tensor()# 从列表创建张量data = [1, 2, 3, 4]tensor_from_list = torch.Tensor(data)

ログイン後にコピー

torch.from_numpy

は、NumPy 配列を PyTorch テンソルに変換するために使用されます。

import numpy as npnumpy_array = np.array([1, 2, 3, 4])torch_tensor = torch.from_numpy(numpy_array)

ログイン後にコピー

torch.Tensor.item

要素を 1 つだけ含むテンソルから Python 値を抽出するために使用されます。スカラーテンソルに適用されます。

scalar_tensor = torch.tensor(5)scalar_value = scalar_tensor.item()

ログイン後にコピー

torch.Tensor.view

は、テンソルの形状を変更するために使用されます。

original_tensor = torch.randn(2, 3)# 2x3的随机张量reshaped_tensor = original_tensor.view(3, 2)# 将形状改变为3x2

ログイン後にコピー

torch.Tensor.to

は、テンソルを指定したデバイス (CPU や GPU など) に変換するために使用されます。

cpu_tensor = torch.randn(3)gpu_tensor = cpu_tensor.to("cuda")# 将张量移动到GPU

ログイン後にコピー

torch.Tensor.numpy

テンソルを NumPy 配列に変換します。

pytorch_tensor = torch.tensor([1, 2, 3])numpy_array = pytorch_tensor.numpy()

ログイン後にコピー

torch.nn.function.one_hot

整数テンソルのワンホットエンコーディングに使用されます。

import torch.nn.functional as Finteger_tensor = torch.tensor([0, 2, 1])one_hot_encoded = F.one_hot(integer_tensor)

ログイン後にコピー

torch.utils.data.Dataset および torch.utils.data.DataLoader

は、データセットのロードと処理に使用されます。これら 2 つのクラスは通常、カスタムデータセットクラスと一緒に使用されます。

from torch.utils.data import Dataset, DataLoaderclass CustomDataset(Dataset):def __init__(self, data):self.data = datadef __len__(self):return len(self.data)def __getitem__(self, index):return self.data[index]dataset = CustomDataset([1, 2, 3, 4, 5])dataloader = DataLoader(dataset, batch_size=2, shuffle=True)

ログイン後にコピー

上記は、単純に使用される PyTorch の重要なデータ変換関数の一部です。

これらは、深層学習タスクでのデータの処理と準備に非常に役立ちます。

ケース

次に、画像セグメンテーションのケースを作成します。

この場合、事前トレーニングされた DeepLabV3 モデルと PASCAL VOC データセットを使用して、画像セグメンテーションに PyTorch と torchvision ライブラリを使用します。

コード全体には、上記で学習した内容、サイズ変更、トリミング、標準化などが含まれます。

import torchimport torchvision.transforms as transformsfrom torchvision import modelsfrom PIL import Imageimport matplotlib.pyplot as plt# 下载示例图像!wget -O example_image.jpg https://pytorch.org/assets/deeplab/deeplab1.jpg# 定义图像转换transform = transforms.Compose([transforms.Resize((256, 256)),# 调整大小transforms.ToTensor(), # 转换为张量transforms.Normalize(mean=[0.485, 0.456, 0.406], std=[0.229, 0.224, 0.225])# 标准化])# 加载并转换图像image_path = 'example_image.jpg'image = Image.open(image_path).convert("RGB")input_tensor = transform(image).unsqueeze(0)# 添加批次维度# 加载预训练的DeepLabV3模型model = models.segmentation.deeplabv3_resnet101(pretrained=True)model.eval()# 进行图像分割with torch.no_grad():output = model(input_tensor)['out'][0]output_predictions = output.argmax(0)# 将预测结果转换为彩色图像def decode_segmap(image, nc=21):label_colors = np.array([(0, 0, 0),# 0: 背景 (128, 0, 0), (0, 128, 0), (128, 128, 0), (0, 0, 128), (128, 0, 128),# 1-5: 物体 (0, 128, 128), (128, 128, 128), (64, 0, 0), (192, 0, 0),# 6-9: 道路 (64, 128, 0), (192, 128, 0), (64, 0, 128), (192, 0, 128),# 10-13: 面部 (64, 128, 128), (192, 128, 128), (0, 64, 0), (128, 64, 0),# 14-17: 植物 (0, 192, 0), (128, 192, 0), (0, 64, 128)])# 18-20: 建筑r = np.zeros_like(image).astype(np.uint8)g = np.zeros_like(image).astype(np.uint8)b = np.zeros_like(image).astype(np.uint8)for l in range(0, nc):idx = image == lr[idx] = label_colors[l, 0]g[idx] = label_colors[l, 1]b[idx] = label_colors[l, 2]rgb = np.stack([r, g, b], axis=2)return rgb# 将预测结果转换为彩色图像output_rgb = decode_segmap(output_predictions.numpy())# 可视化原始图像和分割结果plt.figure(figsize=(12, 6))plt.subplot(1, 2, 1)plt.imshow(image)plt.title('Original Image')plt.subplot(1, 2, 2)plt.imshow(output_rgb)plt.title('Segmentation Result')plt.show()

ログイン後にコピー

この場合、最初に、サイズ変更、テンソルへの変換、正規化などの一連の画像変換関数を定義します。これらの変換により、入力画像がモデルのニーズを満たすことが保証されます。

次に、サンプル画像がロードされ、これらの変換が適用されました。

次に、torchvision で事前トレーニングされた DeepLabV3 モデルを画像セグメンテーションに使用しました。出力では、予測結果の最大インデックスを抽出して、各ピクセルの予測クラスを取得しました。

最後に、予測結果をカラー画像に変換し、元の画像とセグメンテーション結果を視覚化します。

一个超强 Pytorch 操作！！