本文摘要：这篇文章详细解读了基于自监督去噪算法Neighbor2Neighbor的代码实现，该算法利用UNet模型对任意尺寸图像进行去噪处理。首先介绍如何通过AugmentNoise类添加高斯噪声或泊松噪声，并在训练和验证阶段应用不同的噪声生成策略。接着加载预训练的UNet模型，并读取、归一化输入图像数据。为了适应网络输入，采用numpy.pad函数进行“reflect”填充，以保持图像边缘信息并确保卷积后尺寸不变。最终将添加噪声后的图像输入到UNet模型中预测出去噪结果，裁剪恢复至原始尺寸并保存输出图像。

转载文章

本篇文章为转载内容。原文链接：https://blog.csdn.net/qq_42948594/article/details/124712116。

该文由互联网用户投稿提供，文中观点代表作者本人意见，并不代表本站的立场。

作为信息平台，本站仅提供文章转载服务，并不拥有其所有权，也不对文章内容的真实性、准确性和合法性承担责任。

如发现本文存在侵权、违法、违规或事实不符的情况，请及时联系我们，我们将第一时间进行核实并删除相应内容。

前言

Neighbor2Neighbor属于自监督去噪中算法，通过训练后可以对任意尺寸的图像进行去噪，现在对去噪代码中如何实现任意尺寸图像去噪进行解读。

代码

先贴源码

import torch
from PIL import Image
from torchvision import transformsfrom arch_unet import UNet
import numpy as npdef get_generator():global operation_seed_counter  # 全局变量  在局部变量可以引用全局变量并修改operation_seed_counter += 1g_cuda_generator = torch.Generator(device="cuda")g_cuda_generator.manual_seed(operation_seed_counter)return g_cuda_generatorclass AugmentNoise(object):  # 添加噪声的类def __init__(self, style):print(style)if style.startswith('gauss'):self.params = [float(p) / 255.0 for p in style.replace('gauss', '').split('_')]if len(self.params) == 1:self.style = "gauss_fix"elif len(self.params) == 2:self.style = "gauss_range"elif style.startswith('poisson'):self.params = [float(p) for p in style.replace('poisson', '').split('_')]if len(self.params) == 1:self.style = "poisson_fix"elif len(self.params) == 2:self.style = "poisson_range"def add_train_noise(self, x):shape = x.shapeif self.style == "gauss_fix":std = self.params[0]std = std * torch.ones((shape[0], 1, 1, 1), device=x.device)noise = torch.cuda.FloatTensor(shape, device=x.device)torch.normal(mean=0.0,std=std,generator=get_generator(),out=noise)return x + noiseelif self.style == "gauss_range":min_std, max_std = self.paramsstd = torch.rand(size=(shape[0], 1, 1, 1),device=x.device) * (max_std - min_std) + min_stdnoise = torch.cuda.FloatTensor(shape, device=x.device)torch.normal(mean=0, std=std, generator=get_generator(), out=noise)return x + noiseelif self.style == "poisson_fix":lam = self.params[0]lam = lam * torch.ones((shape[0], 1, 1, 1), device=x.device)noised = torch.poisson(lam * x, generator=get_generator()) / lamreturn noisedelif self.style == "poisson_range":min_lam, max_lam = self.paramslam = torch.rand(size=(shape[0], 1, 1, 1),device=x.device) * (max_lam - min_lam) + min_lamnoised = torch.poisson(lam * x, generator=get_generator()) / lamreturn noiseddef add_valid_noise(self, x):shape = x.shapeif self.style == "gauss_fix":std = self.params[0]return np.array(x + np.random.normal(size=shape) * std,dtype=np.float32)elif self.style == "gauss_range":min_std, max_std = self.paramsstd = np.random.uniform(low=min_std, high=max_std, size=(1, 1, 1))return np.array(x + np.random.normal(size=shape) * std,dtype=np.float32)elif self.style == "poisson_fix":lam = self.params[0]return np.array(np.random.poisson(lam * x) / lam, dtype=np.float32)elif self.style == "poisson_range":min_lam, max_lam = self.paramslam = np.random.uniform(low=min_lam, high=max_lam, size=(1, 1, 1))return np.array(np.random.poisson(lam * x) / lam, dtype=np.float32)model_path = 'test_dir/unet_gauss25_b4e100r02/2022-03-02-22-24/epoch_model_040.pth'  # 导入训练的模型文件
device = torch.device('cuda' if torch.cuda.is_available() else 'cpu')
net = UNet().to(device)
net.load_state_dict(torch.load(model_path, map_location=device))
net.eval()noise_adder = AugmentNoise(style='gauss25')img = Image.open('validation/Kodak/000014.jpg')im = np.array(img, dtype=np.float32) / 255.0
origin255 = im.copy()
origin255 = origin255.astype(np.uint8)noisy_im = noise_adder.add_valid_noise(im)
H = noisy_im.shape[0]
W = noisy_im.shape[1]
val_size = (max(H, W) + 31) // 32 * 32
noisy_im = np.pad(noisy_im,[[0, val_size - H], [0, val_size - W], [0, 0]],'reflect')transformer = transforms.Compose([transforms.ToTensor()])
noisy_im = transformer(noisy_im)
noisy_im = torch.unsqueeze(noisy_im, 0)
noisy_im = noisy_im.cuda()with torch.no_grad():prediction = net(noisy_im)prediction = prediction[:, :, :H, :W]
prediction = prediction.permute(0, 2, 3, 1)
prediction = prediction.cpu().data.clamp(0, 1).numpy()
prediction = prediction.squeeze()
pred255 = np.clip(prediction * 255.0 + 0.5, 0, 255).astype(np.uint8)Image.fromarray(pred255).convert('RGB').save('test1.png')

输入图像

请添加图片描述
尺寸大小为(408, 310)，PIL读入后进行归一化处理。

img = Image.open('validation/Kodak/00001.jpg')
print('img', img.size)  # img (408, 310)
im = np.array(img, dtype=np.float32) / 255.0
print('im', im.shape)  # im (310, 408, 3)

先对不规则图像进行填充，要求填充的尺寸是32的倍数，否则输入到网络中会报错。在训练的时候是随机裁剪256*256的切片的。

    b = torch.rand(1, 3, 255, 255).to('cuda')a = net(b)print(a.shape)

在这里插入图片描述
在卷积神经网络中，为了避免因为卷积运算导致输出图像缩小和图像边缘信息丢失，常常采用图像边缘填充技术，即在图像四周边缘填充0，使得卷积运算后图像大小不会缩小，同时也不会丢失边缘和角落的信息。在Python的numpy库中，常常采用numpy.pad()进行填充操作。

val_size = (max(H, W) + 31) // 32 * 32
noisy_im = np.pad(noisy_im,[[0, val_size - H], [0, val_size - W], [0, 0]],'reflect')

‘reflect’，表示对称填充。
在这里插入图片描述

上图转自 http://t.zoukankan.com/shuaishuaidefeizhu-p-14179038.html

>>> a = [1, 2, 3, 4, 5]
>>> np.pad(a, (2, 3), 'reflect')
array([3, 2, 1, 2, 3, 4, 5, 4, 3, 2])

个人感觉使用reflect操作，而不是之间的填充0是为了在边缘去噪的时候更平滑一些。镜像填充后的图如下：
请添加图片描述
输入网络后，得到预测结果。最后进行裁剪，得到去噪后的图像。

prediction = prediction[:, :, :H, :W]

请添加图片描述

本篇文章为转载内容。原文链接：https://blog.csdn.net/qq_42948594/article/details/124712116。

该文由互联网用户投稿提供，文中观点代表作者本人意见，并不代表本站的立场。

作为信息平台，本站仅提供文章转载服务，并不拥有其所有权，也不对文章内容的真实性、准确性和合法性承担责任。

如发现本文存在侵权、违法、违规或事实不符的情况，请及时联系我们，我们将第一时间进行核实并删除相应内容。

名词解释

作为当前文章的名词解释，仅对当前文章有效。

自监督去噪：自监督去噪是一种机器学习和深度学习领域的技术，它允许模型在没有标签数据（即没有对应的噪声-free图像）的情况下进行训练以去除图像中的噪声。通过构造特定任务或利用输入数据的内在结构，算法可以自我监督地学习如何从含有噪声的图像中提取并恢复干净的内容。在 Neighbor2Neighbor算法中，通过在网络训练过程中模拟添加和去除噪声的过程，实现对未知噪声分布图像的去噪能力。

UNet模型：UNet（U-Net）是一种用于图像分割的卷积神经网络架构，由Olaf Ronneberger等人于2015年提出。该模型因其独特的“U”形结构而得名，其特点是包含一个收缩路径（编码器）和一个扩展路径（解码器），两者之间通过跳层连接传递上下文信息。在本文所讨论的场景下，UNet被应用于自监督去噪任务，通过学习从含噪图像到原始无噪声图像的映射关系，从而实现任意尺寸图像的高效去噪处理。

高斯噪声与泊松噪声：高斯噪声来源于高斯分布，是自然界中最常见的一种噪声类型，具有均值为0、方差固定的特性，在图像处理中表现为像素值随机变化，每个像素的噪声独立且符合正态分布规律。而泊松噪声则源于泊松分布，常出现在成像系统中，如光学或放射学领域，其特性是像素值的随机变化率与当前像素强度成正比。在图像去噪的背景下，AugmentNoise类根据用户指定的参数分别生成不同类型的高斯噪声或泊松噪声，以模拟真实情况下的噪声干扰，并通过训练后的UNet模型去除这些噪声，恢复图像原本清晰的内容。