修复图像添加高斯噪声的代码bug和可读性 #254

zhedahe · 2025-09-01T11:47:23Z

why:
原仓库训练代码里没有使用添加高斯噪声的数据增强方式，而是椒盐噪声；若使用添加高斯噪声去数据增强，则训练代码会报错

tofix:
image_proc.py 中的random_gaussian函数里
1）第171行img = img.reshape([width, height]) 应该加入图像的通道数判断，然后修改为：img = img.reshape([height, width, channels])
2) 虽然random_gaussian函数里，img 的尺寸为1024x1024，width 和 height 相同；但是代码里的 width, height = img.shape 表述有歧义，如果config.py 里的size设置了宽和高不一样的尺寸，容易导致bug，应该写为 height, width = img.shape[:2]

test code:

-- coding: utf-8 --

import cv2
import random
import numpy as np
from PIL import Image

def random_gaussian(image, mean=0.1, sigma=0.35):
def gaussianNoisy(im, mean=mean, sigma=sigma):
for _i in range(len(im)):
im[_i] += random.gauss(mean, sigma)
return im

img = np.asarray(image)
height, width = img.shape[:2]
channels = 1 if img.ndim == 2 else img.shape[2]
img = gaussianNoisy(img[:].flatten(), mean, sigma)
img = img.reshape([height, width, channels])
return Image.fromarray(np.uint8(img))

if name == 'main':
image = cv2.imread('./images/test.jpg')
size = (1024, 1024)
image = cv2.resize(image, size, interpolation=cv2.INTER_LINEAR)
image = Image.fromarray(cv2.cvtColor(image, cv2.COLOR_BGR2RGB)).convert('RGB')
gaussian_image = random_gaussian(image)
gaussian_image.save("./images/test_gaussian.png")

何海明 added 2 commits September 1, 2025 19:18

修复图像添加高斯噪声的代码bug和可读性

607b89a

修复图像增强模块里，添加高斯噪声的代码bug和常规顺序

e265606

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

修复图像添加高斯噪声的代码bug和可读性 #254

修复图像添加高斯噪声的代码bug和可读性 #254

Uh oh!

zhedahe commented Sep 1, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

修复 图像添加高斯噪声的代码bug和可读性 #254

Are you sure you want to change the base?

修复 图像添加高斯噪声的代码bug和可读性 #254

Uh oh!

Conversation

zhedahe commented Sep 1, 2025

-- coding: utf-8 --

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

修复图像添加高斯噪声的代码bug和可读性 #254

修复图像添加高斯噪声的代码bug和可读性 #254