DynamicMix:一种动态的像素级混合的图像数据增强方法

DYNAMICMIX: A DYNAMIC PIXEL-LEVEL MIXING METHOD FOR IMAGE DATA AUGMENTATION

  • 摘要: 近年来多个图像进行混合的数据增强方法取得不错的效果。然而,完全接受补丁图像像素区域的方法在一些情况下可能会生成一些质量较低的新样本,需要进一步改进。针对这些问题,提出一种动态的像素级混合算法DynamicMix。提出局部像素混合的策略,选择合适的原图裁剪区域保留图像的部分像素值,从而实现局部像素级混合。为了降低裁剪面积大小对新生成样本的影响,又提出像素级动态混合方法,将裁剪区域图像块与混合比例进行关联,使得原图裁剪区域的像素保留比例随裁剪区域面积的大小而动态改变。通过该方法可以避免在裁剪面积较大的时候,原图特征显著区域丢失过多而导致标签值与对应内容差别过大。在4个数据集上的实验表明:提出的数据增强方法可以让训练出的模型拥有更好的分类性能和鲁棒性。将该方法应用于CIFAR-100和Mini-ImageNet数据集中,使用ResNet-34网络情况下比CutMix方法的Top-1准确率分别提升了1.00百分点和1.14百分点。

     

    Abstract: In recent years, image data augmentation methods based on multi-image mixing have achieved promising results. However, approaches that fully adopt patched pixel regions from source images may generate low-quality synthetic samples in certain scenarios, requiring further improvement. To address these issues, we propose DynamicMix, a dynamic pixel-level mixing algorithm. A local pixel mixing strategy was introduced to select appropriate cropping regions from source images while preserving partial pixel values, achieving localized pixel-level blending. To mitigate the impact of varying cropping sizes on synthetic samples, a dynamic pixel-level mixing mechanism was proposed by associating the cropped image patches with adaptive mixing ratios, which ensured that the preserved pixel ratio from the source image dynamically adjusted according to the size of the cropped region. This approach prevented scenarios where excessive loss of critical features from source images-due to large cropping areas-lead to significant discrepancies between label assignments and actual content. Experiments on four datasets demonstrate that the proposed data augmentation method enhances both classification performance and model robustness. Notably, when applied to CIFAR-100 and Mini-ImageNet datasets with ResNet-34, the method achieves Top-1 accuracy improvements of 1.00 and 1.14 percentage points, respectively, compared with the CutMix baseline.

     

/

返回文章
返回