Abstract

Defect inspection is paramount within the closed-loop manufacturing system. However, existing datasets for defect inspection often lack precision and semantic granularity required for practical applications. In this paper, we introduce the Defect Spectrum, a comprehensive benchmark that offers precise, semantic-abundant, and large-scale annotations for a wide range of industrial defects. Building on four key industrial benchmarks, our dataset refines existing annotations and introduces rich semantic details, distinguishing multiple defect types within a single image. Furthermore, we introduce Defect-Gen, a two-stage diffusion-based generator designed to create high-quality and diverse defective images, even when working with limited datasets. The synthetic images generated by Defect-Gen significantly enhance the efficacy of defect inspection models. Overall, The Defect Spectrum dataset demonstrates its potential in defect inspection research, offering a solid platform for testing and refining advanced models.

overview

Dataset Reannotation

Current industrial datasets often lack the granularity needed for detailed defect inspection, with many offering only binary masks or occasionally misclassifying defects. We introduce the Defect Spectrum to provide detailed, large-scale annotations for a wide range of industrial defects, enhancing the precision of defect inspection systems. Drawing from four industrial benchmarks, Defect Spectrum refines annotations, ensuring accurate representation of subtle defects and filling in missed ones. Our dataset uniquely offers rich semantic annotations, identifying multiple defect types in one image. Additionally, we include descriptive captions for each sample, paving the way for future Vision Language Model studies.
The annotation comparison of the in MVTec dataset. The first row shows the defect image. Row 2 and 3 shows the original annotation and our improved annotation.


The annotation comparison of the in VISION dataset. The first row shows the defect image. Row 2 and 3 shows the original annotation and our improved annotation.

Defect-Click

Pixel mask annotation, especially under Defect Spectrum standards, is a demanding task. We introduce "Defect-Click," an advanced interactive tool that automatically segments defects based on user clicks, leveraging pretrained knowledge of industrial defects. Using Defect-Click, we achieve a 60% time-saving in annotation, though the project still took 580 working hours. This tool significantly streamlines the defect annotation process in the industrial domain.

Defect-Gen

Many industrial datasets suffer from a limited number of defective samples, even for the most extensive datasets like VISION. To combat this deficiency, we introduce "Defect-Gen," a two-stage diffusion-based generator. This generative model excels in producing diverse and high-quality images, even when trained on limited data.
We demonstrate the performance boost achieved using our synthetic data in the following figure. It illustrates how varying synthetic data quantities affect DeepLabV3+(a) and MiT-B0(b) performance. Notably, transformer-based models like MiT-B0 benefit more from synthetic data than CNN-based models.