Deep Learning-Enhanced Industrial Visual Inspection System with Multi-Agent Collaboration for Manufacturing Quality Assurance
PDF

Keywords

Deep Learning
Visual Inspection
Defect Detection
Industrial AI

Abstract

Quality assurance is one of the most critical stages in modern manufacturing, directly affecting product reliability and safety. Traditional visual inspection relies heavily on human inspectors, which is labor-intensive, subjective, and difficult to scale in high-throughput production lines. Recent advances in deep learning and multi-agent systems offer new possibilities for automating and enhancing visual inspection accuracy. This study proposes a Multi-Agent Deep Learning Visual Inspection System (MADL-VIS) for industrial manufacturing quality control. The system employs a two-stage detection pipeline: a deep learning-based defect detection module that identifies surface defects from visual inputs, and a multi-agent collaboration module that performs defect classification, root cause analysis, and inspection report generation. Specifically, the detection module leverages a convolutional neural network architecture with attention mechanisms to extract fine-grained defect features from product surface images. The multi-agent module decomposes the post-detection workflow into specialized tasks—defect categorization, severity scoring, cause inference, and documentation—each handled by a dedicated LLM-powered agent. Experiments conducted on three publicly available industrial inspection datasets demonstrate that the proposed system achieves an average defect detection accuracy of 91.3% and a classification accuracy of 88.7%. The multi-agent post-processing module reduces the average inspection cycle time by 38% compared with manual analysis, while the root cause inference agent achieves a consistency rate of 79% with domain expert assessments. This study validates the effectiveness of integrating deep learning-based visual detection with multi-agent collaborative analysis for automated industrial quality assurance.

PDF

References

1. Zhu, Y., Liu, Q. Hybrid graph attention network-LSTM models for causal-aware supply chain forecasting. J Intell Manuf (2026). https://doi.org/10.1007/s10845-025-02782-3

2. Zhu, Y., & Liu, Q. (2025). Toward transparent groundwater contamination risk forecasting: Integrating causal discovery and Bayesian graph neural networks. Science of the Total Environment, 998, 180233. https://doi.org/10.1016/j.scitotenv.2025.180233

3. Zhu, Y. Application of a QPSO-optimized CNN-LSTM model in water quality prediction. Discov Water 4, 100 (2024). https://doi.org/10.1007/s43832-024-00161-2

4. Zhou, K., Zhong, L., Liu, J. et al. Unveiling the Role of Western Pacific Subtropical High in Urban Heat Islands Using Local Climate Zones Coupled WRF-BEP/BEM. Earth Syst Environ 10, 363–390 (2026). https://doi.org/10.1007/s41748-025-00589-z

5. Zhao, Y., Zhong, L., Zhou, K., Liu, B., & Shu, W. (2024). Responses of the urban atmospheric thermal environment to two distinct heat waves and their changes with future urban expansion in a Chinese megacity. Geophysical Research Letters, 51(11), Article e2024GL109018. https://doi.org/10.1029/2024GL109018

6. Huang, H., Tang, J., Liu, T., & Huang, M. (2026). Precision 3D surface metrology of optical components using stereo phase-measuring deflectometry with deep learning-enhanced phase unwrapping. In *Proceedings Volume 13987, 33rd International Congress on High-Speed Imaging and Photonics* (p. 1398704). SPIE. https://doi.org/10.1117/12.3093993

7. Huang, H., Yang, Y., & Zhu, Y. (2023). Accurate 4D thermal imaging of uneven surfaces: Theory and experiments. *International Journal of Heat and Mass Transfer*, 216, 124580. https://doi.org/10.1016/j.ijheatmasstransfer.2023.124580

8. Wang, S., Yu, Y., Feldt, R., & Parthasarathy, D. (2025). Automating a complete software test process using LLMs: An automotive case study. 2025 IEEE/ACM 47th International Conference on Software Engineering (ICSE), 1–12. https://doi.org/10.1109/ICSE55347.2025.00211