Synthetic data generation in manufacturing: a review of methods, domains, and emerging trends

Research output: Contribution to journalConference articleResearchpeer-review

9 Downloads (Orbit)

Abstract

Data scarcity remains a major barrier to the effective deployment of AI in manufacturing, where labeled data is often limited, costly, or difficult to obtain. This review investigates how synthetic data generation techniques are being applied to address this challenge in manufacturing AI applications. Eighteen recent papers (Jan 2024- May 2025) were analyzed and categorized based on generation methods, application domains,
and data modalities. Techniques covered include GAN (Generative Adversarial Networks), VAEs (Variational Autoencoders), diffusion models, simulation-based approaches, SMOTE (Synthetic Minority Oversampling Technique), and hybrid combinations. Their use spans tasks such as defect detection, predictive maintenance, process modeling, material design, and human–robot collaboration. The review highlights emerging
trends, methodological trade-offs, and practical challenges shaping the future of synthetic data in intelligent manufacturing systems. In addition to consolidating recent work, the review identifies underexplored research gaps and methodological challenges that shape future directions in synthetic data use for manufacturing AI.
Original languageEnglish
JournalProcedia CIRP
Volume139
Pages (from-to)440-445
ISSN2212-8271
DOIs
Publication statusAccepted/In press - 2026
Event13th CIRP Global Web Conference - Hong Kong, China
Duration: 16 Oct 202517 Oct 2025

Conference

Conference13th CIRP Global Web Conference
Country/TerritoryChina
CityHong Kong
Period16/10/202517/10/2025

Keywords

  • Synthetic data
  • Manufacturing AI
  • Generative models
  • Data augmentation

Fingerprint

Dive into the research topics of 'Synthetic data generation in manufacturing: a review of methods, domains, and emerging trends'. Together they form a unique fingerprint.

Cite this