Recognizing and Integrating Legacy Assembly Diagrams into Industry 4.0
Research output: Contribution to book/Conference proceedings/Anthology/Report › Conference contribution › Contributed › peer-review
Contributors
Abstract
Legacy assembly diagrams, often provided as machine-unreadable images, hinder integration into the Industry 4.0 (I4.0) ecosystem. We propose a multi-step pipeline leveraging Large Language Models as a single simple-to-operate tool to recognize parts, relationships, and global identifiers, segment parts in images, and structure the data into Asset Administration Shell as I4.0 digital twins. Targeted prompts are designed for each step and evaluated on 15 diverse real-world diagrams. Results show that while the LLM reliably recognizes parts and link identifiers, there are still some open challenges with relationship extraction and semantic segmentation. Despite these limitations, LLMs provide a viable tool for semi-automated digitalization. Our end-to-end pipeline thus enables seamless integration of legacy diagrams into I4.0 systems.
Details
| Original language | English |
|---|---|
| Title of host publication | IECON 2025 – 51st Annual Conference of the IEEE Industrial Electronics Society |
| Publisher | IEEE Industrial Electronics Society |
| Pages | 1-6 |
| Number of pages | 6 |
| ISBN (electronic) | 9798331596811 |
| ISBN (print) | 979-8-3315-9682-8 |
| Publication status | Published - 17 Oct 2025 |
| Peer-reviewed | Yes |
Conference
| Title | 51st Annual Conference of the IEEE Industrial Electronics Society |
|---|---|
| Abbreviated title | IECON 2025 |
| Conference number | 51 |
| Duration | 14 - 17 October 2025 |
| Website | |
| Location | Hotel Meliá Castilla |
| City | Madrid |
| Country | Spain |
External IDs
| ORCID | /0000-0002-4646-4455/work/199961442 |
|---|---|
| ORCID | /0009-0000-2432-5529/work/199961653 |
Keywords
Keywords
- Assembly, Explosions, Fourth Industrial Revolution, Image recognition, Industrial electronics, Large language models, Periodic structures, Pipelines, Reliability, Semantic segmentation