Optimizing 2D Bridge Engineering Drawing Digitization: A Comparative Study of Text Recognition Tools and Development of Lightweight Post-Recognition Structured Information Extraction Methods

Mengyan Peng; Han Qian; Steffen Marx; Chongjie Kang

doi:10.1016/j.rineng.2026.110186

Optimizing 2D Bridge Engineering Drawing Digitization: A Comparative Study of Text Recognition Tools and Development of Lightweight Post-Recognition Structured Information Extraction Methods

Research output: Contribution to journal › Research article › Contributed › peer-review

Contributors

Mengyan Peng - , DB InfraGO AG - Endowed Chair of Civil Engineering, Chair of Concrete Structures (Author)
Han Qian - , Chair of Concrete Structures (Author)
Steffen Marx - , Chair of Concrete Structures (Author)
Chongjie Kang - , DB InfraGO AG - Endowed Chair of Civil Engineering (Author)

Abstract

This paper investigates the application of text recognition technologies, spanning conventional Optical Character Recognition (OCR) and vision-language models (VLMs) in civil engineering, with a focus on the structured information extraction of textual information in two-dimensional (2D) bridge engineering drawings. Pre-trained OCR tools are evaluated for recognizing printed and handwritten text fragments extracted from bridge engineering drawings, and four VLMs are assessed as OCR-like recognizers. The evaluation employs domain-specific test datasets under controlled degradations (motion blur, partial occlusion, salt-and-pepper noise) and rotations (±45°, ±90°), and adopts four metrics: Levenshtein-based similarity, Jaccard similarity, Term Frequency–Inverse Document Frequency (TF–IDF) similarity, and WordNet-based similarity. The results demonstrate that while all tools perform well on printed text, VLMs achieve the strongest overall robustness across perturbations, and Handprint consistently achieves the highest accuracy among OCR tools, particularly for handwritten text and under rotation. To further enhance the interpretability and usability of raw recognition outputs, two lightweight, rule-based post-recognition structured information extraction methods are developed: a region-based method for extracting fixed-layout information such as title blocks, and a keyword-driven method for identifying domain-specific annotations like material specifications. These methods require no prior segmentation or additional trainable models and are integrated into a graphical user interface, TextBridge. The proposed workflow offers an effective solution for the semi-automated extraction of structured textual content from engineering drawings, with the potential to facilitate the digitization and integration of legacy infrastructure data into modern applications such as Building Information Modeling (BIM) and digital twin systems.

Details

Original language	English
Article number	110186
Journal	Results in Engineering
Volume	30
Early online date	20 Mar 2026
Publication status	E-pub ahead of print - 20 Mar 2026
Peer-reviewed	Yes

External IDs

ORCID	/0000-0002-3578-3098/work/209581762
ORCID	/0000-0001-8735-1345/work/209582896
Scopus	105035014911

Keywords

ASJC Scopus subject areas

General Engineering

Keywords

Lightweight post-recognition structured information extraction, Graphical user interface, Bridge engineering drawings, Optical character recognition tools, Vision–language models

Research Portal of the TU Dresden

Contributors

Abstract

Details

External IDs

Keywords

ASJC Scopus subject areas

Keywords