Optimizing 2D Bridge Engineering Drawing Digitization: A Comparative Study of Text Recognition Tools and Development of Lightweight Post-Recognition Structured Information Extraction Methods
Research output: Contribution to journal › Research article › Contributed › peer-review
Contributors
Abstract
This paper investigates the application of text recognition technologies, spanning conventional Optical Character Recognition (OCR) and vision-language models (VLMs) in civil engineering, with a focus on the structured information extraction of textual information in two-dimensional (2D) bridge engineering drawings. Pre-trained OCR tools are evaluated for recognizing printed and handwritten text fragments extracted from bridge engineering drawings, and four VLMs are assessed as OCR-like recognizers. The evaluation employs domain-specific test datasets under controlled degradations (motion blur, partial occlusion, salt-and-pepper noise) and rotations (±45°, ±90°), and adopts four metrics: Levenshtein-based similarity, Jaccard similarity, Term Frequency–Inverse Document Frequency (TF–IDF) similarity, and WordNet-based similarity. The results demonstrate that while all tools perform well on printed text, VLMs achieve the strongest overall robustness across perturbations, and Handprint consistently achieves the highest accuracy among OCR tools, particularly for handwritten text and under rotation. To further enhance the interpretability and usability of raw recognition outputs, two lightweight, rule-based post-recognition structured information extraction methods are developed: a region-based method for extracting fixed-layout information such as title blocks, and a keyword-driven method for identifying domain-specific annotations like material specifications. These methods require no prior segmentation or additional trainable models and are integrated into a graphical user interface, TextBridge. The proposed workflow offers an effective solution for the semi-automated extraction of structured textual content from engineering drawings, with the potential to facilitate the digitization and integration of legacy infrastructure data into modern applications such as Building Information Modeling (BIM) and digital twin systems.
Details
| Original language | English |
|---|---|
| Article number | 110186 |
| Journal | Results in Engineering |
| Volume | 30 |
| Early online date | 20 Mar 2026 |
| Publication status | E-pub ahead of print - 20 Mar 2026 |
| Peer-reviewed | Yes |
External IDs
| ORCID | /0000-0002-3578-3098/work/209581762 |
|---|---|
| ORCID | /0000-0001-8735-1345/work/209582896 |
| Scopus | 105035014911 |
Keywords
ASJC Scopus subject areas
Keywords
- Lightweight post-recognition structured information extraction, Graphical user interface, Bridge engineering drawings, Optical character recognition tools, Vision–language models