Evaluating Tabular Data Generation Techniques on the DaFne Platform: Insights from a Predictive Maintenance Case Study on Bridges

Research output: Contribution to book/Conference proceedings/Anthology/ReportConference contributionContributedpeer-review

Contributors

Abstract

In the realm of artificial intelligence (AI) and machine learning (ML), the scarcity of robust and diverse datasets often poses a significant challenge, prompting the need for effective data generation methods. This paper presents an evaluation of tabular data generation techniques on the DaFne platform, centered around a predictive maintenance case study for bridges. The DaFne platform offers a variety of tabular data generation functionalities, including rule-based creation, data fusion (with weather data), and data reproduction. We investigate the utility of these functionalities across different machine learning models for the prediction of bridge conditions. Our analysis includes a descriptive statistical comparison of real and synthetic data. Additionally, we explore the utility of original, weather, and synthetic datasets. We do this through the lens of ML models like MLR, XGBoost, CNN, and GRU, performing a predictive maintenance algorithm on these datasets. Our results indicate that while the inclusion of weather data did not significantly enhance predictive performance, the synthetic dataset shows satisfactory quality. However, the synthetic data’s performance is lower than the original data in predictive maintenance tasks, with differences observed in models heavily reliant on sequential data. This research underscores the potential of the DaFne platform in generating high-quality synthetic data. It also highlights areas for future improvement and offers valuable insights for advancing data generation and analysis techniques in predictive maintenance and other AI applications.

Details

Original languageEnglish
Title of host publicationProceedings of 9th International Congress on Information and Communication Technology
EditorsXin-She Yang, Simon Sherratt, Nilanjan Dey, Amit Joshi
PublisherSpringer Science and Business Media B.V.
Pages611-628
Number of pages18
ISBN (electronic)978-981-97-3289-0
ISBN (print)978-981-97-3288-3
Publication statusE-pub ahead of print - 2 Aug 2024
Peer-reviewedYes

Publication series

SeriesLecture Notes in Networks and Systems
Volume1000 LNNS
ISSN2367-3370

Conference

Title9th International Congress on Information and Communication Technology
Abbreviated titleICICT 2024
Conference number9
Duration19 - 22 February 2024
LocationAmerica Square Conference Centre & Online
CityLondon
CountryUnited Kingdom

External IDs

ORCID /0000-0002-9732-9405/work/192043498

Keywords

Keywords

  • Bridge maintenance, Evaluation, Machine learning, Prediction, Synthetic data, Tabular data generation