Managing large datasets with iRODS- A performance analysis

Research output: Contribution to book/conference proceedings/anthology/reportConference contributionContributedpeer-review

Abstract

The integrated Rule Orientated Data System (iRODS) is a Grid data management system that organizes distributed data and their metadata. A Rule Engine allows a flexible definition of data storage, data access and data processing. This paper presents scenarios implemented in a benchmark tool to measure the performance of an iRODS environment as well as results of measurements with large datasets. The scenarios concentrate on data transfers, metadata transfers and stress tests. The user has the possibility to influence the scenarios to adapt them to his own use case. The results show the possibility to find bottlenecks and potential to optimize the settings of an iRODS environment.

Details

Original languageEnglish
Title of host publicationProceedings of the International Multiconference on Computer Science and Information Technology, IMCSIT 2010
PublisherIEEE Computer Society, Washington
Pages647 - 654
Number of pages8
Volume5
ISBN (print)978-1-4244-6432-6
Publication statusPublished - 2010
Peer-reviewedYes

External IDs

Scopus 79551558487

Keywords

Keywords

  • data access, data processing, data storage, data transfer, database management systems, distributed data, distributed databases, electronic data interchange, grid data management system, Grid-Computing, integrated rule orientated data system, large dataset management, metadata transfer, performance analysis, rule engine, stress test