Assessing Anonymized System Logs Usefulness for Behavioral Analysis in RNN Models
Research output: Contribution to journal › Conference article › Contributed › peer-review
Contributors
Abstract
System logs are a common source of monitoring data for analyzing computing systems' behavior. Due to the complexity of modern computing systems and the large size of collected monitoring data, automated analysis mechanisms are required. Numerous machine learning and deep learning methods are proposed to address this challenge. However, due to the existence of sensitive data in system logs their analysis and storage raise serious privacy concerns. Anonymization methods could be used to clean the monitoring data before analysis. However, anonymized system logs, in general, do not provide adequate usefulness for the majority of behavioral analysis. Content-aware anonymization mechanisms such as PaRS preserve the correlation of system logs even after anonymization. This work evaluates the usefulness of anonymized system logs taken from the Taurus HPC cluster anonymized using PaRS, for behavioral analysis via recurrent neural network models.
Details
Original language | English |
---|---|
Number of pages | 12 |
Journal | CEUR Workshop Proceedings |
Volume | 3376 |
Publication status | Published - 2 Dec 2022 |
Peer-reviewed | Yes |
Workshop
Title | 2022 International Workshop on Data-Driven Resilience Research |
---|---|
Abbreviated title | D2R2 2022 |
Duration | 6 July 2022 |
Location | Neues Rathaus & online |
City | Leipzig |
Country | Germany |
External IDs
ArXiv | http://arxiv.org/abs/2212.01101v1 |
---|
Keywords
ASJC Scopus subject areas
Keywords
- Data usefulness, System log analysis, Time series analysis