Toward a Better Understanding of IoT Domain Names: A Study of IoT Backend

Publikation: Beitrag in FachzeitschriftForschungsartikelBeigetragenBegutachtung

Beitragende

  • Ibrahim Ayoub - , Association française pour le nommage Internet en coopération (Autor:in)
  • Martine S. Lenders - , Professur für Distributed and Networked Systems (Autor:in)
  • Benoît Ampeau - , Association française pour le nommage Internet en coopération (Autor:in)
  • Sandoche Balakrichenan - , Association française pour le nommage Internet en coopération (Autor:in)
  • Kinda Khawam - , Université de Versailles Saint-Quentin-en-Yvelines (Autor:in)
  • Thomas C. Schmidt - , Hochschule fur Angewandte Wissenschaften Hamburg (HAW) (Autor:in)
  • Matthias Wählisch - , Professur für Distributed and Networked Systems, Barkhausen Institut gGmbH (Autor:in)

Abstract

In this paper, we study Internet of Things (IoT) domain names, the domain names of backend servers on the Internet that are accessed by IoT devices. We investigate how they compare to non-IoT domain names based on their statistical and DNS properties and the feasibility of classifying these two classes of domain names using machine learning (ML). We construct a dataset of IoT domain names by surveying past studies that used testbeds with real IoT devices. For the non-IoT dataset, we use two lists of top-visited websites. We study the statistical and DNS properties of the domain names. We also leverage machine learning and train six models to perform the classification between the two classes of domain names. The word embedding technique we use to get the real-valued vector representation of the domain names is Word2vec. Our statistical analysis highlights significant differences in domain name length, label frequency, and compliance with typical domain name construction guidelines, while our DNS analysis reveals notable variations in resource record availability and configuration between IoT and non-IoT DNS zones. As for classifying IoT and non-IoT domain names using machine learning, Random Forest achieves the highest performance among the models we train, yielding the highest accuracy, precision, recall, and F1 score. Our work offers novel insights to IoT, potentially informing protocol design and aiding in network security and performance monitoring.

Details

OriginalspracheEnglisch
Seiten (von - bis)68871-68890
Seitenumfang20
FachzeitschriftIEEE Access
Jahrgang13 (2025)
Ausgabenummer13
Frühes Online-Datum16 Apr. 2025
PublikationsstatusVeröffentlicht - 25 Apr. 2025
Peer-Review-StatusJa

Externe IDs

ORCID /0000-0002-3825-2807/work/182729640
Scopus 105003815605
Mendeley 1b9cd894-7076-3758-b089-412d9c79c0fa

Schlagworte

DFG-Fachsystematik nach Fachkollegium

Fächergruppen, Lehr- und Forschungsbereiche, Fachgebiete nach Destatis

Ziele für nachhaltige Entwicklung

Schlagwörter

  • Domain names, security, machine learning, IoT