TY - JOUR
T1 - Mobile network traffic pattern classification with incomplete a priori information
AU - Jin, Zhiping
AU - Liang, Zhibiao
AU - Wang, Yu
AU - Meng, Weizhi
PY - 2021
Y1 - 2021
N2 - In complex networks systems like mobile edge infrastructures, real-time traffic classification according to application types is an enabling technique for network resource optimization and advanced security management. State-of-the-art schemes take advantage of machine learning techniques to train classification models based on behavioral characteristics of network traffic flows. Nonetheless, most existing studies assume complete a priori information of the application classes and formulate the task as a standalone multi-class classification problem. Such classification models cannot properly handle the unknown applications that are absent from the training set during the time of training. In this work, we propose a practical mobile network traffic classification scheme that builds robust classifiers based on incomplete a priori information. Specifically, the core idea is to extract the unknown patterns emerging in the network periodically to complement the initial labeled data set that only consists of a limited number of known applications. We propose two algorithms for the unknown pattern extraction step. One is based on iterative asymmetric binary classification and the other is based on constrained clustering. Empirical results based on a public data set show that the proposed scheme can effectively detect both known and unknown applications.
AB - In complex networks systems like mobile edge infrastructures, real-time traffic classification according to application types is an enabling technique for network resource optimization and advanced security management. State-of-the-art schemes take advantage of machine learning techniques to train classification models based on behavioral characteristics of network traffic flows. Nonetheless, most existing studies assume complete a priori information of the application classes and formulate the task as a standalone multi-class classification problem. Such classification models cannot properly handle the unknown applications that are absent from the training set during the time of training. In this work, we propose a practical mobile network traffic classification scheme that builds robust classifiers based on incomplete a priori information. Specifically, the core idea is to extract the unknown patterns emerging in the network periodically to complement the initial labeled data set that only consists of a limited number of known applications. We propose two algorithms for the unknown pattern extraction step. One is based on iterative asymmetric binary classification and the other is based on constrained clustering. Empirical results based on a public data set show that the proposed scheme can effectively detect both known and unknown applications.
KW - Constrained clustering
KW - Machine learning
KW - Traffic classification
U2 - 10.1016/j.comcom.2020.11.003
DO - 10.1016/j.comcom.2020.11.003
M3 - Journal article
AN - SCOPUS:85096838030
SN - 0140-3664
VL - 166
SP - 262
EP - 270
JO - Computer Communications
JF - Computer Communications
ER -