TY - GEN
T1 - A Few Shots Traffic Classification with mini-FlowPic Augmentations
AU - Horowicz, Eyal
AU - Shapira, Tal
AU - Shavitt, Yuval
N1 - Publisher Copyright: © 2022 Association for Computing Machinery.
PY - 2022/10/25
Y1 - 2022/10/25
N2 - Internet traffic classification has been intensively studied over the past decade due to its importance for traffic engineering and cyber security. One of the best solutions to several traffic classification problems is the FlowPic approach, where histograms of packet sizes in consecutive time slices are transformed into a picture that is fed into a Convolution Neural Network (CNN) model for classification. However, CNNs (and the FlowPic approach included) require a relatively large labeled flow dataset, which is not always easy to obtain. In this paper, we show that we can overcome this obstacle by replacing the large labeled dataset with a few samples of each class and by using augmentations in order to inflate the number of training samples. We show that common picture augmentation techniques can help, but accuracy improves further when introducing augmentation techniques that mimic network behavior such as changes in the RTT. Finally, we show that we can replace the large FlowPics suggested in the past with much smaller mini-FlowPics and achieve two advantages: improved model performance and easier engineering. Interestingly, this even improves accuracy in some cases.
AB - Internet traffic classification has been intensively studied over the past decade due to its importance for traffic engineering and cyber security. One of the best solutions to several traffic classification problems is the FlowPic approach, where histograms of packet sizes in consecutive time slices are transformed into a picture that is fed into a Convolution Neural Network (CNN) model for classification. However, CNNs (and the FlowPic approach included) require a relatively large labeled flow dataset, which is not always easy to obtain. In this paper, we show that we can overcome this obstacle by replacing the large labeled dataset with a few samples of each class and by using augmentations in order to inflate the number of training samples. We show that common picture augmentation techniques can help, but accuracy improves further when introducing augmentation techniques that mimic network behavior such as changes in the RTT. Finally, we show that we can replace the large FlowPics suggested in the past with much smaller mini-FlowPics and achieve two advantages: improved model performance and easier engineering. Interestingly, this even improves accuracy in some cases.
UR - http://www.scopus.com/inward/record.url?scp=85141411148&partnerID=8YFLogxK
U2 - https://doi.org/10.1145/3517745.3561436
DO - https://doi.org/10.1145/3517745.3561436
M3 - منشور من مؤتمر
T3 - Proceedings of the ACM SIGCOMM Internet Measurement Conference, IMC
SP - 647
EP - 654
BT - IMC 2022 - Proceedings of the 2022 ACM Internet Measurement Conference
T2 - 22nd ACM Internet Measurement Conference, IMC 2022
Y2 - 25 October 2022 through 27 October 2022
ER -