Preprint,
In this paper, we develop a method to create a large, labeled dataset of visible network device vendors across the Internet by mapping network-visible IP addresses to device vendors. We use Internet-wide scanning, banner grabs of network-visible devices across the IPv4 address space, and clustering techniques to assign labels to more than 160,000 devices. We subsequently probe these devices and use features extracted from the responses to train a classifier that can accurately classify device vendors. Finally, we demonstrate how this method can be used to understand broader trends across the Internet by predicting device vendors in traceroutes from CAIDA’s Archipelago measurement system and subsequently examining vendor distributions across these traceroutes.
@unpublished{arxiv-2006.13086, title = {{Classifying Network Vendors at Internet Scale}}, author = {Holland, Jordan and Teixera, Ross and Schmitt, Paul and Borgolte, Kevin and Rexford, Jennifer and Feamster, Nick and Mayer, Jonathan}, date = {2020-06-23}, eprint = {2006.13086}, eprintclass = {cs.NI}, eprinttype = {arxiv}, url = {https://arxiv.org/abs/2006.13086} }