Census and Survey of the Visible Internet (extended)

Census and Survey of the Visible Internet (extended)

Heidemann, John and Pradkin, Yuri and Govindan, Ramesh and Papadopoulos, Christos and Bartlett, Genevieve and Bannister, Joseph
USC/Information Sciences Institute

John Heidemann, Yuri Pradkin, Ramesh Govindan, Christos Papadopoulos, Genevieve Bartlett and Joseph Bannister 2008. Census and Survey of the Visible Internet (extended). Technical Report ISI-TR-2008-649b. USC/Information Sciences Institute.

Abstract

Prior measurement studies of the Internet have explored traffic and topology, but have largely ignored edge hosts. While the number of Internet hosts is very large, and many are hidden behind firewalls or in private address space, there is much to be learned from examining the population of \emphvisible hosts, those with public unicast addresses that respond to messages. In this paper we introduce two new approaches to explore the visible Internet. Applying statistical population sampling, we use \emphcensuses to walk the entire Internet address space, and \emphsurveys to probe frequently a fraction of that space. We then use these tools to evaluate address usage, where we find that only 3.6% of allocated addresses are actually occupied by visible hosts, and that occupancy is unevenly distributed, with a quarter of responsive /24 address blocks (subnets) less than 5% full, and only 9% of blocks more than half full. We show about 34 million addresses are very stable and visible to our probes (about 16% of responsive addresses), and we project from this up to 60 million stable Internet-accessible computers. The remainder of allocated addresses are used intermittently, with a median occupancy of 81 minutes. Finally, we show that many firewalls are visible, measuring significant diversity in the distribution of firewalled block size. To our knowledge, we are the first to take a census of edge hosts in the visible Internet since 1982, to evaluate the accuracy of active probing for address census and survey, and to quantify these aspects of the Internet.

Reference

@techreport{Heidemann08a,
  author = {Heidemann, John and Pradkin, Yuri and Govindan, Ramesh and Papadopoulos, Christos and Bartlett, Genevieve and Bannister, Joseph},
  title = {Census and Survey of the Visible Internet (extended)},
  institution = {USC/Information Sciences Institute},
  year = {2008},
  sortdate = {2008-02-01},
  project = {ant, lander, madcat},
  jsubject = {topology_modeling},
  number = {ISI-TR-2008-649b},
  month = feb,
  note = {Updated August, 2008},
  location = {johnh: pafile},
  keywords = {census, survey, internet address space},
  url = {http://www.isi.edu/%7ejohnh/PAPERS/Heidemann08a.html},
  pdfurl = {http://www.isi.edu/%7ejohnh/PAPERS/Heidemann08a.pdf},
  myorganization = {USC/Information Sciences Institute},
  copyrightholder = {authors}
}