{"id":1151,"date":"2018-02-02T23:15:12","date_gmt":"2018-02-03T07:15:12","guid":{"rendered":"https:\/\/ant.isi.edu\/blog\/?p=1151"},"modified":"2020-10-14T15:39:39","modified_gmt":"2020-10-14T22:39:39","slug":"new-technical-report-back-out-end-to-end-inference-of-common-points-of-failure-in-the-internet-extended","status":"publish","type":"post","link":"https:\/\/ant.isi.edu\/blog\/?p=1151","title":{"rendered":"new technical report &#8220;Back Out: End-to-end Inference of Common Points-of-Failure in the Internet (extended)&#8221;"},"content":{"rendered":"<p>We released a new technical report \u201cBack Out: End-to-end Inference of Common Points-of-Failure in the Internet (extended)\u201d, ISI-TR-724, available at&nbsp;<a href=\"https:\/\/www.isi.edu\/~johnh\/PAPERS\/Heidemann18b.pdf\">https:\/\/www.isi.edu\/~johnh\/PAPERS\/Heidemann18b.pdf<\/a>.<\/p>\n<p>From the abstract:<\/p>\n<blockquote>\n<figure id=\"attachment_1154\" aria-describedby=\"caption-attachment-1154\" style=\"width: 300px\" class=\"wp-caption alignright\"><a href=\"https:\/\/ant.isi.edu\/blog\/wp-content\/uploads\/2018\/02\/half.png\"><img loading=\"lazy\" decoding=\"async\" class=\"size-medium wp-image-1154\" src=\"https:\/\/ant.isi.edu\/blog\/wp-content\/uploads\/2018\/02\/half-300x157.png\" alt=\"\" width=\"300\" height=\"157\" srcset=\"https:\/\/ant.isi.edu\/blog\/wp-content\/uploads\/2018\/02\/half-300x157.png 300w, https:\/\/ant.isi.edu\/blog\/wp-content\/uploads\/2018\/02\/half-1024x537.png 1024w, https:\/\/ant.isi.edu\/blog\/wp-content\/uploads\/2018\/02\/half-768x403.png 768w, https:\/\/ant.isi.edu\/blog\/wp-content\/uploads\/2018\/02\/half-1536x806.png 1536w, https:\/\/ant.isi.edu\/blog\/wp-content\/uploads\/2018\/02\/half-2048x1075.png 2048w\" sizes=\"auto, (max-width: 300px) 100vw, 300px\" \/><\/a><figcaption id=\"caption-attachment-1154\" class=\"wp-caption-text\">Clustering (from our event clustering algorithm) of 2014q3 outages from 172\/8, showing 7 weeks including the 2014-08-27 Time Warner outage.<\/figcaption><\/figure>\n<p>Internet reliability has many potential weaknesses: fiber rights-of-way at the physical layer, exchange-point congestion from DDOS at the network layer, settlement disputes between organizations at the financial layer, and government intervention the political layer. This paper shows that we can <em>discover common points-of-failure<\/em>&nbsp;at <em>any<\/em>&nbsp;of these layers by observing correlated failures. We use <em>end-to-end<\/em>&nbsp;observations from data-plane-level connectivity of edge hosts in the Internet. We identify <em>correlations in connectivity<\/em>: networks that usually fail and recover at the same time suggest common point-of-failure. We define two new algorithms to meet these goals. First, we define a computationally-efficient algorithm to create a <em>linear ordering<\/em>&nbsp;of blocks to make correlated failures apparent to a human analyst. Second, we develop an <em>event-based clustering<\/em>&nbsp;algorithm that directly networks with correlated failures, suggesting common points-of-failure. Our algorithms scale to real-world datasets of millions of networks and observations: linear ordering is O(n log n) time and event-based clustering parallelizes with Map\/Reduce. We demonstrate them on three months of outages for 4 million \/24 network prefixes, showing high recall (0.83 to 0.98) and precision (0.72 to 1.0) for blocks that respond. We also show that our algorithms generalize to identify correlations in anycast catchments and routing.<\/p><\/blockquote>\n<p>Datasets from this paper are available at no cost and are listed at https:\/\/ant.isi.edu\/datasets\/outage\/, and we expect to release the software for this paper in the coming months (contact us if you are interested).<\/p>\n","protected":false},"excerpt":{"rendered":"<p>We released a new technical report \u201cBack Out: End-to-end Inference of Common Points-of-Failure in the Internet (extended)\u201d, ISI-TR-724, available at&nbsp;https:\/\/www.isi.edu\/~johnh\/PAPERS\/Heidemann18b.pdf. From the abstract: Internet reliability has many potential weaknesses: fiber rights-of-way at the physical layer, exchange-point congestion from DDOS at the network layer, settlement disputes between organizations at the financial layer, and government intervention the [&hellip;]<\/p>\n","protected":false},"author":2,"featured_media":0,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[282,283],"tags":[34,43,195,71,63,157,32,58,191,22,74,10,44,5,170,16,26,57,36],"class_list":["post-1151","post","type-post","status-publish","format-standard","hentry","category-publications","category-technical-report","tag-algorithms","tag-anycast","tag-clustering","tag-datasets","tag-dns","tag-impact","tag-internet-topology","tag-isi","tag-lacanic","tag-measurement-systems","tag-modeling","tag-network-datasets","tag-outage-detection","tag-papers","tag-retrofuturebridge","tag-software","tag-tech-report","tag-usc","tag-visualization"],"_links":{"self":[{"href":"https:\/\/ant.isi.edu\/blog\/index.php?rest_route=\/wp\/v2\/posts\/1151","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/ant.isi.edu\/blog\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/ant.isi.edu\/blog\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/ant.isi.edu\/blog\/index.php?rest_route=\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/ant.isi.edu\/blog\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=1151"}],"version-history":[{"count":5,"href":"https:\/\/ant.isi.edu\/blog\/index.php?rest_route=\/wp\/v2\/posts\/1151\/revisions"}],"predecessor-version":[{"id":1567,"href":"https:\/\/ant.isi.edu\/blog\/index.php?rest_route=\/wp\/v2\/posts\/1151\/revisions\/1567"}],"wp:attachment":[{"href":"https:\/\/ant.isi.edu\/blog\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=1151"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/ant.isi.edu\/blog\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=1151"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/ant.isi.edu\/blog\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=1151"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}