Author: johnh

New conference paper: “Understanding Partial Reachability in the Internet Core” at NINeS 2026

Post author By johnh
Post date 2026-02-04

Our new paper “Understanding Partial Reachability in the Internet Core” will appear at the 2026 New Ideas in Networked Systems (NINeS), a virtual meeting on February 10, 2026.

Durations of peninsulas (regions with partial Internet reachability) as seen in 2017q4, showing that most peninsulas are brief, but some persist for days or months (Figure 4 from [Baltra26a]). We see similar results in 2020.

From the abstract:

Routing strives to connect all the Internet, but compete: political pressure threatens routing fragmentation; architectural changes such as private clouds, carrier-grade NAT, and firewalls make connectivity conditional; and commercial disputes create partial reachability for days or years. This paper suggests persistent, partial reachability is fundamental to the Internet and an underexplored problem. We first derive a conceptual definition of the Internet core based on connectivity, not authority. We identify peninsulas: persistent, partial connectivity; and islands: when computers are partitioned from the Internet core. Second, we develop algorithms to observe each across the Internet, and apply them to two existing measurement systems: Trinocular, where 6 locations observe 5M networks frequently, and RIPE Atlas, where 13k locations scan the DNS roots frequently. Cross-validation shows our findings are stable over three years of data, and consistent with as few as 3 geographically-distributed observers. We validate peninsulas and islands against CAIDA Ark, showing good recall (0.94) and bounding precision between 0.42 and 0.82. Finally, our work has broad practical impact: we show that peninsulas are more common than Internet outages. Factoring out peninsulas and islands as noise can improve existing measurement systems; their “noise” is 5x to 9.7x larger than the operational events in RIPE’s DNSmon. We show that most peninsula events are routing transients (45%), but most peninsula-time (90%) is due to a few (7%) long-lived events. Our work helps inform Internet policy and governance, with our neutral definition showing no single country or organization can unilaterally control the Internet core.

A technical report with additional appendices is available from our website and arXiv.

This paper is joint work of Guillermo Baltra, Tarak Saluja, Yuri Pradkin, and John Heidemann, building on work begun when Guillermo was a PhD student at USC and Tarak was a summer undergraduate researcher visiting USC from Swarthmore College.

The work is supported by NSF via the EIEIO, MINCEQ, Internet Map, and BRIPOD projects, and by DARPA via AQUARIUS.

Data created from the work is available at ANT, and the input and validation data is available from ANT, RIPE Atlas, and CAIDA.

Tags aquarius, bripod, conference, eieio, internet map, isi, islands, minceq, nines, outage, outage detection, papers, partial reachability, peninsulas, usc

Uncategorized

large Internet outage across Pakistan

Post author By johnh
Post date 2025-08-19

Starting shortly before 2025-08-19t17:05Z (10:05pm August 19 local time in Pakistan), a very large Internet outage occurred across all of Pakistan. Although not affecting all networks, in many areas 50% or more fo the networks are down, as shown in the following map:

Internet outages across Pakistan, shown here at 2025-08-19t17:38Z.

We saw the first outages at 16:30Z, and they quickly ramped up to about half of the networks in most of the country. Since the network outages closely follow the country’s borders, it seems unlikely that this is a weather-related event. As of the time of this post (t20:00Z), the outage appears to be ongoing. We’ll post an update here when we learn more.

Some reports are suggesting it’s a backbone outage caused due to flooding.

These outage were detected using Trinocular, our Internet outage detection system and the large change resulted in an alert.

Update 2025-08-20t00:20Z: It looks like Pakistan’s Internet began recovering around 20:45Z, and is basically back to normal by 2025-08-19t22:30Z (3am local time in Pakistan).

Here’s an “after” picture at 22:24Z:

Tags internet, Internet outage, isi, Trinocular, usc

Uncategorized

Recent Graduate ASM Rizvi Featured in ISI News

Post author By johnh
Post date 2025-02-27

Recent PhD graduate ASM Rizvi was featured in an ISI News story, sharing his thoughts about joining USC and ISI and what he plans to do after graduation.

Congratulations again, Rizvi!

Tags akami, dissertaiton, isi, news, phd, usc

Uncategorized

Huge Power Outage in Chile as Seen in Internet Outages

Post author By johnh
Post date 2025-02-25

Today around 3:30pm local time (around 2025-02-25 T18:30Z), Chile suffered a major power outage. News reports suggest 8 million or more are without power.

We can see the effects of this power outage on Internet access as measured by Trinocular, our internet outage detection system. Outages start around 18:30Z and increase steadily to 20:30Z, the most recent data we have.

We wish Chile the best at a rapid recovery!

Update on Wed 2025-02-26: Our observations show the Chilean Internet starting coming back online around 2025-02-26t02:00Z (which is 2025-02-25t23:00 Chilean time), with most if it back around t06:00Z (2025-02-26t03:00 Chilean time).

Tags chile, isi, outage, Trinocular, usc

Uncategorized

Adam Russell Interviews John Heidemann about Network Research

Post author By johnh
Post date 2025-01-06

As part of the ISI/nsiders podcast, Adam Russell, anthropologist and director of ISI’s AI division is interviewing a number of researchers at ISI.

He recently interviewed John Heidemann about John’s work in networking research about measuring the Internet.

See https://www.isi.edu/isi-insiders-podcast/ for the series, and https://rss.com/podcasts/isi-nsiders/1804707/ for Season 1, Episode 3 (about 20 minutes) for his interview of John Heidemann.

Tags census, internet measurement, interview, isi, outage detection, podcast, usc

Uncategorized

new technical report “Auditing for Bias in Ad Delivery Using Inferred Demographic Attributes”

Post author By johnh
Post date 2024-11-04

We have released a new technical report: “Auditing for Bias in Ad Delivery Using Inferred Demographic Attributes”, available at https://arxiv.org/abs/2410.23394.

From the abstract:

Auditing social-media algorithms has become a focus of public-interest research and policymaking to ensure their fairness across demographic groups such as race, age, and gender in consequential domains such as the presentation of employment opportunities. However, such demographic attributes are often unavailable to auditors and platforms. When demographics data is unavailable, auditors commonly infer them from other available information. In this work, we study the effects of inference error on auditing for bias in one prominent application: black-box audit of ad delivery using paired ads. We show that inference error, if not accounted for, causes auditing to falsely miss skew that exists. We then propose a way to mitigate the inference error when evaluating skew in ad delivery algorithms. Our method works by adjusting for expected error due to demographic inference, and it makes skew detection more sensitive when attributes must be inferred. Because inference is increasingly used for auditing, our results provide an important addition to the auditing toolbox to promote correct audits of ad delivery algorithms for bias. While the impact of attribute inference on accuracy has been studied in other domains, our work is the first to consider it for black-box evaluation of ad delivery bias, when only aggregate data is available to the auditor.

This technical report is joint work of Basilial Imana and Aleksandra Korolova (both of Princeton) and John Heidemann (USC/ISI). This work was supported by the NSF via CNS-1956435, CNS-2344925, and CNS-2319409 (the InternetMap project).

Tags algor, algorithmic fairness, BISG, Discrimination, inference, isi, princeton, usc

Uncategorized

Hurricane Helene Croses the Southeast U.S. as Seen in Internet Outages

Post author By johnh
Post date 2024-09-28

Hurricane Helene made landfall in the U.S. at 11:10pm EDT Sept. 26 (2024-09-27t03:10Z) near Tallahassee, Florida, and we’ve been watching it in the Trinocular Internet Outage system.

Flordia Internet infrasructure appears to have done quite well, with relatively few Internet outages. Here is the view 4.5 hours after landfall, at 3:40am EDT Sept. 27 (2024-09-27t07:40Z), when the eye was already over southern Georgia:

However, storm damange resulted in many outages across Georgia at daybreak. Here is 11 hours after landfall, at 6am EDT Sept 27 (2024-09-27t10:00):

The Carolinas seem particularly strongly effected. Here is a zoom from Georgia to Kentucky as of 9am EDT Sat. Sept. 28 (2024-09-28t13:41Z):

Fortunately the Internet infrastructure in Georgia was quick to recover, suggesting most Internet outages were power loss. We wish the best for those in Kentucky, and for those with physical storm damage and coping with flooding.

The most recent outage data is always visible on our outage website.

Tags hurricane, Internet outage, isi, Trinocular

Uncategorized

brief Internet outage in Bangladesh

Post author By johnh
Post date 2024-08-05

This morning, from about 2024-08-05t04:50Z (10:50am local time) to t07:40Z, Bangladesh had another very large Internet outage. Fortunately, unlike the outage that began on 2024-07-18, this one cleared up after about three hours. I presume this outage corresponds to the resignation of the prime minster.

We hope for calm for the people of Bangladesh.

Tags Bangladesh, events, Internet outage, isi, outage, outage detection, Trinocular, usc

Uncategorized

the June 19 Internet outage (not CrowdStrike)

Post author By johnh
Post date 2024-07-27

There was a huge Internet outage on June 19, 2024. It affected millions of people, interfering with their ability to travel, interact with friends and family, and with businesses to communicate with their customers and place orders. It cost the global economy millions of dollars.

And it had nothing to do with CrowdStrike.

I’m talking about the the 5-day near-total shutdown of the Internet in Bangladesh, from 2024-07-18t15:00Z (9pm July 18 local time in Bangladesh) until about 2024-07-23t13:00Z (7pm July 23 local time). For most of that period, pretty much all Bangladeshi networks were down. People could not communicate with each other. Here are the start, middle, recovery pictures from our blog entry:

These figures show Bangladesh, with circles whose size indicates the number of networks that are out in each part of the country. Circle color indicates the percentage of networks that are out–red is near 100% networks unreachable. My research group measures Internet outages, and you can look at what happened in our website. Red basically never happens for big countries, at least since the 2011 Egyptian revolution.

Bangladesh had civil unrest, protests, and riots due to an unpopular employment law (as reported by many organizations, including the New York Times). The government chose to shut down their Internet (as reported by AP, and others). They restored services on July 24, but I am told they are still blocking several social media services.

What does this have to do with CloudStrike?

Well, nothing. But you may have heard that CloudStrike had a software-update that went wrong, also on July 19. It also interfered with millions of people’s ability to travel, interact with friends and family, and with businesses to communicate with their customers and place orders, as it crashed millions of computers running Microsoft Windows and left them difficult to recover.

But the CloudStrike software glitch was not an Internet outage.

Yes, millions of computers failed. But the Internet was never affected by the failure of CloudStrike computers. Anyone could use the Internet just fine last week, provided they were using services that did not depend on Microsoft Windows. And lots of the computers that failed (like flight status kiosks in airports) were not on the public Internet.

CloudStrike was a massive software failure, but not an Internet outage.

I mention this because I heard multiple media sources discuss the CloudStrike-caused Internet outage. Most prominent was this article by Barath Raghavan and Bruce Schneier on Lawfare (and then reposted on Schneier’s blog), that starts “Friday’s massive internet outage, caused by a mid-sized tech company called CrowdStrike, disrupted major airlines, hospitals, and banks.” They point to “brittleness of infrastructure” as a risk. The article is true, except for the word “Internet”. The New York times called it a “tech outage“, and us in the field should be as careful about our terms.

By analogy, when two Boeing 373 MAX airliners crashed in 2019 and 2020, we did not call out the “massive air traffic control crash”, we correctly pointed at aircraft failures, and eventually at software and design problems in that specific aircraft.

We should not call all computer failures an Internet outage, when the problem is not about network communication. To improve our computing world, we must identify problems correctly.

Because when a nation of 170 million people goes offline, that’s a big deal, too. And that’s not fixable by rebooting.

Tags Bangaldesh, CrowdStrike, editorial, Internet outage, isi, tech outage, usc

Uncategorized

new technical report “Reasoning about Internet Connectivity”

Post author By johnh
Post date 2024-07-26

We have released a new technical report: “Reasoning about Internet Connectivity”, available at https://arxiv.org/abs/2407.14427.

From the abstract:

Innovation in the Internet requires a global Internet core to enable
communication between users in ISPs and services in the cloud. Today, this Internet core is challenged by partial reachability: political pressure
threatens fragmentation by nationality, architectural changes such as
carrier-grade NAT make connectivity conditional, and operational problems and commercial disputes make reachability incomplete for months. We assert that partial reachability is a fundamental part of the Internet core. While some systems paper over partial reachability, this paper is the first to provide a conceptual definition of the Internet core
so we can reason about reachability from first principles. Following
the Internet design, our definition is guided by reachability, not
authority. Its corollaries are peninsulas: persistent regions of
partial connectivity; and islands: when networks are partitioned
from the Internet core. We show that the concept of peninsulas and islands can improve existing measurement systems. In one example,
they show that RIPE’s DNSmon suffers misconfiguration and persistent
network problems that are important, but risk obscuring operationally
important connectivity changes because they are 5x to 9.7x larger. Our evaluation also informs policy questions, showing no single
country or organization can unilaterally control the Internet core.

This technical report is joint work of Guillermo Baltra, Tarang Saluja, Yuri Pradkin, John Heidemann done at USC/ISI. This work was supported by the NSF via the EIEIO and InternetMap projects.

Tags Internet topology, isi, islands, measurement systems, outage detection, papers, partial connectivity, peninsulas, technical report, usc