{"id":33816,"date":"2021-10-05T14:45:47","date_gmt":"2021-10-05T18:45:47","guid":{"rendered":"https:\/\/mjtsai.com\/blog\/?p=33816"},"modified":"2022-04-14T14:43:01","modified_gmt":"2022-04-14T18:43:01","slug":"facebook-bgp-outage","status":"publish","type":"post","link":"https:\/\/mjtsai.com\/blog\/2021\/10\/05\/facebook-bgp-outage\/","title":{"rendered":"Facebook BGP Outage"},"content":{"rendered":"<p><a href=\"https:\/\/blog.cloudflare.com\/october-2021-facebook-outage\/\">Celso Martinho and Tom Strickx<\/a> (<a href=\"https:\/\/news.ycombinator.com\/item?id=28751520\">Hacker News<\/a>):<\/p>\n<blockquote cite=\"https:\/\/blog.cloudflare.com\/october-2021-facebook-outage\/\"><p>Social media quickly burst into flames, reporting what our engineers rapidly confirmed too. Facebook and its affiliated services WhatsApp and Instagram were, in fact, all down. Their DNS names stopped resolving, and their infrastructure IPs were unreachable. It was as if someone had &ldquo;pulled the cables&rdquo; from their data centers all at once and disconnected them from the Internet.<\/p><p>This wasn&rsquo;t a DNS issue itself, but failing DNS was the first symptom we&rsquo;d seen of a larger Facebook outage.<\/p><p>[&#8230;]<\/p><p><a href=\"https:\/\/www.cloudflare.com\/learning\/security\/glossary\/what-is-bgp\/\">BGP<\/a> stands for Border Gateway Protocol. It&rsquo;s a mechanism to exchange routing information between autonomous systems (AS) on the Internet. The big routers that make the Internet work have huge, constantly updated lists of the possible routes that can be used to deliver every network packet to their final destinations. Without BGP, the Internet routers wouldn&rsquo;t know what to do, and the Internet wouldn&rsquo;t work.<\/p><p>The Internet is literally a network of networks, and it&rsquo;s bound together by BGP. BGP allows one network (say Facebook) to advertise its presence to other networks that form the Internet. As we write Facebook is not advertising its presence, ISPs and other networks can&rsquo;t find Facebook&rsquo;s network and so it is unavailable.<\/p><\/blockquote>\n\n<p><a href=\"https:\/\/engineering.fb.com\/2021\/10\/04\/networking-traffic\/outage\/\">Santosh Janardhan<\/a>:<\/p>\n<blockquote cite=\"https:\/\/engineering.fb.com\/2021\/10\/04\/networking-traffic\/outage\/\">\n<p>To all the people and businesses around the world who depend on us, we are sorry for the inconvenience caused by today&rsquo;s outage across our platforms. We&rsquo;ve been working as hard as we can to restore access, and our systems are now back up and running. The underlying cause of this outage also impacted many of the internal tools and systems we use in our day-to-day operations, complicating our attempts to quickly diagnose and resolve the problem.<\/p>\n<\/blockquote>\n\n<p>See also: <a href=\"https:\/\/krebsonsecurity.com\/2021\/10\/what-happened-to-facebook-instagram-whatsapp\/\">Brian Krebs<\/a> (<a href=\"https:\/\/news.ycombinator.com\/item?id=28750930\">Hacker News<\/a>), <a href=\"https:\/\/www.schneier.com\/blog\/archives\/2021\/10\/facebook-is-down.html\">Bruce Schneier<\/a>, <a href=\"https:\/\/news.ycombinator.com\/item?id=28748203\">Hacker<\/a> <a href=\"https:\/\/news.ycombinator.com\/item?id=28750894\">News<\/a>.<\/p>\n\n<p id=\"facebook-bgp-outage-update-2021-10-20\">Update (2021-10-20): <a href=\"https:\/\/engineering.fb.com\/2021\/10\/04\/networking-traffic\/outage\/\">Santosh Janardhan<\/a>:<\/p>\n<blockquote cite=\"https:\/\/engineering.fb.com\/2021\/10\/04\/networking-traffic\/outage\/\"><p>Our engineering teams have learned that configuration changes on the backbone routers that coordinate network traffic between our data centers caused issues that interrupted this communication. This disruption to network traffic had a cascading effect on the way our data centers communicate, bringing our services to a halt.<\/p><\/blockquote>\n\n<p>Via <a href=\"https:\/\/pxlnv.com\/linklog\/facebook-outage-details\/\">Nick Heer<\/a>:<\/p>\n<blockquote cite=\"https:\/\/pxlnv.com\/linklog\/facebook-outage-details\/\">\n<p>For comparison, as I write this, Apple&rsquo;s <a href=\"https:\/\/www.apple.com\/support\/systemstatus\/\">System Status page<\/a> shows a resolved outage in Apple Pay and Wallet. For over seven hours yesterday, &ldquo;users were not able to add, suspend, or remove existing cards to Apple Pay&rdquo;, and this issue has simply been marked as &ldquo;Resolved&rdquo; but there are no more details. This explanation-free status update has been the standard for every iCloud-related outage, <a href=\"https:\/\/www.theverge.com\/2021\/2\/3\/22264919\/apple-icloud-cloud-service-outage-photos-contacts-notes\">including serious incidents<\/a>. It does not build confidence.<\/p>\n<\/blockquote>\n\n<p><a href=\"https:\/\/www.reuters.com\/technology\/telegram-founder-says-over-70-mln-new-users-joined-during-facebook-outage-2021-10-05\/\">Reuters<\/a> (via <a href=\"https:\/\/news.ycombinator.com\/item?id=28764539\">Hacker News<\/a>):<\/p>\n<blockquote cite=\"https:\/\/www.reuters.com\/technology\/telegram-founder-says-over-70-mln-new-users-joined-during-facebook-outage-2021-10-05\/\"><p>Messaging app Telegram gained over 70 million new users during Monday&rsquo;s Facebook outage, its founder Pavel Durov said on Tuesday, as people worldwide were left without key messaging services for nearly six hours.<\/p><\/blockquote>\n\n<p><a href=\"https:\/\/www.facebook.com\/4\/posts\/10113961365418581\/?d=n\">Mark Zuckerberg<\/a> (via <a href=\"https:\/\/news.ycombinator.com\/item?id=28767700\">Hacker News<\/a>):<\/p>\n<blockquote cite=\"https:\/\/www.facebook.com\/4\/posts\/10113961365418581\/?d=n\">\n<p>First, the SEV that took down all our services yesterday was the worst outage we&rsquo;ve had in years. We&rsquo;ve spent the past 24 hours debriefing how we can strengthen our systems against this kind of failure. This was also a reminder of how much our work matters to people. The deeper concern with an outage like this isn&rsquo;t how many people switch to competitive services or how much money we lose, but what it means for the people who rely on our services to communicate with loved ones, run their businesses, or support their communities.<\/p>\n<\/blockquote>","protected":false},"excerpt":{"rendered":"<p>Celso Martinho and Tom Strickx (Hacker News): Social media quickly burst into flames, reporting what our engineers rapidly confirmed too. Facebook and its affiliated services WhatsApp and Instagram were, in fact, all down. Their DNS names stopped resolving, and their infrastructure IPs were unreachable. It was as if someone had &ldquo;pulled the cables&rdquo; from their [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"apple_news_api_created_at":"2021-10-05T18:45:51Z","apple_news_api_id":"7b11ea75-633f-4de6-8a4e-dd94cd6e47ed","apple_news_api_modified_at":"2022-04-14T18:43:04Z","apple_news_api_revision":"AAAAAAAAAAAAAAAAAAAAAQ==","apple_news_api_share_url":"https:\/\/apple.news\/AexHqdWM_TeaKTt2UzW5H7Q","apple_news_coverimage":0,"apple_news_coverimage_caption":"","apple_news_is_hidden":false,"apple_news_is_paid":false,"apple_news_is_preview":false,"apple_news_is_sponsored":false,"apple_news_maturity_rating":"","apple_news_metadata":"\"\"","apple_news_pullquote":"","apple_news_pullquote_position":"","apple_news_slug":"","apple_news_sections":"\"\"","apple_news_suppress_video_url":false,"apple_news_use_image_component":false,"footnotes":""},"categories":[2],"tags":[728,25,1286,2190,96,1363],"class_list":["post-33816","post","type-post","status-publish","format-standard","hentry","category-technology","tag-domain-name-system-dns","tag-facebook","tag-instagram","tag-outage","tag-web","tag-whatsapp"],"apple_news_notices":[],"_links":{"self":[{"href":"https:\/\/mjtsai.com\/blog\/wp-json\/wp\/v2\/posts\/33816","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/mjtsai.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/mjtsai.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/mjtsai.com\/blog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/mjtsai.com\/blog\/wp-json\/wp\/v2\/comments?post=33816"}],"version-history":[{"count":2,"href":"https:\/\/mjtsai.com\/blog\/wp-json\/wp\/v2\/posts\/33816\/revisions"}],"predecessor-version":[{"id":33971,"href":"https:\/\/mjtsai.com\/blog\/wp-json\/wp\/v2\/posts\/33816\/revisions\/33971"}],"wp:attachment":[{"href":"https:\/\/mjtsai.com\/blog\/wp-json\/wp\/v2\/media?parent=33816"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/mjtsai.com\/blog\/wp-json\/wp\/v2\/categories?post=33816"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/mjtsai.com\/blog\/wp-json\/wp\/v2\/tags?post=33816"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}