{"id":8861,"date":"2014-05-22T15:20:19","date_gmt":"2014-05-22T19:20:19","guid":{"rendered":"http:\/\/mjtsai.com\/blog\/?p=8861"},"modified":"2025-01-07T11:50:42","modified_gmt":"2025-01-07T16:50:42","slug":"what-backblaze-doesnt-back-up","status":"publish","type":"post","link":"https:\/\/mjtsai.com\/blog\/2014\/05\/22\/what-backblaze-doesnt-back-up\/","title":{"rendered":"What Backblaze Doesn&rsquo;t Back Up"},"content":{"rendered":"<p><a href=\"http:\/\/www.haystacksoftware.com\/arq\/\">Arq<\/a> recently reported hundreds of GB of missing files, across multiple backup targets. This is so at odds with Amazon Glacier&rsquo;s reputed <a href=\"http:\/\/aws.amazon.com\/glacier\/\">11-nines durability<\/a> that I&rsquo;m guessing it&rsquo;s due to an application bug. It would not surprise me if the files are still there; Arq just isn&rsquo;t seeing them. In any event, my <a href=\"http:\/\/mjtsai.com\/blog\/2014\/03\/04\/arq-4-0\/\">strategy<\/a> is to have multiple cloud backups&mdash;Arq and <a href=\"http:\/\/www.code42.com\/crashplan\/\">CrashPlan<\/a> (which has been working very well recently)&mdash;so this got me thinking about possibly adding a third.<\/p>\n\n<p>The obvious choice is <a href=\"http:\/\/www.backblaze.com\">Backblaze<\/a>. It has a native Mac app, is developed by ex-Apple engineers, and sponsors many fine podcasts.<\/p>\n\n<p>I&rsquo;d previously been hesitant about Backblaze because of the way it <a href=\"http:\/\/mjtsai.com\/blog\/2013\/08\/14\/backblaze-arq-and-external-drives\/\">handles<\/a> <a href=\"http:\/\/www.backblaze.com\/edrive.html\">external drives<\/a>. I&rsquo;ve read about problems with <a href=\"http:\/\/www.apertureexpert.com\/comment\/22620#comment-22620\">large<\/a> <a href=\"https:\/\/www.donationcoder.com\/forum\/index.php?topic=34797\">bzfileids.dat<\/a> <a href=\"http:\/\/harijan.wordpress.com\/2009\/04\/27\/the-real-backblaze-backup-speed\/\">files<\/a> sucking RAM and preventing backups entirely once they get too large. It&rsquo;s also worrisome that it only <a href=\"http:\/\/www.backblaze.com\/remote-backup-everything.html\">retains deleted files for 30 days<\/a>&mdash;meaning that a file is truly lost if I don&rsquo;t <a href=\"http:\/\/www.reddit.com\/r\/IAmA\/comments\/rhrt4\/we_are_the_team_that_runs_online_backup_service\/c45xtev\">notice<\/a> that it&rsquo;s missing right away. And if, for some reason, my Mac doesn&rsquo;t back up for 6 months, Backblaze will <a href=\"https:\/\/help.backblaze.com\/entries\/21809372-What-happens-to-my-backups-when-I-m-away-or-on-vacation-\">expunge all my data<\/a>, even if my subscription is still paid-up. The situations in which my Mac is not able to back up for a while are <em>exactly<\/em> the ones in which I (or my survivors) would want to be able to depend on a cloud backup!<\/p>\n\n<p>My other concern is that Backblaze doesn&rsquo;t actually <a href=\"https:\/\/help.backblaze.com\/entries\/20200388-what-does-backblaze-backup\">back up<\/a> <a href=\"http:\/\/blog.plasticsfuture.org\/2006\/04\/23\/mac-backup-software-harmful\/\">everything<\/a>. It <a href=\"http:\/\/www.haystacksoftware.com\/arq\/backblaze-backup-bouncer-test.txt\">fails<\/a> all but one of the <a href=\"http:\/\/www.n8gray.org\/code\/backup-bouncer\/\">Backup Bouncer<\/a> tests, discarding file permissions, symlinks, Finder flags and locks, creation dates (<a href=\"https:\/\/twitter.com\/backblaze\/status\/3392132183\">despite<\/a> <a href=\"http:\/\/www.apertureexpert.com\/tips\/2013\/2\/13\/backblaze-and-backup-bouncer-test\">claims<\/a>), modification date (<a href=\"https:\/\/twitter.com\/backblaze\/status\/86166191323889665\">timezone-shifted<\/a>), extended attributes (which include Finder tags and the &ldquo;where from&rdquo; URL), and Finder comments. <a href=\"http:\/\/www.haystacksoftware.com\/blog\/2010\/02\/arqs-backup-bouncer-results\/\">Arq<\/a>, <a href=\"http:\/\/www.haystacksoftware.com\/arq\/crashplan-backup-bouncer-test.txt\">CrashPlan<\/a> (as of <a href=\"http:\/\/mjtsai.com\/blog\/2010\/12\/07\/crashplan-3\/\">version 3<\/a>), <a href=\"http:\/\/www.n8gray.org\/blog\/2007\/04\/27\/introducing-backup-bouncer\/\">SuperDuper<\/a>, and <a href=\"http:\/\/www.n8gray.org\/blog\/2008\/06\/09\/backup-bouncer-and-time-machine\/\">Time Machine<\/a> all support all of these. <a href=\"http:\/\/mjtsai.com\/blog\/2010\/12\/17\/dropbox-1-0\/\">Dropbox<\/a> supports all of them except creation dates, locks, and <a href=\"http:\/\/www.paulingraham.com\/dropbox-and-symlinks.html\">symlinks<\/a>.<\/p>\n\n<p>As a <a href=\"http:\/\/jasongarber.com\/blog\/2012\/03\/06\/i-quit-backblaze\">programmer<\/a>, I especially care about metadata. But I think most users would as well, if they knew to think about it. For example, losing dates can make it harder to find your files (i.e. they disappear from smart folders or sort incorrectly), even leading to errors (i.e. not finding the correct set of invoices for a time period). You would never use a backup app that didn&rsquo;t remember which folders your files were in, so I don&rsquo;t know why people consider it acceptable to lose their Finder tags. (If you use <a href=\"http:\/\/c-command.com\/eaglefiler\/\">EagleFiler<\/a>, it can <a href=\"http:\/\/c-command.com\/eaglefiler\/manual-ah\/how-do-finder-tags-work\">restore the tags<\/a> for you.)<\/p>\n\n<p>Some people don&rsquo;t care much about metadata. <a href=\"http:\/\/www.macworld.com\/article\/1156511\/whatonlinebackupservice.html\">Macworld&rsquo;s<\/a> survey of online backup services didn&rsquo;t mention it. Neither did <a href=\"http:\/\/recode.net\/2014\/03\/11\/triple-coverage-back-up-your-computer-in-three-ways\/\">Walt Mossberg<\/a>. (He also told readers that Backblaze automatically includes &ldquo;every user-created file&rdquo;; in fact, it skips files over <a href=\"https:\/\/help.backblaze.com\/entries\/20200388-what-does-backblaze-backup\">4 GB<\/a> by default.) [Update (2014-05-25): <a href=\"http:\/\/thesweetsetup.com\/apps\/best-cloud-backup-service\/\">The Sweet Setup<\/a> doesn&rsquo;t mention metadata, either (via <a href=\"http:\/\/pxlnv.com\/linklog\/what-backblaze-doesnt-back-up\/\">Nick Heer<\/a>).]<\/p>\n\n<p>Backblaze <a href=\"https:\/\/twitter.com\/backblaze\/status\/35103538589736960\">has<\/a> <a href=\"https:\/\/twitter.com\/backblaze\/status\/281337938908966912\">a<\/a> <a href=\"https:\/\/twitter.com\/backblaze\/status\/232911221265932288\">stock<\/a> <a href=\"https:\/\/twitter.com\/backblaze\/status\/469024368346271744\">answer<\/a> when <a href=\"https:\/\/twitter.com\/mjtsai\/status\/468936848388390912\">asked<\/a> about Backup Bouncer:<\/p>\n\n<blockquote cite=\"https:\/\/twitter.com\/backblaze\/status\/469024368346271744\"><p>This actually tests disk imaging products, a bad test for backup as items we fail on shouldn&rsquo;t be backed up by data backup service.<\/p><\/blockquote>\n\n<p><a href=\"http:\/\/www.apertureexpert.com\/tips\/2013\/2\/13\/backblaze-and-backup-bouncer-test\">Some people<\/a> accept this explanation. I think it&rsquo;s misguided and borderline nonsensical. True, Backup Bouncer tests some rather <a href=\"http:\/\/www.shirt-pocket.com\/forums\/showthread.php?t=3503\">esoteric<\/a> features, but Backblaze fails the basic tests, too. It would be one thing to say that there&rsquo;s a <em>limitation<\/em> whereby dates, tags, comments, etc. aren&rsquo;t backed up, but they&rsquo;re actually saying that these <em>shouldn&rsquo;t<\/em> be backed up. As if products that <em>do<\/em> back them up are in error. So presumably Backblaze doesn&rsquo;t consider this a bug and won&rsquo;t be fixing it.<\/p>\n\n<p>Lastly, it&rsquo;s a shame that Backblaze isn&rsquo;t <a href=\"https:\/\/twitter.com\/mjtsai\/statuses\/86256779142496257\">up<\/a> <a href=\"https:\/\/twitter.com\/fabricetdc\/status\/432960765772177408\">front<\/a> about what metadata it supports. Some users are technical enough to investigate these things themselves. Others will have read the excellent <a href=\"http:\/\/www.takecontrolbooks.com\/backing-up\">Take Control of Backing Up Your Mac<\/a> and seen its <a href=\"http:\/\/www.takecontrolbooks.com\/resources\/0014\/online.html\">appendixes<\/a>, which give Backblaze a C for metadata support. But most Backblaze users won&rsquo;t know that a poor choice has been made for them until they need to restore from their backup.<\/p>\n\n<p>Update (2017-08-23): A Backblaze employee <a href=\"https:\/\/www.reddit.com\/r\/Crashplan\/comments\/6vbas5\/crashplan_for_home_is_being_shut_down\/dlzjllo\/\">responded<\/a> to this post:<\/p>\n<blockquote cite=\"https:\/\/www.reddit.com\/r\/Crashplan\/comments\/6vbas5\/crashplan_for_home_is_being_shut_down\/dlzjllo\/\"><p>Backblaze absolutely backs up and restores the &ldquo;file creation date&rdquo; and &ldquo;file last modified date&rdquo;. With these two caveats: Backblaze is only accurate down to Milliseconds (1\/1,000ths of a single second) if you restore by USB hard drive restore, and only accurate to the second if you prepare a ZIP file restore. The latter is because that is a limitation of the ZIP file format.<\/p><p>The tool &ldquo;Backup Bouncer&rdquo; fails Backblaze on this test, and it irritates me. I feel &ldquo;wronged&rdquo; by this. The new APFS Macintosh file system has the ability to set the file creation date down to one BILLIONTH of a second, and I assume that just to be totally difficult Backup Bouncer gleefully sets every last bit.<\/p><\/blockquote>\n<p>I&rsquo;ve <a href=\"https:\/\/www.reddit.com\/r\/Crashplan\/comments\/6vbas5\/crashplan_for_home_is_being_shut_down\/dm0mwcr\/\">asked for clarification<\/a>, but as far I can tell the response is spreading incorrect information and seems to misunderstand various of the issues involved.<\/p>\n<p>I started a Backblaze trial in order to verify the claim that the creation date is preserved, but I was unable to get an answer because 4 hours after Backblaze says that it backed up my test files, they were still not showing up in the restore interface, even though it purports to show the latest files as of this minute. After 5 hours, the files were available, I restored them, and the file creation dates were lost and changed to the modification date. The Backblaze restore also messed up the files&rsquo; modes, making them executable when they had not been.<\/p>\n\n<p id=\"what-backblaze-doesnt-back-up-update-2017-08-24\">Update (2017-08-24): Backblaze support explained to me that it&rsquo;s normal for there to be a delay, which can be from <a href=\"https:\/\/help.backblaze.com\/hc\/en-us\/articles\/217665498-Why-hasn-t-Backblaze-backed-up-my-new-files-yet-\">1&#x2013;8 hours<\/a>, before the files are actually available for restore. This is because, although the file data has been sent to the server, the server can&rsquo;t access the files until the client has sent the <em>index<\/em> that describes the changes. It typically waits a few hours before doing this. What this means is that, during those hours, the Backblaze client reports that the backup is complete (&ldquo;You are backed up as of: Today, 7:28 AM&rdquo;), but it&rsquo;s actually not. If your Mac breaks or goes offline (i.e. you pack up your MacBook for a trip) before the index has been uploaded, it&rsquo;s as if the backup never happened. I assume the delay before sending the index is some sort of optimization, so perhaps it&rsquo;s justified, but I consider it a major bug that the client reports the files as backed up when you can&rsquo;t actually restore them (no matter how long you wait).<\/p>\n\n<p>The Backblaze employee <a href=\"https:\/\/www.reddit.com\/r\/Crashplan\/comments\/6vbas5\/crashplan_for_home_is_being_shut_down\/dm1pdd3\/\">replied<\/a> about the file creation date issue. The gist of it is that the dates are not preserved when restoring via the network. However, you can pay $99 (flash drive) or $189 (hard drive) for them to mail you your data, and in this format the dates will be preserved. If you <a href=\"https:\/\/help.backblaze.com\/hc\/en-us\/articles\/217665948-Restore-Return-Refund-program-How-to-return-a-USB-restore-drive-for-refund?_ga=2.64687592.654907358.1503435814-913609715.1427749241\">mail the drive back<\/a> (sounds like you have to pay shipping) they will refund the cost. I have not verified that this method works, however, I can confirm that the index file that&rsquo;s sent to the Backblaze server contains the correct information for the creation dates.<\/p>\n\n<p>Update (2017-09-03): See also: <a href=\"http:\/\/atp.fm\/237\">Accidental Tech Podcast<\/a>.<\/p>\n\n<p>Update (2018-02-05): <a href=\"https:\/\/news.ycombinator.com\/item?id=16300224\">aikinai<\/a>:<\/p>\n<blockquote cite=\"https:\/\/news.ycombinator.com\/item?id=16300224\">\n<p>I started getting emails warning that all of my external drives were offline and my data would be soon deleted. Instead of &ldquo;Very sorry about that, here&rsquo;s how to fix the issue,&rdquo; I got this long response about the ways their system looks for new files in serial and it can get jammed and start ignoring everything, with no apology, no acknowledgement this was their issue, and no solution. I had to go fishing for solutions and drag the information out of them to finally figure out what I needed to do. Which it turns out is to get back an internal drive (totally unrelated to the other drives Backblaze abandoned) I had physically removed and repurposed, put it back the computer, wait a long time for Backblaze to see it, then uncheck that drive in Backblaze and remove it again.<\/p>\n<p>[&#8230;]<\/p>\n<p>The client will lie to you and you never know what&rsquo;s really backed up. Even if you use the secret alt-click to force a full drive scan, it can still miss files and tell you fully backed up when files from days ago are still nowhere to be found. Luckily I&rsquo;ve never actually needed to do a restore, but I almost thought I did one time and would have been furious at all the missing files I noticed.<\/p>\n<\/blockquote>\n\n<p id=\"what-backblaze-doesnt-back-up-update-2022-03-07\">Update (2022-03-07): <a href=\"https:\/\/www.backblaze.com\/blog\/backblaze-7-0-version-history-and-beyond\/\">Backblaze<\/a> (via <a href=\"https:\/\/tidbits.com\/2019\/10\/08\/backblaze-7-0-extends-version-history-supports-catalina\/\">Adam Engst<\/a>, <a href=\"https:\/\/news.ycombinator.com\/item?id=21193497\">Hacker News<\/a>):<\/p>\n<blockquote cite=\"https:\/\/www.backblaze.com\/blog\/backblaze-7-0-version-history-and-beyond\/\">\n<p>Backblaze has always kept a 30-day version history of your backed up files to help in situations like these, but today we&rsquo;re giving you the option to extend your version history to one year or forever.<\/p>\n<\/blockquote>\n\n<p><a href=\"https:\/\/twitter.com\/danielpunkass\/status\/1425147197801345028\">Daniel Jalkut<\/a>:<\/p>\n<blockquote cite=\"https:\/\/twitter.com\/danielpunkass\/status\/1425147197801345028\"><p>I recently learned that \n@Backblaze&rsquo;s 1 year extended backup doesn&rsquo;t work the way I (or my brother, who ran into this) expected. I thought it was &ldquo;everything just like 30 days plan - but for a year&rdquo;. Instead, if you haven&rsquo;t attached a drive for 30+ days you have to RE-UPLOAD it.<\/p><p>So the day is RETAINED for 365 days, but if you have a slow or expensive bandwidth connection, you have to make sure to re-attach drives &lt;30 days or else you have to re-upload. Even though they have the files? Disappointing. My laziness was counting on it behaving otherwise.<\/p><\/blockquote>\n\n<p><a href=\"https:\/\/twitter.com\/tjw\/status\/1497452574261145605\">Tim Wood<\/a>:<\/p>\n<blockquote cite=\"https:\/\/twitter.com\/tjw\/status\/1497452574261145605\">\n<p>I love* it when \n@backblaze\n says I&rsquo;m fully backed up and I double-check a (non-excluded) file and see it doesn&rsquo;t show up in the restore interface. Cool.<\/p>\n<\/blockquote>\n\n<p id=\"what-backblaze-doesnt-back-up-update-2025-01-07\">Update (2025-01-07): <a href=\"https:\/\/wadetregaskis.com\/backblaze-seemingly-does-not-support-files-greater-than-1-tb\/\">Wade Tregaskis<\/a>:<\/p>\n<blockquote cite=\"https:\/\/wadetregaskis.com\/backblaze-seemingly-does-not-support-files-greater-than-1-tb\/\"><p>For nearly a month now, Backblaze has been fixated on a particular file of mine, that happens to be over 1 TB in size. <\/p><p>[&#8230;]<\/p><p>Admittedly I&rsquo;m guessing somewhat, since that&rsquo;s a rather reader-hostile log message, but the combination of the <code><strong>Z_B_TOO_MANY_CHUNKS<\/strong><\/code> error mnemonic and <code><strong>chunkSeq=100001<\/strong><\/code> (because of its proximity to the arbitrary round number 100,000) strongly suggests that Backblaze is imposing a 100,000 chunk limit.  Since chunks are 10 MB each, that&rsquo;s exactly 1 TB.<\/p><p>This is unequivocally at odds with what they claim repeatedly on their website, on pages like <a href=\"https:\/\/www.backblaze.com\/cloud-backup\/features\/what-gets-backed-up\">What Backblaze Backs Up<\/a> and <a href=\"https:\/\/www.backblaze.com\/computer-backup\/docs\/file-sizes\">File Sizes<\/a>.<\/p><\/blockquote>","protected":false},"excerpt":{"rendered":"<p>Arq recently reported hundreds of GB of missing files, across multiple backup targets. This is so at odds with Amazon Glacier&rsquo;s reputed 11-nines durability that I&rsquo;m guessing it&rsquo;s due to an application bug. It would not surprise me if the files are still there; Arq just isn&rsquo;t seeing them. In any event, my strategy is [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"apple_news_api_created_at":"2019-12-19T19:59:28Z","apple_news_api_id":"7a5b4eff-58f1-4c15-82b0-5d7a4a619798","apple_news_api_modified_at":"2025-01-07T16:50:45Z","apple_news_api_revision":"AAAAAAAAAAAAAAAAAAAAAg==","apple_news_api_share_url":"https:\/\/apple.news\/AeltO_1jxTBWCsF16SmGXmA","apple_news_coverimage":0,"apple_news_coverimage_caption":"","apple_news_is_hidden":false,"apple_news_is_paid":false,"apple_news_is_preview":false,"apple_news_is_sponsored":false,"apple_news_maturity_rating":"","apple_news_metadata":"\"\"","apple_news_pullquote":"","apple_news_pullquote_position":"","apple_news_slug":"","apple_news_sections":"\"\"","apple_news_suppress_video_url":false,"apple_news_use_image_component":false,"footnotes":""},"categories":[2],"tags":[20,203,534,146,35,692,1902,30,32,608,369,1227],"class_list":["post-8861","post","type-post","status-publish","format-standard","hentry","category-technology","tag-glacier","tag-arq","tag-backblaze","tag-backup","tag-cloud","tag-crashplan","tag-extended-attributes","tag-mac","tag-macapp","tag-metadata","tag-superduper","tag-top-posts"],"apple_news_notices":[],"_links":{"self":[{"href":"https:\/\/mjtsai.com\/blog\/wp-json\/wp\/v2\/posts\/8861","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/mjtsai.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/mjtsai.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/mjtsai.com\/blog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/mjtsai.com\/blog\/wp-json\/wp\/v2\/comments?post=8861"}],"version-history":[{"count":15,"href":"https:\/\/mjtsai.com\/blog\/wp-json\/wp\/v2\/posts\/8861\/revisions"}],"predecessor-version":[{"id":46338,"href":"https:\/\/mjtsai.com\/blog\/wp-json\/wp\/v2\/posts\/8861\/revisions\/46338"}],"wp:attachment":[{"href":"https:\/\/mjtsai.com\/blog\/wp-json\/wp\/v2\/media?parent=8861"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/mjtsai.com\/blog\/wp-json\/wp\/v2\/categories?post=8861"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/mjtsai.com\/blog\/wp-json\/wp\/v2\/tags?post=8861"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}