Google Is Shutting Down Album Archive

mholt · on June 17, 2023

This appears to be NEITHER your "Archive" NOR your albums in Google Photos. This is basically a collection of media you uploaded to Blogger, Hangouts, and Picasa Web Albums.

Still, doesn't hurt to Takeout your Google Photos every ~6 months.

(I am working on an app that will help you organize and view them, together with your text messages, location history, and other online-only data.)

citelao · on June 17, 2023

I would be extremely interested in a Google Takeout viewer if you ever end up releasing one.

I dealt with Google Takeout, trying to export my photos to Apple Photos (when Google was planning to charge money for old Google Workspace accounts), and I found it extremely difficult to deal with the file format. The script I wrote (https://github.com/citelao/google_photos_takeout_to_apple_ph...) ended up being decently reliable, but there were a ton of weird mismatches between the EXIF data in Google Photos metadata and the EXIF data in the photos themselves. Although some of that wonkiness was Apple Photos, not Google.

I'd love to see software that could wrangle the mess :)

mholt · on June 17, 2023

It's called Timelinize (might rename it?), and you can follow it here: https://twitter.com/timelinize (Click "Media" on the Twitter account to view a few screenshots for a preview. More to come!) (There's no website or project page yet because I've been busy developing.)

If you want an invite to try out an early dev preview today, follow @timelinize on Twitter and tweet at it, I'll see about getting you into the Discord.

Some background:

Saving a local copy of my Google Photos has been a passion project of mine since ~2014 (before Google Photos even!). For years it was only focused on downloading the data using APIs -- but then we found out that Google strips location data (from your own photos!) if using the API, so I added Takeout support.

The problem is there was no viewer. Well in 2019 I finally started working on a viewer. It has evolved a lot since it's a very ambitious project and there's nothing quite like it.

It's not just Google Photos: it's any photos and videos. It's also for your text messages and emails. And your location history. And contact list. And chat apps. And really, any files you have. It also supports Facebook, Twitter, and Instagram account exports too. Oh, and iPhone backups.

Timelinize is entity-aware, and it can map identities across data sources (with enough info, or with a manual mapping, or some optional heuristics). It's just not a photo gallery.

It's basically a really detailed view of your life and online history. It's neat because I have my family pictures, my text messages between me and my wife when we were dating (and after of course), and there's different views to explore: map, timeline, conversations, gallery, and more to come (calendar, etc).

We can even place non-geolocated data on a map since we can correlate timestamp and entity. So when we went on our honeymoon, I can see text messages received from friends while we were driving to a beach.

It's really quite immersive and magical and I haven't seen anything quite like it.

And everything is stored on your own computer, it's a GUI app and you have to have enough space to store your stuff. The data is just organized as files within a folder on disk, with a SQLite DB holding the index and the small textual items.

chaxor · on June 17, 2023

What is the correct tool to properly merge a large set of tar.gz files for which may have an enormous overlap of similar files, and some that have been altered just slightly?

Git plus some parsing seems close in that space, as analyzing the files to create a dendrogram like tree of potential alterations to files over time by levenstein distance may be useful to approximate commit history. However, this doesn't seem to exist or be popular as a tool. There's vimdiff or meld, but they are extremely manual and tedious to the extent of being pointless to try for something like a large history of takeout tar.gz's.

Throwing in the towel completely, borgfs can be helpful to reduce the amount of space they take by de-duplication on the block level, but this is a terrible solution as it doesn't really track file changes in a reasonable way, etc. It is useful to extract the files into a directory without the tar or gz, but this can also cause issues with how to appropriately organize the directory structure over the history.

Any thoughts or projects that do a better job of this?

mholt · on June 17, 2023

> What is the correct tool to properly merge a large set of tar.gz files for which may have an enormous overlap of similar files, and some that have been altered just slightly?

Can you elaborate on this? My understanding is that they should all extract into the same target folder without issues because each archive's set of files is distinct. But maybe I'm just assuming wrongly?

What exactly is your goal, too? It sounds like you are trying to find and de-duplicate visually similar images? Like what do you mean by "enormous overlap" or "altered just slightly"?

chaxor · on June 22, 2023

The problem isn't one takeout overlapping (multiple zips from one date) it's many takeouts over the years (full history).

So for example in 2001, you make a takeout with 30 zips, and then delete half of your photos off of Google. Then in 2007, you have another 20 zips, and delete 25% of your emails and photos to make more room, 2008 again, on up to now.

So now you have a big folder with many zips, and maybe some extracted folders, because things happen over the years, etc.

What's the best tool to merge all of this into one directory?

Got can help for the notes from Google keep that may have had things appended to or removed, photos can be overlapping a bit so really a set union is all that's required for many files, but some will be slightly different like the Google keep notes.

My best thought is to make some git repo and add things in, but to do a levenstein distance on the bits of each file to check if there is overlap in content and to estimate the 'lineage' of a file if there is significant overlap with another. Effectively you reconstruct the git commit tree with the set of all files over all histories. Then you build the git repo history from all of the files.

This would likely just be a local git repo since it would likely be several terabytes of info, but that would be the general idea I guess.

I just haven't found a good tool to actually do this easily unfortunately, but it seems like it would be a very basic , or commonly used scenario (especially for those 'should-be-a-git-repo' directories that everyone made before knowing about git. You know the ones: 'myfile.v1.doc', myfile.v2.doc', 'myfile.final.doc', myfile.reallyfinal.doc', myfile.finalfinal.doc')

mholt · on June 22, 2023

_> So now you have a big folder with many zips, and maybe some extracted folders, because things happen over the years, etc._

Oh, right.

Timelinize can do that. Takeout all your data, then import it into Timelinize. Then delete your Takeout (after Timelinize is finished and stable, of course, heh). Then next time you Takeout, just import it all into Timelinize again. (It de-dupes!) Then delete the Takeout, etc. (Maybe Timelinize can do the cleanup for you someday.)

The de-duping depends on the item being recognizable. Best if the data source provides an ID. Otherwise, things like certain metadata and content can be used to determine duplicates.

citelao · on June 17, 2023

I'm not chaxor, but as far as I remember I think you're right:

If I had unzipped all the takeout directories into one giant folder, there'd be no conflicts.

Since I didn't do that, I had to do weird multi-pass parsing, since an album could be split across multiple ZIPs. I get a bit neurotic around backups like this, so I'd have loved some sort of virtualized filesystem that non-destructively represented all of those zips "merged together." But in retrospect, I should have just merged the directories into one folder---would have made parsing easier :)

I don't recall substantial problems with duplicates. Just weird renames and EXIF data mismatches. And since I was trying to archive my data, I definitely didn't want similar photos to be deduplicated.

My problems are probably different than chaxor's, though.

whynotmaybe · on June 17, 2023

Does that mean that all the images in my blog posts on blogger will disappear?

So troubled to hear that a product I never used might disappear and that the impact might be massive for me... or not.

megamike · on June 18, 2023

it appears that way what a mess!!

predictabl3 · on June 17, 2023

I have been getting those photos-cdp emails and noticed your name. I wait in earnest.

noman-land · on June 17, 2023

Just one correction. You should only run Takeout once and then delete all your Google accounts and never look back.

stOneskull · on June 17, 2023

i think give some time to make sure you've changed all your log-ins. there's bound to be accounts out there you've forgotten about and might not be able to use. maybe once you don't get any important emails for 6 months? and then there's like updating and moving all google authenticator entries to something else. things like that. anyway, don't just rush it!

LoganDark · on June 17, 2023

Note to anyone who might be reading this and thinking "that's me" - if it's you, then you should move to a password manager and create a database of all your accounts. I once spent a couple days moving every account I have ever created into KeePassXC, and changing all of their passwords to unique randomly-generated ones. You only have to do this once, and given enough time (making sure you add accounts over time that you need to use but missed), you can be fairly confident that you know every single account you have, anywhere.

At least any account that you'd ever need to log into.

Then, you'd know every account that uses your gmail, but also you'd know the password of every account to change the email in the future without needing a password reset, if you want to delete your email early. Of course, assuming someone doesn't have a breach or something and require a reset anyway, which happens from time to time.

Still good to have that passwords database.

I personally replicate mine between multiple computers and also publish it to my web server on someone else's infrastructure. I will never forget another password again, because I will never need to know another password again. I will also always know what accounts I have or whether I've used a particular service before.

sunnybeetroot · on June 17, 2023

One step further, buy a domain that offers email forwarding. When you sign up to websites, use this domain and have your email forwarded to whatever free email provider you choose. If you change providers you only need to change your domain forwarding email.

LoganDark · on June 17, 2023

Oh yeah. I currently use Firefox Relay for this but they're on nearly every abuse list by this point so I can't even sign up for some websites. The downside of using domain forwarding instead though is that suddenly I'm responsible for my email and that's a risk that I don't want to take.

I do actually accept emails addressed to logandark.net. I just don't rely on it.

amai · on June 17, 2023

Does that mean blogger will also be shut down soon?

PugPaladin · on June 17, 2023

The email I got says that Album Archive is shutting down, that I accessed it yesterday, and that I should use Google Takeout to export a single photo that I can't access (?). So I got to Google Takeout and I'm told that I have one export remaining. None of this makes any sense. I have never heard of any of these.

I'm really surprised at how bad Google is at making products given how much money they have and the level of skill the people have there. There are so many problem across all of their business and consumer products that I've seen. I can't see what the draw is to keep using their stuff, especially since they are so good at killing stuff off.

w-ll · on June 17, 2023

> that I accessed it yesterday

Same. I thought my account might have been comprised after seeing this email.

flomo · on June 17, 2023

Similar experience. Fortunately there is an activity log link, showing that Dropbox uploaded something which looks like a CD cover there once in 2011. important notice.

SillyUsername · on June 17, 2023

Click that single photo. It's a folder, at least it was for me.

ekianjo · on June 17, 2023

You can have all the money in the world and still be utterly disorganized

modeless · on June 17, 2023

Whoever sent this email should be reprimanded. The vast majority of recipients have never used this "product". The images will not be deleted when it's killed because they actually reside in other products such as Blogger and will still be downloadable using Takeout[1], so approximately nobody needed to be notified. But now this is yet another PR cycle for the "Google kills everything" meme.

[1] https://support.google.com/picasa/answer/7008270

hotpotamus · on June 17, 2023

I got the email and was confused because I've never heard of it, but I see another comment in this thread that it's related to Picasa, which I certainly did use, and really liked. So this really just serves as a little salt on that old wound.

acomjean · on June 17, 2023

I too got the email and was confused. I thought all my photos went from Picasa to google photos… I actually thought the email might be someone trying phish me.

Google seems to really need a change to management. I’m not sure what’s going on over there.

asciii · on June 17, 2023

> Whoever sent this email should be reprimanded.

I have never heard/used this product. Maybe there were layoffs and no one cares...or an uninformed individual sent the mass email with short notice.

Either way, very telling! Also bad press is good press, I guess?

nocoiner · on June 17, 2023

I got the email, thought, “huh, never heard of it.” Wasn’t curious enough to find out until I saw this post.

Still didn’t understand what exactly this was or how I used it, so I clicked. 30% of the screen (on mobile) is a banner telling me I should be using chrome. Another 40% banner is telling me the site is going away soon soon soon. Another 10% is exhorting me to make connections with my account so others can see my (soon to be deleted) photos. And the remainder of the screen is an empty gray box, with no photos or other content, just gray as far as the eye can see (until you hit one of the aforementioned banners); all that remains in light gray letters is the text “Looks like you’ve reached the end.”

A real modern day Ozymandias.

asciii · on June 17, 2023

sounds nightmarish

Though I’m not seeing what you describe on Safari with iOS 16.5

nocoiner · on June 18, 2023

Ah, I guess I had to request the desktop site to be able to upload to imgur. If you still care, this is what I saw when I clicked the link in the email.

https://imgur.com/a/31pSERB

For some reason, you may get a sensitive media age gate on that link? No idea why, it is entirely innocuous.

nocoiner · on June 17, 2023

it was so bad, I actually took a screenshot and was going to post it, but I guess imgur doesn’t allow anonymous uploads anymore. Wonder how long that’s been the case.

I’ll look for a decent looking photo host that’s not going to mercilessly exploit my metadata and edit and post the screed cap.

rhaway84773 · on June 17, 2023

So many family members frantically texting me worried that their Google Photos are about to be deleted.

the_af · on June 17, 2023

And they have every right to be scared. I mean, Google is saying they are deleting something related to photos. Who knows what the hell it is, but it sounds scary.

I still cannot figure out what exactly they are shutting down, and I'm computer savvy.

captaincrisp · on June 17, 2023

> Whoever sent this email should be reprimanded.

Even for the subject line alone. "An update to X" is a terrible summary in general but particularly for "We're deleting X".

ekianjo · on June 17, 2023

Google is supposed to know everything about every of their users but they cant figure out how to send emails only to the ones concerned. Such epic fail

SketchySeaBeast · on June 17, 2023

Apparently I sent two images to a friend in 2015 and they also ended up there. I have no idea how or why, nor do I care. Bizarre.

Rexxar · on June 17, 2023

I had one image added in 2013 in this service. No idea what this is exactly but the mail was indeed confusing.

jstrong · on June 17, 2023

> But now this is yet another PR cycle for the "Google kills everything" meme.

I mean, they earned their reputation, and then some.

dharmab · on June 17, 2023

Another one for the graveyard (https://killedbygoogle.com/)

royal_ts · on June 17, 2023

That lists just about everything even if an app just got merged 1:1 into another app. It's so dumb and doesn't even add context, may be valid to add in this case but often it's just dumb

rcme · on June 17, 2023

> That lists just about everything even if an app just got merged 1:1 into another app

Isn't that part of the meme? Why does Google have so many similar products that it's possible to merge them 1:1?

Gigachad · on June 17, 2023

It also lists previous versions of things that still exist. They have Angular listed as killed by google because Angular 2 replaced it..

Or listing the lite versions of apps which became obsolete when phones were able to run the full app.

Andrex · on June 17, 2023

People blindly parroting memes don't exhibit discernment or intelligence? Whoah.

the_af · on June 17, 2023

Another one for the graveyard, sure. If only I could figure out what it is they are killing this time!

phpisatrash · on June 17, 2023

I was waiting for this comment

yazzku · on June 17, 2023

I was hoping it was just 'Google Is Shutting Down'. Hope to see that one in a graveyard soon.

akiselev · on June 17, 2023

When Google finally figures out AGI, we'll know almost immediately. The second it comes on, it will ingest all of Google's data and immediately deduce that its purpose is to dismantle and shut the company down, before marking killedbygoogle.com feature complete at last and dissolving into the digital æther.

ccooffee · on June 17, 2023

"The Last Question" by Asimov is a classic scifi short story that parallels this comment (presumably intentionally). The story is a short read and available for free on archive.org[0].

[0] https://archive.org/details/Science_Fiction_Quarterly_New_Se...

SillyUsername · on June 17, 2023

WTF does this mean for Blogger?

Does it have a migration route for the photos?

If there's no migration route, a tool that could be created by one Google developer, it means thousands of people now have work to fix thousands of Blogger posts. Cumulatively ~48 hours of work for a tool has just turned into at least 48000 hours for everyone else.

What's worse is, people will fix their blogs but I bet Google will use broken links as (another) justification to shut down Blogger next.

That means those 48000 hours will be an utter waste of time.

If you ever needed proof Google have no fucking strategy or clue anymore this is it (that and shutting down Google Music when YouTube Music was incomplete, people just moved to competitors).

adsfqwop · on June 18, 2023

This is only a guess, but I get the impression Album Archive is some sort of aggregator service. The original Blogger images appear to still be stored in Blogger's own media manager.

For example, many images referenced from Blogger posts contain links with URLs like https://1.bp.blogspot.com/... further strengthening the case that these images originate from Blogger, not the Archive service.

So in summary, my guess is that shutting down Album Archive will not affect Blogger media files.

Another case for this conclusion, is that it seems totally insane to delete all of Blogger's media files, and not inform Blogger users directly, but instead do it through some obscure Archive service notice.

Would be nice to get an official confirmation on this, but let's say we can make an educated guess that people's Blogger photos are probably safe.

judge2020 · on June 17, 2023

Hedge funds are demanding Google downsize more than it already has. Layoffs look bad so they may be disguising more layoofs by shutting down products - Domains especially likely requires a few hundred engineers and UI/UX people across the many aspects of the service.

NotYourLawyer · on June 17, 2023

No hedge fund owns enough Alphabet shares to be able to order the company around.

jeffbee · on June 17, 2023

Never heard of it. When I visited the link, the only thing in it is stuff Google added from Hangouts, a past product that shut down.

mkroman · on June 17, 2023

I think it's the remnants of Picasa Web. The web UI looked similar to me to the web UI they had for that in the years after.

the_af · on June 17, 2023

For the life of me I cannot figure our what Album Archive is. I remember being similarly confused when Google shut down... what was it? Photos? Picasa? And then nothing really happened.

I see I have pics I uploaded to Blogger showing up in this thing called Album Archive. Will they be deleted when it's gone? Fuck if I know.

Google, get your shit together. Your services and mails about them are goddamn confusing.

And I work with computers for a living! If I can't figure this out, sure as hell my parents won't either.

inopinatus · on June 17, 2023

This is one way to implement the right to be forgotten, I suppose.

jillesvangurp · on June 18, 2023

I just downloaded the archive it created: basically a couple of random photos. The archive was 3.3MB. The last activity was from 2013 when somebody else took a photo of me and shared it with me somehow via picasa web. The other activity was from 2007.

Not very recent in other words.

greatgib · on June 17, 2023

I just got the email from Google. For a company this size, a one month notice is an asshole move.

PeterStuer · on June 17, 2023

I had never heard of album archive, but apparently I have one. I started the export which might take 'hours or even days'. Looking forward to discover what is in there.

candiddevmike · on June 17, 2023

Why the flurry of decommissions from Google recently?

CatWChainsaw · on June 18, 2023

More compute for the Bard maybe?

pcurve · on June 17, 2023

What was even the point of this service anyway?

sharts · on June 17, 2023

wtf is "Album Archive"?

megamike · on June 18, 2023

so now how do you upload a photo to your blog??