What are they building?! 🌌 πŸš€ πŸ›°

7 years after this company closed they've still got their landing page up with vintage web design

@joe it works - put the access folder in the folder with your originals. I did this:

$ tree BagAccess
β”œβ”€β”€ bag-info.txt
β”œβ”€β”€ bagit.txt
β”œβ”€β”€ data
β”‚Β Β  β”œβ”€β”€ access
β”‚Β Β  β”‚Β Β  └── MARBLES.jpg
β”œβ”€β”€ manifest-sha512.txt
└── tagmanifest-sha512.txt

and started an 'unzipped bag' transfer, chose 'Do Not Normalize' and got a dip as well as an aip


Does anyone know how this works in Archivematica (if it does) for bags?

Specifically, where would the access folder go in the bag in order for Archivematica to recognize it during a transfer

Thanks @joe for writing this incredibly useful FRED guide Looks like a must-read resource for anyone working on reading/imaging born-digital content from old media!

It seems like it would be easy enough to do with the data you can get from their API. But I'm all about making less work for myself

For context, I have a bunch of URLs that I want to see if they've been crawled and how often in order to prioritize which of those to set up as seeds

Does any one have a script that can take a url and determine if it's been crawled by IA and then print the number of times with the date range? ex. url was crawled 12 times between 2002 and 2018

It's always nice when you send an email and someone agrees to query their database for you instead of having to figure out a way to scrape their site. I've now got a 300+ url seed list to dig into!

oh hey, who knew readpst worked with ost files too. Yay mbox conversion πŸ“¬

GLAM friends! A friend let me know that there’s an opening for a records management job at the Vancouver company she works for.

Looks like the iPres 2018 keynote speakers have been posted:

"And here is a dark truth of planning for β€œclimate resilience.” Decisions about which areas will be protected are not only about whose safety will be guaranteed; they also involve transnational concerns like reassuring global investors and preserving manufacturing supply chains."

A scheduled web archiving crawl contained a legacy scoping rule to accept all URLs containing "blogs" and has resulted in over 500,000 hosts from ... whoops

Found out about the dark-grey theme for MS 0ffice and my eyes are feeling better already 😎 🌚

I can only assume that the "SystemKernelPanic.rtf" file saved to the desktop on one of our processing computers is meant as a daily memento mori for me and other digital archivists to come