Copying files isn't always a straightforward process (or, some things I've learned working with digital archives) https://blog.suppliedtitle.org/2021/10/19/copying-files-isnt-always-a-straightforward-process-or-some-things-ive-learned-working-with-digital-archives.html
Doc-minded colleagues: does anyone know of good + accessible tools or libraries for creating help-text overlays or guided actions on a web site?
Like, something that can be integrated into a front-end client that enables a "tutorial" mode with scripted actions and text you can advance through to explain the interface
Follow up to my earlier post about archiving and deleting my Facebook account, with suggestions on how I'd approach doing the same thing today https://blog.suppliedtitle.org/2021/10/08/so-you-want-to-archive-your-facebook-account.html
Watching in amazement as Ilya Kreymer talks at the Archiving the Black Web workshop about https://github.com/webrecorder/browsertrix-crawler which lets you crawl a website with multiple headless browsers and take the collected data and view it with https://replayweb.page -- while letting you even view a screencast of what the browsers are doing.
New blog, with some thoughts on a recent report on the Significant Properties of Spreadsheets https://www.bitsgalore.org/2021/09/24/on-the-significant-properties-of-spreadsheets
Government Labels https://inkdroid.org/2021/09/16/gov/ In the lead up to the 2020 US Presidential Election Twitter implemented new labels for government officials, organizations and state-affiliated media accounts. This was a follow on from their previous ban on state-backed political advertising in 2019.
By their own description Twitter apply these labels to:
Accounts of key government officials, including foreign ministers, institutional entities, ambassadors, official spokespeople, and key
Does anyone have a good workflow for converting articles & blog posts on the web to epub (preferred) or pdf for later reading on an ereader? #crosspost
New blog post -
PDF processing and analysis with open-source tools (an attempt at working various PDF-related working notes and lists into something more structured) https://www.bitsgalore.org/2021/09/06/pdf-processing-and-analysis-with-open-source-tools
Once a dollar disappears into Bezos's bank account, it is frozen in amber. No matter how many Subzero fridges Bezos fills with vintage Veuve, he'll barely dent his fortune.
Those dollars that pile up in the accounts of the wealthy are like oily rags piling up in the economy's garage. They can't be used for consumption, so they're pumped into assets, causing massive spikes in things like housing, raising the cost of living for everyone else.
Some thoughts about *that* talk _Why "Web Archiving" is No Longer a Useful Concept or Phrase_
sh is a kinda genius replacement for Python's subprocess module:
Does anyone happen to know what https://githubmemory.com is and how it works? I've been noticing it popping up in my search results more and more lately, and am ... confused.
hello fediverse! I'm looking for openstreetmap users (or rather: editors) that would be willing to talk to me for a journalistic article (in the Luxembourgish leftist weekly "woxx") about their experience.
(you can also just reply to this toot and tell me why you started to edit osm/whats your motivation to do it/what problems you see)
boosts more than welcome
I'm just noticing that browsertrix-crawler added an experimental feature to provide a browser screencast of what web content is being crawled. I can see how this could be super useful for site-specific behaviors that scroll pages, and click on detail links, etc.
I have some exciting professional news to share!
The Sloan Foundation has generously funded "Collaborating on Software Archiving for Institutions" #CoSAI, with 3 streams of work:
1) technical development on open source, decentralized toolkit for software archiving & replay; 2) community building and education on software archiving; and 3) optimizing machine and human workflows
Psyched that this is a big collaborative project, feat. the one and only @wilkie
digipres.club is a space for folks interested in productive conversations about, well, digital preservation! If you enjoy talking about how to do memory work with computers, or even with cardboard boxes of old photos, you belong with us on digipres.club. Many of us are/were Twitter users looking for an inclusive and community supported approach to social media. If any of these things sound good to you, consider joining us now.