New blog post - Mapping the Dutch web domain (all of it!) https://www.bitsgalore.org/2020/09/09/mapping-the-dutch-web-domain
New blog post on how I tried to restore (and subsequently capture to WARC) one of the oldest and most unique Dutch homepages from the mid-'90s https://www.bitsgalore.org/2020/06/30/restoring-liesbets-virtual-home
For those in search of some next-level digital preservation weirdness, look no further than my review of the ISO/IEC TS 22424 standard on #EPUB 3 preservation https://www.bitsgalore.org/2020/04/30/iso-iec-ts-22424-standard-on-epub3-preservation
Does Microsoft OneDrive export large ZIP files that are corrupt? UPDATED version of my earlier blog, turns out that one unexpected ZIP64 field value results in ZIP files that are unreadable by most extraction tools and libraries https://bitsgalore.org/2020/03/11/does-microsoft-onedrive-export-large-ZIP-files-that-are-corrupt
My father’s hammer l, and a small story of maintenance and digital preservation: https://blogs.bl.uk/webarchive/2020/03/theseus-data-store.html
Does Microsoft OneDrive export large ZIP files that are corrupt? New blog: https://www.bitsgalore.org/2020/03/11/does-microsoft-onedrive-export-large-ZIP-files-that-are-corrupt (if anyone has any more info or test results on this I'd love to know!)
Offline digital data carriers in the KB deposit collection, new blog https://www.bitsgalore.org/2020/02/20/offline-digital-carriers-kb-deposit-collection
Really useful blog post on how to add tracker-free commenting functionality to static Jekyll web sites using Github issues https://aristath.github.io/blog/static-site-comments-using-github-issues-api (I can confirm this actually works, as I used it for my own blog at https://www.bitsgalore.org/)
New bloggage - web domain geolocation and spatial analysis with QGIS https://www.bitsgalore.org/2020/02/11/web-domain-geolocation-and-spatial-analysis
One for the #jpeg2000 crowd - the action-packed official 2.0 release of jpylyzer is out now! More info here: https://jpylyzer.openpreservation.org/2019/11/20/Release-of-jpylyzer-2-0-0
We just published the jpylyzer 2 release candidate, which includes (among other things) support for raw codestream validation. More info here: https://openpreservation.org/news/jpylyzer-2-release-candidate-out-now/ #jpeg2000
Here's a weird and slightly spooky 1-minute trailer(!) I made for the upcoming jpylyzer 2.0.0 release (don't miss out on the audio on this one) https://youtu.be/gIutpFxGy28
Recovering '90s Data Tapes - Experiences From the KB Web Archaeology project https://www.bitsgalore.org/2019/09/09/recovering-90s-data-tapes-experiences-kb-web-archaeology Web-friendly version of the paper I wrote for the upcoming #iPres2019 conference in Amsterdam. Contains links to lots of tape-related resources
Now in blog form! "rsync, GUIs, power, control, design, and decisions" https://bits.ashleyblewer.com/blog/2019/06/29/rsync-guis-power-control-design-and-decisions/
Attention jpylyzer users - I'm considering some changes to jpylyzer's output format for an upcoming 2.0 release in November. See this note for details: https://gist.github.com/bitsgalore/300a295572606c17fa763335a255efaf If you have any comments or suggestions just let me know!
One for the floppy crowd - a simple disk imaging workflow tool (basically a simple graphical front-end around dd and ddrescue) https://www.bitsgalore.org/2019/04/10/a-simple-disk-imaging-workflow-tool
New bloggage - a simple workflow tool for imaging optical media using readom and ddrescue https://www.bitsgalore.org/2019/03/22/a-simple-workflow-tool-for-imaging-optical-media-using-readom-and-ddrescue
UPDATE: OPF call was this morning, and codestream validation in jpylyzer will happen! https://github.com/openpreserve/jpylyzer/issues/113 (already did a quick test last week which confirmed this is easy to implement)
An email I got yesterday about validating JPEG 2000-encoded video frames made me think about adding an option to jpylyzer for validating raw JPEG 2000 codestreams (through the Python API and the CLI). Is this something anyone would find useful? (Next week there'll be a call with OPF about the priorities for an upcoming jpylyzer release by the end of 2019, so if this is something people are interested let me know so I can try to push it up the priority list!)
Digital preservation, file formats.
digipres.club is a space for folks interested in productive conversations about, well, digital preservation! If you enjoy talking about how to do memory work with computers, or even with cardboard boxes of old photos, you belong with us on digipres.club. Many of us are/were Twitter users looking for an inclusive and community supported approach to social media. If any of these things sound good to you, consider joining us now.