For those in search of some next-level digital preservation weirdness, look no further than my review of the ISO/IEC TS 22424 standard on #EPUB 3 preservation https://www.bitsgalore.org/2020/04/30/iso-iec-ts-22424-standard-on-epub3-preservation
Does Microsoft OneDrive export large ZIP files that are corrupt? UPDATED version of my earlier blog, turns out that one unexpected ZIP64 field value results in ZIP files that are unreadable by most extraction tools and libraries https://bitsgalore.org/2020/03/11/does-microsoft-onedrive-export-large-ZIP-files-that-are-corrupt
My father’s hammer l, and a small story of maintenance and digital preservation: https://blogs.bl.uk/webarchive/2020/03/theseus-data-store.html
@mtnsic Oh, that's really interesting, might give this a try at some point. Very strange though, since as far as I'm aware most open-source zip libraries have supported ZIP64 for ages, which makes me wonder what those cloud providers are using instead.
Does Microsoft OneDrive export large ZIP files that are corrupt? New blog: https://www.bitsgalore.org/2020/03/11/does-microsoft-onedrive-export-large-ZIP-files-that-are-corrupt (if anyone has any more info or test results on this I'd love to know!)
Offline digital data carriers in the KB deposit collection, new blog https://www.bitsgalore.org/2020/02/20/offline-digital-carriers-kb-deposit-collection
Really useful blog post on how to add tracker-free commenting functionality to static Jekyll web sites using Github issues https://aristath.github.io/blog/static-site-comments-using-github-issues-api (I can confirm this actually works, as I used it for my own blog at https://www.bitsgalore.org/)
@andrewjbtw Like all Java applications I'm aware of, initialization of the Java VM is a pain if you want to process large numbers of files, but you can get around that by running the Tika server application. Once that's fired up, you can then process individual files using HTTP requests, which is pretty fast. See for more info + some examples here: https://cwiki.apache.org/confluence/display/TIKA/TikaJAXRS
New bloggage - web domain geolocation and spatial analysis with QGIS https://www.bitsgalore.org/2020/02/11/web-domain-geolocation-and-spatial-analysis
One for the #jpeg2000 crowd - the action-packed official 2.0 release of jpylyzer is out now! More info here: https://jpylyzer.openpreservation.org/2019/11/20/Release-of-jpylyzer-2-0-0
We just published the jpylyzer 2 release candidate, which includes (among other things) support for raw codestream validation. More info here: https://openpreservation.org/news/jpylyzer-2-release-candidate-out-now/ #jpeg2000
Here's a weird and slightly spooky 1-minute trailer(!) I made for the upcoming jpylyzer 2.0.0 release (don't miss out on the audio on this one) https://youtu.be/gIutpFxGy28
Recovering '90s Data Tapes - Experiences From the KB Web Archaeology project https://www.bitsgalore.org/2019/09/09/recovering-90s-data-tapes-experiences-kb-web-archaeology Web-friendly version of the paper I wrote for the upcoming #iPres2019 conference in Amsterdam. Contains links to lots of tape-related resources
@The_BFOOL Can't you just set up an autoforward in gmail (explained here https://www.lifewire.com/how-to-forward-your-gmail-email-to-another-email-address-1171906) to your ProtonMail address? Then always use your ProtonMail address for sending/replying, so over time people will stop using the gmail one.
Now in blog form! "rsync, GUIs, power, control, design, and decisions" https://bits.ashleyblewer.com/blog/2019/06/29/rsync-guis-power-control-design-and-decisions/
Attention jpylyzer users - I'm considering some changes to jpylyzer's output format for an upcoming 2.0 release in November. See this note for details: https://gist.github.com/bitsgalore/300a295572606c17fa763335a255efaf If you have any comments or suggestions just let me know!
One for the floppy crowd - a simple disk imaging workflow tool (basically a simple graphical front-end around dd and ddrescue) https://www.bitsgalore.org/2019/04/10/a-simple-disk-imaging-workflow-tool
Digital preservation, file formats.
digipres.club is a space for folks interested in productive conversations about, well, digital preservation! If you enjoy talking about how to do memory work with computers, or even with cardboard boxes of old photos, you belong with us on digipres.club. Many of us are/were Twitter users looking for an inclusive and community supported approach to social media. If any of these things sound good to you, consider joining us now.