One for the floppy crowd - a simple disk imaging workflow tool (basically a simple graphical front-end around dd and ddrescue) https://www.bitsgalore.org/2019/04/10/a-simple-disk-imaging-workflow-tool
New bloggage - a simple workflow tool for imaging optical media using readom and ddrescue https://www.bitsgalore.org/2019/03/22/a-simple-workflow-tool-for-imaging-optical-media-using-readom-and-ddrescue
UPDATE: OPF call was this morning, and codestream validation in jpylyzer will happen! https://github.com/openpreserve/jpylyzer/issues/113 (already did a quick test last week which confirmed this is easy to implement)
An email I got yesterday about validating JPEG 2000-encoded video frames made me think about adding an option to jpylyzer for validating raw JPEG 2000 codestreams (through the Python API and the CLI). Is this something anyone would find useful? (Next week there'll be a call with OPF about the priorities for an upcoming jpylyzer release by the end of 2019, so if this is something people are interested let me know so I can try to push it up the priority list!)
Working on a lecture for next week inspired me to properly condense all my bookmarked emulation resources into one basket, and put the list out there for everyone.
If you've got favored blogs/papers/guides for emulation, please contribute and share!!
Following a couple of days of digging, dusting and editing, restored versions of all preservation-themed blogs I've written since 2010 are now available here https://www.bitsgalore.org/ Will probably use this as my main blogging platform from now on.
Wrote a new blog post on recovering ’90s data tapes, with links to full workflow descriptions (including hardware configuration), easy to use recovery software and tonnes of other tape-related resources http://openpreservation.org/blog/2019/01/31/roll-the-tape-recovering-90s-data-tapes-in-bitcurator/
Out now: new release of the tapeimgr tape extraction tool. Now with improved metadata support (incl entry of identifiers, descriptions and annotations through GUI or CLI). Default output dir is now configurable as well https://github.com/KBNLresearch/tapeimgr
Out now - tapeimgr 0.3.0 https://github.com/KBNLresearch/tapeimgr Main change is that it now runs without root access; see release notes for more details https://github.com/KBNLresearch/tapeimgr/releases/tag/0.3.0
Python-based tape extraction tool (GUI + CLI) is shaping up nicely (code will follow once I've sorted out some remaining details) ....
New bloggage - Crawling offline web content: the NL-menu case http://openpreservation.org/blog/2018/07/11/crawling-offline-web-content-the-nl-menu-case/
Last fall I wrote a bunch of Apache Tika mimetype patterns for detecting various versions of Lotus 1-2-3 and Quattro Pro spreadsheets. Almost forgot about this, then I just saw they're finally merged into Tika's master branch https://github.com/apache/tika/pull/209#event-1723687193
this rings so true omg "Contrary to common belief, the volume of face-to-face interaction decreased significantly (approx. 70%) in [two field studies transitions to open office plans], with an associated increase in electronic interaction. In short, rather than prompting increasingly vibrant face-to-face collaboration, open architecture appeared to trigger a natural human response to socially withdraw from officemates and interact instead over email and IM." http://rstb.royalsocietypublishing.org/content/373/1753/20170239
Does anyone have any thoughts advice on how to crawl a website from localhost with wget, while preserving all files in the source directory? More context here: http://qanda.digipres.org/1166/crawl-website-localhost-preserving-files-source-directory
W3C is looking for a developer/technical lead for the #epubcheck tool. See RFP for more info https://github.com/IDPF/epubcheck/wiki/Epubcheck-Development-Update-and-Maintenance-Request-for-Proposal #epub
Thanks @joe for writing this incredibly useful FRED guide https://josephcarrano.wordpress.com/2018/06/28/a-guide-to-fred-and-digital-preservation-sundries/ Looks like a must-read resource for anyone working on reading/imaging born-digital content from old media!
For those interested in the omSipCreator tool (which creates ingest-ready SIPS with METS files from batches of optical media images), I did some major refactoring on the code and added some more documentation. All available at https://github.com/KBNLresearch/omSipCreator
Digital preservation, file formats.
digipres.club is a space for folks interested in productive conversations about, well, digital preservation! If you enjoy talking about how to do memory work with computers, or even with cardboard boxes of old photos, you belong with us on digipres.club. Many of us are/were Twitter users looking for an inclusive and community supported approach to social media. If any of these things sound good to you, consider joining us now.