New blog post: I quit twitter and now my brain functions again! Everyone should do this https://lissertations.net/post/1651
Hey. Did I mention you can come to work with me on here? You totally should apply. Otherwise please signal boost to under represented groups and yes they can HMU https://main-princeton.icims.com/jobs/12544/library-it-operations-engineer/job
Bookmarked: More Thoughts on Pre-recording Conference Talks | Disruptive Library Technology Jester https://dltj.org/article/pre-recording-conference-talks-redux/#the-tree-of-tweets A nice example from @DataG on using the replay-web-page web component to display a web archive in a blog post. No Wayback infrastructure required! Just static files on the web, in your browser.
Here's a nice little example from Peter Murray of using the <replay-web-page> web component to embed a web archive recording (WARC data) in a blog post.
command line tricks
Today I learned that `ssh` interacts with stdin/stdout normally. So you can pipe data from machine1 into a command on machine2!
For example, my local machine is a Mac that uses `pbcopy` to put stuff on the clipboard. Ex:
$ echo 'foo' | pbcopy
puts "foo" on my clipboard. If I'm ssh'ed in to a remote machine & I'm running sshd locally, I can do
$ echo 'foo' | ssh myLocalMachineIP pbcopy
and that puts the output of the script from the remote machine onto my local clipboard!
Some researchers referred me to this site as a sustainable repository for machine leaning models etc. https://huggingface.co/
Checked the Internet Archive, and in 2019 it was "The social AI who learns to chit-chat, talks sassy and trades selfies with you." http://web.archive.org/web/20190730145501/https://huggingface.co/
Not quite ready to put all my eggs in that basket. 😀
The COVID vaccination dataset from the Maryland Department of Health that we've been using in class is already a 404. They changed the shape of the dataset to now include single dose vaccinations, and changed the title, which changed the URL. I guess it's good for students to get a feel for how volatile data on the web can be, even when published by local authorities. Maybe they will do better than us.
It feels like tooting about conferences isn't really a thing on 🐘, but I just want to say that there are probably quite a few folks on here who would have enjoyed #CollabW21 this week and might want to keep an eye out for other events from the Software Sustainability Institute.
It's by far the friendliest conference I know, exceptionally well-planned online delivery, and relevant to anyone who writes software for research, from 5-line analysis scripts up. https://www.software.ac.uk/cw21
I've always been impressed with DigitalOcean documentation but is there a good guide that answers more general questions like, what do I need to understand to run my own server? I'd like to do things like run my own personal nextcloud but I'm not sure how much I need to know about security or what the time commitment might be. So I continue to run some of my own stuff off a raspberry pi on a local network. I do have my blog on a hosting provider but it's static and I don't manage the server.
#NFT that just vanish after someone took your money. #WhereIsMySurprisedFace https://www.vice.com/en/article/pkdj79/peoples-expensive-nfts-keep-vanishing-this-is-why
I'm looking for a resource that is similar to a privacy toolkit for consumers but that is focused on small website owners.
I'm hoping for something that will for example tell you about Matomo as alternative to Google Analytics. That will tell you how to avoid placing Facebook tracking pixels and how to spot your site using other resources (like fonts and libraries) that track your users. Maybe even recommend some static-site systems?
Does anyone know of something like that?
my weekend proof-of-concept: upload mediainfo.json about your mysterious a/v collections, and get some charts to help you understand what you're working with!
What's the plan for this #GLAMDataSci thing, anyway? Glad you asked! Here's a roadmap for how it should shape up for the rest of 2021:
Right then, I've got as far as I can on my own with the #GLAMDataSci network (which was never going to be very far… 😆), and now it's your turn! It's time to run a launch workshop to plan the network's first 6 months or so, and I need your help.
In return, you’ll get to join and shape a community that will help you and others develop confidence, and skills in stats and data science. We want everyone to be able to make the most of data, ethically, in the Cultural Heritage sector. 1/2
Especially needed right now is a handful of folks to plan and run that workshop, but just taking an hour or two out of your day to attend and contribute to that workshop would be incredibly valuable. Sign up to our mailing list at https://tinyletter.com/glamdatasci to hear about that.
More details here on things that we need and ways to get in touch: https://glamdatasci.network/2021/03/help-needed-preparing-to-launch/
OPF had a good discussion led by the national archives of Sweden in a webinar "A Call to Help: collecting obsolete equipment and playback devices" this morning.
Made me think (with even more conviction) that the "loss" of expertise with older tech is not really a loss at all, it's 1) a misunderstanding of how tech knowledge has always been gained (through time + practice) and 2) an utter failure of search and discovery systems, esp for web archives
digipres.club is a space for folks interested in productive conversations about, well, digital preservation! If you enjoy talking about how to do memory work with computers, or even with cardboard boxes of old photos, you belong with us on digipres.club. Many of us are/were Twitter users looking for an inclusive and community supported approach to social media. If any of these things sound good to you, consider joining us now.