"Introduction to the WARC" ICYMI: https://archive.org/details/introduction-to-the-warc
At least ~150 different Archive-It partners have collected COVID pages, sites, and posts. That's currently over 20,000 seeds and nearly 10TB of data stored. And growing obvs 📈 Assuming ~someone~ were to unify them at an access point 😏 what functionality might you want to see and use beyond full-text search?
People wring their hands a lot about IA, but in the end I have a copy of Matthew McConaughey in Unsolved Mysteries and you don't, so https://web.archive.org/web/20140221043225/http://www.youtube.com/watch?v=6_pJ8oiFeGs&gl=US&hl=en
Actual linked open data klaxon https://support.archive-it.org/hc/en-us/community/posts/360048039451-New-training-webinar-The-Archive-It-Partner-Data-API
Join us if you'd like to attend and/or help to organize a climate grief event in Austin around the edges of the SAA/CoSA meeting http://landscapelibrarian.com/climate-grief
An example of the problem with crowdsourced tagging with no review. My favorite tag is "CHECK OUT MY SOUNDCLOUD FAM ITS FIRE AF MAN YOULL BURN YOUR HANDS IF YOU TOUCH IT (LIT AF) http://edgerton-digital-collections.org/?s=hee-nc-57001#hee-nc-57001
Web archivist for IA