Skip to main content

Digital Preservation

2023

Robust file transfers with Rclone
Move slow and keep things| ·3340 words
building-web-archives Digital Preservation Web Archives
My Format Identification Misunderstandings
Those were not the DROID descriptions I was looking for.| ·216 words
format-identification Digital Preservation
Speeding Up Format Identification
Fast versus fastidious?| ·1137 words
format-identification Digital Preservation
More Magic From Siegfried
...and Roy!| ·612 words
format-identification Digital Preservation

2018

Continuous, incremental, scalable, higher-quality web crawls with Heritrix
IIPC conference 2018| ··2412 words
Web Archives Digital Preservation
Story of a Bad Deed
A tiny digital mystery| ··1075 words
digipres-lessons-learned Digital Preservation Keeping Codes Lessons Learned

2017

Sustaining the Software that Preserves Access to Web Archives
A post for International Digital Preservation Day 2017| ··907 words
Web Archives Digital Preservation
Driving Crawls With Web Annotations
Without competing with Pinboard| ··1002 words
Web Archives Digital Preservation
Tools for Legal Deposit
Shifting toward domain scale| ··736 words
Web Archives Digital Preservation
The Web Archive and the Catalogue
The Shelves and The Mine| ··1670 words
mining-web-archives Data Mining Web Archives Digital Preservation webarchive-discovery