01
Introduction to Digital Preservation
September 8, Tue 5:30-9:30p
Topics
Introduction to digital technology and preservation
Digital Technology History Timeline, 1820 - 1995
Recommended Readings
Digital Preservation Coalition: Digital Preservation Handbook
ALA Freedom to Read Statement & Bill of Rights
Clark & Steadman: Alan Turing's Legacy (WIRED)
DP Workshop Digital Technology and Preservation Timeline
The Signal: Digital Preservation Pioneer: Clifford Lynch
Lynch: Accessibility and Integrity of Networked Information Collections (1993)
Class Discussion
Exercise: Digital technology timeline, 1995 - present
02
History of Digital Preservation Efforts, Intro to OAIS Model
September 15, Tue 5:30-9:30p
Topics
OAIS reference model, history, and context
History of digital preservation efforts and initiatives
Lab
Follow along exercise: Navigate file systems with the command line, create files, create folders, list files and directories (pwd, cd, mkdir, touch, ls, etc.)
Individual exercise: Create your own directory structures and files based on a theme of your choice. Create an inventory report with "ls".
Readings
Besser: Moving from Digital Collections to Interoperable Libraries (2002)
Bacoum: A Brief History of Digital Preservation
Lavoie: Meeting the Challenges of Digital Preservation: The OAIS Reference Model
alternate PDF download
Rosenthal: Requirements for Digital Preservation Systems
Reccommended
Lavoie: The Open Archival System Introductory Guide
Reference
1996 Task Force on Archiving of Digital Information
Reference Model for an Open Archival Information System (The Magenta Book)
Lab Exercise Files
w2-lab.zip
03
Software, Operating Systems & Computing Environments
September 22, Tue 5:30-9:30p
Topics
Operating systems and digital preservation:
UNIX GNU/Linux and Free Software Movement and Open Source Software
Discussion of Assignment #1
Lab
Introduction to the bash shell and command line structure
Command Line Scavenger Hunt
Readings
Finley: Linux Took Over the Web. Now, It’s Taking Over the World (WIRED)
Kelty: Two Bits: The Cultural Significance of Free Software (Chapter 3 only) – alternate download here
Lyons: Introduction to Using the Command Line
Recommended Reading
Raymond: The Cathedral and the Bazaar
Lab Exercise & Files
Command line scavenger hunt
cli-scavenger-hunt.zip
04
Digital Format Identification, NDSA Levels of Preservation
September 29, Tue 5:30-9:30p
Topics
NDSA Levels of Preservation
File Identification and Format Sustainability
Characteristics and Specifications of Digital Video
Lab
Encoder/Decoder exercise
Command-line file playback with mplayer, analysis with mediainfo
Review file sustainability factors: Sustainability of Digital Formats
Readings
NDSA Levels of Preservation
Peltzman: Expanding NDSA Levels of Preservation
Martin: What is a Digital File?
Lacinak: Primer on Codecs
Jackson: Formats Over Time, UK Libraries
Recommended/Reference
Rosenthal: Formats Through Time, DHSR Blog
Library of Congress: Sustainability of Digital Formats (Introduction, Sustainability)
Video Formats, Codecs and Containers
Nagels (BAVC): PAR, SAR, and DAR: Making Sense of Standard Definition (SD) video pixels
Digital Preservation Coalition: Digital Preservation Handbook, File Formats and Standards
Lab Exercise Files
w4-lab-files.zip
w4-lab-spreadsheet
05
Data Storage: Architecture & File Systems
October 6, Tue 5:30-9:30p
Topics
Digital File Systems and Storage Media
Data Storage Considerations
Networked Storage Architecture
Lab
Follow along exercise: Write a bash shell script using ATOM text editor
Individual exercise: Customize a bash shell script using ATOM (GUI) with a prepared script to perform a batch process (ffmpeg, disk usage, df, etc.)
Readings
Google – Pinheiro, Weber, Barroso: Failure Trends in a Large Disk Drive Population
Glicksman: Storage Architectures and Network
2017 NDSA Storage Survey & Report
Clipper Notes: LTO tape advantages over disk
Recommended/Reference
Backblaze – Klein: What Can 49,056 Hard Drives Tell Us?
Lab Exercises
2020-w5-lab.txt
06
Data Transfer, Fixity, & Integrity
October 13, Tue 5:30-9:30p
Topics
Safe file transfer and maintaining data integrity
Fixity and checksums for preservation of audiovisual digital objects
Lab
Follow along: “for loop”
Follow along exercise: File transfer (cp, mv, rsync)
Follow along exercise: Fixity and checksums (md5, sha1, hashdeep, framemd5)
Readings
Checking Your Digital Content: An NDSA Publication
Baily: Protect Your Data (The Signal)
Rice: Reconsidering Checksums
Havemeyer-King: Trojan Dots and DIY Solutions (NDSR Blog)
Recommended/Reference
Goldin: A gentle introduction to rsync, a free, powerful tool for media ingest
Lab Exercises
w6-fixity-transfer.zip
07
Midterm Presentations
October 20, Tue 5:30-9:30p
DUE – In Class
Assignment #1 Presentations
Note: Students have the option to give presentations asynchronously for this assignment, and are expected to provide answers and insights during an in-class Q&A.
Readings
Summers: Web as a Preservation Medium (Medium)
Fino-Radin: Rhizome Preservation
08
Intro to Web Archiving
October 27, Tue 5:30-9:30p
Topics
Introduction to the Internet and World Wide Web, Web Architecture and Preservation
Web Preservation and the Internet Archive
Lab
Web archiving with WGET
Rhizome Webrecorder demo
Readings & Resources
Rhizome: Old Web Today Portal (enter URL and date to explore archived web pages from the past)
Lasar: 25 Years of Hypercard (Ars Technica)
Berners-Lee: WorldWideWeb Executive Summary
McKeehan: Symmetrical Web Archiving with Webrecorder (NDSR Blog)
ArchiveIt: Known Challenges of Web Archiving
Reference
Berners-Lee: Original Proposal for Information Management for CERN
ArchiveIt: Scoping guidance for specific types of sites (Social Media, etc.)
ArchiveIt: Glossary of Web Archiving Terms
Bill Atkinson: Reflecting on Hypercard in 2016
Lab Exercise Files
2020-w8-lab.rtf
09 🇺🇸
No Class
November 03, Tue 5:30-9:30p
10
Preservation Metadata
November 10, Tue 5:30-9:30p
Topics
Introduction to preservation metadata
PREMIS, METS & XML
Hierarchical Listing of Semantic Units
Lab
Review of PREMIS/METS metadata records
Exploring NANO & FIND
Readings
Caplan: Understanding PREMIS
Amaral: METS for Transferable Metadata
Lavoie, Gardener: Preservation Metadata
Recommended/Reference
PREMIS Data Dictionary
Lab Exercise Files
2020-w10-lab.zip (10MB)
## 🎥
AMIA
November 16 – 21
11
Digital Repository Design & Microservices
November 17, Tue 5:30-9:30p
Topics
Microservices for digital preservation and repositories
Digital repository design
Lab
File Signatures and Hexercises
Gary Kessler: List of File Signatures
Corkami File Signature Posters
Readings
Handel: Data Migration, Digital Asset Management and Microservices at CUNY TV
Spalenka: Some Assembly Required: Micro-services and Digital Preservation
Cramer & Kott: Designing and Implementing Second Generation Digital Preservation Services (Standford Univeristy)
Recommended
Abrams, et al: Preservation Is Not a Place
Abrams, et al: An Emergent Micro-Services Approach to Digital Curation Infrastructure
Microservice Links
Lab
w11-lab.zip
12 🦃
Digital Preservation Packaging & Automation
November 24, Tue 5:30-9:30p
Topics
Digital Preservation Packaging & the Bagit Specification
Automating Ingest of Submission Information Packages
Review ingest script
LTO Tape: Formats, indexes and LTFS
Lab
Review Ingest Script
Create "bags" and reports with Bagit command line software
Readings
Bagit: A Video Introduction
Gates: Using Bagit – The Patch Bay
Gates: Using Bagit in 2018 – The Patch Bay
Kim, Ross: Digital Forensics Formats: Seeking a Digital Preservation Storage Container Format
Recommended Readings
Lavoie: The fifth blackbird
Lab Exercise Files
2020-w12-lab.zip
Download bagger software
13
Trusted Digital Repositories & Sustainability
December 1, Tue 5:30-9:30p
Topics
TDR, TRAC and the TRAC Checklist
History of Digital Computing
Lab: macOS Disk Utility and diskutil command
Create, Erase, Reformat and Encrypt Disk Images
Readings
TRAC Checklist (read pages 1–8, skim/review remainder of document)
Rosenthal: TRAC Audit, Lessons Learned (DSHR Blog)
What is LOCKSS? and LOCKSS Preservation Principles
The CLOCKSS Archive and What is the Difference Between LOCKSS and CLOCKSS?
Center for Research Libraries: Certification Report on CLOCKSS
Recommended Readings
Lavoie: The fifth blackbird
Lab Exercise Files
w13-lab.rtf
14
Final Project Presentations
December 8, Tue 2:30-6:30p
721 Broadway, Room 635
DUE – In Class
Assignment #2 Presentations
Note: Students have the option to give presentations asynchronously for this assignment, and are expected to provide answers and insights during an in-class Q&A.
Last class!
##
Final Papers Due
December 11, Fri 6p
Last day of Fall 2020 classes
Sunday, December 13
Additional Recommended Readings
Rosenthal: The Medium Term Prospects for Long Term Storage
Pease, Amir, et al: The Linear Tape File System
Lazorchak: Digital Forensics and Digital Preservation: An Interview with Kam Woods of BitCurator
Educopia Institute: BagIt Usage Instructions
Internet Draft: BagIt File Packaging Format Specifications for the Internet Engineering Task Force (IETF)