Practice
- Format
- Defining Format stub
- Talking About Formats draft
- The Format Registry Landscape stub
- Migration
- Web Archives
- Archiving the Dynamic Web draft
- On Web Archiving stub
- Mining For Meaning stub
- UKWA Sustainable Access Plan stub
- What We Need
- Accessible Digital Preservation stub
- Fixing Fixity complete
- One Parser To Rule Them All stub
- What Do We Need? (part 2) stub
- What Do We Need stub
- DigiPresNews complete
- Archiving the Scholarly Record draft
- Ideas For Preservation Tools & Services
- Practice
- On Compression of Digitized Material stub
- Remixing the Planets Testbed stub
Experiments
- Meta
- Reconstruction stub
- Dawn of Emulation outline
- Finding Formats stub
- On Systems
- On Tools
- DiskImageInterlacer.java
- Experiments
War Stories
- Characterising PDFs stub
- Dissecting Domesday stub
- War stories
- Archiving Wikipedia During the SOPA Blackout stub
Fundamentals
- Communicating with the Future
- Communicating with the Future outline
- Codes stub
- What to Preserve? stub
- Making Plans stub
- Derivations
- Digital Preservation Lessons Learned
- Fundamentals
- Never Alone stub
- Not a Science stub
- Press Any Key stub
- Significant Properties And Authentic Transformations outline
Blog Posts
Ideas
-
Space-time plots for digital preservation as communications.
- IDCC15 Call For Papers
- Due 13th October 2014.
- 10 pages (Research or Practice Papers) or one-page abstract (5 min presentation) Data Papers.
- Themes:
- A decade of data curation – Papers should reflect on the developments that have taken place in the area of digital curation over the past ten years, and the implications these have for the future; reflections and synthesis of what has happened are welcome but all should aim to draw on this to identify future and current perspectives and action:
- Whatever happened to…?
- What were the lasting trends and passing phases?
- What lessons have we learned?
- What are the major advances that have been made?
- What are the next big challenges we need to tackle?
- Curation Infrastructure – Papers should describe institutional, consortia, national or international infrastructure supporting digital curation and research data management:
- Tools, systems and services that are in development
- Evaluations of existing tools
- Proposals for new approaches to large-scale service delivery
- Cutting edge research and exploration into new curation methods
- Working with challenging data – Papers should discuss work with particularly challenging or specialist forms of data
- Data on a large scale big data or large collections of long tail data
- Complex data, models and formats
- Disciplinary data
- A decade of data curation – Papers should reflect on the developments that have taken place in the area of digital curation over the past ten years, and the implications these have for the future; reflections and synthesis of what has happened are welcome but all should aim to draw on this to identify future and current perspectives and action:
- Ideas:
- Lessons Learned From Ten Years Of Digital Preservation
- Essentially the DP Myths papers, but reframing as lessons learned as an attempt to gain more traction. Also includes “Whatever happend to Significant Properties”. Challenges around representing minorities (mobile phone economics etc.), mid-value niches (Pages), not obsolescence itself…
- Format Identification In The Long Tail
- Comparing DROID and Tika, but moreover, using various correlates to explore understand the likely formats in the long-tail of application/octet-stream.
- Involves coding up the extractions.
- Obsolescence in the Web Archive
- Studying examples of obsolescence,leading to User-Driven Digital Preservation.
- Also include element trend analysis and script things?
- Involves researching formats and developing actions.
- Mining Meaning
- (as below)
- Lessons Learned From Ten Years Of Digital Preservation
- Web Archives as scholarly Sources: Issues, Practices and Perspectives – Call For Papers
- Due 8th December 2014.
- 1,500 words (short) or 2,500 words (long).
- Themes:
- research methods for studying the archived web
- the evolution of language on the web
- the history(ies) of the web
- the changing structure of the web
- Ideas:
- Mining Meaning
- Making sense of 2 billion fragments of UK web history. Issues of interpretation and context etc.
- Mining Meaning
Longwrites
- Practice
- Ref from Bitwiser: Good practice and Quirks Mode
- OAIS is clumsy, arbitrary boundary, not user centered.
- Experiments
- Bitwiser II: ignored v redundant via MC.
- Fundamentals
- Simplification pressure? e.g. Markdown or even Wikipedia? REST over SOAP. HTML over PDF? Others?
Sources
- Sources: W3ACT, Monitrix, The crawl, OpenWayback?
- Pull in requirements and issues, AIT stuff.
- Pull in Monitrix, BDT, IMAQA, WAT Mining.
- anjackson.net book
- xcltools HEAD
- Capturing Properties
- INTRANET: Capturing Properties