Difference between revisions of "Imagining librarianship & experiments with document conversion"

From Parallel Library Services
Jump to navigation Jump to search
Line 33: Line 33:
After this, it uses a simple syntax to make headings, paragraphs, bold and italic, lists (ordered and unordered), hyperlinks, and many more elements that can easily be converted to multiple file formats. This is part of a hybrid publishing workflow, whereby  
After this, it uses a simple syntax to make headings, paragraphs, bold and italic, lists (ordered and unordered), hyperlinks, and many more elements that can easily be converted to multiple file formats. This is part of a hybrid publishing workflow, whereby  


[[File:traditional_workflow.png]]
[[File:traditional_workflow.png|thumb|A traditional publishing workflow, with linear content creation and intense design activity to produce many formats]]


[[File:markdown_workflow.png]]
[[File:markdown_workflow.png|thumb|A "single source" publishing workflow, using a markup language such as Markdown to create content and design in parallel, with multiple formats to export to]]


We began by catching up on our projects, recording thoughts in a pad:
We began by catching up on our projects, recording thoughts in a pad:

Revision as of 09:24, 12 May 2022

Imagining librarianship & experiments with document conversion
Location: At Varia (Gouwstraat 3, Rotterdam), and online
Date: November 24th, 2021
Time: 16:00-19:00 CET
Pad: https://pad.simonbrowne.biz/p/pls-meeting-4
Tools: {{{tools detail}}}
Guests: {{{guests detail}}}

Context

PDF (Portable Document Format) is a highly popular digital file format for ebooks. In this workshop, we created, queried and embedded metadata in a PDF by using tools such as Pandoc, ExifTool and of course Calibre, "the swiss army knife of document conversion".

Activities

After some catching up on the contexts of our projects, we discussed the plan for today:

  • a tour of Calibre
  • hybrid publishing workflows
  • embedding metadata in PDFs
  • making digital files (EPUB, PDF with pandoc)
  • converting between file formats in Calibre

Our workshop was documented on a pad using Markdown to create structure. Markdown is a lightweight markup language that can be useful in hybrid publishing, where inputs (plain text) may have may outputs (file formats). From the one document it is possible to create a variety of files, including EPUB, PDF, HTML and even Wikitext, the syntax MediaWiki uses.

Markdown uses YAML metadata headers, which require a title in the initial metatdata block:

---
title: my new document
---

After this, it uses a simple syntax to make headings, paragraphs, bold and italic, lists (ordered and unordered), hyperlinks, and many more elements that can easily be converted to multiple file formats. This is part of a hybrid publishing workflow, whereby

A traditional publishing workflow, with linear content creation and intense design activity to produce many formats
A "single source" publishing workflow, using a markup language such as Markdown to create content and design in parallel, with multiple formats to export to

We began by catching up on our projects, recording thoughts in a pad:

Pls-workshop-04.png

From this file we made a PDF:

File:Workshop 04.md.pdf