Difference between revisions of "Convert text documents with Pandoc"
Line 1: | Line 1: | ||
https://pandoc.org | https://pandoc.org | ||
Pandoc is a "universal document converter" which converts from one markup language to another. | Pandoc is a "universal document converter" which converts from one markup language (e.g. [[HTML]], [[Markdown]] to another. Here are some basic recipes for converting documents. | ||
You can find instructions for installation on [https://pandoc.org/installing.html the Pandoc website] for your particular operating system. Once you have pandoc installed, open a terminal session to use its command line interface. | |||
More extensive documentation is available in [https://pandoc.org/MANUAL.html the official Pandoc manual] or through the [[command line]] by typing | More extensive documentation is available in [https://pandoc.org/MANUAL.html the official Pandoc manual] or through the [[command line]] by typing | ||
Line 9: | Line 9: | ||
<pre>man pandoc</pre> | <pre>man pandoc</pre> | ||
== | === Common pandoc arguments=== | ||
<code>-f</code> or <code>--from</code> Option which is followed by the input format; | |||
<code>-t</code> or <code>--to</code> Option which is followed by the output format; | |||
< | <code>-s</code> or <code>--standalone</code> Option produces output with an appropriate header and footer; | ||
<code>-o</code> or <code>--output</code> Option for file output | |||
== Convert plain text files (.txt) that are structured with [[Markdown]] to PDF == | |||
For this, you need to have an up-to-date version of BasicTeX installed. On Mac, first install with homebrew: | |||
<pre> | <pre>brew install BasicTex</pre> | ||
Then, do: | |||
< | <pre>pandoc MANUAL.txt --pdf-engine=xelatex -o example13.pdf</pre> | ||
== Convert downloaded wiki pages (plain text in the .wiki format) to HTML files == | |||
=== Example 1: Convert an HTML string to Markdown === | |||
Enter a string of HTML and pipe it to pandoc: | |||
< | <pre>echo "<h1>Hello Pandoc</h1><p>from html to markdown</p>" | pandoc -f html -t markdown</pre> | ||
=== Example 2: Convert a MediaWiki file to HTML=== | |||
# Save the content of a wiki page on to a [[Plain text|plain-text]] file, example: <code>page.wiki</code> | |||
# Convert: | |||
</ | |||
<pre>pandoc page.wiki -f mediawiki -t html -o page.html</pre> | |||
[[Category:Cookbook]] | [[Category:Cookbook]] |
Revision as of 14:29, 11 October 2021
Pandoc is a "universal document converter" which converts from one markup language (e.g. HTML, Markdown to another. Here are some basic recipes for converting documents.
You can find instructions for installation on the Pandoc website for your particular operating system. Once you have pandoc installed, open a terminal session to use its command line interface.
More extensive documentation is available in the official Pandoc manual or through the command line by typing
man pandoc
Common pandoc arguments
-f
or --from
Option which is followed by the input format;
-t
or --to
Option which is followed by the output format;
-s
or --standalone
Option produces output with an appropriate header and footer;
-o
or --output
Option for file output
Convert plain text files (.txt) that are structured with Markdown to PDF
For this, you need to have an up-to-date version of BasicTeX installed. On Mac, first install with homebrew:
brew install BasicTex
Then, do:
pandoc MANUAL.txt --pdf-engine=xelatex -o example13.pdf
Convert downloaded wiki pages (plain text in the .wiki format) to HTML files
Example 1: Convert an HTML string to Markdown
Enter a string of HTML and pipe it to pandoc:
echo "<h1>Hello Pandoc</h1><p>from html to markdown</p>" | pandoc -f html -t markdown
Example 2: Convert a MediaWiki file to HTML
- Save the content of a wiki page on to a plain-text file, example:
page.wiki
- Convert:
pandoc page.wiki -f mediawiki -t html -o page.html