WordMord: No Annotation is Alone

From Parallel Library Services
Revision as of 16:48, 26 January 2022 by Simon (talk | contribs) (Created page with "--- Work(d)session--- = WORDMORD = http://wordmord-ur.la/ WordMord believes that the violence of language is not eradicated by merely deleting/erasing words, but rather by t...")
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigation Jump to search

--- Work(d)session---

WORDMORD

http://wordmord-ur.la/

WordMord believes that the violence of language is not eradicated by merely deleting/erasing words, but rather by transversing their violent imposition through specific practices that trouble and disrupt grammatical consistency, semantic norms, ‘correct’ pronunciation, ‘proper’ bodily posture. The rupture of linguistic limits suggests the possibility of experiencing language in its materiality.

WordMord poses questions on the relationship between language, technology, trauma and violence. The collective artistic research will evolve through workshops, presentations and artworks. Through collaborations with artists, activists and groups working on feminist coding,WordMord seeks to shape an online rhizomatic space as an active feminist archive. At the same time, it project will provide tools and methods towards a poetically subversive meta/para/re-writing of derogatory narratives and consequently of trauma and violence.

  1. WordMord´s initial research group: Vassiliea Stylianidou aka Franck-Lee Alli-Tis, Angeliki Diakrousi, Christina Karagianni, Stylianos Benetos aka Oýto Arognos, Mounologies: Eleni Diamantouli and Anna Delimpasi. It started in collaboration with the #CNMFPP in 2019.

ANNOTATE THE WEB

https://web.hypothes.is/ Hypothes.is is a 501(c) open-source software project that aims to collect comments about statements made in any web-accessible content, and filter and rank those comments to assess each statement's credibility. [from Wikipedia]

XPPL: https://issue.xpub.nl/06/ https://w-i-t-m.net/images/xppl_interface.jpg

   (especially Annotations interface by Angeliki https://pzwiki.wdka.nl/mediadesign/User:Angeliki/X-LIB/Annotations

RE(D)ACTION

https://parallel-library.simonbrowne.biz/calibre/read/49/pdf#page=126

Redaction Reaction Readaction

OuNuPo: https://issue.xpub.nl/05/

   (especially Chapter 4 - Natasha Berting   How Bias Spreads from the Canon to the Web + Erase / Replace)

NO ANNOTATION IS ALONE

"PDF as a democratic means of digitally publishing a criminal code, makes the law accessible to everyone. It is a static tool for sharing institutional knowledge that does not easily allow for editing and commenting on its content. Each digital tool contains a narrative as a guidance mechanism with specific technical and ideological constraints. The way of using a PDF is universally accepted and stored in collective memory. Similarly, the original content of this PDF is determined by entrenched decisions, based on the perpetuation of old habits, and in turn determines dominant social, institutional and ideological behaviours."

NO ANNOTATION IS ALONE TOOL

[Instructions made by WordMord]

Install

$ python3 -m venv venv $ . venv/bin/activate $ sudo apt-get install tesseract-ocr

$ tesseract --list-langs If the language you want is not there check languages here: https://tesseract-ocr.github.io/tessdoc/Data-Files-in-different-versions.html

Install the languages you want: $ sudo apt-get install tesseract-ocr-ell

$ sudo apt install imagemagick OR $ brew install imagemagick

$ pip install pillow maybe need to install first $ sudo apt-get install libjpeg8-dev

$ pip install reportlab

Annotate the PDF

$ convert [name0fyourFile].pdf [name0fyourFile].png

$ tesseract -l [language] [name0fyourFile].png [name0fyourFile] hocr

Make a new searchable pdf: $ python3 hocrtransform-invisible-PDF.py -i [name0fyourFile].png [HocrFile].hocr [NewPDF].pdf

hocr

hOCR is an open standard of data representation for formatted text obtained from optical character recognition (OCR). The definition encodes text, style, layout information, recognition confidence metrics and other information using Extensible Markup Language (XML) in the form of Hypertext Markup Language (HTML) or XHTML. [from Wikipedia]

View the text (produced as hocr) within the pdf ( from https://pzwiki.wdka.nl/mediadesign/Optical_character_recognition_with_Tesseract )

using hocrjs

We will use User Script instruction with Tampermonkey.

Installing viewer:

  • open Firefox
  • go to FF addons and search for Tampermonkey
  • install it
  • Browse to unpkg.com/hocrjs/dist/hocr.user.js
    • click "Install". It will install the script in your browser Tampermonkey
    • click the Tampermonkey and go to the "Dashboard". hocr-viewer should be enabled

View the hocr int the Firefox

  • change the extension of your hocr file from .hocr to .html
  • open the .html file in firefox

Edit the hocr file with a text editor and replace words [editedHocrFile].hocr

Make a new searchable pdf with an annotated hocr: $ python3 hocrtransform-invisible-PDF.py -i [name0fyourFile].png [editedHocrFile].hocr [NewAnnotatedPDF].pdf

Annotation replaces the initial content

$ python3 hocrtransform-visible-pdf.py -i [name0fyourFile].png [editedHocrFile].hocr [TransformedPDF].pdf