Copy-pasting code from the PDF version of the book#
Readers reaching the website from the PDF version of the book may have noticed that copy-pasting commands and scripts from the digital edition of the book may produce errors once the commands/scripts are run on the PC. This is most likely due to the formatting of the code in the PDF file, where - for typesetting reasons - the font employed generates incorrect results when pasted to plain-text. These can be the result of a number of reasons, including:
the presence of ligatures: one such example is the word
trafilatura
as in[c5.04]
- reported below in fig. 0.06 where the lettersf
andi
are included as one single character (representing the ligaturefi
) instead of two separate ones (fi
)the use of alternative characters: it is the case of
[c5.31]
where the characterEN DASH
(Unicode characterU+2013
, see fig. 0.07 for a depiction of how it appears in the PDF version) is used instead ofHYPHEN-MINUS
(Unicode characterU+002D
). While they may appear identical, they are in fact two different entities: this can be noted when put side-by-side-–
. In CLI commands and scripts only theHYPHEN-MINUS
one is accepted.

Figure 0.06 Example of ligature as included in the PDF version of the book#

Figure 0.07 Example of EN-DASH
as included in the PDF version of the book#