25 June 2012

Script for spell checking of pdfs.

In one shot
# pdftotext -raw -nopgbrk file.pdf - | aspell list | sort | uniq -c


Or exporting to txt first and then using the spell checker.
# pdftotext -raw -nopgbrk file.pdf file.txt
# aspell list < file.txt | sort | uniq -c

0 Comments:

Post a Comment

<< Home

Too Cool for Internet Explorer