[ILUG-Cochin.org] pdftotext and utf8
sanalkumar mr
sanalmadatheth at gmail.com
Sat Feb 21 14:30:28 IST 2009
Dear all.....Plz help me to solve this problem.
$ pdftotext -f 3 commonds.pdf rr.txt
When I executed the above command the file was converted to text file.But
when I tried it with a Malayalam pdf file it didn't give me the output.
$ pdftotext -f 1 -layout -enc utf8 madhyam_first.pdf ff.txt
The command executed is given above....and the error shown is given below
Error: Couldn't find unicodeMap file for the 'utf8' encoding
Error: Couldn't get text encoding
I visited man pages of pdftotext and some sites to learn more about that
http://www.cyberciti.biz/faq/converter-pdf-files-to-text-format-command/
,but didn't get help for the conversion of a Malayalam(utf8) pdf file.
and then I tried this
$ pdftotext -f 1 -layout -enc UTF-8 madhyam_first.pdf ff.txt
and I got a file named ff.txt , but that is not Malayalam
Plz help me to solve this issue
--
പറയാതെ പരിഭവം നീയൊന്നു മുടി കോതി-
ക്കഴിയുമ്പൊഴായുസ്സു തീരും
പറയാതെ പരിഭവം ഞാനൊന്നു നെടുവീര്പ്പിട്ടു-
കഴിയുമ്പോള് ലോകവും തീരും..........
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://ilug-cochin.org/pipermail/mailinglist_ilug-cochin.org/attachments/20090221/1f24d010/attachment.html>
More information about the Mailinglist
mailing list