python - PythonでTesseract OCRを使用したUnicodeDecodeError 'charmap'コーデック

翻译自：https://stackoverflow.com/questions/38023947 2016-06-25T00:52:39.110

1637 次

teseract-OCR を使用して、Python で画像ファイルに対して OCR を実行しようとしています。私の環境は、Windows マシン上の Python 3.5 Anaconda です。

コードは次のとおりです。

from PIL import Image
from pytesseract import image_to_string
out = image_to_string(Image.open('sample.png'))

私が得ているエラーは次のとおりです。

File "Anaconda3\lib\sitepackages\pytesseract\pytesseract.py", line 167, in image_to_string
return f.read().strip()
File "Anaconda3\lib\encodings\cp1252.py", line 23 in decode
return codecs.charmap_decode(input, self.errors, decoding_table)[0]
UnicodeDecodeError:'charmap' codec can't decode byte 0x81 in position 1583: character maps to <undefined>

ここに記載されている解決策を試しましたハックが機能していません

Mac OS でコードを試してみましたが、動作しています。

私はpytesseractの問題を調べました:これは未解決の問題です

ありがとう

python - PythonでTesseract OCRを使用したUnicodeDecodeError 'charmap'コーデック

1 に答える 1

Related

Reference