GitHub
Tesseract OCR:tesseract-ocr/tesseract: Tesseract Open Source OCR Engine (main repository) (github.com)
Tesseract User Manual:Tesseract User Manual | tessdoc (tesseract-ocr.github.io)
How to train LSTM Tesseract:tessdoc/TrainingTesseract-5.md at main ·tesseract-ocr/tessdoc (github.com)
- 作業系統:win10
- 版本訊息-命令提示字元(cmd)
C:\Users\user>tesseract --version
tesseract v5.0.1.20220118
leptonica-1.78.0
libgif 5.1.4 : libjpeg 8d (libjpeg-turbo 1.5.3) : libpng 1.6.34 : libtiff 4.0.9 : zlib 1.2.11 : libwebp 0.6.1 : libopenjp2 2.3.0
Found AVX2
Found AVX
Found FMA
Found SSE4.1
Found libarchive 3.5.0 zlib/1.2.11 liblzma/5.2.3 bz2lib/1.0.6 liblz4/1.7.5 libzstd/1.4.5
Found libcurl/7.77.0-DEV Schannel zlib/1.2.11 zstd/1.4.5 libidn2/2.0.4 nghttp2/1.31.0
一、安裝tesseract-ocr
二、安裝opencv-python【可略】
python --version
pip install opencv-python
pip install pytesseract
三、安裝語言包tessdata_best
下載:chi_tra.traineddata
貼至(預設路徑):C:\Program Files\Tesseract-OCR\tessdata
四、環境配置
新增【TESSDATA_PREFIX】環境變數
- C:\Program Files\Tesseract-OCR\tessdata
環境變數PATH 新增
- C:\Program Files\Tesseract-OCR\tessdata
- C:\Program Files\Tesseract-OCR
五、確認tesseract是否安裝成功
命令提示字元:tesseract
版本: tesseract --version
列出語言包:tesseract --list-langs
備註:環境變數更動後須重新開機才會啟用設定