识别英文数字验证码的程序（Python示例）

news/2024/11/30 12:52:04/文章来源:https://www.cnblogs.com/ocr12/p/18578299

在本篇文章中，我们将使用 Python 来实现一个英文数字验证码识别的程序。我们会使用 Tesseract OCR（Optical Character Recognition，光学字符识别）来识别验证码中的文本内容。

环境准备
首先，需要安装以下 Python 库：

Pillow：用于图像处理。
pytesseract：Tesseract 的 Python 包，用于文本识别。
Tesseract OCR：一个开源的 OCR 引擎。
安装 Python 库：

bash

pip install Pillow pytesseract
同时，您需要安装 Tesseract OCR 引擎。可以从 Tesseract GitHub页面下载并安装，或者使用包管理工具安装：

Windows：下载并安装 tesseract-ocr-w32-x64.exe。
Linux：可以使用以下命令安装：
bash

sudo apt-get install tesseract-ocr
MacOS：可以通过 Homebrew 安装：
bash
brew install tesseract
2. 验证码识别程序
创建一个 Python 脚本 captcha_recognition.py，并将以下代码写入其中：

python

import pytesseract
from PIL import Image
import matplotlib.pyplot as plt

设置 Tesseract 可执行文件路径（仅在 Windows 上需要）

pytesseract.pytesseract.tesseract_cmd = r'C:\Program Files\Tesseract-OCR\tesseract.exe'

读取验证码图像

image = Image.open('captcha_image.png')

显示原始图像

plt.imshow(image)
plt.axis('off') # 不显示坐标轴
plt.show()

进行 OCR 识别

captcha_text = pytesseract.image_to_string(image, config='--psm 6')

输出识别的验证码内容

print(f"识别的验证码是：{captcha_text.strip()}")
3. 代码解析
图像加载：使用 PIL 库的 Image.open() 方法加载验证码图像。
OCR 识别：使用 pytesseract.image_to_string() 方法来识别图像中的文本。config='--psm 6' 配置告诉 Tesseract 我们预计图像包含的是一行文本（即验证码的常见格式）。
可视化：用 matplotlib.pyplot 库显示原始图像，方便查看输入的验证码图像。
4. 如何运行
将验证码图像保存为 captcha_image.png（您可以使用任何实际的验证码图像）。

运行 Python 脚本：

bash

python captcha_recognition.py
程序将会显示验证码图像，并输出识别的验证码内容。