CV综述OCR任务---目录

tech2025-08-12 81

CV综述OCR任务---目录

图像任务OCR任务图像分类目标检测图像分割图像增强视频任务正文： OCR学习OCR参考资料:参考博客:典型应用常见挑战比赛经典OCR方法单字符识别方法序列识别方法tessernet 文字检测模型 Part (think about CV中的目标检测)faster-RCNN/YOLO/SSDCTPN (2016): Connectionist Text Proposal NetworkEAST 2017 : An Efficient and Accurate Scene Text DetectorPSENET：progressive scale expansion algorithmDB: Real-time Scene Text Detection with Differentiable Binarization 文字识别CRNN == CNN +RNN +CTC基于ａｔｔｅｎｔｉｏｎ文字识别

图像任务

OCR任务

图像分类

构建我的classification框架记录

目标检测

参考CV综述目标检测整理—目录

图像分割

参考CV综述图像分割整理—目录

图像增强

视频任务

正文： OCR

学习OCR

参考资料:

OCR文字识别实战 TextDetection文本检测数据集汇总

参考博客:

典型应用

车牌识别，路牌识别拍照搜题，拍照检查自然场景文字识别内容审核数码管数字识别方法整理

常见挑战

版式复杂扭曲形变手写涂改，笔记不均匀光照，反光，运动模糊

比赛

ICDAR

经典OCR方法

单字符识别方法

模式匹配特征提取+分类器CNN识别

序列识别方法

CRNNRAREattention_ocr

tessernet

论文: 源码: 特点:

文字检测模型 Part (think about CV中的目标检测)

faster-RCNN/YOLO/SSD

CTPN (2016): Connectionist Text Proposal Network

论文: Detecting Text in Natural Image with Connectionist Text Proposal Network@2016 源码: 参考博客: OCR技术（CTPN，SegLink，EAST文本检测）

特点:

connect proposalsSide-refinement检测小框代替直接检测大文本框对于非水平的文本的检测效果并不好

EAST 2017 : An Efficient and Accurate Scene Text Detector

论文: EAST: An Efficient and Accurate Scene Text Detector@20170710 源码: argman/EAST --tensorflow /flask 参考博客: OCR文本检测模型—EAST PVANET算法笔记

less channels with more layers论文：PVANET: Deep but Lightweight Neural Networks for Real-time Object Detection @20160930

特点:

提出了基于two-stage的文本检测方法：全卷积网络(FCN)和非极大值抑制(NMS)在检测长文本时的效果比较差，这主要是由于网络的感受野不够大；在检测曲线文本时，效果不是很理想

PSENET：progressive scale expansion algorithm

论文: Shape Robust Text Detection with Progressive Scale Expansion Network 源码: tensorflow_PSENet 参考博客: psenet解读 PSENet原理介绍

特点:

psenet作为一种基于分割的方法psenet的检测速度稍慢

DB: Real-time Scene Text Detection with Differentiable Binarization

论文: Real-time Scene Text Detection with Differentiable Binarization 源码:

参考博客: 《DB：Real-time Scene Text Detection with Differentiable Binarization》论文笔记

特点:

文字识别

CRNN == CNN +RNN +CTC

论文: An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition @20150721 源码: meijieru/crnn.pytorch 特点:

每次都要计算概率，速度慢end to end识别

参考博客 CRNN算法详解

基于ａｔｔｅｎｔｉｏｎ文字识别

attention_ocr

最新回复(0)