neural network applications in ocr daniel hentschel robert johnston center for imaging science...

22
Neural Network Neural Network Applications in OCR Applications in OCR Daniel Hentschel Robert Johnston Center for Imaging Science Rochester Institute of Technology

Upload: sydney-conley

Post on 20-Jan-2016

214 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Neural Network Applications in OCR Daniel Hentschel Robert Johnston Center for Imaging Science Rochester Institute of Technology

Neural Network Applications Neural Network Applications in OCRin OCR

Daniel Hentschel

Robert JohnstonCenter for Imaging Science

Rochester Institute of Technology

Page 2: Neural Network Applications in OCR Daniel Hentschel Robert Johnston Center for Imaging Science Rochester Institute of Technology

BackgroundBackground

Recently, many Ancient documents have been discovered. Deciphering these documents is integral to understanding our past and predicting our future as a human race.

Page 3: Neural Network Applications in OCR Daniel Hentschel Robert Johnston Center for Imaging Science Rochester Institute of Technology

BackgroundBackground

Unfortunately, the majority of ancient Hebrew documents found are seriously degraded.

Page 4: Neural Network Applications in OCR Daniel Hentschel Robert Johnston Center for Imaging Science Rochester Institute of Technology

BackgroundBackground

This degradation makes deciphering of the documents difficult. Many techniques have been developed to enhance the text in degraded manuscripts in order to improve readability.

The majority of this work has been carried out by Roger Easton and Robert Johnston at the Center for Imaging Science at RIT.

Note:

Page 5: Neural Network Applications in OCR Daniel Hentschel Robert Johnston Center for Imaging Science Rochester Institute of Technology

Question:Question:

Would it be beneficial to use a neural network to help in the deciphering of degraded ancient Hebrew documents?

Page 6: Neural Network Applications in OCR Daniel Hentschel Robert Johnston Center for Imaging Science Rochester Institute of Technology

Research PurposeResearch PurposeCreate an application to assist in

document restoration.Develop a neural network adept

at Hebrew OCR (optical character recognition).

Analyze the functionality of the network when studying degraded characters

Page 7: Neural Network Applications in OCR Daniel Hentschel Robert Johnston Center for Imaging Science Rochester Institute of Technology

Creating an ApplicationCreating an Application

Page 8: Neural Network Applications in OCR Daniel Hentschel Robert Johnston Center for Imaging Science Rochester Institute of Technology

Developed with Visual BasicDeveloped with Visual Basic

Page 9: Neural Network Applications in OCR Daniel Hentschel Robert Johnston Center for Imaging Science Rochester Institute of Technology

Calculate Input FeaturesCalculate Input FeaturesSegmentation.Horizontal and Vertical section averages.Length of the skeleton as a percentage of

the circumscribed rectangle perimeter.Complexity: Square root of the black area

divided by the length of the skeleton.Number of dead ends and intersections in

the skeleton of the character.

Page 10: Neural Network Applications in OCR Daniel Hentschel Robert Johnston Center for Imaging Science Rochester Institute of Technology

SegmentationSegmentation

Simple segmentation. Not elegant, but functional.

Page 11: Neural Network Applications in OCR Daniel Hentschel Robert Johnston Center for Imaging Science Rochester Institute of Technology

Thinning AlgorithmThinning Algorithm

Adapted from “A Fast Thinning Algorithm For Characters” (Flores, Rezende, Carrijo, Yabu-tti)

Red - pixel being analyzed

Green/Blue - to be deleted

Page 12: Neural Network Applications in OCR Daniel Hentschel Robert Johnston Center for Imaging Science Rochester Institute of Technology

Workings of a Neural NetworkWorkings of a Neural Network

Input LayerHidden Layer

Output Layer

Page 13: Neural Network Applications in OCR Daniel Hentschel Robert Johnston Center for Imaging Science Rochester Institute of Technology

Nodes in a Neural NetworkNodes in a Neural Network

Inputs are multiplied by a weighting factor and a bias is added.

Weighted inputs are summed.Sum is applied to a function:

11

2Output

Input Weighted

e

Page 14: Neural Network Applications in OCR Daniel Hentschel Robert Johnston Center for Imaging Science Rochester Institute of Technology

Internal Function of a NodeInternal Function of a NodeNode Function

-1

-0.8

-0.6

-0.4

-0.2

0

0.2

0.4

0.6

0.8

1

-10 -8 -6 -4 -2 0 2 4 6 8 10

Input

Ou

tpu

t

Page 15: Neural Network Applications in OCR Daniel Hentschel Robert Johnston Center for Imaging Science Rochester Institute of Technology

Operation of a NodeOperation of a Node

Page 16: Neural Network Applications in OCR Daniel Hentschel Robert Johnston Center for Imaging Science Rochester Institute of Technology

Input to the Neural NetworkInput to the Neural Network9 input nodes

– Horizontal (3) and Vertical (2) histogram.– Length of the skeleton as a percentage of the

circumscribed rectangle perimeter.– Complexity: Square root of the black area divided

by the length of the skeleton.– Number of changes of intersections, and dead

ends encountered in the skeleton.

Page 17: Neural Network Applications in OCR Daniel Hentschel Robert Johnston Center for Imaging Science Rochester Institute of Technology

Output of the Neural NetworkOutput of the Neural Network

4 output nodes

Page 18: Neural Network Applications in OCR Daniel Hentschel Robert Johnston Center for Imaging Science Rochester Institute of Technology

Hidden NodesHidden Nodes

3 hidden nodes works well for 4 characters

Page 19: Neural Network Applications in OCR Daniel Hentschel Robert Johnston Center for Imaging Science Rochester Institute of Technology

ResultsResults

Training set:

Page 20: Neural Network Applications in OCR Daniel Hentschel Robert Johnston Center for Imaging Science Rochester Institute of Technology

ResultsResults

Characters not in training set:.07

.00

.94

.04

.05

.05

.92

.05

.08

.04

.03

.94

.07

.95

.02

.02

.86

.05

.05

.02

.08

.04

.04

.92

.12

.77

.01

.13

Page 21: Neural Network Applications in OCR Daniel Hentschel Robert Johnston Center for Imaging Science Rochester Institute of Technology

ConclusionConclusion

The neural network is quite effective at deciphering non-degraded text.

Not enough degraded characters have been studied yet to determine how well the network will perform

Future work:– More than 4 characters– Optimize inputs– Analyze degraded characters

Page 22: Neural Network Applications in OCR Daniel Hentschel Robert Johnston Center for Imaging Science Rochester Institute of Technology

The EndThe End