… Or Do Word Recognition!

As powerful as the neural revolution was in the 90s, new advances in OCR research have moved beyond it. The latest OCR software is now capable of recognizing total words, not individual characters! Such algorithms try to match the image of a whole word to a word: the contour and shape of pixel cloud X looks like the Dutch word “Familieleden”, the pixel cloud Y looks like the word “alleged” etc.

Word recognition of scanned line

This technology is called “Word Image Decoding”, it has some basic similarity with the algorithms used for speech recognition. It “decodes” the word images without a prior segmentation of the words into characters.

The specific strength of word image decoding lies in the recognition of degraded images where characters for instance touch (“ligatures”) or are broken. That’s because degraded word images are less defaced than degraded character shapes. An individual character can be so damaged that it’s barely recognizable — even to the human eye. A degraded word offers more clues: it’s unlikely that each character will be degraded to the same extent! As codes get broken bit by bit, difficult words get recognized by starting with the easier characters inside them.

Scanned image with broken characters

Scanned image with ligatures - touching characters

Because there ’s more context, it’s easier for the system to guess what the shapes mean. Given the example below, isn’t easier to recognize complete words than individual, segmented character shapes…?

Word recognition of broken, army-style characters

Submit feedback

X (Tweet)

Previous page — Next page

Where does this technology come from? — Reaping the benefits of the neural revolution — … Or do word recognition — Each expert casts his vote — Autolearning font shapes — Putting more feet on the street — … With a document revolution thrown in

Home page — Intro — Scanners — Images — History — OCR — Languages — Accuracy — Output — BCR — Pen scanners — Sitemap — Search — Contact – Feedback

The early years	Neural networks	Word recognition	Expert systems
Autolearning	Voting systems	Document revolution

Home page	Intro	Scanners	Images	History	OCR	Languages
Accuracy	Output	BCR	Pen scanners	Sitemap	Search	Contact – Feedback