Chapter 1 – Introducing Readiris
4
and drag your scanned documents to the Dock icon. They will be
processed on the spot.
General information
Readiris is based on the most advanced recognition technologies.
Font-independent text recognition is complemented by self-learning
techniques. The system is able to learn new characters and words
through contextual and linguistic analysis. This means that the OCR
accuracy of the recognition system will improve as it goes along.
Readiris also recognizes tabular data and recreates them as
worksheets in your spreadsheet software or as table objects inside
your word processor; your numeric data are immediately ready for
further processing.
Readiris supports up to 125 languages: all American and European
languages are supported, including the Central-European, Baltic and
Cyrillic languages as well as Greek and Turkish. Optionally,
Readiris can read Hebrew documents and four Asian languages -
Japanese, Simplified and Traditional Chinese and Korean. Readiris
even copes with mixed alphabets: the software detects “Western”
words that occur in Greek, Cyrillic, Hebrew and Asian documents -
many untranscribable proper names, brand names, etc. are written
using the Western symbols.
Readiris uses linguistics during the recognition phase, not
afterwards. As a result, Readiris recognizes all kinds of documents
with top accuracy, including low-quality documents, faxes and dot
matrix printouts. It copes beautifully with badly scanned and copied
documents containing too light or dark font shapes. Joined
characters are resolved while fragmented characters, such as dot
matrix symbols, are recomposed.
Besides that, Readiris has a user verification function. When
activated, the user verification function (Interactive learning) not
only flags the characters the recognition system isn't sure of but also
allows to increase the system's accuracy. All solutions you confirm