support@athento.com
icon-phone + 34 932 20 23 14
icon-search Enter a Keyword
icon-login Login

OCR

Get information from scanned documents quickly and easily

 


What is OCR?

OCR softwareApplications is aimed at the digitization of texts in images. Automatically identify symbols or characters belonging to a particular script, based on an image to be stored as data with which we can interact (edit, select, copy and paste) using a text editor program such as Open Office or Microsoft Office (Word), for example.

esquema_ocr

So if your computer has scanned documents in image formats (JPG, PNG, TIFF, etc.), Such as an ID, a payroll or an invoice, we can not process this information by searching, selecting text content or transmutation to another format (DOC, and Microsoft Office Excel. ODT or. ODS Open Office or TXT), except if we have a Software for Optical Character Recognition (OCR)

The OCR module can be integrated into a document management software solution, such as Nuxeo, Sharepoint, EMC Documentum, IBM FileNet and Alfresco. Also, the Athento ECM system incorporates this functionality by default

 

 

 

What advantages does OCR software provide?

The main advantage is the ability to search for content within a document scanned without OCR on the scanning tool. This means fast searches without having to waste time looking through the entire document, page by page, word for word, to find something specific.

In addition, this type of solution for organizations that already have scanning hardware (scanners) without OCR, don’t need to replace these tools with modern scanners, in many cases with the same quality of scan, and the unique contribution of software OCR on the device.

By centralizing, with OCR Software within a Document Management System, you can search directly on the image format files (such as a JPG) that contain text, and only use this software in a single location, the server hosting the Document Management Software.

 

How much can I save by implementing an OCR module?

In organizations with large numbers of scanners without OCR software, it allows the reuse of these components and not having to deal with buying a new fleet of OCR scanners, thus saving money and reducing theimpact on the environment (consumption of hardware and transport). In some cases, you can save thousands of devices, so the savings can be in the hundreds of thousands of dollars.

In terms of productivity, there are organizations (including banks, as we have seen in our experience) who have contracts and records in digitized format. TIFF or PDF formats without indexing. To check whether a document or a particular contract is the one we’re looking for, we must open and read without being able to search “full text” on all content.

 

OCR Vs. scanned documents without OCR or paper

Continuing working with documents in image format which can not automatically process information, such as selecting and copying text or search, is a significant loss in productivity, which will be increased as data volumes are growing. Of course, the information grows about 20% each year, so these files are becoming more frequent.

OCR technology solves these problems with a very high technological level, which allows recognition of text on scanned documents with a fairly low quality compared to other OCR systems.

Of course, compared to working with paper, the advantages are even greater, saving space, saving a highly polluting product such as paper, improving effective information management etc.

In addition, this module is perfect to be supplemented with modules such as Digital SignatureIntelligent Document Management and Workflow (BPM).

 

CMMI_Yerbabuena

Athento is certified
with CMMI

ENISA

Athento is supported by
ENISA

ISO 27001

Used data centers are certified
with iso 27001

Cloud Security Alliance

Athento is certified
with CSA

Logo PCI DSS

PCI DSS
Level 1 certified

Logo PCI DSS

SOC 1 type II and
SOC 2 type II certified