Follow Us to Stay Connected: Google Plus Twitter

 

Request Our Brochure
Our document management services facilitate instant and secure access to information with increased productivity, performance and profits. In your email please provide a description of the project and a contact phone number

Scanning Services Inquiry
Professionally equipped to manage your needs, we offer services that allow you to allocate resources efficiently while staying focused on your core business. In your email please provide a description of the project and a contact phone number

Optimizing PDFs for User Search - Optical Character Recognition

OCR or Optical Character Recognition refers to technologies that involve "reading" words from a scanned image by translating each character on an image into searchable text. OCR enables users to search for and retrieve information within a file or page. In addition, when a set of files is indexed, users are able to search for keywords across an entire document library and retrieve each page with exact precision. OCR enables users to execute searches in seconds, searches that once required hours or days to complete.

To provide our customers with the Optimal OCR Accuracy and Layout Retention, the Digital Documents Technology Team has developed dDOptimaOCR© 3.5, the most advanced Optical Character Recognition technologies and processes in the industry. dDOptimaOCR utilizes advanced OCR technologies and processes to enable six-sigma level character and formatting accuracy, and ensure that the highest quality image possible is presented to the OCR engines for conversion to text. Our OCR scanning services ensure that you will get the best quality available.

It is important to note that the quality and condition of a paper document collection are key factors in the successful recognition of characters to create readable text. Therefore, to enhance the quality of each original page, we start by focusing on the scan quality of each image -- removing noise such as borders, speckles, and skews. In addition, we utilize advanced color filter technologies to remove any page background colors, in conjunction with multi-light image capture technologies to remove any shadows cast by page creases that could impact image quality or recognition accuracy.

Once document scanning and processing are complete, an OCR text layer is added behind each image utilizing our dDOptimaOCR solution. This solution begins with an additional orientation filter to ensure that the best image is presented to the OCR engines. Next, the characters in the image are processed utilizing multi-engine OCR voting technologies that rank each character to determine the best text recognition fit. Then once a word is generated, it is filtered through a proprietary lexicon to ensure the highest quality results. Finally, this text is processed utilizing sophisticated layout retention technologies to represent the image text layout, providing the best possible text representation for pinpoint search and retrieval accuracy. Once these processes are complete, the Quality Control Team generates an OCR Benchmark Report detailing the accuracy of the OCR process and the quality of the results.

Need More Information?
Contact us Today to Request our Services or to Receive our Brochure

Contact Us Button

Other Ways to Connect
Call us: 703-288-5555

Follow Us to Stay Connected
Google Plus Twitter