Digital Documents, LLC

Document Scanning & Imaging Services Leaders

TO EMAIL US PLEASE CLICK HERE >
 

Home Contact Us Site Map
 
 

 

Services Quote
Professionally equipped to manage your needs, we offer services that allow you to allocate resources efficiently while staying focused on your core business.  To request a Quote, please click here to email us.  In your email please provide a description of the project and a contact phone number.

 

Our Brochure
To request our brochure, please click here to email us.  In your email please provide a description of the project and a contact phone number.


 
 

Optimizing PDFs for User Search

OCR or Optical Character Recognition refers to technologies that involve "reading" words from a scanned image by translating each character on an image into searchable text. OCR enables users to search for and retrieve information within a file or page. In addition, when a set of files is indexed, users are able to search for keywords across an entire document library and retrieve each page with exact precision. OCR enables users to execute searches in seconds, searches that once required hours or days to complete.

To provide our customers with the Optimal OCR Accuracy and Layout Retention, the Digital Documents Technology Team has developed dDOptimaOCR© 3.5, the most advanced Optical Character Recognition technologies and processes in the industry. dDOptimaOCR utilizes advanced OCR technologies and processes to enable six-sigma level character and formatting accuracy, and ensure that the highest quality image possible is presented to the OCR engines for conversion to text.

It is important to note that the quality and condition of a paper document collection are key factors in the successful recognition of characters to create readable text. Therefore, to enhance the quality of each original page, we start by focusing on the scan quality of each image -- removing noise such as borders, speckles, and skews. In addition, we utilize advanced color filter technologies to remove any page background colors, in conjunction with multi-light image capture technologies to remove any shadows cast by page creases that could impact image quality or recognition accuracy.

Once document scanning and processing are complete, an OCR text layer is added behind each image utilizing our dDOptimaOCR solution. This solution begins with an additional orientation filter to ensure that the best image is presented to the OCR engines. Next, the characters in the image are processed utilizing multi-engine OCR voting technologies that rank each character to determine the best text recognition fit. Then once a word is generated, it is filtered through a proprietary lexicon to ensure the highest quality results. Finally, this text is processed utilizing sophisticated layout retention technologies to represent the image text layout, providing the best possible text representation for pinpoint search and retrieval accuracy. Once these processes are complete, the Quality Control Team generates an OCR Benchmark Report detailing the accuracy of the OCR process and the quality of the results.

 
 

 
 

Home     Overview     Offerings     Methods     Guidelines     Benefits     Contact Us

As an industry leader, we provide ISO 32000 compliant document conversion services with a focus on value pricing and quality to support any size project.  Utilizing our Industry Best-Practice dDSpeedScan© "digitization" technologies, we offer the highest quality services in the industry.  With over thirty (30) years of industry experience, our seasoned Team serves global private and public sector organizations in their paperless office initiatives.  The services we provide include: Document Management Services, Knowledge Management Services, File Scanning, Imaging Services, Large-Format & Blueprint Scanning, Microfiche Conversion, Electronic File Conversion (to PDF, TIFF or 250 file formats), Multi-Engine Optical Character Recognition (OCR), and File Naming and Indexing Services.  By Converting Information into Digital Assets, we enable Organizations to Go Digital -- increasing their Productivity, Performance and Profits. 

Copyright 2001-2008 - Digital Documents, LLC

8000 Towers Crescent Drive, Vienna, Virginia / Fairfax, Virginia / McLean, Virginia (VA)

Washington DC (DC), Maryland (MD), New York (NY), Pennsylvania (PA),

Ohio (OH), Illinois (IL), North Carolina (NC), South Carolina (SC),

Georgia (GA), Florida (FL), Texas (TX) and California (CA)

Coast-to-Coast We Serve as Document Scanning & Imaging Services Leaders