This job configuration uses a 3-step process to automate the OCR processing:
Step 1 uses a full-page OCR process on each image. Field data is extracted from the full-page OCR using template and dictionary matching algorithms. This is done in Pre-Index mode to allow unattended processing. Data is saved to a database so it can be reviewed and corrected in Step 2.
Step 2 uses Database Update mode to find images with missing index values and allow the user to manually enter the correct data.
Step 3 uses a SimpleSearch configuration to search and view the indexed images, including full text searches.
|How are Simple Software products licensed?|
|How do you configure OCR to read index information from MS Office or PDF documents?|
|Can SimpleIndex create searchable PDF Image+Text files with hidden text?|
|How do you configure full text searching in Retrieval mode?|
|Can OCR text be saved to MS Word or HTML formats?|
|I'm using full page OCR. The information is all appearing in the txt file but it is losing format about half way through. Data to the right is ending up at the end of the txt doc. Can this be fixed?|
|Some pages in my documents have unwanted barcodes that are being read. How can I exclude these from recognition?|
|How do you configure the Autofill feature?|
|Is it possible to read OCR or Barcodes only on specific pages instead of every page?|
|What is the point of SimpleQC?|