Audit and OCR Documents in SharePoint with Searchlight

Searchlight is a no-code auditing and OCR tool that makes all image and PDF files searchable in SharePoint. Start with an audit to identify all your non-searchable content, and then add OCR to unlock text in all your existing files.

Benefits

Full SharePoint Visibility

Full SharePoint Visibility

Audit your SharePoint library to discover the percentage of non-searchable, partially searchable, and fully searchable files you have.

Unlock Hidden Content

Unlock Hidden Content

Make informed decisions by accessing all your SharePoint content. OCR non-searchable images, scans, and faxes into fully searchable PDFs.

High-Volume Document Auditing

High-Volume Document Auditing

Unlock content with an AI-powered OCR engine for your entire SharePoint library. Keep files indexed by applying OCR to newly added or recently edited documents.

Meet Compliance Standards

Meet Compliance Standards

Remove the risk of sensitive data remaining hidden in non-searchable files. OCR and identify all confidential, personal, or legally protected information in your SharePoint library.

Key Features

Audit

Audit the document library to determine which documents are candidates for processing by examining each document’s searchability status and the document library’s processing settings.

Capabilities:
  • IncludedSearchability status: non-searchable (scans, faxes, TIFFs, and images), partially searchable, fully searchable
  • IncludedInput file support: PDF, TIFF, JPEG, BMP, MSG with PDF attachments
  • IncludedProcess SharePoint lists
  • IncludedExclude specific documents
  • IncludedAudit reporting
  • IncludedFilter documents by regular expression

Key Features

Audit

Audit the document library to determine which documents are candidates for processing by examining each document’s searchability status and the document library’s processing settings.

Capabilities:
  • IncludedSearchability status: non-searchable (scans, faxes, TIFFs, and images), partially searchable, fully searchable
  • IncludedInput file support: PDF, TIFF, JPEG, BMP, MSG with PDF attachments
  • IncludedProcess SharePoint lists
  • IncludedExclude specific documents
  • IncludedAudit reporting
  • IncludedFilter documents by regular expression

OCR

Searchlight’s optical character recognition (OCR) technology creates a text version of the file contents.

Capabilities:
  • IncludedSame high-quality OCR engine as Adobe Acrobat (Canon IRIS)
  • Included120+ languages supported
  • IncludedA maximum number of languages per job
  • IncludedAdvanced PDF compression (iHQC)
  • IncludedJPEG2000 compression
  • IncludedPDF/A validation
  • IncludedRetain bookmarks/metadata/viewer preferences
  • IncludedProcess hidden text
  • IncludedAdvanced preprocessing settings

Monitor and Schedule

Automate the document monitoring process, allowing Searchlight OCR to scan the document library and automatically handle new and updated documents.

Capabilities:
  • IncludedJob scheduling
  • IncludedOptional tagging support
  • IncludedAdvanced job management and scheduling for multiple jobs
  • IncludedEmail alerts and CSV reports
  • IncludedProcessing history database
  • IncludedMonitoring status

Searchlight Can Do Much More

Searchlight Can Do Much More

Archiving

Transform SharePoint files into fully searchable and electronically archived documents. Archive image, MSG, and PDF files into long-term veraPDF and ISO-compliant PDF/A standards (PDF/A-1, PDF/A-2, PDF/A-3).

Searchlight Can Do Much More

Metadata Tagging

Identify relevant metadata by using automated metadata tagging. Retain current metadata or configure Searchlight’s metadata extractor module to add metadata to new and existing documents automatically.

Searchlight Can Do Much More

Content Reuse

Reuse content from searchable PDFs by selecting and copying the newly generated text layer. Convert legacy documents into a searchable and usable format.

Searchlight Can Do Much More

Multi-Language Support

Searchlight OCR is compatible with more than 100 different languages, including Asian, Arabic, Farsi, and Hebrew languages. The extended OCR engine supports multiple languages within a single document from the same alphabet, e.g. French, German, and Italian.

Searchlight Can Do Much More

PDF Compression

Images in the output PDF file can be compressed using JBIG2 (for black and white images) or MRC (for color images). This dramatically reduces the output size of PDFs and reduces the size of your SharePoint library.

Searchlight Can Do Much More

Reporting

Generate scheduled reports and receive email alerts about the latest updates to your SharePoint library. Get a summary of the library status as a whole or details about specific job runs.

Trusted by thousands of high-profile organizations

Pfizer
Bank of America
BBC Worldwide
Siemens
IRS
Pfizer
Bank of America
BBC Worldwide
Siemens
IRS

Deployment Options

Choose between flexible options featuring subscription plans for SharePoint Online and on-premises deployment.

SharePoint Online

SharePoint Online

For SharePoint Online (Office 365), choose between installing Searchlight OCR on your own local servers or on an Azure instance.

On-Premises Deployment

On-Premises Deployment

Deploy Searchlight for SharePoint On-Premises within your in-house IT infrastructure. The installation package covers all license types and SharePoint editions (2013–2019). To learn more, refer to the requirements and documentation.

SharePoint Online (Office 365) System Requirements

Supported OS

  • Windows 11 (x64)
  • Windows 10 (x64)
  • Windows Server 2012 (x64)
  • Windows Server 2016
  • Windows Server 2019
  • Windows Server 2022

Additional Tools

SharePoint Server Client Components SDK (x86 | x64)

SharePoint Migration

Support for the Windows file system allows documents to be preprocessed before uploading in large migrations.

Supported Operating Systems

  • Windows 10 (64 bit)
  • Windows Server 2016
  • Windows Server 2019
  • Windows Server 2022

Recommended Memory

  • Single Core Deployment — 8 GB RAM
  • 8 Core Deployment — 16 GB RAM
  • More than 8 Cores — Ask
  • support@aquaforest.com

Recommended CPU

  • Single Core Deployment — i5 processor
  • 8 Core Deployment — i7 processor
  • More than 8 Cores — Ask
  • support@aquaforest.com

Disk Space

950 MB

.NET Framework

4.7.2

Visual C++ Redistributable

The Visual C++ Redistributable package is required for deployment and development. The Aquaforest engine requires Visual C++ 2017 Redistributable (x86 | x64).

Additional Resources

Have a Question?
We’re Always Happy to Help.

© Muhimbi Ltd. 2008 - 2024
This website uses cookies to ensure you get the best experience. Learn more