GroupDocs.Search delivers enterprise-grade full-text search, document indexing, and text extraction to help teams create a fast, accurate search experience across PDFs, Office files, images (OCR), emails, archives, and more. Build end-user search features, create text indexes for quick document search, and keep results fresh with high-performance index updates.
- Recent updates improve OCR-backed indexing, faster spell correction, and refined fuzzy search defaults.
- New guidance on distributed search networks and image-hash indexing for reverse image search.
- Expanded examples for regular expressions, homophone search, and keyboard layout correction.
High-performance APIs for full-text search, indexing, and highlighting on .NET Framework and .NET Core.
- GroupDocs.Search-for-.NET: Core API for indexing, spell check, fuzzy/regex/boolean search.
Repository: https://github.com/groupdocs-search/GroupDocs.Search-for-.NET - Showcases: ASP.NET MVC UI demonstrating search and highlighting.
Examples: https://github.com/groupdocs-search/GroupDocs.Search-for-.NET/tree/master/Showcases/GroupDocs.Search-UI
string indexFolder = @"c:\MyIndex\";
string documentsFolder = @"c:\MyDocuments\";
// Create index and add documents
Index index = new Index(indexFolder);
index.Add(documentsFolder);
// Regular expression search: two or more identical chars at word start
SearchResult result = index.Search("^^(.)\\1{1,}");Native Java API for indexing, fuzzy search, regex search, and OCR-backed text search.
Repository: https://github.com/groupdocs-search/GroupDocs.Search-for-Java
com.groupdocs.search.Index index = new com.groupdocs.search.Index("indexDir");
index.add("docsDir");
com.groupdocs.search.results.SearchResult result =
index.search("keyboard layout correction", new com.groupdocs.search.options.SearchOptions());Cross-platform bindings to integrate full-text search into scripting and server workflows.
Repositories:
- Python via .NET: https://github.com/groupdocs-search/GroupDocs.Search-for-PythonNet
- Node.js via Java: https://github.com/groupdocs-search/GroupDocs.Search-for-Node.js-via-Java
- Enterprise content search across mixed formats with text indexing and OCR.
- Compliance & legal discovery: highlight hits, export results, filter by metadata.
- Knowledge bases & customer support search with typo-tolerant and synonym search.
- E-discovery and audit: date/number range queries, faceted search, and filtering.
- Media libraries: reverse image search using indexed image hashes.
- 80+ formats, OCR-backed text extraction, metadata indexing, and custom extractors.
- Create text indexes in memory or on disk; update, merge, optimize, and compact indexes.
- Flexible search modes: simple, boolean, regex, fuzzy, wildcard, phrase with wildcards, faceted search.
- Spell check, keyboard layout correction, homophone search, and synonym search.
- Indexing controls: stop words, blended characters, whole-word characters, character replacement.
- Distributed search network support for load-balanced queries across indexes.
- Save extracted text with configurable compression; filter documents during indexing.
- Highlight results per document or per segment; search across multiple indexes simultaneously.
- .NET:
Install-Package GroupDocs.Searchordotnet add package GroupDocs.Search - Java: Add the GroupDocs.Search dependency from Maven Central.
- Run sample projects from the repositories above to index documents and execute search queries.
- Documentation and tutorials: https://docs.groupdocs.com/search/
- Free support forum: https://forum.groupdocs.com/c/search
- Temporary license for full-feature evaluation: https://purchase.groupdocs.com/temporary-license
groupdocs-search document-indexing full-text-search search-text-in-document create-text-index quick-document-search regex-search fuzzy-search synonym-search ocr-search reverse-image-search enterprise-search metadata-search spell-check-search