Skip to content

groupdocs-search/GroupDocs.Search-for-Java

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

78 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Java Library to Search & Index Documents

GroupDocs.Search for Java is a Search & Indexing Library that can perform basic and advanced search operations against various formats to gain data insights. It can also be used for indexing various document types to perform search on the basis of different search query types. Advance searching techniques such as fuzzy, synonyms, boolean and full-text search are supported.

Directory Description
Examples Java examples and sample documents for you to get started quickly.

Document Indexing Features

  • 80+ supported file formats.
  • Create index in memory or on disk.
  • Update index to take into account changed, deleted and added documents.
  • Merge several indexes into one.
  • Optimize index to improve search performance.
  • Indexing password protected documents.
  • Indexing with stop words.
  • Support for indexing additional fields.
  • Support for blended characters.
  • Support for characters indexed as a whole word.
  • Support for character replacement during indexing.
  • Support for custom text extractors.
  • Option for compact and metadata index.
  • Ability to save extracted text in index with different level of compression.
  • Document filtering during indexing.
  • Deleting indexed paths from index.
  • Ability to separately extract data from documents and index them.
  • Support for optical text recognition on images.
  • Calculation and indexing of image hashes for reverse image search.
  • The ability to create a distributed search network that automatically balances the load across nodes.

Document Search Features

  • Simple word search.
  • Boolean search.
  • Regular expression search.
  • Faceted search.
  • Case sensitive search.
  • Flexible fuzzy search.
  • Synonym search.
  • Homophone search.
  • Wildcard search.
  • Phrase search with wildcards.
  • Search for different word forms.
  • Date range search.
  • Numeric range search.
  • Search by chunks (pages).
  • Document filtering in search result.
  • Search for different object types: text, numbers, dates, file names, document types, metadata fields, document creation/modification dates.
  • Combine different types of search into one search query.
  • Alias substitution in search queries.
  • Perform spell check during search.
  • Perform keyboard layout correction during search.
  • Search queries in text or flexible object form.
  • Highlight search results in the text of the entire document or in text segments.
  • Multiple simultaneous thread safe search.
  • Thread safe search during indexing, updating or merging operation.
  • Search over several indexes simultaneously.
  • Built-in support for reverse image search.

Get Started with GroupDocs.Search for Java

GroupDocs.Search for Java requires J2SE 7.0 (1.7), J2SE 8.0 (1.8) or above. Please install Java first if you do not have it already.

GroupDocs hosts all Java APIs on GroupDocs Artifact Repository, so simply configure your Maven project to fetch the dependencies automatically.

Perform Case-Sensitive Search with a Query in Object Form

// creating an index in the specified folder
Index index = new Index(indexFolder);
 
// indexing documents from the specified folder
index.add(documentsFolder);
 
SearchOptions options = new SearchOptions();
options.setUseCaseSensitiveSearch(true); // Enabling case sensitive search
 
// creating search query in object form
SearchQuery query = SearchQuery.createWordQuery("Windows");
 
// searching in the index
SearchResult result = index.search(query, options);

Home | Product Page | Documentation | Demos | API Reference | Examples | Blog | Search | Free Support | Temporary License

About

GroupDocs.Search for Java examples, plugins and showcase projects: https://products.groupdocs.com/search/java

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Contributors 10

Languages