Creates a new Bloom search instance based on Bloom filters, which can be used to add documents and test the membership of search terms in the added set.
Bloom search options.
Readonly
errorError rate used in all Bloom filters to generate document signatures.
Readonly
fieldsA record containing the name of all indexable fields and their relative weight used to rank search results.
Collection containing document summaries and Bloom filter signatures used to search, with document shorthand reference identifier used as keys.
Readonly
minMinimum term cardinality used to calculate the Bloom filter size. This can be used to reduce false positives when dealing with small documents with sparse term frequency distribution.
Readonly
ngramsThe n-grams to store in the index. Defaults to 1
(no n-grams).
Readonly
seedHash seed to use in Bloom Filters, defaults to 0x00c0ffee
.
Readonly
summaryAn array with the names of fields to preserve as summary, and which are returned as search results for the matching documents. It is recommended to keep only fields necessary to identify a document (e.g. title, URL, short description) to keep space requirements down.
Readonly
termOptimises storage by grouping indexed terms into buckets according to term frequency in a document.
Indexes a single document with its unique reference identifier.
A unique reference identifier for the document. Adding another document with the same reference replaces the document on the search index.
The document to index.
Optional
language: stringLanguage identifier which is fed back into the
stemmer
and stopwords
callback functions to help
decide how to handle these steps.
Replaces the instance's index with an index from another instance. Its primary use case is to rehydrate the index from a static file or payload.
NB: Calling this method will not change any other attributes in the
instance. It is up to developers to ensure that the instances were
initialised with compatible options, in particular the stemmer
function. Incompatible stemmer
implementations may cause matches to
not be found in the rehydrated index.
Replacement index.
Removes an indexed document.
Reference identifier of the document to remove.
Scans the document index and returns a list of documents summaries (with
only the properties declared in the summary
option) that possibly match
one or more terms in the query.
Each search term is run through the provided stemmer
function to ensure
terms are processed in the same way as the tokens previously added to the
index's signature.
Terms to search.
Individual words are matched against the signature of each indexed document.
You may prefix each word with the +
operator to intersect results that
(probably) contain the required word, or use the -
operator to exclude
results containing the word.
If the ngrams
option is greater than 1
, you are also able to search for
exact phrases up to ngrams
words typed between quotes (for example,
"this phrase"
). Only documents containing these words in that sequence
are returned in the search results.
Optional
language: stringLanguage identifier for the search terms. This is used only to help choose the appropriate stemming algorithm, search results will not filtered by language.
Ordered list of document summaries, sorted by probable search term frequency.
Indexer and searcher based on Bloom filters.