nordlys.core.retrieval.indexer_mongo module¶
Mongo Indexer¶
This class is a tool for creating an index from a Mongo collection.
To use this class, you need to implement callback_get_doc_content()
function.
See indexer_fsdm
for an example usage of this class.
Author: | Faegheh Hasibi |
---|
-
class
nordlys.core.retrieval.indexer_mongo.
IndexerMongo
(index_name, mappings, collection, model='BM25')[source]¶ Bases:
object
-
build
(callback_get_doc_content, bulk_size=1000)[source]¶ Builds the DBpedia index from the mongo collection.
To speedup indexing, we index documents as a bulk. There is an optimum value for the bulk size; try to figure it out.
Parameters: - callback_get_doc_content – a function that get a documet from mongo and return the content for indexing
- bulk_size – Number of documents to be added to the index as a bulk
-