nordlys.core.data.facc.facc2mongo module

Facc to Mongo

Adds entity surface forms from the Freebase Annotated ClueWeb Corpora (FACC).

The input to this script is (name variant, Freebase entity, count) triples. See data/facc1/README.md for the preparation of FACC data in such format.

Authors:Krisztian Balog, Faegheh Hasibi
class nordlys.core.data.facc.facc2mongo.FACCToMongo(config)[source]

Bases: object

Inserts FACC surface forms to Mongo.

build()[source]

Builds surface form collection from FACC annotations.

nordlys.core.data.facc.facc2mongo.arg_parser()[source]
nordlys.core.data.facc.facc2mongo.main(args)[source]