lda_top_words

This is one of the test pipelines included in Pimlico’s repository. See Module test pipelines for more details.

Config file

The complete config file for this test pipeline:

# Extract lists of words from an LDA model
#
# These can be used for coherence evaluation.

[pipeline]
name=lda_top_words
release=latest

# Load trained model
[lda]
type=pimlico.datatypes.gensim.GensimLdaModel
dir=%(test_data_dir)s/datasets/gensim/lda

# Extract the top words for each topic
[top_words]
type=pimlico.modules.gensim.lda_top_words
input_model=lda
num_words=10

Modules

The following Pimlico module types are used in this pipeline: