Tokenized corpus to ID mapper¶
Path | pimlico.modules.corpora.vocab_mapper |
Executable | yes |
Inputs¶
Name | Type(s) |
---|---|
text | TarredCorpus<TokenizedDocumentType> |
vocab | Dictionary |
Outputs¶
Name | Type(s) |
---|---|
ids | IntegerListsDocumentCorpus |
Path | pimlico.modules.corpora.vocab_mapper |
Executable | yes |
Name | Type(s) |
---|---|
text | TarredCorpus<TokenizedDocumentType> |
vocab | Dictionary |
Name | Type(s) |
---|---|
ids | IntegerListsDocumentCorpus |