FastText embedding reader¶
Path | pimlico.modules.input.embeddings.fasttext |
Executable | yes |
Reads in embeddings from the FastText format, storing them in the format used internally in Pimlico for embeddings.
Can be used, for example, to read the pre-trained embeddings offered by Facebook AI.
Currently only reads the text format (.vec
), not the binary format (.bin
).
See also
pimlico.modules.input.embeddings.fasttext_gensim
:- An alternative reader that uses Gensim’s FastText format reading code and permits reading from the binary format, which contains more information.
Inputs¶
No inputs
Outputs¶
Name | Type(s) |
---|---|
embeddings | Embeddings |
Options¶
Name | Description | Type |
---|---|---|
path | (required) Path to the FastText embedding file | string |
limit | Limit to the first N words. Since the files are typically ordered from most to least frequent, this limits to the N most common words | int |