Pimlico guides
Core docs
Core Pimlico modules
Corpus manipulation
Embeddings
Gensim topic modelling
Input readers
Embeddings
Text corpora
20 Newsgroups
Europarl corpus reader
Huggingface text corpus
Raw text archives
Raw text files
XML files
Malt dependency parser
NLTK
OpenNLP tools
Output modules
Scikit-learn tools
spaCy
Document-level text filters
General utilities
Visualization tools
Command-line interface
API Documentation
Module test pipelines
Example pipelines
Future plans
Pimlico
Docs
»
Core Pimlico modules
»
Input readers
»
Text corpora
»
20 Newsgroups
Edit on GitHub
20 Newsgroups
ΒΆ
20 Newsgroups fetcher (sklearn)
Read the Docs
v: latest
Versions
latest
v0.9
v0.8
v0.7
v0.6
v0.5
v0.3
v0.2
python3
Downloads
pdf
html
On Read the Docs
Project Home
Builds
Free document hosting provided by
Read the Docs
.