Pimlico
v0.6
Pimlico guides
Core docs
Core Pimlico modules
CAEVO event extractor
C&C parser
Stanford CoreNLP
Corpus-reading
Human-readable formatting
Corpus document list filter
Corpus split
Corpus subset
Tar archive grouper
Tar archive grouper (filter)
Corpus vocab builder
Tokenized corpus to ID mapper
Embedding feature extractors and trainers
Feature set processing
Malt dependency parser
OpenNLP modules
R interfaces
Regular expressions
Scikit-learn tools
General utilities
Visualization tools
Future plans
Pimlico API Documentation
Pimlico
Docs
»
Core Pimlico modules
»
Corpus-reading
Edit on GitHub
Corpus-reading
ΒΆ
Base modules for reading input from textual corpora.
Human-readable formatting
Corpus document list filter
Corpus split
Corpus subset
Tar archive grouper
Tar archive grouper (filter)
Corpus vocab builder
Tokenized corpus to ID mapper
Read the Docs
v: v0.6
Versions
latest
stable
v0.6
v0.5
v0.3
v0.2
Downloads
On Read the Docs
Project Home
Builds
Free document hosting provided by
Read the Docs
.