pimlico.datatypes.formatters.tokenized module¶
-
class
TokenizedDocumentFormatter
(corpus, raw_data=False)[source]¶ Bases:
pimlico.cli.browser.formatter.DocumentBrowserFormatter
-
DATATYPE
¶
-
-
class
CharacterTokenizedDocumentFormatter
(corpus, raw_data=False)[source]¶ Bases:
pimlico.cli.browser.formatter.DocumentBrowserFormatter
-
DATATYPE
¶ alias of
pimlico.datatypes.tokenized.CharacterTokenizedDocumentType
-
-
class
SegmentedLinesFormatter
(corpus)[source]¶ Bases:
pimlico.cli.browser.formatter.DocumentBrowserFormatter
-
DATATYPE
¶ alias of
pimlico.datatypes.tokenized.SegmentedLinesDocumentType
-