Splitting¶
summ.splitter.Splitter
¶
Splitters are responsible for taking a file and splitting it into a list of documents (chunks).
By defauly, we just split on double-newlines (paragraphs).
Source code in summ/splitter/splitter.py
20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 |
|
wrap(other: Splitter) -> Splitter
classmethod
¶
Wrap an existing splitter to chain processing.
Source code in summ/splitter/splitter.py
26 27 28 29 30 31 32 33 34 35 36 37 38 39 |
|
summ.splitter.OtterSplitter
¶
Bases: Splitter
Adds Splitter support for transcripts exported from otter.ai.
To filter out your own remarks, pass a list of speakers to exclude.
Source code in summ/splitter/otter.py
4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 |
|