Welcome to mirror list, hosted at ThFree Co, Russian Federation.

__init__.py « mosestokenizer « tokenizer « scripts - github.com/moses-smt/mosesdecoder.git - Unnamed repository; edit this file 'description' to name the repository.
summaryrefslogtreecommitdiff
blob: d815a91dc018a397b4dea5ed486bdb396c3ed0d9 (plain)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
"""
Wrappers for several pre-processing scripts from the Moses toolkit.

Copyright ® 2016-2017, Luís Gomes <luismsgomes@gmail.com>

This package provides wrappers for the following Perl scripts:

``tokenizer.perl``
    class `mosestokenizer.tokenizer.MosesTokenizer`

``split-sentences.perl``
    class `mosestokenizer.sentsplitter.MosesSentenceSplitter`

``normalize-punctuation.perl``
    class `mosestokenizer.punctnormalizer.MosesPunctuationNormalizer`

"""

from mosestokenizer.tokenizer import MosesTokenizer
from mosestokenizer.detokenizer import MosesDetokenizer
from mosestokenizer.sentsplitter import MosesSentenceSplitter
from mosestokenizer.punctnormalizer import MosesPunctuationNormalizer

__version__ = "1.0.0"

__all__ = [
    "MosesTokenizer",
    "MosesDetokenizer",
    "MosesSentenceSplitter",
    "MosesPunctuationNormalizer",
]