Welcome to mirror list, hosted at ThFree Co, Russian Federation.

github.com/marian-nmt/sentencepiece.git - Unnamed repository; edit this file 'description' to name the repository.
summaryrefslogtreecommitdiff
path: root/src
AgeCommit message (Expand)Author
2020-06-01Port absl::flat_hash_mapTaku Kudo
2020-05-31Use absl::flagsTaku Kudo
2020-05-23Surpress build warning, reproduced minloglevelTaku Kudo
2020-05-200.1.91 pre-releaseTaku Kudo
2020-05-17added interface to read from iterator/write to io bufferTaku Kudo
2020-05-12Fixed test failure error.Taku Kudo
2020-05-12Added new Pythonic interface.Taku Kudo
2020-05-10Added spec_parser test cases.Taku Kudo
2020-05-09Revert the default size of piece length.Taku Kudo
2020-05-09Fixed windows build failureTaku Kudo
2020-05-09Fixed windows build failureTaku Kudo
2020-05-08Fixed TF build error.Taku Kudo
2020-05-07Fixed test error.Taku Kudo
2020-05-07Initial release of 0.19. Merged internal sentencepiece.Taku Kudo
2020-04-24Prefer longest user_defined_symbol if ambigousTaku Kudo
2019-10-30Fix a typoKentaro Hayashi
2019-01-29Update trainer_interface.ccTaku Kudo
2019-01-18Update trainer_interface.ccTaku Kudo
2019-01-10remove control characters in the default nmt_* normalizersTaku Kudo
2019-01-10updated the documentTaku Kudo
2019-01-09added --treat_whitespace_as_suffix option to make _ be a suffix of word.Taku Kudo
2019-01-08emit relative path of file in LOG(INFO)Taku Kudo
2019-01-08Do not parse deprecated proto filedsTaku Kudo
2019-01-08added (Encode|Decode)AsSerializedProto interface so Python module can get ful...Taku Kudo
2019-01-08emits too-long-sentence error only onceTaku Kudo
2019-01-08use builtin protobuf-lite package in third_partyTaku Kudo
2018-12-10added third_party as a subdir of cmakeTaku Kudo
2018-12-09Fixed build break.Taku Kudo
2018-12-09deperecated mining_sentence_size and training_sentence_size. Load all sentenc...Taku Kudo
2018-12-08support to change the piece of unk/bos/eos/padTaku Kudo
2018-11-18remvoe the max size constraint for input sentence size etc.Taku Kudo
2018-11-13Fixed minor bugsTaku Kudo
2018-11-13Do not set UNUSED type to UNK/CONTROL/USER_DEFINED tokensTaku Kudo
2018-11-13Fixed build error on Ubutu trustyTaku Kudo
2018-11-11added --max_sentence_length flagTaku Kudo
2018-11-10added --split_by_number flagTaku Kudo
2018-11-09bug fix around user-defined-symbolsTaku Kudo
2018-11-08stop normalization for user_defined_symbolsTaku Kudo
2018-11-06added --minloglevel flagTaku Kudo
2018-10-28pushed new nfkc_cf.tsvTaku Kudo
2018-09-16performance tuningsrTaku Kudo
2018-09-07Added --use_all_vocab=true flag for WORD/CHAR modelTaku Kudo
2018-09-02Added self testing feature.Taku Kudo
2018-09-01Update sentencepiece_processor_test.ccTaku Kudo
2018-09-01Added is_binary flag to filesystem API.Taku Kudo
2018-09-01Introduced new Filesystem API phase 2.Taku Kudo
2018-09-01Introduced new Filesystem API.Taku Kudo
2018-08-14Fixed build break.Taku Kudo
2018-08-14Do not allow the token to include NULL character.Taku Kudo
2018-08-14Added make_py_wheel_mac.pyTaku Kudo