Welcome to mirror list, hosted at ThFree Co, Russian Federation.

github.com/marian-nmt/sentencepiece.git - Unnamed repository; edit this file 'description' to name the repository.
summaryrefslogtreecommitdiff
path: root/src
AgeCommit message (Expand)Author
2018-12-08support to change the piece of unk/bos/eos/padTaku Kudo
2018-11-18remvoe the max size constraint for input sentence size etc.Taku Kudo
2018-11-13Fixed minor bugsTaku Kudo
2018-11-13Do not set UNUSED type to UNK/CONTROL/USER_DEFINED tokensTaku Kudo
2018-11-13Fixed build error on Ubutu trustyTaku Kudo
2018-11-11added --max_sentence_length flagTaku Kudo
2018-11-10added --split_by_number flagTaku Kudo
2018-11-09bug fix around user-defined-symbolsTaku Kudo
2018-11-08stop normalization for user_defined_symbolsTaku Kudo
2018-11-06added --minloglevel flagTaku Kudo
2018-10-28pushed new nfkc_cf.tsvTaku Kudo
2018-09-16performance tuningsrTaku Kudo
2018-09-07Added --use_all_vocab=true flag for WORD/CHAR modelTaku Kudo
2018-09-02Added self testing feature.Taku Kudo
2018-09-01Update sentencepiece_processor_test.ccTaku Kudo
2018-09-01Added is_binary flag to filesystem API.Taku Kudo
2018-09-01Introduced new Filesystem API phase 2.Taku Kudo
2018-09-01Introduced new Filesystem API.Taku Kudo
2018-08-14Fixed build break.Taku Kudo
2018-08-14Do not allow the token to include NULL character.Taku Kudo
2018-08-14Added make_py_wheel_mac.pyTaku Kudo
2018-08-09fixed build errorTaku Kudo
2018-08-06Fix for WindowsTaku Kudo
2018-08-05Enable unittest on WindowsTaku Kudo
2018-08-03Added JoinPath and StrCatTaku Kudo
2018-08-01Added u8 literal to UTF8 string.Taku Kudo
2018-08-01Minor fixes for windowsTaku Kudo
2018-08-01Support to load big blob data for Windows.Taku Kudo
2018-08-01Updated version numberTaku Kudo
2018-07-31Prepare windows buildTaku Kudo
2018-07-31Remove setjmp/longjmpTaku Kudo
2018-07-27invalid test causing all pieces to be identified as non valid because on firs...Jean A. Senellart
2018-07-26Added --unk_surface option to allow user to change unknown surface string.Taku Kudo
2018-07-25Add messages to tcmallocTaku Kudo
2018-07-24Switched to cmakeTaku Kudo
2018-07-12Added new API to get bos/eos/unk/pad idsTaku Kudo
2018-06-29Changed the hard-limit of --mining_sentence_sizeTaku Kudo
2018-06-29Added normalization with Unicode case foldingTaku Kudo
2018-06-22Add LoadFromSerialiedProtoTaku Kudo
2018-06-20Fixes the usage of strerror_rTaku Kudo
2018-06-19Avoids copy in Python2 Unicode mode.Taku Kudo
2018-06-19Update sentencepiece_processor.hTaku Kudo
2018-06-18Fixed build error on clang.Taku Kudo
2018-06-18Introduced minimum string_wrapper to remove extra string copyTaku Kudo
2018-06-18remove src/stringpiece.hTaku Kudo
2018-06-18Uses abs::string_view instead of StringPieceTaku Kudo
2018-06-17Minor fixesTaku Kudo
2018-06-16Support snake case in Python moduleTaku Kudo
2018-06-16Minor fixes.Taku Kudo
2018-06-11Minor style fixes.Taku Kudo