Welcome to
mirror list
, hosted at
ThFree Co
, Russian Federation.
github.com/marian-nmt/sentencepiece.git - Unnamed repository; edit this file 'description' to name the repository.
index
:
github.com/marian-nmt/sentencepiece.git
anthonyaue/document_minexport_build
anthonyaue/remove_cr
anthonyaue/test_change
casing
gmaster
master
mjd/base64
mjd/casing
mjd/casing2
mjd/casing3
mjd/oldmaster
mjd/oldmaster2
noproto
rename-version
rjai/casing
rjai/fix_case_encoding_arg
sr
zhaogao/modify_batch_file
Unnamed repository; edit this file 'description' to name the repository.
www-data
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
src
Age
Commit message (
Expand
)
Author
2018-08-03
Added JoinPath and StrCat
Taku Kudo
2018-08-01
Added u8 literal to UTF8 string.
Taku Kudo
2018-08-01
Minor fixes for windows
Taku Kudo
2018-08-01
Support to load big blob data for Windows.
Taku Kudo
2018-08-01
Updated version number
Taku Kudo
2018-07-31
Prepare windows build
Taku Kudo
2018-07-31
Remove setjmp/longjmp
Taku Kudo
2018-07-27
invalid test causing all pieces to be identified as non valid because on firs...
Jean A. Senellart
2018-07-26
Added --unk_surface option to allow user to change unknown surface string.
Taku Kudo
2018-07-25
Add messages to tcmalloc
Taku Kudo
2018-07-24
Switched to cmake
Taku Kudo
2018-07-12
Added new API to get bos/eos/unk/pad ids
Taku Kudo
2018-06-29
Changed the hard-limit of --mining_sentence_size
Taku Kudo
2018-06-29
Added normalization with Unicode case folding
Taku Kudo
2018-06-22
Add LoadFromSerialiedProto
Taku Kudo
2018-06-20
Fixes the usage of strerror_r
Taku Kudo
2018-06-19
Avoids copy in Python2 Unicode mode.
Taku Kudo
2018-06-19
Update sentencepiece_processor.h
Taku Kudo
2018-06-18
Fixed build error on clang.
Taku Kudo
2018-06-18
Introduced minimum string_wrapper to remove extra string copy
Taku Kudo
2018-06-18
remove src/stringpiece.h
Taku Kudo
2018-06-18
Uses abs::string_view instead of StringPiece
Taku Kudo
2018-06-17
Minor fixes
Taku Kudo
2018-06-16
Support snake case in Python module
Taku Kudo
2018-06-16
Minor fixes.
Taku Kudo
2018-06-11
Minor style fixes.
Taku Kudo
2018-06-11
Support an empty normalziation and other minor fixes
Taku Kudo
2018-06-11
added missing include
Taku Kudo
2018-06-09
Uses NMT_NFKC rule by default.
Taku Kudo
2018-06-08
Allows to define duplicated user defined symbols
Taku Kudo
2018-06-07
Support user defined symbols in Char/BPE
Taku Kudo
2018-06-06
Added --generate_vocabulary option to spm_encode
Taku Kudo
2018-06-06
Support vocab restriction feature in BPE model.
Taku Kudo
2018-06-06
Support vocab restriction feature
Taku Kudo
2018-06-04
Minor style fixes
Taku Kudo
2018-06-04
Updated normalizer
Taku Kudo
2018-05-13
Made DecodeUTF8 more strict.
Taku Kudo
2018-05-11
s/PopulateNormalizationSpec/PopulateNormalizerSpec/
Taku Kudo
2018-05-11
Fixed build errors
Taku Kudo
2018-05-10
CHECK to util::Status migration for Builder
Taku Kudo
2018-05-06
CHECK to Status migration for Trainer.
Taku Kudo
2018-05-05
Changed the Makefile rule for protobuf
Taku Kudo
2018-05-03
Update sentencepiece_processor.h
Taku Kudo
2018-05-01
Set normalization_rule in once place
Taku Kudo
2018-04-30
Reimplement Trainer with Proto reflection
Taku Kudo
2018-04-28
Uses util::Status to propagate error messages
Taku Kudo
2018-04-18
Fix typo
Graham Neubig
2018-04-17
Moved the spec verifier and increases the sentencepiece_length param
Taku Kudo
2018-04-16
Add --hard_vocab_limit flag.
Taku Kudo
2018-04-09
Merge pull request #53 from google/sr
Taku Kudo
[next]