Welcome to
mirror list
, hosted at
ThFree Co
, Russian Federation.
github.com/marian-nmt/sentencepiece.git - Unnamed repository; edit this file 'description' to name the repository.
index
:
github.com/marian-nmt/sentencepiece.git
anthonyaue/document_minexport_build
anthonyaue/remove_cr
anthonyaue/test_change
casing
gmaster
master
mjd/base64
mjd/casing
mjd/casing2
mjd/casing3
mjd/oldmaster
mjd/oldmaster2
noproto
rename-version
rjai/casing
rjai/fix_case_encoding_arg
sr
zhaogao/modify_batch_file
Unnamed repository; edit this file 'description' to name the repository.
www-data
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
src
Age
Commit message (
Expand
)
Author
2022-02-21
add one header file to installation
Marcin Junczys-Dowmunt
2021-08-31
Fix Surface String to Token Mappings for Case Encoding (#12)
Rohit Jain
2021-08-27
Disable denormalizer flags (#13)
Rohit Jain
2021-07-16
Enable toggling Case Encoding flag from C++ Train API (#11)
Rohit Jain
2021-06-16
Enables --encode_unicode_case option for case-aware sentence piece (#10)
Marcin Junczys-Dowmunt
2020-10-24
add SetRandomGeneratorSeed
Taku Kudo
2020-10-23
fixed build break.
Taku Kudo
2020-10-23
validate the range of piece in Python module
Taku Kudo
2020-10-21
fixed typo.
Taku Kudo
2020-10-21
move sentencepiece python moduel to sub directory.
Taku Kudo
2020-10-17
Fix FTBFS on armel, mips, powerpc, m68k and sh4
Kentaro Hayashi
2020-10-13
changed macro big endian
Taku Kudo
2020-10-13
changed macro big endian
Taku Kudo
2020-10-13
changed macro big endian
Taku Kudo
2020-10-13
changed macro big endian
Taku Kudo
2020-10-13
support big-endian architecture
Taku Kudo
2020-10-13
support big-endian architecture
Taku Kudo
2020-10-13
merges internal changes to github
Taku Kudo
2020-10-03
Fix type of generate_vocabulary option
Guillaume Klein
2020-09-04
clear description for alpha of BPE-dropout
zengl
2020-08-27
fix typo
Yohei Tamura
2020-06-26
Added split_digits to SentencePieceTrainer
mingruimingrui
2020-06-16
rollback proto version
Taku Kudo
2020-06-08
upgrade protobuf
Taku Kudo
2020-06-08
Fixed compile error on Solaris.
Taku Kudo
2020-06-02
Fix build break.
Taku Kudo
2020-06-01
Port absl::flat_hash_map
Taku Kudo
2020-05-31
Use absl::flags
Taku Kudo
2020-05-23
Surpress build warning, reproduced minloglevel
Taku Kudo
2020-05-20
0.1.91 pre-release
Taku Kudo
2020-05-17
added interface to read from iterator/write to io buffer
Taku Kudo
2020-05-12
Fixed test failure error.
Taku Kudo
2020-05-12
Added new Pythonic interface.
Taku Kudo
2020-05-10
Added spec_parser test cases.
Taku Kudo
2020-05-09
Revert the default size of piece length.
Taku Kudo
2020-05-09
Fixed windows build failure
Taku Kudo
2020-05-09
Fixed windows build failure
Taku Kudo
2020-05-08
Fixed TF build error.
Taku Kudo
2020-05-07
Fixed test error.
Taku Kudo
2020-05-07
Initial release of 0.19. Merged internal sentencepiece.
Taku Kudo
2020-04-24
Prefer longest user_defined_symbol if ambigous
Taku Kudo
2019-10-30
Fix a typo
Kentaro Hayashi
2019-01-29
Update trainer_interface.cc
Taku Kudo
2019-01-18
Update trainer_interface.cc
Taku Kudo
2019-01-10
remove control characters in the default nmt_* normalizers
Taku Kudo
2019-01-10
updated the document
Taku Kudo
2019-01-09
added --treat_whitespace_as_suffix option to make _ be a suffix of word.
Taku Kudo
2019-01-08
emit relative path of file in LOG(INFO)
Taku Kudo
2019-01-08
Do not parse deprecated proto fileds
Taku Kudo
2019-01-08
added (Encode|Decode)AsSerializedProto interface so Python module can get ful...
Taku Kudo
[next]