Welcome to mirror list, hosted at ThFree Co, Russian Federation.

github.com/marian-nmt/sentencepiece.git - Unnamed repository; edit this file 'description' to name the repository.
summaryrefslogtreecommitdiff
AgeCommit message (Collapse)Author
2022-10-27Merge pull request #15 from alexandremuzio/patch-1HEADmasterRoman Grundkiewicz
Rename python/VERSION to python/VERSION.txt
2022-10-25Rename VERSION to VERSION.txtalexandremuzio
Installing python package fails with below error. This change addresses this issue ``` × python setup.py egg_info did not run successfully. │ exit code: 1 ╰─> [10 lines of output] Traceback (most recent call last): File "<string>", line 2, in <module> File "<pip-setuptools-caller>", line 34, in <module> File "/home/alferre/code/sentencepiece/python/setup.py", line 111, in <module> version=version(), File "/home/alferre/code/sentencepiece/python/setup.py", line 36, in version with codecs.open('VERSION.txt', 'r', 'utf-8') as f: File "/opt/conda/envs/ptca/lib/python3.8/codecs.py", line 905, in open file = builtins.open(filename, mode, buffering) FileNotFoundError: [Errno 2] No such file or directory: 'VERSION.txt' [end of output] ```
2022-09-02Merge pull request #14 from marian-nmt/rename-versionRoman Grundkiewicz
Rename VERSION to VERSION.txt
2022-09-02Rename VERSION to VERSION.txtrename-versionRoman Grundkiewicz
2022-02-21add one header file to installationMarcin Junczys-Dowmunt
2021-08-31Fix Surface String to Token Mappings for Case Encoding (#12)Rohit Jain
Co-authored-by: Marcin Junczys-Dowmunt <marcinjd@microsoft.com> Co-authored-by: Rohit Jain <Rohit.Jain@microsoft.com>
2021-08-27Disable denormalizer flags (#13)Rohit Jain
Co-authored-by: Rohit Jain <Rohit.Jain@microsoft.com>
2021-07-16Enable toggling Case Encoding flag from C++ Train API (#11)Rohit Jain
* Enable toggling Case Encoding flag from C++ Train API * Fixing issue with hardcoding truth value of encode_decode_case flag
2021-06-16Enables --encode_unicode_case option for case-aware sentence piece (#10)Marcin Junczys-Dowmunt
* Enables --encode_unicode_case option for case-aware sentence piece * Example: This IS a TEST OF THE CASING gets converted internally to Tthis Uis a Atest of the casing before segmentation. * This is fully reversible.
2021-06-02Merge pull request #8 from rjai/rjai/spm_studentRohit Jain
Allow for generating alternatively named SPM binaries with CMake
2021-05-25Update CMakeFiles to allow for configurable artifact namesRohit Jain
2021-05-23Update cmakeRohit Jain
2021-05-23Adding alternative project name for spm latest to prevent lib conflictsRohit Jain
2020-10-24udpate clang build rule.gmasterTaku Kudo
2020-10-24Merge branch 'master' of https://github.com/google/sentencepieceTaku Kudo
2020-10-24add SetRandomGeneratorSeedTaku Kudo
2020-10-23Update README.mdTaku Kudo
2020-10-23fixed build break.Taku Kudo
2020-10-23validate the range of piece in Python moduleTaku Kudo
2020-10-22Fixed build error on Mac.Taku Kudo
2020-10-21fixed typo.Taku Kudo
2020-10-21move sentencepiece python moduel to sub directory.Taku Kudo
2020-10-21support pip install src-pckageTaku Kudo
2020-10-21Merge pull request #563 from kenhys/fix-ftbfs-portsTaku Kudo
cmake: fix FTBFS on armel, mips, powerpc, m68k and sh4
2020-10-20Merge pull request #562 from kenhys/add-multiarchTaku Kudo
cmake: use GNUInstallDirs.cmake on UNIX
2020-10-17test: update expected install pathKentaro Hayashi
2020-10-17cmake: use GNUInstallDirs.cmake on UNIXKentaro Hayashi
It enables to support Multiarch path (/usr/lib/<triplet>) well.
2020-10-17Fix FTBFS on armel, mips, powerpc, m68k and sh4Kentaro Hayashi
2020-10-17Merge pull request #556 from equivalence1/fix_readme_sil_symbolTaku Kudo
Fix space symbol in code snippet: _ -> ▁
2020-10-15Fix SIL symbol in code snippet: _ -> _Dmitrii Kravchenko
2020-10-14add support py3.9 for macTaku Kudo
2020-10-14add support py3.9 for macTaku Kudo
2020-10-14add support py3.9 for win/macTaku Kudo
2020-10-13changed macro big endianTaku Kudo
2020-10-13changed macro big endianTaku Kudo
2020-10-13changed macro big endianTaku Kudo
2020-10-13changed macro big endianTaku Kudo
2020-10-13enables s390x platformTaku Kudo
2020-10-13support big-endian architectureTaku Kudo
2020-10-13support big-endian architectureTaku Kudo
2020-10-13merges internal changes to githubTaku Kudo
2020-10-12remove s390x as it is big endianTaku Kudo
2020-10-12Fixed typo.Taku Kudo
2020-10-12Merge pull request #542 from AWSjswinney/patch-1Taku Kudo
Update make_py_wheel.sh
2020-10-12use docer for arm etcTaku Kudo
2020-10-12add arm64, ppc64le, s390x arch experimentally.Taku Kudo
2020-10-12remove tensorflow moduleTaku Kudo
2020-10-05Merge pull request #552 from guillaumekln/fix-generate-vocabularyTaku Kudo
Fix type of generate_vocabulary option
2020-10-03Fix type of generate_vocabulary optionGuillaume Klein
2020-10-02Merge pull request #551 from stephantul/masterTaku Kudo
Add options list for training to documentation