diff options
author | Jaume Zaragoza <ZJaume@users.noreply.github.com> | 2021-06-15 10:28:55 +0300 |
---|---|---|
committer | GitHub <noreply@github.com> | 2021-06-15 10:28:55 +0300 |
commit | d36b1548478debfad02638f1187a6e23c96a6471 (patch) | |
tree | d65277f0112ba7f02507ab0c3bea1cde08f5468a | |
parent | 7dcc0fef9d0304f97c5805eeb5ce2afb6db69c8a (diff) |
Update README
-rw-r--r-- | README.md | 4 |
1 files changed, 2 insertions, 2 deletions
@@ -8,7 +8,7 @@ indicates the likelihood of a pair of sentences being mutual translations (with Sentence pairs considered very noisy are scored with 0. Although a training tool (`bicleaner-ai-train`) is provided, you may want to use the available ready-to-use language packages. -Please, visit https://github.com/bitextor/bicleaner-data/releases/latest or use `./utils/download-pack.sh` to download the latest language packages. +Please, visit https://github.com/bitextor/bicleaner--ai-data/releases/latest or use `./utils/download-pack.sh` to download the latest language packages. Visit our [Wiki](https://github.com/bitextor/bicleaner-ai/wiki/How-to-train-your-Bicleaner-AI) for a detailed example on Bicleaner training. ## What is New? @@ -105,7 +105,7 @@ bicleaner-ai-classify [-h] * positional arguments: * `input`: Tab-separated files to be classified (default line format: `URL1 URL2 SOURCE_SENTENCE TARGET_SENTENCE [EXTRA_COLUMNS]`, tab-separated). When input is -, reads standard input. * `output`: Output of the classification (default: standard output). When output is -, writes standard output. - * `metadata`: Training metadata (YAML file), generated by `bicleaner-ai-train` or [downloaded](https://github.com/bitextor/bicleaner-data/releases/latest) as a part of a language pack. You just need to `untar` the language pack for the pair of languages of the file you want to clean. The tar file contains the YAML metadata file. + * `metadata`: Training metadata (YAML file), generated by `bicleaner-ai-train` or [downloaded](https://github.com/bitextor/bicleaner-ai-data/releases/latest) as a part of a language pack. You just need to `untar` the language pack for the pair of languages of the file you want to clean. The tar file contains the YAML metadata file. There's a script that can download and unpack it for you, use: ```bash $ ./utils/download-pack.sh en cs ./models |