From 66001fb7156999c8313855b2fabfd1259355b78a Mon Sep 17 00:00:00 2001 From: ZJaume Date: Wed, 27 Jul 2022 11:12:23 +0000 Subject: Update hardrules and installation instructions --- README.md | 17 ++++------------- requirements.txt | 2 +- 2 files changed, 5 insertions(+), 14 deletions(-) diff --git a/README.md b/README.md index be46422..9693db7 100644 --- a/README.md +++ b/README.md @@ -47,21 +47,12 @@ The use of XLMRoberta and 1:10 positive to negative ratio were inspired in the w - TensorFlow >= 2.6.5 - CUDA 11.2 (for training and inference with full models) -Bicleaner AI is written in Python and can be installed using `pip`: +Bicleaner AI is written in Python and can be installed using `pip`. +It also requires the [KenLM](https://github.com/kpu/kenlm) Python bindings with support for 7-gram language models. +You can easily install it by running the following command: ```bash -pip install bicleaner-ai -``` - -Bicleaner AI requires the [KenLM](https://github.com/kpu/kenlm) Python bindings with support for 7-gram language models. You can easily install it by running the following commands: - -```bash -git clone https://github.com/kpu/kenlm -cd kenlm -pip install . --install-option="--max_order 7" -mkdir -p build && cd build -cmake .. -DKENLM_MAX_ORDER=7 -DCMAKE_INSTALL_PREFIX:PATH=/your/prefix/path -make -j all install +pip install bicleaner-ai https://github.com/kpu/kenlm/archive/master.zip --install-option="--max_order 7" ``` Hardrules uses [FastSpell](https://github.com/mbanon/fastspell) that requires `python-dev` and `libhunspell-dev`: diff --git a/requirements.txt b/requirements.txt index 1d07d4b..8373732 100644 --- a/requirements.txt +++ b/requirements.txt @@ -5,7 +5,7 @@ pytest toolwrapper joblib sacremoses -bicleaner-hardrules>=2.1 +bicleaner-hardrules>=2.3,<3.0 sentencepiece tensorflow>=2.6.5 glove-python-binary==0.2.0 -- cgit v1.2.3