Welcome to mirror list, hosted at ThFree Co, Russian Federation.

github.com/MaartenGr/KeyBERT.git - Unnamed repository; edit this file 'description' to name the repository.
summaryrefslogtreecommitdiff
AgeCommit message (Collapse)Author
2022-11-03v0.7 (#135)HEADv0.7.0masterMaarten Grootendorst
* Added option to extract and pass word/document embeddings for faster iteration * Focused on making the documentation a bit nicer (visualizations, etc. ) * Fixed #71 * Fixed #122, #136
2022-07-27v0.6.0 (#120)v0.6.0Maarten Grootendorst
* Major speedup, up to 2x to 5x when passing multiple documents (for MMR and MaxSum) compared to single documents * Same results whether passing a single document or multiple documents * MMR and MaxSum now work when passing a single document or multiple documents * Improved documentation * Added 🤗 Hugging Face Transformers * Highlighting support for Chinese texts * Now uses the CountVectorizer for creating the tokens * This should also improve the highlighting for most applications and higher n-grams * Fix #106 * Fix #116
2022-07-11Add missing optional dependencies (gensim, use, and spacy) (#114)yusuke1997
2022-03-31v0.5.1 (#97)v0.5.1Maarten Grootendorst
* Added CountVectorizer tips and tricks page, including `KeyphraseVectorizers` * Added general styling: `black`, `flake8`, `pre-commit`
2021-09-28v0.5 (#58)v0.5.0Maarten Grootendorst
* Guided KeyBERT * Update default SBERT model
2021-06-30v0.4 (#43)v0.4.0Maarten Grootendorst
* Use paraphrase-MiniLM-L6-v2 as the default embedding model * Highlight a document's keywords * Added FAQ
2021-05-10v0.3 (#32)v0.3.0Maarten Grootendorst
* Use candidate words instead of extracting those from the documents * Spacy, Gensim, USE, and Custom Backends were added * Improved imports * Fix encoding error when locally installing KeyBERT #30 * Improved documentation (ReadMe & MKDocs) * Add the main tutorial as a shield * Typos #31, #35
2021-02-09v0.2 (#23)v0.2.0Maarten Grootendorst
* Add similarity scores to the output * Add Flair as a possible back-end * Update documentation + improved testing
2020-12-02Add custom countvectorizer (#14)Maarten Grootendorst
2020-10-28Add diversity parameter in max sum sim (#7)v0.1.2Maarten Grootendorst
* Add diversity parameter in max sum sim * Remove 3.7 testing due to timeout errors
2020-10-27Added max sum similarity (#6)Maarten Grootendorst
2020-10-27Add MMR (#4)Maarten Grootendorst
* Added MMR * Update documentation and pypi version
2020-10-22Init commitMaartenGr