Welcome to mirror list, hosted at ThFree Co, Russian Federation.

github.com/stanfordnlp/stanza.git - Unnamed repository; edit this file 'description' to name the repository.
summaryrefslogtreecommitdiff
AgeCommit message (Expand)Author
2022-11-04Adjust learning rate, don't print out infinite ppltrans_lmJohn Bauer
2022-11-04sum losses when training by length. use sum loss instead of mean so that unn...John Bauer
2022-11-04Have defaults for both IT and VIJohn Bauer
2022-11-04log training args in the trans_lmJohn Bauer
2022-11-04attempt to use transformer lm rerankerJohn Bauer
2022-11-04maybe use a TQDM when scoring stuffJohn Bauer
2022-11-04Small end to end unit testJohn Bauer
2022-11-04Put the rest of the model config into args and save the config.John Bauer
2022-11-04Load in a parse tree LM datasetJohn Bauer
2022-11-04Code changes to make the demo work - can refactor things to make loading and ...John Bauer
2022-11-04Copy chunks of tutorial from pytorch.org as a module - does not run yet becau...John Bauer
2022-11-03Add min_len and max_len args to tokenize_wiki.py. Skip one line wiki docs, s...John Bauer
2022-11-02Fix format error in log lineJohn Bauer
2022-11-01slice in a more generic manner when copying model. makes it easier to make f...John Bauer
2022-11-01Set this option in the partitioned test so that it still tests this code path...John Bauer
2022-11-01lattn_partitioned == False should affect the input proj dimension as wellJohn Bauer
2022-11-01Add an argument for partitioning / not partitioning lattnJohn Bauer
2022-11-01Oops, this was incorrectJohn Bauer
2022-11-01Log some stats after all models are created for training (move the log line)John Bauer
2022-11-01Use some words from the silver dataset (currently |gold| words are added, eve...John Bauer
2022-10-31Add a suffix argument to the renormalize scriptJohn Bauer
2022-10-31Script to renormalize Vietnamese diacriticsJohn Bauer
2022-10-30Add a separate argument for --silver_epoch_size, just in case people want thatJohn Bauer
2022-10-30Add notes on silver words for the delta embeddingJohn Bauer
2022-10-30Since we just ran into a bug where checkpoints were not correctly loaded, add...John Bauer
2022-10-30update commentJohn Bauer
2022-10-30Track how many batches a model gets trained for. Backdoor test for the silve...John Bauer
2022-10-30Rough draft of using silver trees.John Bauer
2022-10-29Move uses_xpos() to the model itself, add it Ensemble. Will make it easier t...John Bauer
2022-10-29Try smaller chunks for the parse_text. One giant chunk ran out of GPUJohn Bauer
2022-10-29Add a couple hopefully helpful log lines to the parse_text operationJohn Bauer
2022-10-29Connect model ensembles to the predict_text functionalityJohn Bauer
2022-10-29oops, model was supposed to be set to eval() when run in predict_text modeJohn Bauer
2022-10-29refactor predict dir,file,format args so they can be used elsewhere if neededJohn Bauer
2022-10-29Refactor an unnecessary duplication of argumentsJohn Bauer
2022-10-29Add functionality to turn a tokenized text file into a file of parse treesJohn Bauer
2022-10-29ignore em dashes in Wikipedia, as that seems to be listsJohn Bauer
2022-10-29Add a useful doc on how to build batches from tagged wordsJohn Bauer
2022-10-29Use reasonable defaults for EN and VI ensembles. Can add other languages as ...John Bauer
2022-10-29Oops, logger was missing in the retagging.py moduleJohn Bauer
2022-10-29A script for tokenizing a Wikipedia file and writing it outJohn Bauer
2022-10-28Accept a single file for wiki processing in selftrain_wiki.pyJohn Bauer
2022-10-28fix bug in the lt/gt finding (it can start a line). use FoundationCache to s...John Bauer
2022-10-28Add a rotation to make N non-overlapping dev sets with the remainder being tr...John Bauer
2022-10-28Simplify reading & writing loop. Will make it easier to 'rotate' the dev setJohn Bauer
2022-10-28Add a prototype of model emsembling. Better would be to integrate it with ru...John Bauer
2022-10-28Refactor the retagging args & pipeline creation into a separate modeuleJohn Bauer
2022-10-28Keep scores when parsing a block of sentences.John Bauer
2022-10-28fix a commentJohn Bauer
2022-10-27Order the pretrains so that resource files are made with a consistent md5sumJohn Bauer