Age | Commit message (Expand) | Author |
2022-11-04 | Adjust learning rate, don't print out infinite ppltrans_lm | John Bauer |
2022-11-04 | sum losses when training by length. use sum loss instead of mean so that unn... | John Bauer |
2022-11-04 | Have defaults for both IT and VI | John Bauer |
2022-11-04 | log training args in the trans_lm | John Bauer |
2022-11-04 | attempt to use transformer lm reranker | John Bauer |
2022-11-04 | maybe use a TQDM when scoring stuff | John Bauer |
2022-11-04 | Small end to end unit test | John Bauer |
2022-11-04 | Put the rest of the model config into args and save the config. | John Bauer |
2022-11-04 | Load in a parse tree LM dataset | John Bauer |
2022-11-04 | Code changes to make the demo work - can refactor things to make loading and ... | John Bauer |
2022-11-04 | Copy chunks of tutorial from pytorch.org as a module - does not run yet becau... | John Bauer |
2022-11-03 | Add min_len and max_len args to tokenize_wiki.py. Skip one line wiki docs, s... | John Bauer |
2022-11-02 | Fix format error in log line | John Bauer |
2022-11-01 | slice in a more generic manner when copying model. makes it easier to make f... | John Bauer |
2022-11-01 | Set this option in the partitioned test so that it still tests this code path... | John Bauer |
2022-11-01 | lattn_partitioned == False should affect the input proj dimension as well | John Bauer |
2022-11-01 | Add an argument for partitioning / not partitioning lattn | John Bauer |
2022-11-01 | Oops, this was incorrect | John Bauer |
2022-11-01 | Log some stats after all models are created for training (move the log line) | John Bauer |
2022-11-01 | Use some words from the silver dataset (currently |gold| words are added, eve... | John Bauer |
2022-10-31 | Add a suffix argument to the renormalize script | John Bauer |
2022-10-31 | Script to renormalize Vietnamese diacritics | John Bauer |
2022-10-30 | Add a separate argument for --silver_epoch_size, just in case people want that | John Bauer |
2022-10-30 | Add notes on silver words for the delta embedding | John Bauer |
2022-10-30 | Since we just ran into a bug where checkpoints were not correctly loaded, add... | John Bauer |
2022-10-30 | update comment | John Bauer |
2022-10-30 | Track how many batches a model gets trained for. Backdoor test for the silve... | John Bauer |
2022-10-30 | Rough draft of using silver trees. | John Bauer |
2022-10-29 | Move uses_xpos() to the model itself, add it Ensemble. Will make it easier t... | John Bauer |
2022-10-29 | Try smaller chunks for the parse_text. One giant chunk ran out of GPU | John Bauer |
2022-10-29 | Add a couple hopefully helpful log lines to the parse_text operation | John Bauer |
2022-10-29 | Connect model ensembles to the predict_text functionality | John Bauer |
2022-10-29 | oops, model was supposed to be set to eval() when run in predict_text mode | John Bauer |
2022-10-29 | refactor predict dir,file,format args so they can be used elsewhere if needed | John Bauer |
2022-10-29 | Refactor an unnecessary duplication of arguments | John Bauer |
2022-10-29 | Add functionality to turn a tokenized text file into a file of parse trees | John Bauer |
2022-10-29 | ignore em dashes in Wikipedia, as that seems to be lists | John Bauer |
2022-10-29 | Add a useful doc on how to build batches from tagged words | John Bauer |
2022-10-29 | Use reasonable defaults for EN and VI ensembles. Can add other languages as ... | John Bauer |
2022-10-29 | Oops, logger was missing in the retagging.py module | John Bauer |
2022-10-29 | A script for tokenizing a Wikipedia file and writing it out | John Bauer |
2022-10-28 | Accept a single file for wiki processing in selftrain_wiki.py | John Bauer |
2022-10-28 | fix bug in the lt/gt finding (it can start a line). use FoundationCache to s... | John Bauer |
2022-10-28 | Add a rotation to make N non-overlapping dev sets with the remainder being tr... | John Bauer |
2022-10-28 | Simplify reading & writing loop. Will make it easier to 'rotate' the dev set | John Bauer |
2022-10-28 | Add a prototype of model emsembling. Better would be to integrate it with ru... | John Bauer |
2022-10-28 | Refactor the retagging args & pipeline creation into a separate modeule | John Bauer |
2022-10-28 | Keep scores when parsing a block of sentences. | John Bauer |
2022-10-28 | fix a comment | John Bauer |
2022-10-27 | Order the pretrains so that resource files are made with a consistent md5sum | John Bauer |