Welcome to mirror list, hosted at ThFree Co, Russian Federation.

github.com/stanfordnlp/stanza.git - Unnamed repository; edit this file 'description' to name the repository.
summaryrefslogtreecommitdiff
AgeCommit message (Expand)Author
2022-11-03Add a flag to control how many tags to use when labeling shift transitionscon_shift_tags2John Bauer
2022-11-03Move convert_trees back to transition_sequenceJohn Bauer
2022-11-03Add a tag label to a ShiftJohn Bauer
2022-11-03Tag dropout - turn X% of tags into UNK on each training loopJohn Bauer
2022-11-03Throw out wiki docs of length 2 as well when building a silver datasetJohn Bauer
2022-11-03Add min_len and max_len args to tokenize_wiki.py. Skip one line wiki docs, s...John Bauer
2022-11-02Fix format error in log lineJohn Bauer
2022-11-01slice in a more generic manner when copying model. makes it easier to make f...John Bauer
2022-11-01Set this option in the partitioned test so that it still tests this code path...John Bauer
2022-11-01lattn_partitioned == False should affect the input proj dimension as wellJohn Bauer
2022-11-01Add an argument for partitioning / not partitioning lattnJohn Bauer
2022-11-01Oops, this was incorrectJohn Bauer
2022-11-01Log some stats after all models are created for training (move the log line)John Bauer
2022-11-01Use some words from the silver dataset (currently |gold| words are added, eve...John Bauer
2022-10-31Add a suffix argument to the renormalize scriptJohn Bauer
2022-10-31Script to renormalize Vietnamese diacriticsJohn Bauer
2022-10-30Add a separate argument for --silver_epoch_size, just in case people want thatJohn Bauer
2022-10-30Add notes on silver words for the delta embeddingJohn Bauer
2022-10-30Since we just ran into a bug where checkpoints were not correctly loaded, add...John Bauer
2022-10-30update commentJohn Bauer
2022-10-30Track how many batches a model gets trained for. Backdoor test for the silve...John Bauer
2022-10-30Rough draft of using silver trees.John Bauer
2022-10-29Move uses_xpos() to the model itself, add it Ensemble. Will make it easier t...John Bauer
2022-10-29Try smaller chunks for the parse_text. One giant chunk ran out of GPUJohn Bauer
2022-10-29Add a couple hopefully helpful log lines to the parse_text operationJohn Bauer
2022-10-29Connect model ensembles to the predict_text functionalityJohn Bauer
2022-10-29oops, model was supposed to be set to eval() when run in predict_text modeJohn Bauer
2022-10-29refactor predict dir,file,format args so they can be used elsewhere if neededJohn Bauer
2022-10-29Refactor an unnecessary duplication of argumentsJohn Bauer
2022-10-29Add functionality to turn a tokenized text file into a file of parse treesJohn Bauer
2022-10-29ignore em dashes in Wikipedia, as that seems to be listsJohn Bauer
2022-10-29Add a useful doc on how to build batches from tagged wordsJohn Bauer
2022-10-29Use reasonable defaults for EN and VI ensembles. Can add other languages as ...John Bauer
2022-10-29Oops, logger was missing in the retagging.py moduleJohn Bauer
2022-10-29A script for tokenizing a Wikipedia file and writing it outJohn Bauer
2022-10-28Accept a single file for wiki processing in selftrain_wiki.pyJohn Bauer
2022-10-28fix bug in the lt/gt finding (it can start a line). use FoundationCache to s...John Bauer
2022-10-28Add a rotation to make N non-overlapping dev sets with the remainder being tr...John Bauer
2022-10-28Simplify reading & writing loop. Will make it easier to 'rotate' the dev setJohn Bauer
2022-10-28Add a prototype of model emsembling. Better would be to integrate it with ru...John Bauer
2022-10-28Refactor the retagging args & pipeline creation into a separate modeuleJohn Bauer
2022-10-28Keep scores when parsing a block of sentences.John Bauer
2022-10-28fix a commentJohn Bauer
2022-10-27Order the pretrains so that resource files are made with a consistent md5sumJohn Bauer
2022-10-27Add an extraction for bartphoJohn Bauer
2022-10-27Sort files when converting VLSP22 so that output is the same across platformsJohn Bauer
2022-10-27Adjust attention masks for vi phobertJohn Bauer
2022-10-27add a line for using phobert on the vlsp tagger datasetJohn Bauer
2022-10-26Existing POS models with bert would be broken when looking at the args, so in...John Bauer
2022-10-26Fix retag package when using relabel_tagsJohn Bauer