Welcome to mirror list, hosted at ThFree Co, Russian Federation.

github.com/stanfordnlp/stanza.git - Unnamed repository; edit this file 'description' to name the repository.
summaryrefslogtreecommitdiff
AgeCommit message (Expand)Author
2022-05-19Simplify - can use torch tensors directly rather than first creating np arraysrefactor_dataloaderJohn Bauer
2022-05-19Pytorch dataloaderJohn Bauer
2022-05-19Start to refactor pieces of the tokenizer dataset into pieces so we can make ...John Bauer
2022-05-19numpy instead of torch is slightly faster in the small sentence regime, very ...John Bauer
2022-05-17labels is unused in tokenizer predictJohn Bauer
2022-05-17Remove unused importJohn Bauer
2022-05-16Add the skip_newlines test to the file reading version of the tokenizer data ...John Bauer
2022-05-16Abstract away labels() rather than having the eval code know the format of th...John Bauer
2022-05-16more specific Exception typeJohn Bauer
2022-05-16Get rid of the input_data field - was only used for tests, and the tests don'...John Bauer
2022-05-16For the MWT test, use the fake tokenizer files rather than putting in the fak...John Bauer
2022-05-16Factor out a method to write the input to temp files in a tokenizer testJohn Bauer
2022-05-16Add a tiny bit of docJohn Bauer
2022-05-16Merge pull request #1029 from stanfordnlp/refactor_tokJohn Bauer
2022-05-16Run some basic tests on the dictionary in the ZH tokenizerJohn Bauer
2022-05-16Rearrange - not necessary for this to be an inner functionJohn Bauer
2022-05-16whitespaceJohn Bauer
2022-05-16torch tensors instead of lists of numbersJohn Bauer
2022-05-15Separate out the label creation - no need to make a fake string of 0s at runtimeJohn Bauer
2022-05-15Add a test which pokes the DataLoader object to make sure it is processing da...John Bauer
2022-05-15Separate the next() functionality which advances an unfinished batch into a s...John Bauer
2022-05-14whitespace updateJohn Bauer
2022-05-14use save_name for consistencyJohn Bauer
2022-05-14Use save_name to load if load_name is not setJohn Bauer
2022-05-14zh -> zh-hans to match the names of other modelsJohn Bauer
2022-05-14Use weighted_f1 by default to pick a best modelJohn Bauer
2022-05-14Oops, got the test file name mixed up with trainJohn Bauer
2022-05-14Add bert to sentiment training. Includes loading it in pipelines and at test...John Bauer
2022-05-14Log average loss even when doing an interim analysisJohn Bauer
2022-05-14Simplify loading of charlm in sentimentJohn Bauer
2022-05-14Remove unused constJohn Bauer
2022-05-14Use save_name and load_name in run_sentiment.py in order to simplify the name...John Bauer
2022-05-14Cache charlms in the FoundationCache, including a test of the loading.John Bauer
2022-05-14In conparser, load charlm in the load() methodJohn Bauer
2022-05-14add pytest as requirementJ38
2022-05-14update to new conda set upJ38
2022-05-14write temp directories to test working dir not present dirJ38
2022-05-12create id_tensor directly on the device neededJohn Bauer
2022-05-12Hindi bert model is missing max_lengthJohn Bauer
2022-05-12Use the dash replacement for the HI bert as wellJohn Bauer
2022-05-12Merge pull request #1026 from stanfordnlp/charlm_refactorJohn Bauer
2022-05-12Simplify opening .xz vs regular text filesJohn Bauer
2022-05-12Add a test of save & load charlmJohn Bauer
2022-05-12Move the charlm build code to char_model.py to be near the code it uses. Als...John Bauer
2022-05-12Simplify a bit - the phobert notes are now hidden in the bert_embedding moduleJohn Bauer
2022-05-11Some small notes on running hinerJohn Bauer
2022-05-11bert can be separate from word embJohn Bauer
2022-05-11Technically this is more correcter -- no list in default parametersJohn Bauer
2022-05-11Add HiNER processing (which is just a conll -> json conversion)John Bauer
2022-05-10Add `str()` and `in` methods to multivocabJohn Bauer