Welcome to mirror list, hosted at ThFree Co, Russian Federation.

github.com/stanfordnlp/stanza.git - Unnamed repository; edit this file 'description' to name the repository.
summaryrefslogtreecommitdiff
AgeCommit message (Expand)Author
2021-03-24small fixppf_dataPfeiffer_T480s
2021-03-24started refractoring to pytorch dataloaderPfeiffer_T480s
2021-03-24Add a note on something that doesn't seem to helppydataloaderJohn Bauer
2021-03-24Refactor - put everything in torch_dataJohn Bauer
2021-03-24Maybe a little faster?John Bauer
2021-03-24One fewer thing to send backJohn Bauer
2021-03-24This should be a little faster, but need to untangle the tensorsJohn Bauer
2021-03-23pass back strings or lists of chunks rather than including the tuples with fa...John Bauer
2021-03-23Refactor to make profiling easierJohn Bauer
2021-03-23Attempt to make a pytorch dataloader for bulk_process - not an improvement yetJohn Bauer
2021-03-23Merge pull request #650 from stanfordnlp/vi_fixJohn Bauer
2021-03-23This whitespace annoyed meJohn Bauer
2021-03-23This should be slightly fasterJohn Bauer
2021-03-23This should be faster for Chinese or any other skip_newline languageJohn Bauer
2021-03-23Fix inconsistency issue between vi and the rest of the languages on how conse...Peng Qi
2021-03-19Add more comments to tokenizer output_predictions functionPeng Qi
2021-03-19Add comments to tokenizer data loaderPeng Qi
2021-03-18Add a hopefully useful error message when a FileNotFoundError occursJohn Bauer
2021-03-17Merge pull request #647 from stanfordnlp/fix_parensJohn Bauer
2021-03-17Fix a problem in the Chinese tokenizer by re.escaping all input textJohn Bauer
2021-03-17Merge pull request #645 from stanfordnlp/charlm_inputJohn Bauer
2021-03-17Add some docs on how to run thisJohn Bauer
2021-03-17Read in .xz file as well as .txt filesJohn Bauer
2021-03-17Improve output someJohn Bauer
2021-03-17Add a flag (on by default) to write converted files as .xzJohn Bauer
2021-03-17Add an optional output directoryJohn Bauer
2021-03-17Rearrange some if statements to make it easier to read the action part of the...John Bauer
2021-03-17Add the proxy parameter to the corenlp download script - incidentally, this f...John Bauer
2021-03-17Oops, previous efficiency changes were forgetting to update this field. Save...John Bauer
2021-03-17Unused?John Bauer
2021-03-16Merge pull request #644 from stanfordnlp/fix_bulk_mwtJohn Bauer
2021-03-16Improve the MWT bulk_process by using the superclass, then updating the countsJohn Bauer
2021-03-16Only create the pipeline once during the testJohn Bauer
2021-03-16Don't recreate all of the word & token objects. Saves a noticeable amount of...John Bauer
2021-03-16Add a couple more fields to the bulk mwt testJohn Bauer
2021-03-16This shortcut saves a bit of timeJohn Bauer
2021-03-16Add a test which confirms that bulk_process is working with an MWT languageJohn Bauer
2021-03-16Merge pull request #643 from stanfordnlp/fix_bulk_mwtJohn Bauer
2021-03-16Add a specific bulk_process for MWTJohn Bauer
2021-03-15Skip MWT tokens with empty word lists when bulk processing a language with mwtJohn Bauer
2021-03-15Merge pull request #642 from stanfordnlp/fix_bulk_mwtJohn Bauer
2021-03-15Get resources from main instead of master going forwardJohn Bauer
2021-03-09Add proxies parameters when downloading the model (#638)Mr.Tan
2021-03-08Merge branch 'master' into devJohn Bauer
2021-03-06This is unusedJohn Bauer
2021-03-04fix tokenizer batch off-by-one issuePeng Qi
2021-03-03Add a model_dir argument to correspond to the model_dir argument when downloa...John Bauer
2021-03-02All zh- languages (including zh-hant) should have the same skip_newline behaviorJohn Bauer
2021-03-02Tokenizer substring fix (#632)Peng Qi
2021-02-18Merge pull request #627 from stanfordnlp/tokenize-dataloader-paraaugJohn Bauer