github.com/stanfordnlp/stanza.git - Unnamed repository; edit this file 'description' to name the repository.

Age	Commit message (Expand)	Author
2021-03-24	small fixppf_data	Pfeiffer_T480s
2021-03-24	started refractoring to pytorch dataloader	Pfeiffer_T480s
2021-03-24	Add a note on something that doesn't seem to helppydataloader	John Bauer
2021-03-24	Refactor - put everything in torch_data	John Bauer
2021-03-24	Maybe a little faster?	John Bauer
2021-03-24	One fewer thing to send back	John Bauer
2021-03-24	This should be a little faster, but need to untangle the tensors	John Bauer
2021-03-23	pass back strings or lists of chunks rather than including the tuples with fa...	John Bauer
2021-03-23	Refactor to make profiling easier	John Bauer
2021-03-23	Attempt to make a pytorch dataloader for bulk_process - not an improvement yet	John Bauer
2021-03-23	Merge pull request #650 from stanfordnlp/vi_fix	John Bauer
2021-03-23	This whitespace annoyed me	John Bauer
2021-03-23	This should be slightly faster	John Bauer
2021-03-23	This should be faster for Chinese or any other skip_newline language	John Bauer
2021-03-23	Fix inconsistency issue between vi and the rest of the languages on how conse...	Peng Qi
2021-03-19	Add more comments to tokenizer output_predictions function	Peng Qi
2021-03-19	Add comments to tokenizer data loader	Peng Qi
2021-03-18	Add a hopefully useful error message when a FileNotFoundError occurs	John Bauer
2021-03-17	Merge pull request #647 from stanfordnlp/fix_parens	John Bauer
2021-03-17	Fix a problem in the Chinese tokenizer by re.escaping all input text	John Bauer
2021-03-17	Merge pull request #645 from stanfordnlp/charlm_input	John Bauer
2021-03-17	Add some docs on how to run this	John Bauer
2021-03-17	Read in .xz file as well as .txt files	John Bauer
2021-03-17	Improve output some	John Bauer
2021-03-17	Add a flag (on by default) to write converted files as .xz	John Bauer
2021-03-17	Add an optional output directory	John Bauer
2021-03-17	Rearrange some if statements to make it easier to read the action part of the...	John Bauer
2021-03-17	Add the proxy parameter to the corenlp download script - incidentally, this f...	John Bauer
2021-03-17	Oops, previous efficiency changes were forgetting to update this field. Save...	John Bauer
2021-03-17	Unused?	John Bauer
2021-03-16	Merge pull request #644 from stanfordnlp/fix_bulk_mwt	John Bauer
2021-03-16	Improve the MWT bulk_process by using the superclass, then updating the counts	John Bauer
2021-03-16	Only create the pipeline once during the test	John Bauer
2021-03-16	Don't recreate all of the word & token objects. Saves a noticeable amount of...	John Bauer
2021-03-16	Add a couple more fields to the bulk mwt test	John Bauer
2021-03-16	This shortcut saves a bit of time	John Bauer
2021-03-16	Add a test which confirms that bulk_process is working with an MWT language	John Bauer
2021-03-16	Merge pull request #643 from stanfordnlp/fix_bulk_mwt	John Bauer
2021-03-16	Add a specific bulk_process for MWT	John Bauer
2021-03-15	Skip MWT tokens with empty word lists when bulk processing a language with mwt	John Bauer
2021-03-15	Merge pull request #642 from stanfordnlp/fix_bulk_mwt	John Bauer
2021-03-15	Get resources from main instead of master going forward	John Bauer
2021-03-09	Add proxies parameters when downloading the model (#638)	Mr.Tan
2021-03-08	Merge branch 'master' into dev	John Bauer
2021-03-06	This is unused	John Bauer
2021-03-04	fix tokenizer batch off-by-one issue	Peng Qi
2021-03-03	Add a model_dir argument to correspond to the model_dir argument when downloa...	John Bauer
2021-03-02	All zh- languages (including zh-hant) should have the same skip_newline behavior	John Bauer
2021-03-02	Tokenizer substring fix (#632)	Peng Qi
2021-02-18	Merge pull request #627 from stanfordnlp/tokenize-dataloader-paraaug	John Bauer