Welcome to
mirror list
, hosted at
ThFree Co
, Russian Federation.
github.com/stanfordnlp/stanza.git - Unnamed repository; edit this file 'description' to name the repository.
index
:
github.com/stanfordnlp/stanza.git
142_resources_patch
NFC
am-notes
aws_sagemaker_tooling
azshan
beam
bert_mix
charlm_cache
charlm_checkpoint
con_add_pattn
con_attn2
con_bigrams
con_checkpoint
con_classifier
con_classifier_reranking
con_focal
con_freeze
con_kbest
con_lattn
con_lattn2
con_mixed_pattn
con_mlp
con_mlp_inputs
con_multitask
con_pattn_lr
con_pattn_replace
con_restart_transitions
con_self_gan
con_shift_tags
con_shift_tags2
con_shift_transitions
con_simple_transformer
con_simple_unary
con_skip
con_trans
con_tree_lstm
con_tree_lstm2
con_tree_lstm3
con_tstack
con_vector_dropout
con_vit
con_warmup
con_warmup_2
con_warmup_lattn
dataloader_local
de_ner
dev
elmo2
elmo_many
fewer_cuda
fix_unit_tests
gh-pages
gh-pages-sent
hebrew_combined
hi-layered-ner
hi-ner-cleaned
hi-shuffle
hi_ner
hi_ner_final
inorder_unary
kazakh_ner
kk_trans
lattn_issue
m1
main
marathi
margin_penalty
masakhane
masks
masks2
masks3
ner_bert
ner_bert_copy
ner_wv
ninf_langid
nner
no_header_pt
numeric_re
ordered_dict
pattn_issue
pos_bert
pos_charlm
ppf_data
pydataloader
refactor_dataloader
refactor_lstm
refactor_tok
refactor_tokenizer
refactor_tokkenizer_2
runner-demo
semgrex_search_visualization
sentence_ids
sentiment
sentiment_charlm
sentiment_lstm
sentiment_trainer
sindhi
spanish_sent
t5
t5b
tagger_mha
thai-sybrnn
thai_ner
tiny_ud
token
tokens
tr_ner
trans_lm
tweet
ug_ner
update_stanza
updated_eval
updated_eval_2
vi_bert_last
visualization
wandb
word_lstm_pattn
wordinput-sentsegmenter
xpos
Unnamed repository; edit this file 'description' to name the repository.
www-data
summary
refs
log
tree
commit
diff
log msg
author
committer
range
Age
Commit message (
Expand
)
Author
2021-03-24
small fix
ppf_data
Pfeiffer_T480s
2021-03-24
started refractoring to pytorch dataloader
Pfeiffer_T480s
2021-03-24
Add a note on something that doesn't seem to help
pydataloader
John Bauer
2021-03-24
Refactor - put everything in torch_data
John Bauer
2021-03-24
Maybe a little faster?
John Bauer
2021-03-24
One fewer thing to send back
John Bauer
2021-03-24
This should be a little faster, but need to untangle the tensors
John Bauer
2021-03-23
pass back strings or lists of chunks rather than including the tuples with fa...
John Bauer
2021-03-23
Refactor to make profiling easier
John Bauer
2021-03-23
Attempt to make a pytorch dataloader for bulk_process - not an improvement yet
John Bauer
2021-03-23
Merge pull request #650 from stanfordnlp/vi_fix
John Bauer
2021-03-23
This whitespace annoyed me
John Bauer
2021-03-23
This should be slightly faster
John Bauer
2021-03-23
This should be faster for Chinese or any other skip_newline language
John Bauer
2021-03-23
Fix inconsistency issue between vi and the rest of the languages on how conse...
Peng Qi
2021-03-19
Add more comments to tokenizer output_predictions function
Peng Qi
2021-03-19
Add comments to tokenizer data loader
Peng Qi
2021-03-18
Add a hopefully useful error message when a FileNotFoundError occurs
John Bauer
2021-03-17
Merge pull request #647 from stanfordnlp/fix_parens
John Bauer
2021-03-17
Fix a problem in the Chinese tokenizer by re.escaping all input text
John Bauer
2021-03-17
Merge pull request #645 from stanfordnlp/charlm_input
John Bauer
2021-03-17
Add some docs on how to run this
John Bauer
2021-03-17
Read in .xz file as well as .txt files
John Bauer
2021-03-17
Improve output some
John Bauer
2021-03-17
Add a flag (on by default) to write converted files as .xz
John Bauer
2021-03-17
Add an optional output directory
John Bauer
2021-03-17
Rearrange some if statements to make it easier to read the action part of the...
John Bauer
2021-03-17
Add the proxy parameter to the corenlp download script - incidentally, this f...
John Bauer
2021-03-17
Oops, previous efficiency changes were forgetting to update this field. Save...
John Bauer
2021-03-17
Unused?
John Bauer
2021-03-16
Merge pull request #644 from stanfordnlp/fix_bulk_mwt
John Bauer
2021-03-16
Improve the MWT bulk_process by using the superclass, then updating the counts
John Bauer
2021-03-16
Only create the pipeline once during the test
John Bauer
2021-03-16
Don't recreate all of the word & token objects. Saves a noticeable amount of...
John Bauer
2021-03-16
Add a couple more fields to the bulk mwt test
John Bauer
2021-03-16
This shortcut saves a bit of time
John Bauer
2021-03-16
Add a test which confirms that bulk_process is working with an MWT language
John Bauer
2021-03-16
Merge pull request #643 from stanfordnlp/fix_bulk_mwt
John Bauer
2021-03-16
Add a specific bulk_process for MWT
John Bauer
2021-03-15
Skip MWT tokens with empty word lists when bulk processing a language with mwt
John Bauer
2021-03-15
Merge pull request #642 from stanfordnlp/fix_bulk_mwt
John Bauer
2021-03-15
Get resources from main instead of master going forward
John Bauer
2021-03-09
Add proxies parameters when downloading the model (#638)
Mr.Tan
2021-03-08
Merge branch 'master' into dev
John Bauer
2021-03-06
This is unused
John Bauer
2021-03-04
fix tokenizer batch off-by-one issue
Peng Qi
2021-03-03
Add a model_dir argument to correspond to the model_dir argument when downloa...
John Bauer
2021-03-02
All zh- languages (including zh-hant) should have the same skip_newline behavior
John Bauer
2021-03-02
Tokenizer substring fix (#632)
Peng Qi
2021-02-18
Merge pull request #627 from stanfordnlp/tokenize-dataloader-paraaug
John Bauer
[next]