Welcome to
mirror list
, hosted at
ThFree Co
, Russian Federation.
github.com/stanfordnlp/stanza.git - Unnamed repository; edit this file 'description' to name the repository.
index
:
github.com/stanfordnlp/stanza.git
142_resources_patch
NFC
am-notes
aws_sagemaker_tooling
azshan
beam
bert_mix
charlm_cache
charlm_checkpoint
con_add_pattn
con_attn2
con_bigrams
con_checkpoint
con_classifier
con_classifier_reranking
con_focal
con_freeze
con_kbest
con_lattn
con_lattn2
con_mixed_pattn
con_mlp
con_mlp_inputs
con_multitask
con_pattn_lr
con_pattn_replace
con_restart_transitions
con_self_gan
con_shift_tags
con_shift_tags2
con_shift_transitions
con_simple_transformer
con_simple_unary
con_skip
con_trans
con_tree_lstm
con_tree_lstm2
con_tree_lstm3
con_tstack
con_vector_dropout
con_vit
con_warmup
con_warmup_2
con_warmup_lattn
dataloader_local
de_ner
dev
elmo2
elmo_many
fewer_cuda
fix_unit_tests
gh-pages
gh-pages-sent
hebrew_combined
hi-layered-ner
hi-ner-cleaned
hi-shuffle
hi_ner
hi_ner_final
inorder_unary
kazakh_ner
kk_trans
lattn_issue
m1
main
marathi
margin_penalty
masakhane
masks
masks2
masks3
ner_bert
ner_bert_copy
ner_wv
ninf_langid
nner
no_header_pt
numeric_re
ordered_dict
pattn_issue
pos_bert
pos_charlm
ppf_data
pydataloader
refactor_dataloader
refactor_lstm
refactor_tok
refactor_tokenizer
refactor_tokkenizer_2
runner-demo
semgrex_search_visualization
sentence_ids
sentiment
sentiment_charlm
sentiment_lstm
sentiment_trainer
sindhi
spanish_sent
t5
t5b
tagger_mha
thai-sybrnn
thai_ner
tiny_ud
token
tokens
tr_ner
trans_lm
tweet
ug_ner
update_stanza
updated_eval
updated_eval_2
vi_bert_last
visualization
wandb
word_lstm_pattn
wordinput-sentsegmenter
xpos
Unnamed repository; edit this file 'description' to name the repository.
www-data
summary
refs
log
tree
commit
diff
log msg
author
committer
range
Age
Commit message (
Expand
)
Author
2022-08-05
Add a trainer for the charlm - useful for saving and loading everything for c...
charlm_checkpoint
John Bauer
2022-08-04
Run the charlm for a couple iterations and make sure that doesn't barf
John Bauer
2022-08-04
More descriptive error for empty training data
John Bauer
2022-08-04
Make vars() part of the parse_args call - will simplify use in tests
John Bauer
2022-08-04
Lower learning rate seems to learn better for models of all sizes
John Bauer
2022-08-04
Throw an error if the training or dev data files are empty
John Bauer
2022-08-03
Rough outline of a semgrex interface demo program
John Bauer
2022-08-03
Add the graphIndex and semgrexIndex from CoreNLP 4.5.0 to make the semgrex in...
John Bauer
2022-08-03
Make the error for an unfinished language (hopefully) more useful
John Bauer
2022-08-03
Make a special error type for missing language instead of throwing a ValueError
John Bauer
2022-08-03
Add Saraiki (and fix an alphabetization error)
John Bauer
2022-08-02
Update run_depparse to download word vectors as well
John Bauer
2022-08-02
Automatically download POS (with charlm & wordvec) when redoing depparse
John Bauer
2022-08-02
Oops, need to tell the Trainer where to get the charlm if not in the default ...
John Bauer
2022-08-02
Add a piece of doc
John Bauer
2022-08-02
Download pretrain when training POS
John Bauer
2022-08-02
Add a few more useful output statements to the prepare scripts
John Bauer
2022-08-02
Add a hopefully useful status line to the preparation script
John Bauer
2022-08-02
2.10 is the most recent version
John Bauer
2022-08-02
Explicitely remove unsaved modules from the optimizer, although this doesn't ...
John Bauer
2022-08-01
Add an option for an alternate output directory if needed
John Bauer
2022-08-01
Convert oscar 2022 files to txt by extracting the content fields
John Bauer
2022-08-01
Making combining the input the default for the lattn layer
John Bauer
2022-07-30
Rename Fragment -> SentimentDatum to make a more understandable name for pote...
John Bauer
2022-07-30
Further refactor - put the utility method in the utility methods file
John Bauer
2022-07-30
Refactor - the MR sentiment dataset can use the 'write_dataset' function
John Bauer
2022-07-30
Also remove ufeff
John Bauer
2022-07-30
Strip words ... only changes one word in bn_daffodil
John Bauer
2022-07-29
Refactor read_datasets from the bn_daffodil NER script. May be useful for bu...
John Bauer
2022-07-29
bert & roberta attention masks
John Bauer
2022-07-29
Restore a sentence to VIT that seems to be fixed in the latest updates
John Bauer
2022-07-29
Update to use the latest version of Italian VIT constituency treebank from ELRA
John Bauer
2022-07-29
Some minor updates to the VIT constituency processing based on more text upda...
John Bauer
2022-07-29
Update comments and paths for building constituency it_vit to reflect the lat...
John Bauer
2022-07-29
Add a bunch of languages represented in fasttext vectors
John Bauer
2022-07-29
exceptions and logging with periods separated from the language names for rea...
John Bauer
2022-07-27
Latest POS models change this word to NOUN
John Bauer
2022-07-26
Integrate xlnet
John Bauer
2022-07-26
Use the next token instead of the last token for the endpoint in bert - makes...
John Bauer
2022-07-25
Oops, bugfix - the start & end tokens take up space in the bert tokenizer
John Bauer
2022-07-24
Updated tagger tags Opal as a noun (perhaps we should choose a more stable te...
John Bauer
2022-07-22
Save NER models if the training hasn't gone on long enough to hit a checkpoint
John Bauer
2022-07-21
Load the pretrained charlm, adds it as inputs to the POS model
John Bauer
2022-07-20
Small whitespace change
John Bauer
2022-07-20
Upgrade convert_pretrain to read from .csv files
John Bauer
2022-07-20
Add a reader for .csv files to the pretrained embedding reading function
John Bauer
2022-07-20
Move logging inside read_from_file
John Bauer
2022-07-20
Failed lines are now dropped inside read_from_file
John Bauer
2022-07-20
Test that the en_ewt with an unexpected tagset logs an error. All other test...
John Bauer
2022-07-20
Put the xpos vocab test in a class so we can level it up
John Bauer
[next]