Welcome to
mirror list
, hosted at
ThFree Co
, Russian Federation.
github.com/stanfordnlp/stanza.git - Unnamed repository; edit this file 'description' to name the repository.
index
:
github.com/stanfordnlp/stanza.git
142_resources_patch
NFC
am-notes
aws_sagemaker_tooling
azshan
beam
bert_mix
charlm_cache
charlm_checkpoint
con_add_pattn
con_attn2
con_bigrams
con_checkpoint
con_classifier
con_classifier_reranking
con_focal
con_freeze
con_kbest
con_lattn
con_lattn2
con_mixed_pattn
con_mlp
con_mlp_inputs
con_multitask
con_pattn_lr
con_pattn_replace
con_restart_transitions
con_self_gan
con_shift_tags
con_shift_tags2
con_shift_transitions
con_simple_transformer
con_simple_unary
con_skip
con_trans
con_tree_lstm
con_tree_lstm2
con_tree_lstm3
con_tstack
con_vector_dropout
con_vit
con_warmup
con_warmup_2
con_warmup_lattn
dataloader_local
de_ner
dev
elmo2
elmo_many
fewer_cuda
fix_unit_tests
gh-pages
gh-pages-sent
hebrew_combined
hi-layered-ner
hi-ner-cleaned
hi-shuffle
hi_ner
hi_ner_final
inorder_unary
kazakh_ner
kk_trans
lattn_issue
m1
main
marathi
margin_penalty
masakhane
masks
masks2
masks3
ner_bert
ner_bert_copy
ner_wv
ninf_langid
nner
no_header_pt
numeric_re
ordered_dict
pattn_issue
pos_bert
pos_charlm
ppf_data
pydataloader
refactor_dataloader
refactor_lstm
refactor_tok
refactor_tokenizer
refactor_tokkenizer_2
runner-demo
semgrex_search_visualization
sentence_ids
sentiment
sentiment_charlm
sentiment_lstm
sentiment_trainer
sindhi
spanish_sent
t5
t5b
tagger_mha
thai-sybrnn
thai_ner
tiny_ud
token
tokens
tr_ner
trans_lm
tweet
ug_ner
update_stanza
updated_eval
updated_eval_2
vi_bert_last
visualization
wandb
word_lstm_pattn
wordinput-sentsegmenter
xpos
Unnamed repository; edit this file 'description' to name the repository.
www-data
summary
refs
log
tree
commit
diff
log msg
author
committer
range
Age
Commit message (
Expand
)
Author
2022-11-11
Add a margin_loss term
margin_penalty
John Bauer
2022-11-11
Turn the split dataset name for vlsp22 into a -x-y format. Makes it easier t...
John Bauer
2022-11-11
Add notes on a relu vs gelu experiment
John Bauer
2022-11-11
Split the retagging operation into chunks. The tqdm is no longer as smooth, ...
John Bauer
2022-11-09
Addition of extra nonlinearities for experiments (#1149)
Hung Bui
2022-11-09
BrokenPipeError is more appropriate - fits the errors that come out of subpro...
John Bauer
2022-11-08
Slightly update a score
John Bauer
2022-11-08
Add a flag to use or not use .xz
John Bauer
2022-11-07
Add a variant of multihead attention where there's one key or one key per label
John Bauer
2022-11-07
Change convert_pretrain to use argparse so it has a nice --help method
John Bauer
2022-11-07
More notes on TOP_DOWN experiments
John Bauer
2022-11-07
Skip blank lines
John Bauer
2022-11-07
Change CompoundUnary to use a constructor similar to the OpenConstituent cons...
John Bauer
2022-11-07
Update some experiment numbers
John Bauer
2022-11-07
Simplify the operation of unary transitions in the event we are using TOP_DOW...
John Bauer
2022-11-07
Log the open nodes used in a model
John Bauer
2022-11-06
Add a flag to remove all sentences which don't fit in a bert tokenizer when p...
John Bauer
2022-11-06
Add the ability to quiet the logging
John Bauer
2022-11-06
Update error to ValueError (more appropriate) and log what the unexpected typ...
John Bauer
2022-11-06
Update log line & allow list of str instead of list of tuples
John Bauer
2022-11-05
Add some doc on the transition schemes
John Bauer
2022-11-05
Discard Devanagari text from the VI wikipedia
John Bauer
2022-11-05
A couple comments on how the NER training is organized
John Bauer
2022-11-05
Allow unknown compound transitions composed of known transitions in the dev o...
John Bauer
2022-11-05
Also chuck some sentences with long words
John Bauer
2022-11-05
throw out long JA sentences as well when tokenizing Wikipedia
John Bauer
2022-11-05
Specifically exclude one sentence from VI Wikipedia which makes Bert sad
John Bauer
2022-11-04
Refactor the tokenization method from tokenize_wiki.py Reuse it to add an op...
John Bauer
2022-11-04
AddSinulsoidalEncoding as a module
John Bauer
2022-11-04
The tokenization script was changed to account for length and emdash, so it i...
John Bauer
2022-11-03
Throw out wiki docs of length 2 as well when building a silver dataset
John Bauer
2022-11-03
Add min_len and max_len args to tokenize_wiki.py. Skip one line wiki docs, s...
John Bauer
2022-11-02
Fix format error in log line
John Bauer
2022-11-01
slice in a more generic manner when copying model. makes it easier to make f...
John Bauer
2022-11-01
Set this option in the partitioned test so that it still tests this code path...
John Bauer
2022-11-01
lattn_partitioned == False should affect the input proj dimension as well
John Bauer
2022-11-01
Add an argument for partitioning / not partitioning lattn
John Bauer
2022-11-01
Oops, this was incorrect
John Bauer
2022-11-01
Log some stats after all models are created for training (move the log line)
John Bauer
2022-11-01
Use some words from the silver dataset (currently |gold| words are added, eve...
John Bauer
2022-10-31
Add a suffix argument to the renormalize script
John Bauer
2022-10-31
Script to renormalize Vietnamese diacritics
John Bauer
2022-10-30
Add a separate argument for --silver_epoch_size, just in case people want that
John Bauer
2022-10-30
Add notes on silver words for the delta embedding
John Bauer
2022-10-30
Since we just ran into a bug where checkpoints were not correctly loaded, add...
John Bauer
2022-10-30
update comment
John Bauer
2022-10-30
Track how many batches a model gets trained for. Backdoor test for the silve...
John Bauer
2022-10-30
Rough draft of using silver trees.
John Bauer
2022-10-29
Move uses_xpos() to the model itself, add it Ensemble. Will make it easier t...
John Bauer
2022-10-29
Try smaller chunks for the parse_text. One giant chunk ran out of GPU
John Bauer
[next]