Welcome to
mirror list
, hosted at
ThFree Co
, Russian Federation.
github.com/stanfordnlp/stanza.git - Unnamed repository; edit this file 'description' to name the repository.
index
:
github.com/stanfordnlp/stanza.git
142_resources_patch
NFC
am-notes
aws_sagemaker_tooling
azshan
beam
bert_mix
charlm_cache
charlm_checkpoint
con_add_pattn
con_attn2
con_bigrams
con_checkpoint
con_classifier
con_classifier_reranking
con_focal
con_freeze
con_kbest
con_lattn
con_lattn2
con_mixed_pattn
con_mlp
con_mlp_inputs
con_multitask
con_pattn_lr
con_pattn_replace
con_restart_transitions
con_self_gan
con_shift_tags
con_shift_tags2
con_shift_transitions
con_simple_transformer
con_simple_unary
con_skip
con_trans
con_tree_lstm
con_tree_lstm2
con_tree_lstm3
con_tstack
con_vector_dropout
con_vit
con_warmup
con_warmup_2
con_warmup_lattn
dataloader_local
de_ner
dev
elmo2
elmo_many
fewer_cuda
fix_unit_tests
gh-pages
gh-pages-sent
hebrew_combined
hi-layered-ner
hi-ner-cleaned
hi-shuffle
hi_ner
hi_ner_final
inorder_unary
kazakh_ner
kk_trans
lattn_issue
m1
main
marathi
margin_penalty
masakhane
masks
masks2
masks3
ner_bert
ner_bert_copy
ner_wv
ninf_langid
nner
no_header_pt
numeric_re
ordered_dict
pattn_issue
pos_bert
pos_charlm
ppf_data
pydataloader
refactor_dataloader
refactor_lstm
refactor_tok
refactor_tokenizer
refactor_tokkenizer_2
runner-demo
semgrex_search_visualization
sentence_ids
sentiment
sentiment_charlm
sentiment_lstm
sentiment_trainer
sindhi
spanish_sent
t5
t5b
tagger_mha
thai-sybrnn
thai_ner
tiny_ud
token
tokens
tr_ner
trans_lm
tweet
ug_ner
update_stanza
updated_eval
updated_eval_2
vi_bert_last
visualization
wandb
word_lstm_pattn
wordinput-sentsegmenter
xpos
Unnamed repository; edit this file 'description' to name the repository.
www-data
summary
refs
log
tree
commit
diff
log msg
author
committer
range
Age
Commit message (
Expand
)
Author
2022-05-30
Script to remove unmodified pieces of pretrain from NER model
ner_wv
John Bauer
2022-05-30
Notes on which embeddings are used for which NER, in the form of a map of def...
John Bauer
2022-05-30
Ignore unknown embedding words based on a switch (not sure this is useful or ...
John Bauer
2022-05-30
Separate delta embedding from base NER embedding
John Bauer
2022-05-30
Update version number: removing 90% of the embedding from the NER models mean...
John Bauer
2022-05-30
Use save_name to check if a model already exists
John Bauer
2022-05-29
Try to generalize wikiner reading - currently the download format is a
John Bauer
2022-05-28
Merge pull request #1031 from stanfordnlp/refactor_dataloader
John Bauer
2022-05-28
Simplify - can use torch tensors directly rather than first creating np arrays
John Bauer
2022-05-28
Pytorch dataloader
John Bauer
2022-05-28
Start to refactor pieces of the tokenizer dataset into pieces so we can make ...
John Bauer
2022-05-28
Set default JA NER to GSD
John Bauer
2022-05-27
Merge pull request #1038 from stanfordnlp/ja_ner
John Bauer
2022-05-27
Convert the Megagon ja_gsd
John Bauer
2022-05-27
Generalize the conll -> iob conversion a bit
John Bauer
2022-05-26
Add arguments for epsilon and beta2 to initializing an AdamW optimizer
John Bauer
2022-05-26
Add the ability to read a single file from a zipfile
John Bauer
2022-05-26
Corner case: when limiting word vectors to pretty close to the length of vect...
John Bauer
2022-05-25
Add the ability to read .gz files to the pretrain conversion
John Bauer
2022-05-24
Adjust tab stops
John Bauer
2022-05-23
do not install transformers library by default; now support m1 macos
yuhui-zh15
2022-05-23
Add a method to get the keys in a multivocab
John Bauer
2022-05-23
basic __str__ for a Pipeline
John Bauer
2022-05-23
germeval2014 looks like a more reliable dataset, so make that the default
John Bauer
2022-05-19
numpy instead of torch is slightly faster in the small sentence regime, very ...
John Bauer
2022-05-17
labels is unused in tokenizer predict
John Bauer
2022-05-17
Remove unused import
John Bauer
2022-05-16
Add the skip_newlines test to the file reading version of the tokenizer data ...
John Bauer
2022-05-16
Abstract away labels() rather than having the eval code know the format of th...
John Bauer
2022-05-16
more specific Exception type
John Bauer
2022-05-16
Get rid of the input_data field - was only used for tests, and the tests don'...
John Bauer
2022-05-16
For the MWT test, use the fake tokenizer files rather than putting in the fak...
John Bauer
2022-05-16
Factor out a method to write the input to temp files in a tokenizer test
John Bauer
2022-05-16
Add a tiny bit of doc
John Bauer
2022-05-16
Merge pull request #1029 from stanfordnlp/refactor_tok
John Bauer
2022-05-16
Run some basic tests on the dictionary in the ZH tokenizer
John Bauer
2022-05-16
Rearrange - not necessary for this to be an inner function
John Bauer
2022-05-16
whitespace
John Bauer
2022-05-16
torch tensors instead of lists of numbers
John Bauer
2022-05-15
Separate out the label creation - no need to make a fake string of 0s at runtime
John Bauer
2022-05-15
Add a test which pokes the DataLoader object to make sure it is processing da...
John Bauer
2022-05-15
Separate the next() functionality which advances an unfinished batch into a s...
John Bauer
2022-05-14
whitespace update
John Bauer
2022-05-14
use save_name for consistency
John Bauer
2022-05-14
Use save_name to load if load_name is not set
John Bauer
2022-05-14
zh -> zh-hans to match the names of other models
John Bauer
2022-05-14
Use weighted_f1 by default to pick a best model
John Bauer
2022-05-14
Oops, got the test file name mixed up with train
John Bauer
2022-05-14
Add bert to sentiment training. Includes loading it in pipelines and at test...
John Bauer
2022-05-14
Log average loss even when doing an interim analysis
John Bauer
[next]