Welcome to mirror list, hosted at ThFree Co, Russian Federation.

github.com/stanfordnlp/stanza.git - Unnamed repository; edit this file 'description' to name the repository.
summaryrefslogtreecommitdiff
BranchCommit messageAuthorAge
142_resources_patchPut all needed pretrains in default.zip in case the NER uses a different pret...John Bauer19 months
NFCNFC instead of NFDJohn Bauer18 months
am-notesUpdate workspace.xmlanwesham-lab3 years
aws_sagemaker_toolingDumping out useless print statements and import statementsSecroLoL19 months
azshantesting git controlsunknown23 months
beamBeam training loss - rough draftJohn Bauer19 months
bert_mixMix with N vectors instead of just 1John Bauer18 months
charlm_cacheAdd bert to sentiment training. Includes loading it in pipelines and at test...John Bauer2 years
charlm_checkpointAdd a trainer for the charlm - useful for saving and loading everything for c...John Bauer21 months
con_add_pattnFinetune a model that was learned with no pattn to now use pattnJohn Bauer23 months
con_attn2ATTN method to build larger constituents out of smaller constituentsJohn Bauer2 years
con_bigramsmaybe try to avoid the bigram explodingJohn Bauer2 years
con_checkpointAlways save checkpoints. Always load from a checkpoint if one exists.John Bauer20 months
con_classifierLoad a parser as an argument to the parser.John Bauer19 months
con_classifier_rerankingRough draft of using the classifier as a rerankerJohn Bauer19 months
con_focalUse a pip installable focal loss library instead?John Bauer19 months
con_freezeRather than freezing, just learn reaaaaaly slowlyJohn Bauer21 months
con_kbestAdd a second type of reranker (need to make these general)John Bauer2 years
con_lattnExperiment with not doing weight decay at all for pattnJohn Bauer23 months
con_lattn2Move the LayerNorm outside of the positional encodings. Set the d_model used...John Bauer21 months
con_mixed_pattnLearning the mixing factor rather than hardcoding it to 0.1John Bauer20 months
con_mlpAttempt the MLP again (forgot that it might be more useful in the case of ber...John Bauer2 years
con_mlp_inputsCombine all inputs using MLPs rather than concatJohn Bauer2 years
con_multitaskBegin making it so you can use multiple treebanks and annotation schemesJohn Bauer21 months
con_pattn_lrlow lr and low weight decay for the norms of pattnJohn Bauer23 months
con_pattn_replacePotentially multiply p into c in the partitioned transformerJohn Bauer2 years
con_restart_transitionsRestart transitions when restarting trainingJohn Bauer20 months
con_self_ganAdd a loss to make the model try to approximate what the classifier would lik...John Bauer2 years
con_shift_tagsAdd a flag to control how many tags to use when labeling shift transitionsJohn Bauer20 months
con_shift_tags2Add a flag to control how many tags to use when labeling shift transitionsJohn Bauer18 months
con_shift_transitionsAdd a label to a ShiftJohn Bauer20 months
con_simple_transformerAdd a simple MHA to the modelJohn Bauer2 years
con_simple_unaryRemove the specialized unary transformJohn Bauer2 years
con_skipUse the dynamic LSTM for constituents. Not sure it helps, is definitely slowJohn Bauer2 years
con_transuse two transitions as input for the LSTM. not sure this will help at allJohn Bauer20 months
con_tree_lstmAttempt to come up with an initial tree_cx for the TREE_LSTM methodJohn Bauer20 months
con_tree_lstm2Add a TREE_LSTM node combination method.John Bauer20 months
con_tree_lstm3FixJohn Bauer2 years
con_tstackTransformer stack. Currently only has one head of attention. TODO: Probably...John Bauer20 months
con_vector_dropoutDon't remove pattn... we rely on dropout to prevent pattn from going to 0John Bauer20 months
con_vitIT WORKSgit diffgit diffJohn Bauer2 years
con_warmuplow weight decay for the normsJohn Bauer23 months
con_warmup_2AdaDelta warmup for the conparser. Motivation: AdaDelta results inJohn Bauer23 months
con_warmup_lattnAdd an option to build the lattn out of the entire input, not just pattnJohn Bauer23 months
dataloader_localSupport variants and pretokenized as beforeJohn Bauer3 years
de_nerAttempt to compensate for German BERT tokenizers not handling soft hyphen ver...John Bauer2 years
devFix broken unittestJohn Bauer18 months
elmo2Add support for elmoformanylangs to sentimentJohn Bauer21 months
elmo_manyAllow different size input transforms in the NERJohn Bauer3 years
fewer_cudaRemove some cuda() calls in favor of getting the device instead. Will make i...John Bauer20 months
fix_unit_testsadd pytest as requirementJ382 years
gh-pagesUpdate an obsolete commentJohn Bauer19 months
gh-pages-sentAdd a (not quite complete) sentiment model pageJohn Bauer21 months
hebrew_combinedAdd the capacity to build he_combined models from UD_Hebrew-IAHLTwiki and a f...John Bauer21 months
hi-layered-neryeahanwesham-lab3 years
hi-ner-cleanedianwesham-lab3 years
hi-shuffleUpdate convert_fire_2013.pyanwesham-lab3 years
hi_nerUpdate convert_fire_2013.pyanwesham-lab3 years
hi_ner_finalUpdate convert_fire_2013.pyanwesham-lab2 years
inorder_unarywill combine PreterminalUnary with CompoundUnaryJohn Bauer18 months
kazakh_nerwhitespaceJohn Bauer21 months
kk_transless noisy, context for the outputJohn Bauer21 months
lattn_issueReplace all inputs with the lattn inputs if it's on. Problem: this does not ...John Bauer18 months
m1fix unittest?yuhui-zh1524 months
mainBump version number to release a few small changesJohn Bauer20 months
marathiAdd a processing for the MR l3cube sentimentJohn Bauer23 months
margin_penaltyAdd a margin_loss termJohn Bauer18 months
masakhaneAdd a preparation script for Masakhane NERJohn Bauer21 months
masksfix bert_embedding.pyKantapong Kotchum21 months
masks2Just to compare masked or not-masked for xlnetJohn Bauer21 months
masks3Another attempt at xlnet masksJohn Bauer21 months
ner_bertAdd bert embeddings to the bottom layer of the NER.Vy2 years
ner_bert_copyIn situations where we already have a bert_tokenizer loaded, don't load it ag...John Bauer2 years
ner_wvScript to remove unmodified pieces of pretrain from NER modelJohn Bauer24 months
ninf_langidMask illegal langauges by setting them to -ninf. 0 means that illegal langua...John Bauer22 months
nneradd ner_max_depth argument which will be called laterKantapong Kotchum21 months
no_header_ptAdd a test of the no-header pretrainJohn Bauer22 months
numeric_refix that it takes forever to tokenize a really long non-numeric tokenJohn Bauer23 months
ordered_dictDO NOT MERGE - this keeps a large object on the GPU between testsJohn Bauer20 months
pattn_issuemore heads? should run some experimentsJohn Bauer18 months
pos_bertUpgrading POS models to use Bert will require a resources version bumpJohn Bauer20 months
pos_charlmAdd a pos-specific charlm map for the medical EN datasets and the one dataset...John Bauer22 months
ppf_datasmall fixPfeiffer_T480s3 years
pydataloaderAdd a note on something that doesn't seem to helpJohn Bauer3 years
refactor_dataloaderSimplify - can use torch tensors directly rather than first creating np arraysJohn Bauer24 months
refactor_lstmAlso refactor a constituent_lstm_stack. The unary transitions are a little w...John Bauer20 months
refactor_tokturn the labels into a separate arrayJohn Bauer24 months
refactor_tokenizerSeparate out the label creation - no need to make a fake string of 0s at runtimeJohn Bauer2 years
refactor_tokkenizer_2Refactor a Dataset & Dataloader... although it is only used to prepare the se...John Bauer2 years
runner-demodemoJ3824 months
semgrex_search_visualizationDumping out old and unneeded files from developmentSecroLoL21 months
sentence_idsProcess sentence ids from the corpus, if available.John Bauer2 years
sentimentSwitch the VI model to use words tokenized from the stanza tokenizer rather t...John Bauer2 years
sentiment_charlmsimplify a bitJohn Bauer2 years
sentiment_lstm2d conv. Uses the width of a conv feature to rescale the outputJohn Bauer21 months
sentiment_trainerRefactor a Trainer object out of the classifier.py main program. In addition...John Bauer21 months
sindhiUpdate a few broken tags from the Sindhi NER dataset and add a little documen...John Bauer20 months
spanish_sentWrapper for converting Spanish TASS2020 for sentimentJohn Bauer21 months
t5start t5 integration... needs more masksJohn Bauer22 months
t5bAttempt to integrate T5 into NER/conparse. T5-small doesn't work well at all...John Bauer22 months
tagger_mhaPotentially use a MHA layer in the taggerJohn Bauer18 months
thai-sybrnnUpdate model.pyGordon3 years
thai_nerMerge branch 'dev' into thai_nerJohn Bauer22 months
tiny_udAdd expected xpos vocabs for the tiny treebanksJohn Bauer21 months
tokenoopsJohn Bauer19 months
tokensWrite spaces as their own tokens in Thai? Probably needs a lot of workJohn Bauer22 months
tr_nerAdd more debug to the watch_regex optionJohn Bauer2 years
trans_lmAdjust learning rate, don't print out infinite pplJohn Bauer18 months
tweetCopy over the changes from TweebankNLP. Will merge or edit them laterJohn Bauer20 months
ug_nerAug_17Arman Aydin21 months
update_stanzaUpdated syllable token + character token model frameworkFTdiscovery3 years
updated_evalpointless refactoringJohn Bauer21 months
updated_eval_2Update compose_ete_results.py to allow multiple input filesJohn Bauer21 months
vi_bert_lastUse the last word piece instead of the firstJohn Bauer18 months
visualizationRemoving unnecessary print statementsSecroLoL22 months
wandbGeneralize the sentiment csv reading code and move it to process_utilsJohn Bauer23 months
word_lstm_pattnMove the pattn & lattn after the word lstm. The position information should ...John Bauer18 months
wordinput-sentsegmenterUpdate trainer.pyGordon3 years
xposAutomatically determine the vocab type if it isn't already knownJohn Bauer22 months