Welcome to mirror list, hosted at ThFree Co, Russian Federation.

github.com/stanfordnlp/stanza.git - Unnamed repository; edit this file 'description' to name the repository.
summaryrefslogtreecommitdiff
AgeCommit message (Expand)Author
2021-08-06Update convert_fire_2013.pyhi_neranwesham-lab
2021-08-06Update convert_fire_2013.pyanwesham-lab
2021-08-06Update convert_fire_2013.pyanwesham-lab
2021-08-05Update convert_fire_2013.pyanwesham-lab
2021-08-05Update convert_fire_2013.pyanwesham-lab
2021-08-05modify randomized data distribution styleanwesham-lab
2021-08-05Update convert_fire_2013.pyanwesham-lab
2021-08-05Update convert_fire_2013.pyanwesham-lab
2021-08-04Update convert_fire_2013.pyanwesham-lab
2021-08-04Update convert_fire_2013.pyanwesham-lab
2021-08-04Update convert_fire_2013.pyanwesham-lab
2021-07-29Update convert_fire_2013.pyanwesham-lab
2021-07-29Update convert_fire_2013.pyanwesham-lab
2021-07-29Update convert_fire_2013.pyanwesham-lab
2021-07-28Update convert_fire_2013.pyanwesham-lab
2021-07-28locationanwesham-lab
2021-07-28Update .gitignoreanwesham-lab
2021-07-25One attempt at reducing the tag space for hi_ner. Not super greatJohn Bauer
2021-07-25Add some explanation to the logging output for the NER scoresJohn Bauer
2021-07-25Add processing for it_fbk. Uses the .tsv file they sent us and their recomme...John Bauer
2021-07-25Add the ability for the ner model to upscale basic (no B- or I-) tagging -> B...John Bauer
2021-07-25Add a processing step for NHCLT datasets. Currently Afrikaans is the most us...John Bauer
2021-07-25Make the matrix more readable when there are a ton of categoriesJohn Bauer
2021-07-25Format ints differently from floats in the confusion matrixJohn Bauer
2021-07-25Add a confusion matrix over tokens to the output of the ner_taggerJohn Bauer
2021-07-25Add a flag for finetuning from a different load name from the save_nameJohn Bauer
2021-07-25If given an empty list, simply return an empty list when sort is called. Fix...John Bauer
2021-07-24Merge pull request #766 from stanfordnlp/thai_lst20_redoJohn Bauer
2021-07-23Add a test of empty text for the pipelineJohn Bauer
2021-07-23Add indentation to the json rather than saving it in one large dumpJohn Bauer
2021-07-23Fix command line for hindi datasetsJohn Bauer
2021-07-23Process gz files as well as .txt and .txt.xz in the charlmJohn Bauer
2021-07-22Adjust orchid preparation script to always include spaces after sentencesJohn Bauer
2021-07-22Add a test which checks that the orchid results are consistentJohn Bauer
2021-07-22Add a longer test for a couple different variations on processing textJohn Bauer
2021-07-22Add an option to split clauses into sentences if a space is between clausesJohn Bauer
2021-07-22Add more notes on how the tokenization boundaries are determinedJohn Bauer
2021-07-22Add an option to add spaces after the sentence ends (which is actually more c...John Bauer
2021-07-22Add a lot of notes on how the characters are expected to line up in the testJohn Bauer
2021-07-21Attempt to add a helpful error explaining where it looked for LST20John Bauer
2021-07-21Add a tiny test for part of the LST20 preparationJohn Bauer
2021-07-21Make the retokenization an option for the lst20 datasetJohn Bauer
2021-07-21Use pythainlp to resplit lst20 sentences as wellJohn Bauer
2021-07-21Refactor some of the processing code which uses pythainlpJohn Bauer
2021-07-21Revert "Adjust the newpar title"John Bauer
2021-07-20Standardize the final short_name of the hindi ner dataset regardless of which...John Bauer
2021-07-20Add a few extra cases to treebank_to_short_name so that calling on an already...John Bauer
2021-07-19Add some more command lines to the prepare_ner_dataset.py docJohn Bauer
2021-07-19Improve prepare_ner_dataset docJohn Bauer
2021-07-19Merge pull request #765 from stanfordnlp/thaiJohn Bauer