Welcome to mirror list, hosted at ThFree Co, Russian Federation.

github.com/stanfordnlp/stanza.git - Unnamed repository; edit this file 'description' to name the repository.
summaryrefslogtreecommitdiff
AgeCommit message (Expand)Author
2021-07-28ihi-ner-cleanedanwesham-lab
2021-07-28Update .gitignoreanwesham-lab
2021-07-27Add vlsp pos dataset option for VLSP WS task (#772)vythaihn
2021-07-25Add some explanation to the logging output for the NER scoresJohn Bauer
2021-07-25Add processing for it_fbk. Uses the .tsv file they sent us and their recomme...John Bauer
2021-07-25Add the ability for the ner model to upscale basic (no B- or I-) tagging -> B...John Bauer
2021-07-25Add a processing step for NHCLT datasets. Currently Afrikaans is the most us...John Bauer
2021-07-25Make the matrix more readable when there are a ton of categoriesJohn Bauer
2021-07-25Format ints differently from floats in the confusion matrixJohn Bauer
2021-07-25Add a confusion matrix over tokens to the output of the ner_taggerJohn Bauer
2021-07-25Add a flag for finetuning from a different load name from the save_nameJohn Bauer
2021-07-25If given an empty list, simply return an empty list when sort is called. Fix...John Bauer
2021-07-24Merge pull request #766 from stanfordnlp/thai_lst20_redoJohn Bauer
2021-07-23Add a test of empty text for the pipelineJohn Bauer
2021-07-23Add indentation to the json rather than saving it in one large dumpJohn Bauer
2021-07-23Fix command line for hindi datasetsJohn Bauer
2021-07-23Process gz files as well as .txt and .txt.xz in the charlmJohn Bauer
2021-07-22Adjust orchid preparation script to always include spaces after sentencesJohn Bauer
2021-07-22Add a test which checks that the orchid results are consistentJohn Bauer
2021-07-22Add a longer test for a couple different variations on processing textJohn Bauer
2021-07-22Add an option to split clauses into sentences if a space is between clausesJohn Bauer
2021-07-22Add more notes on how the tokenization boundaries are determinedJohn Bauer
2021-07-22Add an option to add spaces after the sentence ends (which is actually more c...John Bauer
2021-07-22Add a lot of notes on how the characters are expected to line up in the testJohn Bauer
2021-07-21Attempt to add a helpful error explaining where it looked for LST20John Bauer
2021-07-21Add a tiny test for part of the LST20 preparationJohn Bauer
2021-07-21Make the retokenization an option for the lst20 datasetJohn Bauer
2021-07-21Use pythainlp to resplit lst20 sentences as wellJohn Bauer
2021-07-21Refactor some of the processing code which uses pythainlpJohn Bauer
2021-07-21Revert "Adjust the newpar title"John Bauer
2021-07-20Standardize the final short_name of the hindi ner dataset regardless of which...John Bauer
2021-07-20Add a few extra cases to treebank_to_short_name so that calling on an already...John Bauer
2021-07-19Add some more command lines to the prepare_ner_dataset.py docJohn Bauer
2021-07-19Improve prepare_ner_dataset docJohn Bauer
2021-07-19Merge pull request #765 from stanfordnlp/thaiJohn Bauer
2021-07-19Refactor some to make it easier to test the lst20 scriptJohn Bauer
2021-07-19Don't make new text files for datasets which already produced text filesJohn Bauer
2021-07-19Integrate lst20 into the tokenization scriptJohn Bauer
2021-07-19Move process_lst20 to tokenizationJohn Bauer
2021-07-19Add a script which converts the LST20 dataset for tokenizationJohn Bauer
2021-07-19Test updates based on changes to the underlying data, which changed the resul...John Bauer
2021-07-19Test updates based on changes to the underlying data, which changed the resul...John Bauer
2021-07-17Merge pull request #764 from stanfordnlp/move_orchidJohn Bauer
2021-07-17Problem with space separationJohn Bauer
2021-07-17Add NewPar to new paragraphs.John Bauer
2021-07-17Connect BEST to the conversion scriptJohn Bauer
2021-07-17Add th_orchid to the prepare_tokenizer_treebank scriptJohn Bauer
2021-07-17Move thai orchid & best tokenization to the tokenization specific directoryJohn Bauer
2021-07-16Merge pull request #763 from stanfordnlp/vlsp_tokenizerJohn Bauer
2021-07-16Adjust the newpar titleJohn Bauer