github.com/stanfordnlp/stanza.git - Unnamed repository; edit this file 'description' to name the repository.

Age	Commit message (Expand)	Author
2022-05-30	Script to remove unmodified pieces of pretrain from NER modelner_wv	John Bauer
2022-05-30	Notes on which embeddings are used for which NER, in the form of a map of def...	John Bauer
2022-05-30	Ignore unknown embedding words based on a switch (not sure this is useful or ...	John Bauer
2022-05-30	Separate delta embedding from base NER embedding	John Bauer
2022-05-30	Update version number: removing 90% of the embedding from the NER models mean...	John Bauer
2022-05-30	Use save_name to check if a model already exists	John Bauer
2022-05-29	Try to generalize wikiner reading - currently the download format is a	John Bauer
2022-05-28	Merge pull request #1031 from stanfordnlp/refactor_dataloader	John Bauer
2022-05-28	Simplify - can use torch tensors directly rather than first creating np arrays	John Bauer
2022-05-28	Pytorch dataloader	John Bauer
2022-05-28	Start to refactor pieces of the tokenizer dataset into pieces so we can make ...	John Bauer
2022-05-28	Set default JA NER to GSD	John Bauer
2022-05-27	Merge pull request #1038 from stanfordnlp/ja_ner	John Bauer
2022-05-27	Convert the Megagon ja_gsd	John Bauer
2022-05-27	Generalize the conll -> iob conversion a bit	John Bauer
2022-05-26	Add arguments for epsilon and beta2 to initializing an AdamW optimizer	John Bauer
2022-05-26	Add the ability to read a single file from a zipfile	John Bauer
2022-05-26	Corner case: when limiting word vectors to pretty close to the length of vect...	John Bauer
2022-05-25	Add the ability to read .gz files to the pretrain conversion	John Bauer
2022-05-24	Adjust tab stops	John Bauer
2022-05-23	do not install transformers library by default; now support m1 macos	yuhui-zh15
2022-05-23	Add a method to get the keys in a multivocab	John Bauer
2022-05-23	basic __str__ for a Pipeline	John Bauer
2022-05-23	germeval2014 looks like a more reliable dataset, so make that the default	John Bauer
2022-05-19	numpy instead of torch is slightly faster in the small sentence regime, very ...	John Bauer
2022-05-17	labels is unused in tokenizer predict	John Bauer
2022-05-17	Remove unused import	John Bauer
2022-05-16	Add the skip_newlines test to the file reading version of the tokenizer data ...	John Bauer
2022-05-16	Abstract away labels() rather than having the eval code know the format of th...	John Bauer
2022-05-16	more specific Exception type	John Bauer
2022-05-16	Get rid of the input_data field - was only used for tests, and the tests don'...	John Bauer
2022-05-16	For the MWT test, use the fake tokenizer files rather than putting in the fak...	John Bauer
2022-05-16	Factor out a method to write the input to temp files in a tokenizer test	John Bauer
2022-05-16	Add a tiny bit of doc	John Bauer
2022-05-16	Merge pull request #1029 from stanfordnlp/refactor_tok	John Bauer
2022-05-16	Run some basic tests on the dictionary in the ZH tokenizer	John Bauer
2022-05-16	Rearrange - not necessary for this to be an inner function	John Bauer
2022-05-16	whitespace	John Bauer
2022-05-16	torch tensors instead of lists of numbers	John Bauer
2022-05-15	Separate out the label creation - no need to make a fake string of 0s at runtime	John Bauer
2022-05-15	Add a test which pokes the DataLoader object to make sure it is processing da...	John Bauer
2022-05-15	Separate the next() functionality which advances an unfinished batch into a s...	John Bauer
2022-05-14	whitespace update	John Bauer
2022-05-14	use save_name for consistency	John Bauer
2022-05-14	Use save_name to load if load_name is not set	John Bauer
2022-05-14	zh -> zh-hans to match the names of other models	John Bauer
2022-05-14	Use weighted_f1 by default to pick a best model	John Bauer
2022-05-14	Oops, got the test file name mixed up with train	John Bauer
2022-05-14	Add bert to sentiment training. Includes loading it in pipelines and at test...	John Bauer
2022-05-14	Log average loss even when doing an interim analysis	John Bauer