diff --git a/README.md b/README.md
index 793691d..41119bd 100644
--- a/README.md
+++ b/README.md
@@ -5,18 +5,17 @@
+
+
-
-
| LLMs | +Model | + + +Data | + +|||
|---|---|---|---|---|---|
| + | License | +Commercial Use | +Other noteable restrictions | +License | +Corpus | +
| Encoder-only | +|||||
| BERT series of models (general domain) | +Apache 2.0 | +β | ++ | Public | +BooksCorpus, English Wikipedia | +
| RoBERTa | +MIT license | +β | ++ | Public | +BookCorpus, CC-News, OpenWebText, STORIES | +
| ERNIE | +Apache 2.0 | +β | ++ | Public | +English Wikipedia | +
| SciBERT | +Apache 2.0 | +β | ++ | Public | +BERT corpus, 1.14M papers from Semantic Scholar | +
| LegalBERT | +CC BY-SA 4.0 | +β | ++ | Public (except data from the Case Law Access Project) | +EU legislation, US court cases, etc. | +
| BioBERT | +Apache 2.0 | +β | ++ | PubMed | +PubMed, PMC | +
| Encoder-Decoder | +|||||
| T5 | +Apache 2.0 | +β | ++ | Public | +C4 | +
| Flan-T5 | +Apache 2.0 | +β | ++ | Public | +C4, Mixture of tasks (Fig 2 in paper) | +
| BART | +Apache 2.0 | +β | ++ | Public | +RoBERTa corpus | +
| GLM | +Apache 2.0 | +β | ++ | Public | +BooksCorpus and English Wikipedia | +
| ChatGLM | +ChatGLM License | +β | +No use for illegal purposes or military research, no harm the public interest of society | +N/A | +1T tokens of Chinese and English corpus | +
| Decoder-only | +|||||
| GPT2 | +Modified MIT License | +β | +Use GPT-2 responsibly and clearly indicate your content was created using GPT-2. | +Public | +WebText | +
| GPT-Neo | +MIT license | +β | ++ | Public | +Pile | +
| GPT-J | +Apache 2.0 | +β | ++ | Public | +Pile | +
| ---> Dolly | +CC BY NC 4.0 | +β | ++ | CC BY NC 4.0, Subject to terms of Use of the data generated by OpenAI | +Pile, Self-Instruct | +
| ---> GPT4ALL-J | +Apache 2.0 | +β | ++ | Public | +GPT4All-J dataset | +
| Pythia | +Apache 2.0 | +β | ++ | Public | +Pile | +
| ---> Dolly v2 | +MIT license | +β | ++ | Public | +Pile, databricks-dolly-15k | +
| OPT | +OPT-175B LICENSE AGREEMENT | +β | +No development relating to surveillance research and military, no harm the public interest of society | +Public | +RoBERTa corpus, the Pile, PushShift.io Reddit | +
| ---> OPT-IML | +OPT-175B LICENSE AGREEMENT | +β | +same to OPT | +Public | +OPT corpus, Extended version of Super-NaturalInstructions | +
| YaLM | +Apache 2.0 | +β | ++ | Unspecified | +Pile, Teams collected Texts in Russian | +
| BLOOM | +The BigScience RAIL License | +β | +No use of generating verifiably false information with the purpose of harming others; content without expressly disclaiming that the text is machine generated |
+ Public | +ROOTS corpus (LaurenΒΈcon et al., 2022) | +
| ---> BLOOMZ | +The BigScience RAIL License | +β | +same to BLOOM | +Public | +ROOTS corpus, xP3 | +
| Galactica | +CC BY-NC 4.0 | +β | ++ | N/A | +The Galactica Corpus | +
| LLaMA | +Non-commercial bespoke license | +β | +No development relating to surveillance research and military, no harm the public interest of society | +Public | +CommonCrawl, C4, Github, Wikipedia, etc. | +
| ---> Alpaca | +CC BY NC 4.0 | +β | ++ | CC BY NC 4.0, Subject to terms of Use of the data generated by OpenAI | +LLaMA corpus, Self-Instruct | +
| ---> Vicuna | +CC BY NC 4.0 | +β | ++ | Subject to terms of Use of the data generated by OpenAI; Privacy Practices of ShareGPT |
+ LLaMA corpus, 70K conversations from ShareGPT.com | +
| ---> GPT4ALL | +GPL Licensed LLaMa | +β | ++ | Public | +GPT4All dataset | +
| OpenLLaMA | +Apache 2.0 | +β | ++ | Public | +RedPajama | +
| CodeGeeX | +The CodeGeeX License | +β | +No use for illegal purposes or military research | +Public | +Pile, CodeParrot, etc. | +
| StarCoder | +BigCode OpenRAIL-M v1 license | +β | +No use of generating verifiably false information with the purpose of harming others; content without expressly disclaiming that the text is machine generated |
+ Public | +The Stack | +MPT-7B | +Apache 2.0 | +β | ++ | Public | +mC4 (english), The Stack, RedPajama, S2ORC | +
| falcon | +TII Falcon LLM License | +β /β | +Available under a license allowing commercial use | +Public | +RefinedWeb | +