The History of Large Language Models - Timeline

Let us know how you would edit the timeline!

Feb 28, 2023

See the complete timeline here: https://voicebot.ai/large-language-models-history-timeline/

There is a lot of talk about large language models (LLM) but little discussion about where they originated and how the technology progressed. Synthedia and Voicebot.ai compiled a timeline of key technology developments that helped move the industry forward to the LLMs you know today.

Apparently, a timelines of LLM history is also an LFS (large file size). It’s a bit too large for an email newsletter, so you just get the snippet above and below here. You can see the entire timeline on Voicebot.ai here.

See the full timeline

Just the most recent entries to the timeline.

Let us know what you think. What would you add? What would you say doesn’t deserve to be in the timeline? We look forward to your input.

Synthedia

How Will Truth Prevail? New Methods Emerge for Truth Checking in ChatGPT and LLMs

“They will do no wrong. They will tell no lies.” Bible NIV, Zephaniah 3:13, cited directly. “Ask me no questions, and I’ll tell you no fibs.” From The Rehearsal by George Villiers c.1672 as answered by ChatGPT ☹️ “Ask me no questions, and I’ll tell you no fibs.” From She Stoops to Conquer by Oliver Goldsmith, c.1773, cited by numerous online sources and …

2 years ago · 4 likes · Bret Kinsella

Uwe PLEBAN

Mar 10, 2023

Great timeline. Comments on inaccuracies concerning OpenAI models, plus spelling issues.

A. The GPT-1 model is dated 2016, but this cannot be correct, because it uses the transformer architecture described in the Google paper of 2017. The OpenAI announcement of GPT-1 is dated June 11, 2018. Reference here: https://openai.com/research/language-unsupervised . The paper itself has no date attached; at the bottom of page 1 it merely says "Preprint. Work in progress." So your timeline should have the entry for GPT-1 moved down by 2 years. Note: There is an OpenAI research web page on Generative Models dated June 16, 2016 here: https://openai.com/research/generative-models, but it discusses these models in the context of image generation (GANs and friends), not for text generation.

B. Misspelling: Universal Setnence Encoder - should be "Sentence"

C. Bert should always be spelled BERT.

D. GPT-2 was announced on February 14, 2019 - link here: https://openai.com/research/better-language-models. Your timeline dates it to 2018. The final release of the 1.5B parameter model was actually in November 2019.

E. The announcement of GPT-3 is dated May 28, 2020. Link: https://openai.com/research/language-models-are-few-shot-learners. Your timeline says June 2020.

F. The OpenAI announcement of Codex is dated July 07, 2021. Link here: https://openai.com/research/evaluating-large-language-models-trained-on-code. Your timeline says August 2020.

G. The evolution of GPT-3 includes WebGPT and InstructGPT. You may want to include their announcements by OpenAI on your timeline. WebGPT (Dec. 16, 2021): https://openai.com/research/webgpt; InstructGPT(Jan. 27, 2022): https://openai.com/research/instruction-following

Expand full comment

1 reply by Bret Kinsella

Howard Horvath

Mar 2, 2023Edited

I think there will be a tremendous demand for industry-specific LLMs, possibly further tuned to a particular state or even city user base. And given we've already seen the capacity/processing power of previous mainframes reduced to a size and cost (over time) that allows us to now carry them around in our pocket - I'm wondering if we'll ever see the day people will have access to their own personal LLM tuned to what the individual wants/needs most.

1 reply

2 more comments...

The History of Large Language Models - Timeline

Let us know how you would edit the timeline!

Discussion about this post