Blog posts

The Research Path to GPT-4, Part 2

7 minute read

Published: March 21, 2024

TLDR: This post follows the thread of papers authored by Alec Radford that ultimately led to GPT-4. It observes that original motivation for the next-token prediction was as a representation learning mechanism, and there appears to be a gradual (and somewhat accidental) realization that these models could be used for much more…

The Research Path to GPT-4, Part 1

4 minute read

Published: March 19, 2024

TLDR: This post follows the thread of papers authored by Alec Radford that ultimately led to GPT-4. It observes that original motivation for the next-token prediction was as a representation learning mechanism, and there appears to be a gradual (and somewhat accidental) realization that these models could be used for much more…

Machine Learning Eras and their Bottlenecks

4 minute read

Published: February 28, 2024

TLDR: Making sense of where we are in AI research by looking at the bottlenecks of each machine learning era so far, and where this suggests we’re headed.

Tim Pearce

Blog posts

2024

The Research Path to GPT-4, Part 2

The Research Path to GPT-4, Part 1

Machine Learning Eras and their Bottlenecks