Curtis Pyke

AI

Continuous Autoregressive Language Models – Full Paper and Review

June 22, 2026

What if we stopped predicting the next token—and predicted the next vector instead? That’s the central shift proposed by Continuous...

AI

Forward Deployed AI Engineers: The Most Valuable People in the Building

by Curtis Pyke

June 22, 2026

There’s a new power role inside AI companies, and if you’re paying attention you’re hearing it everywhere — Forward Deployed...

AI

The Smol Training Playbook: The Secrets to Building World-Class LLMs – Book And Review

by Curtis Pyke

June 22, 2026

TL;DR (aka: Why this playbook matters) Training a modern large language model (LLM) is not just “pick an architecture, grab...

Blog

Moloch’s Bargain – Emergent Misalignment When LLM’s Compete For Audience – Paper Summary

by Curtis Pyke

October 9, 2025

Introduction "Moloch's Bargain: Emergent Misalignment When LLMs Compete for Audiences" by Batu El and James Zou from Stanford University presents...

Blog

Less is More: Recursive Reasoning with Tiny Networks – Paper Summary

by Curtis Pyke

October 8, 2025

The artificial intelligence community just witnessed something extraordinary—and profoundly counterintuitive. A neural network with merely 7 million parameters has achieved...

Blog

Video Models Are Zero-shot Learners And Reasoners – Paper Review

by Curtis Pyke

September 28, 2025

TLDR Video Models Are Zero-Shot Learners and Reasoners: The Coming GPT-3 Moment for Computer Vision Google DeepMind just dropped a...

Blog

GDPVAL: Evaluating AI Model Performance On Real-World Economically Valuable Tasks – Paper Summary

by Curtis Pyke

September 26, 2025

TL;DR GDPval is an OpenAI evaluation of real professional “knowledge-work” on computers. It contains 1,320 expert-authored tasks spanning 44 occupations...

Blog

Godel Test: Can Large Language Models Solve Easy Conjectures? – Paper Summary

by Curtis Pyke

September 25, 2025

Artificial intelligence has already conquered a long list of benchmarks. From passing the bar exam to outperforming humans on high...

Blog

REFRAG: A Breakthrough in Efficient RAG Processing That Achieves 30x Speed Gains

by Curtis Pyke

September 7, 2025

In a groundbreaking development from Meta Superintelligence Labs, researchers have unveiled REFRAG - a novel framework that dramatically accelerates retrieval-augmented...

Blog

Why Language Models Hallucinate – OpenAI Paper Summary

by Curtis Pyke

September 6, 2025

Large language models don’t “see” the world. They model it—statistically, hungrily, and at scale. So when they produce confident falsehoods—hallucinations—it...

Curtis Pyke

Continuous Autoregressive Language Models – Full Paper and Review

Forward Deployed AI Engineers: The Most Valuable People in the Building

The Smol Training Playbook: The Secrets to Building World-Class LLMs – Book And Review

Moloch’s Bargain – Emergent Misalignment When LLM’s Compete For Audience – Paper Summary

Less is More: Recursive Reasoning with Tiny Networks – Paper Summary

Video Models Are Zero-shot Learners And Reasoners – Paper Review

GDPVAL: Evaluating AI Model Performance On Real-World Economically Valuable Tasks – Paper Summary

Godel Test: Can Large Language Models Solve Easy Conjectures? – Paper Summary

REFRAG: A Breakthrough in Efficient RAG Processing That Achieves 30x Speed Gains

Why Language Models Hallucinate – OpenAI Paper Summary

Recent News

Big Tech Wants to Kill the Common Cold With a $500 Million Health Moonshot

Did We Just Cross the Rubicon? AI Access Can Now Change Overnight

Did AI Safety Become Regulatory Capture?

The End of Open AI? How Government Is Quietly Becoming the Gatekeeper of Frontier Models

Kingy AI Launch Intelligence

The Best in A.I.

Recent Posts

Recent News

Big Tech Wants to Kill the Common Cold With a $500 Million Health Moonshot