Artificial intelligence

LLM Evals Course Lesson 5: How to Evaluate Complex Architectures

LLM Evals Course Lesson 5: How to Evaluate Complex Architectures

Notes from lesson 5 of Hamel and Shreya's LLM evaluation course - evaluating retrieval quality, generation quality, and common pitfalls in RAG systems.

The Decade of Agents: Swyx's AI Engineer Paris Keynote

The Decade of Agents: Swyx's AI Engineer Paris Keynote

Swyx argues for 2025-2035 as the decade of AI agents, backed by unprecedented infrastructure investment and converging technical definitions.

LLM Evals Course Lesson 4: Multi-turn and Collaborative Evaluation

LLM Evals Course Lesson 4: Multi-turn and Collaborative Evaluation

Notes from lesson 4 of Hamel and Shreya's LLM evaluation course - handling multi-turn conversations and building evaluation criteria through collaboration.

Building Domain-Specific Annotation Tools with FastHTML: Lessons from Isaac Flath

Building Domain-Specific Annotation Tools with FastHTML: Lessons from Isaac Flath

How Isaac Flath built a medical flashcard annotation tool for AnkiHub using FastHTML, and why custom annotation tools beat generic ones for complex domains.

LLM Evals Course Lesson 3: Building Automated Evaluators

LLM Evals Course Lesson 3: Building Automated Evaluators

Notes from lesson 3 of Hamel and Shreya's LLM evaluation course - implementing automated evaluators, building reliable LLM-as-judge systems, and avoiding common pitfalls.

Three Things I Learned About Voice Agents from Kwindla Kramer

Three Things I Learned About Voice Agents from Kwindla Kramer

Three thigns I learned about voice agent architecture, context limitations, and latency trade-offs.