Claude Agent SDK Part 5: Editing Files with Checkpointing
Adding the ability for the agent to create posts that follow my templates, with the ability to recover from mistakes.
Adding the ability for the agent to create posts that follow my templates, with the ability to recover from mistakes.
Building context profiles and usage tracking that works with the SDK's design.
Discovering the trade-offs between agency and control when building on the Claude Agent SDK.
Notes from lesson 3 of Hamel and Shreya's LLM evaluation course - implementing automated evaluators, building reliable LLM-as-judge systems, and avoiding common pitfalls.
Three thigns I learned about voice agent architecture, context limitations, and latency trade-offs.
A few things from Evals Course office hrs following lesson 2 of Hamel and Shreya's LLM evaluation course.
Pearson FT published AI Demystified offers a gentle introduction for business leaders who want to understand how AI might impact their field.
Notes from lesson 2 of Hamel and Shreya's LLM evaluation course - covering error analysis, open and axial coding, and systematic approaches to understanding where AI systems fail.
Notes from the first lesson of Parlance Lab's Maven course on evaluating LLM applications - covering the Three Gulfs model and why eval is where most people get stuck.