thingsithinkithink

Artificial intelligence

View All
Claude Agent SDK Part 5: Editing Files with Checkpointing

Claude Agent SDK Part 5: Editing Files with Checkpointing

Adding the ability for the agent to create posts that follow my templates, with the ability to recover from mistakes.

Claude Agent SDK Part 4: Implementing Context Profiles

Claude Agent SDK Part 4: Implementing Context Profiles

Building context profiles and usage tracking that works with the SDK's design.

Claude Agent SDK Part 3: The Context Control Problem

Claude Agent SDK Part 3: The Context Control Problem

Discovering the trade-offs between agency and control when building on the Claude Agent SDK.


Recent Post

LLM Evals Course Lesson 3: Building Automated Evaluators

LLM Evals Course Lesson 3: Building Automated Evaluators

Notes from lesson 3 of Hamel and Shreya's LLM evaluation course - implementing automated evaluators, building reliable LLM-as-judge systems, and avoiding common pitfalls.

Three Things I Learned About Voice Agents from Kwindla Kramer

Three Things I Learned About Voice Agents from Kwindla Kramer

Three thigns I learned about voice agent architecture, context limitations, and latency trade-offs.

LLM Evals Course: Lesson 2b (office hrs)

LLM Evals Course: Lesson 2b (office hrs)

A few things from Evals Course office hrs following lesson 2 of Hamel and Shreya's LLM evaluation course.

AI Demystified Book by Antonio Weiss

AI Demystified Book by Antonio Weiss

Pearson FT published AI Demystified offers a gentle introduction for business leaders who want to understand how AI might impact their field.

LLM Evals Lesson 2 Error Analysis

LLM Evals Lesson 2 Error Analysis

Notes from lesson 2 of Hamel and Shreya's LLM evaluation course - covering error analysis, open and axial coding, and systematic approaches to understanding where AI systems fail.

Hamel & Shreya's LLM Evals Course: Lesson 1

Hamel & Shreya's LLM Evals Course: Lesson 1

Notes from the first lesson of Parlance Lab's Maven course on evaluating LLM applications - covering the Three Gulfs model and why eval is where most people get stuck.