thingsithinkithink

Artificial intelligence

View All
LLM Evals Course Lesson 7: Interfaces for Human Review

LLM Evals Course Lesson 7: Interfaces for Human Review

Notes from lesson 7 of Hamel and Shreya's LLM evaluation course - interface design principles and strategic sampling.

Building an AI Sandbox with Docker

Building an AI Sandbox with Docker

How to set up a persistent Docker environment for AI coding tools without losing your authentication every time you restart the container.

LLM Evals Course Lesson 6: Complex Pipelines and CI/CD

LLM Evals Course Lesson 6: Complex Pipelines and CI/CD

Notes from lesson 6 of Hamel and Shreya's LLM evaluation course - debugging agentic systems, handling complex data modalities, and implementing CI/CD for production LLM applications.


Recent Post

LLM Evals Course Lesson 7: Interfaces for Human Review

LLM Evals Course Lesson 7: Interfaces for Human Review

Notes from lesson 7 of Hamel and Shreya's LLM evaluation course - interface design principles and strategic sampling.

Building an AI Sandbox with Docker

Building an AI Sandbox with Docker

How to set up a persistent Docker environment for AI coding tools without losing your authentication every time you restart the container.

LLM Evals Course Lesson 6: Complex Pipelines and CI/CD

LLM Evals Course Lesson 6: Complex Pipelines and CI/CD

Notes from lesson 6 of Hamel and Shreya's LLM evaluation course - debugging agentic systems, handling complex data modalities, and implementing CI/CD for production LLM applications.

LLM Evals Course Lesson 5: How to Evaluate Complex Architectures

LLM Evals Course Lesson 5: How to Evaluate Complex Architectures

Notes from lesson 5 of Hamel and Shreya's LLM evaluation course - evaluating retrieval quality, generation quality, and common pitfalls in RAG systems.

The Decade of Agents: Swyx's AI Engineer Paris Keynote

The Decade of Agents: Swyx's AI Engineer Paris Keynote

Swyx argues for 2025-2035 as the decade of AI agents, backed by unprecedented infrastructure investment and converging technical definitions.

LLM Evals Course Lesson 4: Multi-turn and Collaborative Evaluation

LLM Evals Course Lesson 4: Multi-turn and Collaborative Evaluation

Notes from lesson 4 of Hamel and Shreya's LLM evaluation course - handling multi-turn conversations and building evaluation criteria through collaboration.