thingsithinkithink

Artificial intelligence

View All
LLM Evals Course Lesson 4: Multi-turn and Collaborative Evaluation

LLM Evals Course Lesson 4: Multi-turn and Collaborative Evaluation

Notes from lesson 4 of Hamel and Shreya's LLM evaluation course - handling multi-turn conversations and building evaluation criteria through collaboration.

Building Domain-Specific Annotation Tools with FastHTML: Lessons from Isaac Flath

Building Domain-Specific Annotation Tools with FastHTML: Lessons from Isaac Flath

How Isaac Flath built a medical flashcard annotation tool for AnkiHub using FastHTML, and why custom annotation tools beat generic ones for complex domains.

LLM Evals Course Lesson 3: Building Automated Evaluators

LLM Evals Course Lesson 3: Building Automated Evaluators

Notes from lesson 3 of Hamel and Shreya's LLM evaluation course - implementing automated evaluators, building reliable LLM-as-judge systems, and avoiding common pitfalls.


Recent Post

Regular Expression Basics

Regular Expression Basics

Notes and examples on regular expressions

Improving Python Function Usability with wrapping

Improving Python Function Usability with wrapping

A simple approach to modifying Python functions for better usability without altering their core behaviour.

Off-the-Radar AI links for the End of the Year

Off-the-Radar AI links for the End of the Year

Sharing a few lesser-known (but I liked them) things that I don't see other people talking about.

Things I Think I Think

Things I Think I Think

I need a space to think, learn, and work in public.