thingsithinkithink

Artificial intelligence

View All
LLM Evals Course Lesson 4: Multi-turn and Collaborative Evaluation

LLM Evals Course Lesson 4: Multi-turn and Collaborative Evaluation

Notes from lesson 4 of Hamel and Shreya's LLM evaluation course - handling multi-turn conversations and building evaluation criteria through collaboration.

Building Domain-Specific Annotation Tools with FastHTML: Lessons from Isaac Flath

Building Domain-Specific Annotation Tools with FastHTML: Lessons from Isaac Flath

How Isaac Flath built a medical flashcard annotation tool for AnkiHub using FastHTML, and why custom annotation tools beat generic ones for complex domains.

LLM Evals Course Lesson 3: Building Automated Evaluators

LLM Evals Course Lesson 3: Building Automated Evaluators

Notes from lesson 3 of Hamel and Shreya's LLM evaluation course - implementing automated evaluators, building reliable LLM-as-judge systems, and avoiding common pitfalls.


Recent Post

Is Psychology or Politics Behind Project Failures?

Is Psychology or Politics Behind Project Failures?

How Big Things Get Done (Ch2): Exploring the forces that drive us to think fast and act slow

How AI is making its way into Government

How AI is making its way into Government

Exploring AI's role in government: from sanctioned projects to unapproved staff use, and its creeping integration into everyday tools.

Does OpenAI's loss on ChatGPT Pro mean they're doomed?

Does OpenAI's loss on ChatGPT Pro mean they're doomed?

I don't think so. 🤷

Think Slow, Act Fast: The Paradox of Project Planning

Think Slow, Act Fast: The Paradox of Project Planning

How Big Things Get Done (ch1)

Two AI things from Kent Hendricks' annual 'Things I learned' blog

Two AI things from Kent Hendricks' annual 'Things I learned' blog

impact on jobs and carbon emissions...

Open Models in 2024: Progress and Challenges

Open Models in 2024: Progress and Challenges

Open models achieved a lot in 2024, Luca from the AI Institute gives a good overview.