
LLM Evals Course Lesson 4: Multi-turn and Collaborative Evaluation
Notes from lesson 4 of Hamel and Shreya's LLM evaluation course - handling multi-turn conversations and building evaluation criteria through collaboration.
Notes from lesson 4 of Hamel and Shreya's LLM evaluation course - handling multi-turn conversations and building evaluation criteria through collaboration.
How Isaac Flath built a medical flashcard annotation tool for AnkiHub using FastHTML, and why custom annotation tools beat generic ones for complex domains.
Notes from lesson 3 of Hamel and Shreya's LLM evaluation course - implementing automated evaluators, building reliable LLM-as-judge systems, and avoiding common pitfalls.
How Big Things Get Done (Ch2): Exploring the forces that drive us to think fast and act slow
Exploring AI's role in government: from sanctioned projects to unapproved staff use, and its creeping integration into everyday tools.
I don't think so. 🤷
How Big Things Get Done (ch1)
impact on jobs and carbon emissions...
Open models achieved a lot in 2024, Luca from the AI Institute gives a good overview.