
LLM Evals Course: Lesson 2b (office hrs)
A few things from Evals Course office hrs following lesson 2 of Hamel and Shreya's LLM evaluation course.
A few things from Evals Course office hrs following lesson 2 of Hamel and Shreya's LLM evaluation course.
Notes from lesson 2 of Hamel and Shreya's LLM evaluation course - covering error analysis, open and axial coding, and systematic approaches to understanding where AI systems fail.
Notes from the first lesson of Parlance Lab's Maven course on evaluating LLM applications - covering the Three Gulfs model and why eval is where most people get stuck.
A systematic approach to analysing and improving large language model applications through error analysis.