LLM Evals Course Lesson 6: Complex Pipelines and CI/CD
Notes from lesson 6 of Hamel and Shreya's LLM evaluation course - debugging agentic systems, handling complex data modalities, and implementing CI/CD for production LLM applications.
Notes from lesson 6 of Hamel and Shreya's LLM evaluation course - debugging agentic systems, handling complex data modalities, and implementing CI/CD for production LLM applications.
Notes from lesson 5 of Hamel and Shreya's LLM evaluation course - evaluating retrieval quality, generation quality, and common pitfalls in RAG systems.
Swyx argues for 2025-2035 as the decade of AI agents, backed by unprecedented infrastructure investment and converging technical definitions.
Bavaro's approach to strategy: Vision, Strategic Framework, and Roadmap.
Are frameworks actually useful? Exploring how they enable communication, engagement, and focused thinking
Simon Willison was a guest on Logan Kilpatrick's Google podcast. Topics covered: AI as a 'cyborg enhancement', the non-intuitive challenges of mastering LLM use, and the legitimate need for uncensored language models in fields like journalism.
How Big Things Get Done (Ch2): Exploring the forces that drive us to think fast and act slow
Exploring AI's role in government: from sanctioned projects to unapproved staff use, and its creeping integration into everyday tools.
I don't think so. 🤷