
Error Analysis for Improving LLM Applications
A systematic approach to analysing and improving large language model applications through error analysis.
A systematic approach to analysing and improving large language model applications through error analysis.
Why evaluation-driven experimentation creates better roadmaps in AI products.
Simon Willison was a guest on Logan Kilpatrick's Google podcast. Topics covered: AI as a 'cyborg enhancement', the non-intuitive challenges of mastering LLM use, and the legitimate need for uncensored language models in fields like journalism.
How Big Things Get Done (Ch2): Exploring the forces that drive us to think fast and act slow
Exploring AI's role in government: from sanctioned projects to unapproved staff use, and its creeping integration into everyday tools.
I don't think so. 🤷
How Big Things Get Done (ch1)
impact on jobs and carbon emissions...
Open models achieved a lot in 2024, Luca from the AI Institute gives a good overview.