·7 min read
Evals are the new unit tests
The real engineering discipline behind shipping reliable LLM features is not prompting — it is evals. They are to LLM apps what unit tests are to deterministic code, except you measure pass-rates and distributions instead of exact equality.
#ai
#llm
#evals
#testing
#opinion
#ai-assisted