Shallow Evals and How to Make Them Right
I am coming back from Trace, Braintrust's flagship event at the California Academy of Sciences in San Francisco. There is a big question I have been sitting with as I work with evals, and the conversations at Trace sharpened it even further. The question is this: WHO is