Evalytic
Pytest for
AI outputs.
Evaluate images, text, RAG, and agents with LLM judges, local metrics, and CI gates.
Know if your AI is good before your users tell you.
"A neon sign reading 'MIDNIGHT CAFE' above a door in a rainy Tokyo alley at night"
Visual evaluation is the most mature public workflow today. Text, RAG, and agent support are now available in the same SDK.
$ pip install evalytic
copy