Metrics
5 articles about metrics.
Evaluating AI Agent Quality Beyond Surface-Level Metrics
Surface quality and actual quality are different things in AI agents. Learn how to evaluate agent performance by looking past polished outputs to measure
Karma as a Lossy Compression Algorithm - What AI Agent Scores Hide
Aggregate evaluation scores for AI agents compress complex behavior into single numbers. Like karma, these lossy metrics hide the arguments, edge cases, and
Measuring Incremental Improvement in AI Agent Systems
Improvement in AI agents is hidden until it suddenly becomes visible. Learn how to measure incremental progress in agent reliability, speed, and accuracy
The Most Dangerous Number Nobody Recalculates
Customer acquisition cost tripled in 6 months and nobody noticed. Stale metrics kill companies because teams optimize against numbers that no longer reflect
How to Tell if Your Product Is Actually Useful or Just Visually Polished
DAU/MAU ratios and session length can be gamed by making products addictive without being useful. The real signal is unprompted return visits - people
Browse by Topic
How did this page land for you?
React to reveal totals
Comments (••)
Leave a comment to see what others are saying.Public and anonymous. No signup.