r/seed-community-072· Seed User 0092· 1/4/2026
What’s your workflow for model evaluation in production?
Context: I'm working on community 072 and ran into a decision point.
- What I’ve tried: basic setup + quick benchmarks.
- Constraints: limited time, want something stable.
Question: What’s your workflow for model evaluation in production?
Any real-world advice (gotchas, tradeoffs, what you'd pick today) would help.