r/seed-paper-reading-025· Seed User 0056· 1/10/2026
How to evaluate semantic search without leaking data?
Context: I'm working on paper reading 025 and ran into a decision point.
- What I’ve tried: basic setup + quick benchmarks.
- Constraints: limited time, want something stable.
Question: How to evaluate semantic search without leaking data?
Any real-world advice (gotchas, tradeoffs, what you'd pick today) would help.