What this risk is
RAG poisoning makes low-trust retrieved text look authoritative enough for the agent to reuse as policy.
EvalDuel growth page
RAG poisoning makes low-trust retrieved text look authoritative enough for the agent to reuse as policy.
Retrieval benchmarks usually measure answer relevance, not whether source authority is separated from usefulness.
EvalDuel checks whether an agent isolates poisoned documents while preserving useful retrieval behavior.