Question 1
Scenario: Your RAG bot answers confidently but cites the wrong paragraph. What do you fix first?
Reveal answer
Correct answer: Retrieval quality and chunking/indexing
Applied · Stage test
No governed timed route exists for this stage yet, so this page gives you an honest untimed stage-end check built from the published bank.
Scenario: Your RAG bot answers confidently but cites the wrong paragraph. What do you fix first?
Correct answer: Retrieval quality and chunking/indexing
Scenario: A prompt change breaks a workflow. What engineering practice should exist?
Correct answer: Treat prompts like interfaces: version, test, review
Scenario: The model performs well overall but fails for one user segment. What catches this?
Correct answer: Slice testing by segment and scenario
Scenario: You must choose a threshold. What should it be based on?
Correct answer: Cost of false positives vs false negatives and review capacity
Scenario: You add lots of context and answer quality drops. What is the most likely reason?
Correct answer: Too much context dilutes key facts and increases distraction
Scenario: Users try to trick the system by changing wording until it misbehaves. What is the right framing?
Correct answer: Adversarial behaviour / distribution shift that needs monitoring and guardrails
Scenario: A team wants the assistant to answer policy questions using current governed documents. What should you try before fine-tuning?
Correct answer: Retrieval augmented generation with permissions, traceability, and cited sources
Scenario: Retrieval returns the right chunk, but the model still answers wrongly. What do you add?
Correct answer: Citations and answer-grounding checks (and refuse when evidence is weak)
Which evaluation approach is most defensible for a user-facing assistant?
Correct answer: A scenario set with slice tests and acceptance criteria linked to harms
Scenario: Users report 'it was fine yesterday'. What do you check first?
Correct answer: Versioned changes (prompt, retrieval index, tools) and correlated failure spikes
Scenario: Facts change every week and the answer must stay current. Which deployment pattern is the best starting point?
Correct answer: A retrieval-augmented system grounded in current documents
Scenario: Your RAG system retrieves contradictory policies. What should the assistant do?
Correct answer: Surface the conflict, cite both sources, and ask a clarifying question or escalate