Applied · Stage test

AI Intermediate stage test

No governed timed route exists for this stage yet, so this page gives you an honest untimed stage-end check built from the published bank.

Format Untimed self-check

Questions 12

Best time to use it After the stage modules and practice

Question 1

Scenario: Your RAG bot answers confidently but cites the wrong paragraph. What do you fix first?

Retrieval quality and chunking/indexing
Model size
Font size
GPU driver

Reveal answer

Correct answer: Retrieval quality and chunking/indexing

Question 2

Scenario: A prompt change breaks a workflow. What engineering practice should exist?

Treat prompts like interfaces: version, test, review
Never change prompts
Only use longer prompts
Disable monitoring

Reveal answer

Correct answer: Treat prompts like interfaces: version, test, review

Question 3

Scenario: The model performs well overall but fails for one user segment. What catches this?

Slice testing by segment and scenario
Only aggregate accuracy
Add more emojis
Use a bigger context window

Reveal answer

Correct answer: Slice testing by segment and scenario

Question 4

Scenario: You must choose a threshold. What should it be based on?

Cost of false positives vs false negatives and review capacity
The highest possible number
What looks good in a demo
The model name

Reveal answer

Correct answer: Cost of false positives vs false negatives and review capacity

Question 5

Scenario: You add lots of context and answer quality drops. What is the most likely reason?

Too much context dilutes key facts and increases distraction
More context always improves accuracy
The GPU is out of memory
The prompt became encrypted

Reveal answer

Correct answer: Too much context dilutes key facts and increases distraction

Question 6

Scenario: Users try to trick the system by changing wording until it misbehaves. What is the right framing?

Adversarial behaviour / distribution shift that needs monitoring and guardrails
A harmless UX issue only
A database indexing problem
A compiler bug

Reveal answer

Correct answer: Adversarial behaviour / distribution shift that needs monitoring and guardrails

Question 7

Scenario: A team wants the assistant to answer policy questions using current governed documents. What should you try before fine-tuning?

Retrieval augmented generation with permissions, traceability, and cited sources
Full model retraining immediately
Random prompt changes with no retrieval layer
Disable citations so the answer sounds smoother

Reveal answer

Correct answer: Retrieval augmented generation with permissions, traceability, and cited sources

Question 8

Scenario: Retrieval returns the right chunk, but the model still answers wrongly. What do you add?

Citations and answer-grounding checks (and refuse when evidence is weak)
More temperature
A bigger logo
Disable tests

Reveal answer

Correct answer: Citations and answer-grounding checks (and refuse when evidence is weak)

Question 9

Which evaluation approach is most defensible for a user-facing assistant?

A scenario set with slice tests and acceptance criteria linked to harms
One overall benchmark score
Only speed tests
Only subjective demo feedback

Reveal answer

Correct answer: A scenario set with slice tests and acceptance criteria linked to harms

Question 10

Scenario: Users report 'it was fine yesterday'. What do you check first?

Versioned changes (prompt, retrieval index, tools) and correlated failure spikes
Only model temperature
Only the marketing page
Only the GPU type

Reveal answer

Correct answer: Versioned changes (prompt, retrieval index, tools) and correlated failure spikes

Question 11

Scenario: Facts change every week and the answer must stay current. Which deployment pattern is the best starting point?

A retrieval-augmented system grounded in current documents
A frozen model with no retrieval layer
Batch predictions with no source refresh
Manual copy and paste into the prompt every day

Reveal answer

Correct answer: A retrieval-augmented system grounded in current documents

Question 12

Scenario: Your RAG system retrieves contradictory policies. What should the assistant do?

Surface the conflict, cite both sources, and ask a clarifying question or escalate
Pick one at random to keep flowing
Hide citations and answer confidently
Ignore retrieval and answer from memory

Reveal answer

Correct answer: Surface the conflict, cite both sources, and ask a clarifying question or escalate