Practice and strategy · Stage test

AI Advanced stage test

No governed timed route exists for this stage yet, so this page gives you an honest untimed stage-end check built from the published bank.

Format Untimed self-check

Questions 12

Best time to use it After the stage modules and practice

Question 1

Scenario: An agent can call tools. What control limits blast radius fastest?

Scoped permissions + logging + stop conditions
Give it admin access
Disable logs
Let it learn in production

Reveal answer

Correct answer: Scoped permissions + logging + stop conditions

Question 2

Why must LLM systems log tool use?

Tool misuse is a primary failure mode
It makes the UI prettier
It reduces latency automatically
It replaces governance

Reveal answer

Correct answer: Tool misuse is a primary failure mode

Question 3

Governance is most defensibly framed as:

Paperwork to satisfy compliance
Decision quality under uncertainty with evidence and review triggers
A way to slow teams down
Something you do after an incident only

Reveal answer

Correct answer: Decision quality under uncertainty with evidence and review triggers

Question 4

Scenario: You must store evidence but protect privacy. What is the defensible approach?

Store the minimum needed with retention, access controls, and clear purpose
Store everything forever just in case
Store nothing at all
Store only screenshots

Reveal answer

Correct answer: Store the minimum needed with retention, access controls, and clear purpose

Question 5

A system passes a benchmark but fails in production. What was tested incorrectly?

The model alone, not the socio-technical system (pipelines, UI, humans, incentives)
Only the UI, not the model
Only the GPU, not the code
Only the database, not the network

Reveal answer

Correct answer: The model alone, not the socio-technical system (pipelines, UI, humans, incentives)

Question 6

Scenario: An agent starts looping tool calls. What is the safest immediate control?

A kill switch / stop condition and scoped permissions
Increase temperature
Give it more tools
Turn off logging

Reveal answer

Correct answer: A kill switch / stop condition and scoped permissions

Question 7

What makes an AI incident response plan credible?

Clear triggers, owners, evidence to collect, and rollback steps
A vague statement that safety matters
Only model retraining
Only a policy document

Reveal answer

Correct answer: Clear triggers, owners, evidence to collect, and rollback steps

Question 8

Which signal is most useful for detecting retrieval problems early?

Evidence mismatch rate (answers not supported by cited chunks)
GPU utilisation
Number of UI clicks
Number of features in the index

Reveal answer

Correct answer: Evidence mismatch rate (answers not supported by cited chunks)

Question 9

Scenario: Leadership wants 'one model for everything'. What is the first system-level risk to raise?

Different domains have different harms/constraints; one-size-fits-all increases systemic risk
It will always reduce incidents
It guarantees faster delivery
It eliminates the need for evaluation

Reveal answer

Correct answer: Different domains have different harms/constraints; one-size-fits-all increases systemic risk

Question 10

Scenario: You need to prove safe operation to an auditor. What evidence is most defensible?

Versioned policies, monitoring dashboards, incident logs, and review triggers
A promise that the model is safe
Only a benchmark score
Only screenshots of the UI

Reveal answer

Correct answer: Versioned policies, monitoring dashboards, incident logs, and review triggers

Question 11

Scenario: Product wants to disable refusal behaviour because it 'hurts conversion'. What is the governance response?

Treat it as a risk acceptance decision: document harms, evidence, owner, and review trigger
Let sales decide without documentation
Disable monitoring to avoid scrutiny
Change nothing and hope for the best

Reveal answer

Correct answer: Treat it as a risk acceptance decision: document harms, evidence, owner, and review trigger

Question 12

Scenario: An agent suggests a plan. What is the most defensible way to execute it?

Ask the agent to propose steps, but require explicit confirmation for each high-impact tool action
Let the agent run end-to-end without review
Disable logs to reduce cost
Give the agent admin permissions to avoid blockers

Reveal answer

Correct answer: Ask the agent to propose steps, but require explicit confirmation for each high-impact tool action