Foundations · Stage test

AI Foundations stage test

No governed timed route exists for this stage yet, so this page gives you an honest untimed stage-end check built from the published bank.

Format Untimed self-check

Questions 12

Best time to use it After the stage modules and practice

Question 1

Scenario: A model is 98% accurate but still causes harm. What is your first suspicion?

The model is overfitting because it trained too long
Errors are concentrated in a minority group or high-impact cases
The model needs more parameters
The UI is probably the only problem

Reveal answer

Correct answer: Errors are concentrated in a minority group or high-impact cases

Question 2

Scenario: A spam model relies heavily on number of links. Why is that risky?

Links are always a sign of spam
The model may learn a shortcut correlated in training but not causal
Link counting is too expensive to compute
It violates encryption

Reveal answer

Correct answer: The model may learn a shortcut correlated in training but not causal

Question 3

Scenario: You trained and tested on data from the same week. What failure can appear later?

Drift as real inputs change
The GPU will overheat
The model becomes deterministic
The labels become encrypted

Reveal answer

Correct answer: Drift as real inputs change

Question 4

Scenario: A model output is used to automatically reject applications. What is the safer default?

Full automation with no appeal path
Human review for high-impact cases with accountability and monitoring
Raise the temperature for more creativity
Only collect more data and ignore governance

Reveal answer

Correct answer: Human review for high-impact cases with accountability and monitoring

Question 5

Labels are created by humans under time pressure. What is the predictable risk?

Label noise and bias
Perfect ground truth
Fewer features
Lower compute cost

Reveal answer

Correct answer: Label noise and bias

Question 6

Scenario: You accidentally trained on features created after the outcome date. What happened?

Label leakage that makes tests look unrealistically good
Better generalisation
Lower variance automatically
Safer deployment by default

Reveal answer

Correct answer: Label leakage that makes tests look unrealistically good

Question 7

Scenario: Only 1% of cases are positive. Accuracy is 99%. What should you check next?

Precision/recall and threshold trade-offs
Only model size
Only GPU type
Only prompt wording

Reveal answer

Correct answer: Precision/recall and threshold trade-offs

Question 8

Scenario: A stakeholder asks for full automation to cut costs. What is the first governance question?

What is the worst credible harm and who is accountable for it?
Which cloud vendor is used?
How many parameters does the model have?
Can we remove monitoring to save time?

Reveal answer

Correct answer: What is the worst credible harm and who is accountable for it?

Question 9

Scenario: You want to store chat logs to improve the model. What is the most defensible default?

Collect the minimum needed with clear purpose, retention, and access controls
Collect everything forever because it might be useful
Collect nothing and keep no operational evidence
Email transcripts to the whole team for faster iteration

Reveal answer

Correct answer: Collect the minimum needed with clear purpose, retention, and access controls

Question 10

Scenario: The model is confident even when wrong. What metric helps you detect this?

Calibration (reliability) analysis
Only throughput
Only token count
Only model size

Reveal answer

Correct answer: Calibration (reliability) analysis

Question 11

Scenario: You are not sure the model is safe. What rollout approach reduces harm fastest?

A staged rollout with monitoring, guardrails, and a rollback plan
Big-bang launch to learn faster
Turn off logging to reduce privacy risk
Disable the appeal path to reduce support load

Reveal answer

Correct answer: A staged rollout with monitoring, guardrails, and a rollback plan

Question 12

Scenario: Users treat model outputs as truth. What product change reduces over-reliance?

Show uncertainty limits, require confirmation for high-impact actions, and provide sources/alternatives
Hide explanations to keep the UI clean
Increase temperature for confidence
Remove all warnings to improve adoption

Reveal answer

Correct answer: Show uncertainty limits, require confirmation for high-impact actions, and provide sources/alternatives