Question 1
Scenario: A model is 98% accurate but still causes harm. What is your first suspicion?
Reveal answer
Correct answer: Errors are concentrated in a minority group or high-impact cases
Foundations · Stage test
No governed timed route exists for this stage yet, so this page gives you an honest untimed stage-end check built from the published bank.
Scenario: A model is 98% accurate but still causes harm. What is your first suspicion?
Correct answer: Errors are concentrated in a minority group or high-impact cases
Scenario: A spam model relies heavily on number of links. Why is that risky?
Correct answer: The model may learn a shortcut correlated in training but not causal
Scenario: You trained and tested on data from the same week. What failure can appear later?
Correct answer: Drift as real inputs change
Scenario: A model output is used to automatically reject applications. What is the safer default?
Correct answer: Human review for high-impact cases with accountability and monitoring
Labels are created by humans under time pressure. What is the predictable risk?
Correct answer: Label noise and bias
Scenario: You accidentally trained on features created after the outcome date. What happened?
Correct answer: Label leakage that makes tests look unrealistically good
Scenario: Only 1% of cases are positive. Accuracy is 99%. What should you check next?
Correct answer: Precision/recall and threshold trade-offs
Scenario: A stakeholder asks for full automation to cut costs. What is the first governance question?
Correct answer: What is the worst credible harm and who is accountable for it?
Scenario: You want to store chat logs to improve the model. What is the most defensible default?
Correct answer: Collect the minimum needed with clear purpose, retention, and access controls
Scenario: The model is confident even when wrong. What metric helps you detect this?
Correct answer: Calibration (reliability) analysis
Scenario: You are not sure the model is safe. What rollout approach reduces harm fastest?
Correct answer: A staged rollout with monitoring, guardrails, and a rollback plan
Scenario: Users treat model outputs as truth. What product change reduces over-reliance?
Correct answer: Show uncertainty limits, require confirmation for high-impact actions, and provide sources/alternatives