Data course stage
Stage 2. Applied Data
Move into models, pipelines, and applied analytics while keeping reliability in view.
3 hours across 9 modules
Stage 2 Applied as a four-cluster supply chain map
Nine Applied modules group into four practical clusters that turn governed source data into a contracted product.
Stage 2 Applied builds the supply chain that turns governed source data into a contracted product: pipelines, governance, interoperability, analysis. Each cluster has its own canonical anchor; the stage finishes when a learner can defend each clause of a data contract.
Module path
Work through the modules in order. The stage visual above gives the map; each module opens with the detailed visuals that carry the concept.
- 0.5h
A1
Architectures and pipelines
- Explain ETL vs ELT and when each is appropriate
- Describe data lineage and why it matters for trust
- 0.5h
A2
Governance and stewardship
- Describe DAMA DMBOK knowledge areas
- Explain data catalogues and master data management
- 0.25h
A3
Interoperability and standards
- Explain the role of APIs in data interoperability
- Describe common data exchange standards
- 0.25h
A4
Analysis and insight
- Distinguish descriptive, diagnostic, predictive, and prescriptive analytics
- Explain feature engineering in machine learning contexts
- 0.5h
A5
Probability and distributions
- Explain normal distributions and what they tell you about data
- Describe correlation vs causation with examples
- 0.25h
A6
Inference and experiments
- Design a valid A/B test with proper controls
- Explain p-values and their limitations
- 0.25h
A7
Data modelling basics
- Create conceptual and logical data models
- Apply normalisation rules to reduce redundancy
- 0.25h
A8
Data as a product
- Explain Dehghani's data mesh principles
- Describe data contracts and their purpose
- 0.25h
A9
Risk, ethics, and strategic value
- Explain the right to erasure under GDPR
- Connect data ethics to business risk and reputation