What data is and why it matters
Data starts as recorded observations, for example numbers on a meter, text in a form, or pixels in a photo.
Course summary
Use this page to revisit what each stage gives you and return to the exact weak point that needs another pass.
Stage 1 of 3
Start with the language, formats, and habits that make data useful across teams.
Data starts as recorded observations, for example numbers on a meter, text in a form, or pixels in a photo.
I want a simple model in your head that stays useful even when the tools change, and DIKW works because it forces you to separate raw observations from meaning before.
Data work goes wrong when people are casual about units.
Computers store everything using bits (binary digits) because hardware can reliably tell two states apart.
Interoperability is a boring word for a very expensive problem.
Open data is not “everything on the internet”.
Visualisation is part of data literacy.
Quality means data is accurate (close to the truth), complete (not missing key pieces), and timely (fresh enough to be useful).
Data starts at collection, gets stored, processed, shared, and eventually archived or deleted.
Roles exist so someone is accountable for quality, access, and change.
Ethics matters from the first data point.
Stage 2 of 3
Move into models, pipelines, and applied analytics while keeping reliability in view.
Data architecture is how data is organised, moved, and protected across systems.
Governance is agreeing how data is handled so people can work quickly without being reckless.
Interoperability means systems understand each other.
Analysis is asking good questions of data and checking that the answers hold up.
Data work is mostly uncertainty management.
Inference is the art of learning about a bigger reality from limited observations.
Modelling is not magic.
A mature organisation treats important datasets like products.
Data risk is broader than security.
Stage 3 of 3
Join up data architecture, streaming, governance, and product thinking for real systems.
Maths in data systems describes patterns, uncertainty, and change.
Models are simplified representations of reality.
Inference is about drawing conclusions while admitting uncertainty.
Data systems distribute to handle scale and resilience.
Regulation exists to protect people and markets.
Data creates value when it improves decisions, products, and relationships.