Unicode Standard Published
October 1991StandardsStandard publishedDate precision, monthEvidence grade, primary1 primary source
Drivers:
Globalisation of computing required universal text representation. Software companies needed consistent character handling. The web's growth demanded reliable multilingual support.
Unicode is the system that allows computers to display text in any language. Before Unicode, different countries used different systems, and text often appeared as gibberish when sent between countries. Unicode gives every character in every language a unique number, so computers everywhere can display text correctly.
Unicode Standard Published event plate
Structured atlas record showing date, domain, evidence grade, source count, and predecessor and successor links.
Forecasts and counterfactuals stay labelled as opinion in the event data. Source: Computer History Museum.
Before
Different character encoding schemes existed for different languages (ASCII, ISO-8859 variants, Shift-JIS, etc.). Software could not reliably handle multilingual text. Documents became corrupted when transferred between systems with different encodings.
What changed
Unicode provided a single character encoding standard capable of representing virtually all writing systems. It assigned unique code points to over 140,000 characters from scripts worldwide. Unicode became the foundation for internationalised software and the web.
How it happened
The Unicode Consortium was founded in 1988 by engineers from Xerox and Apple. Unicode 1.0 was published in October 1991. It was synchronised with ISO/IEC 10646 to ensure a single universal character set. UTF-8 encoding (1992) enabled efficient representation compatible with ASCII.
Outcomes
- Enabled truly multilingual computing
- Became universal encoding for the web (UTF-8)
- Unified character handling across platforms
- Preserved scripts for endangered languages
Limitations
- Complex for some scripts (combining characters)
- Han unification remains controversial
- Emoji standardisation adds ongoing work
- Legacy encoding issues persist
Lessons learnt
- Universal standards enable global communication
- Backward compatibility (UTF-8 with ASCII) aids adoption
- Character encoding is fundamental infrastructure
- Ongoing maintenance required for living standards
Stakeholders and artefacts
Organisations
- Unicode Consortiumstandards_bodyDeveloped and maintains standard
- ISO/IECstandards_bodyISO 10646 synchronisation
Individuals
- Joe BeckerCo-founder, XeroxCo-created Unicode
- Lee CollinsCo-founder, AppleCo-created Unicode
- Mark DavisCo-founder, AppleCo-created Unicode, president of Consortium
Artefacts
- UnicodespecificationUniversal character encoding standard
- UTF-8specificationVariable-width encoding for Unicode
- Code PointspecificationUnique identifier for each Unicode character
Key terms
On this course
Read in the path Standards Bodies: How Technology Gets Standardised.