Unicode Standard Published

October 1991.Standards.Standard published.Date precision, month.Evidence grade, primary.1 primary source

Drivers:

Interoperability needStandardisation

Globalisation of computing required universal text representation. Software companies needed consistent character handling. The web's growth demanded reliable multilingual support.

Unicode is the system that allows computers to display text in any language. Before Unicode, different countries used different systems, and text often appeared as gibberish when sent between countries. Unicode gives every character in every language a unique number, so computers everywhere can display text correctly.

Unicode Standard Published event plate

Structured atlas record showing date, domain, evidence grade, source count, and predecessor and successor links.

Forecasts and counterfactuals stay labelled as opinion in the event data. Source: Computer History Museum.

Before

Different character encoding schemes existed for different languages (ASCII, ISO-8859 variants, Shift-JIS, etc.). Software could not reliably handle multilingual text. Documents became corrupted when transferred between systems with different encodings.

What changed

Unicode provided a single character encoding standard capable of representing virtually all writing systems. It assigned unique code points to over 140,000 characters from scripts worldwide. Unicode became the foundation for internationalised software and the web.

How it happened

The Unicode Consortium was founded in 1988 by engineers from Xerox and Apple. Unicode 1.0 was published in October 1991. It was synchronised with ISO/IEC 10646 to ensure a single universal character set. UTF-8 encoding (1992) enabled efficient representation compatible with ASCII.

Outcomes

Enabled truly multilingual computing
Became universal encoding for the web (UTF-8)
Unified character handling across platforms
Preserved scripts for endangered languages

Limitations

Complex for some scripts (combining characters)
Han unification remains controversial
Emoji standardisation adds ongoing work
Legacy encoding issues persist

Lessons learnt

Universal standards enable global communication
Backward compatibility (UTF-8 with ASCII) aids adoption
Character encoding is fundamental infrastructure
Ongoing maintenance required for living standards

Stakeholders and artefacts

Organisations

Unicode Consortiumstandards_bodyDeveloped and maintains standard
ISO/IECstandards_bodyISO 10646 synchronisation

Individuals

Joe BeckerCo-founder, XeroxCo-created Unicode
Lee CollinsCo-founder, AppleCo-created Unicode
Mark DavisCo-founder, AppleCo-created Unicode, president of Consortium

Artefacts

UnicodespecificationUniversal character encoding standard
UTF-8specificationVariable-width encoding for Unicode
Code PointspecificationUnique identifier for each Unicode character

Key terms

UnicodeUTF-8character encodinginternationalisationcode point

On this course

Read in the path Standards Bodies: How Technology Gets Standardised.

Sources

1"The Unicode Standard: A Technical Introduction". Unicode Consortium, 2024.authoritativewww.unicode.org/standard/principles.html