Skip to content

Unicode Standard Published

October 1991.Standards.Standard published.Date precision, month.Evidence grade, primary.1 primary source

Drivers:

Interoperability needStandardisation

Globalisation of computing required universal text representation. Software companies needed consistent character handling. The web's growth demanded reliable multilingual support.

Unicode is the system that allows computers to display text in any language. Before Unicode, different countries used different systems, and text often appeared as gibberish when sent between countries. Unicode gives every character in every language a unique number, so computers everywhere can display text correctly.

Unicode Standard Published event plate

Structured atlas record showing date, domain, evidence grade, source count, and predecessor and successor links.

Event plate: Unicode Standard Published Convergence-divergence layout. The central hero card carries the event year, type, title, evidence grade, domain and era band. 0 predecessor cards on the left feed in with red arrows labelled "absorbs". 0 successor cards on the right derive with red arrows labelled "spawns". Key terms below the hero pin the vocabulary the event introduced. EVENT PLATE Source: https://www.unicode.org/standard/principles.html 1991 - STANDARD PUBLISHED Unicode StandardPublished primary evidence Domain: AI and machine learning Era band: E6 AI-scale systems KEY TERMS - VOCABULARY THE EVENT INTRODUCED Unicode UTF-8 character encoding internationalisation Convergence-divergence: predecessors absorbed, successors spawned Hero card carries year, evidence and domain. 0 predecessors flow in from the left; 0 successors flow out to the right. Key termsbelow pin the vocabulary the event introduced.

Forecasts and counterfactuals stay labelled as opinion in the event data. Source: Computer History Museum.

Before

Different character encoding schemes existed for different languages (ASCII, ISO-8859 variants, Shift-JIS, etc.). Software could not reliably handle multilingual text. Documents became corrupted when transferred between systems with different encodings.

What changed

Unicode provided a single character encoding standard capable of representing virtually all writing systems. It assigned unique code points to over 140,000 characters from scripts worldwide. Unicode became the foundation for internationalised software and the web.

How it happened

The Unicode Consortium was founded in 1988 by engineers from Xerox and Apple. Unicode 1.0 was published in October 1991. It was synchronised with ISO/IEC 10646 to ensure a single universal character set. UTF-8 encoding (1992) enabled efficient representation compatible with ASCII.

Outcomes

  • Enabled truly multilingual computing
  • Became universal encoding for the web (UTF-8)
  • Unified character handling across platforms
  • Preserved scripts for endangered languages

Limitations

  • Complex for some scripts (combining characters)
  • Han unification remains controversial
  • Emoji standardisation adds ongoing work
  • Legacy encoding issues persist

Lessons learnt

  • Universal standards enable global communication
  • Backward compatibility (UTF-8 with ASCII) aids adoption
  • Character encoding is fundamental infrastructure
  • Ongoing maintenance required for living standards

Stakeholders and artefacts

Organisations

  • Unicode Consortiumstandards_bodyDeveloped and maintains standard
  • ISO/IECstandards_bodyISO 10646 synchronisation

Individuals

  • Joe BeckerCo-founder, XeroxCo-created Unicode
  • Lee CollinsCo-founder, AppleCo-created Unicode
  • Mark DavisCo-founder, AppleCo-created Unicode, president of Consortium

Artefacts

  • UnicodespecificationUniversal character encoding standard
  • UTF-8specificationVariable-width encoding for Unicode
  • Code PointspecificationUnique identifier for each Unicode character

Key terms

UnicodeUTF-8character encodinginternationalisationcode point

On this course

Read in the path Standards Bodies: How Technology Gets Standardised.

Sources

1"The Unicode Standard: A Technical Introduction". Unicode Consortium, 2024.authoritativewww.unicode.org/standard/principles.html