This is a continuation of the story about the Unicode standard. Read The ASCII table and The 8th bit if you haven’t already.


Last time I finished with a horrible story of overlapping encoding standards, and a general confusion in the software world. It was a world where a French scientists was unable to talk with a German friend.

The hero who solved all of the problems was the Unicode standard. It started with a simple idea. Collect all the symbols, create a big table, and assign a number to each of them. Oh, and try to be compatible with existing standards as much as possible.

When I said all the symbols, I really meant all the symbols. Music notes, Klingon and Emoji charactes are only some bizzare examples. Collecting all these symbols is much easier said than done, so there exists a consortium a.k.a the Unicode Consortium who is responsible for the whole standard.


There are various ways to represent Unicode characters in different contexts. Here is a list of common representations for the symbol below


Screenshot from 2014-10-09 01:00:50.png

 Interesting facts

And for the end here is the list of all the Unicode characters.

Happy encoding!


Now read this

The ASCII table

Originally I wanted to write about Unicode, but I have realized that it is better to start out with the basics, and you can’t get more basic than the ASCII table. As computers can only represent two kind of values — true or false — some... Continue →