mysql - What's the difference between utf8_unicode_ci and utf8mb4_0900_ai_ci

Question

Welcome To Ask or Share your Answers For Others

mysql - What's the difference between utf8_unicode_ci and utf8mb4_0900_ai_ci

1 Answer

深蓝 · Answer 1 · 2021-03-06T05:07:50+0000

The encoding is the same. That is, the bytes look the same.
The character set is different. utf8mb4 has more characters.
The collation (how comparisions are done) is different.
The perfomance is different, but it rarely matters.

utf8_unicode_ci implies the CHARACTER SET utf8, which includes only the 1-, 2-, and 3-byte UTF-8 characters. Hence it excludes most Emoji and some Chinese characters.

utf8mb4_unicode_ci implies the CHARACTER SET utf8mb4 is the corresponding COLLATION for the 4-byte CHARACTER SET utf8mb4.

The Unicode organization has been evolving the specification over the years. Here are the mappings from its "versions" to MySQL Collations:

4.0   _unicode_
5.20  _unicode_520_
9.0   _0900_

Most of the differences will be in areas that most people never encounter. One example: At some point, a change allowed Emoji to be distinguished and ordered in some manner.

The suffix (MySQL doc):

_bin      -- just compare the bits; don't consider case folding, accents, etc
_ci       -- explicitly case insensitive (A=a) and implicitly accent insensitive (a=á)
_ai_ci    -- explicitly case insensitive and accent insensitive
_as (etc) -- accent-sensitive (etc)

Performance:

_bin         -- simple, fast
_general_ci  -- fails to compare multiple letters; eg ss=?, so somewhat fast
...          -- slower
_900_        -- (8.0) much faster because of a rewrite

However: The speed of collation is usually the least of the performance issues in queries. INDEXes, JOINs, subqueries, table scans, etc are much more critical to performance.

Categories

mysql - What's the difference between utf8_unicode_ci and utf8mb4_0900_ai_ci

mysql - What's the difference between utf8_unicode_ci and utf8mb4_0900_ai_ci

Please log in or register to add a comment.

Please log in or register to answer this question.

1 Answer

Please log in or register to add a comment.

Just Browsing Browsing

Most popular tags