By default, short messages are sent using GSM aplhabet (03.38), defined by 3GPP TS 23.038. This setting can be changed by issuing AT+CSMP
command (Set Text Mode Parameters - I'll not examine it in depth in this answer).
More specifically, 7-bits aplhabet is used, so that the device can encode 8 characters in 7 bytes (8x7=56), saving "precious space" to send some more information.
This alphabet is a clever derivation from 7-bit ASCII. A set of codes used in ASCII for "useless" characters (especially control characters) are instead used to add specific characters from alphabet such as Danish:
Danish Character |
GSM 03.38 code |
? |
x0B |
? |
x0C |
? |
x0D |
? |
x0F |
? |
x1C |
? |
x1D |
与恶龙缠斗过久,自身亦成为恶龙;凝视深渊过久,深渊将回以凝视…