unicode_text/example.txt

135 lines
2.8 KiB
Plaintext
Raw Blame History

This file contains invisible Unicode characters

This file contains invisible Unicode characters that are indistinguishable to humans but may be processed differently by a computer. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.

This file contains Unicode characters that might be confused with other characters. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.

Azeri:
Zəfər, jaketini də papağını da götür,
bu axşam hava çox soyuq olacaq.
Chinese:
子曰:「學而時習之,不亦說乎?
有朋自遠方來,不亦樂乎?
人不知而不慍,不亦君子乎?」
Czech:
Příliš žluťoučký kůň úpěl ďábelské ódy.
Danish:
Høj bly gom vandt fræk sexquiz på wc.
Finnish:
Törkylempijävongahdus.
Greek:
Ξεσκεπάζω τὴν ψυχοφθόρα βδελυγμία
German (non-ASCII letters):
Heizölrückstoßabdämpfung
Hebrew (does not work yet):
נקודה מודגשת
Icelandic:
Kæmi ný öxi hér, ykist þjófum nú bæði
víl og ádrepa.
Irish:
d'Ith cat mór dubh na héisc lofa go pras
Russian:
В чащах юга жил бы цитрус?
Да, но фальшивый экземпляр!
Hungarian:
Árvíztűrő tükörfúrógép
Hiragana:
いろはにほへとちりぬるを
わかよたれそつねならむ
うゐのおくやまけふこえて
あさきゆめみしゑひもせす
Katakana:
イロハニホヘト チリヌルヲ
ワカヨタレソ ツネナラム
ウヰノオクヤマ ケフコエテ
アサキユメミシ ヱヒモセスン
Georgian:
გთხოვთ ახლავე გაიაროთ რეგისტრაცია
Ethiopic:
ሰማይ አይታረስ ንጉሥ አይከሰስ።
Runic:
ᚻᛖ ᚳᚹᚫᚦ ᚦᚫᛏ ᚻᛖ ᛒᚢᛞᛖ ᚩᚾ ᚦᚫᛗ ᛚᚪᚾᛞᛖ
Various latin alphabet variants:
𝐛𝐨𝐥𝐝 𝖋𝖗𝖆𝖐𝖙𝖚𝖗 𝒊𝒕𝒂𝒍𝒊𝒄 𝓼𝓬𝓻𝓲𝓹𝓽
𝕕𝕠𝕦𝕓𝕝𝕖-𝕤𝕥𝕣𝕦𝕔𝕜 𝚖𝚘𝚗𝚘𝚜𝚙𝚊𝚌𝚎
 
Emoji:
🌀 🌁 🌂 🌃 🌄 🌅 🌆 🌇
🐀 🐁 🐂 🐃 🐄 🐅 🐆 🐇
🔀 🔁 🔂 🔃 🔄 🔅 🔆 🔇
😀 😁 😂 😃 😄 😅 😆 😇
Line breaking:
U+000A LINE FEED
LINE 1
LINE 2
U+000A LINE FEED
LINE 1
LINE 2
U+000D CARRIAGE RETURN
U+000A LINE FEED
LINE 1
LINE 2
U+0085 NEXT LINE
LINE 1… LINE 2
U+2029 PARAGRAPH SEPARATOR
LINE 1 LINE 2
Horizontal tabulator:
1 2 3 4 5 6
t a b
tt aa bb
ttt aaa bbb
tttt aaaa bbbb
ttttt aaaaa bbbbb
tttttt aaaaaa bbbbbb
Combining characters:
NFC: à é î õ ū
NFD: à é î õ ū
U+FF17 FULLWIDTH DIGIT SEVEN
U+20DE COMBINING ENCLOSING SQUARE
7⃞
U+01B5 LATIN CAPITAL LETTER Z WITH STROKE
U+0327 COMBINING CEDILLA
U+0308 COMBINING DIAERESIS
Ƶ̧̈
U+25CC DOTTED CIRCLE
U+0EB5 LAO VOWEL SIGN II
◌ີ
U+00A0 NO-BREAK-SPACE
U+0EB5 LAO VOWEL SIGN II
U+0EC8 LAO TONE MAI EK
 ີ່
U+0E81 LAO LETTER KO
U+0EB3 LAO VOWEL SIGN AM
ກຳ
Thai:
พ่อขุนรามคำแหงมหาราช
Mayalayam:
ബ്qരഹ്മപുരത്തേക്ക്