<?php
$examples = array(
'Valid ASCII' => "a",
'Valid 2 Octet Sequence' => "\xc3\xb1",
'Invalid 2 Octet Sequence' => "\xc3\x28",
'Invalid Sequence Identifier' => "\xa0\xa1",
'Valid 3 Octet Sequence' => "\xe2\x82\xa1",
'Invalid 3 Octet Sequence (in 2nd Octet)' => "\xe2\x28\xa1",
'Invalid 3 Octet Sequence (in 3rd Octet)' => "\xe2\x82\x28",
'Valid 4 Octet Sequence' => "\xf0\x90\x8c\xbc",
'Invalid 4 Octet Sequence (in 2nd Octet)' => "\xf0\x28\x8c\xbc",
'Invalid 4 Octet Sequence (in 3rd Octet)' => "\xf0\x90\x28\xbc",
'Invalid 4 Octet Sequence (in 4th Octet)' => "\xf0\x28\x8c\x28",
'Valid 5 Octet Sequence (but not Unicode!)' => "\xf8\xa1\xa1\xa1\xa1",
'Valid 6 Octet Sequence (but not Unicode!)' => "\xfc\xa1\xa1\xa1\xa1\xa1",
);
mb_substitute_character(0x3013);
foreach ($examples as $k => $v) {
echo "{$k}\n";
echo 'raw: ' . $v . '\n';
echo 'default: ' . mb_convert_encoding($v, 'UTF-8', 'UTF-8') . '\n';
echo "\n";
}
- Output for 5.4.0 - 5.4.45, 5.5.0 - 5.5.38, 5.6.0 - 5.6.40, 7.0.0 - 7.0.33, 7.1.0 - 7.1.33, 7.2.0 - 7.2.33, 7.3.0 - 7.3.33, 7.4.0 - 7.4.33, 8.0.0 - 8.0.30, 8.1.0 - 8.1.28, 8.2.0 - 8.2.18, 8.3.0 - 8.3.6
- Valid ASCII
raw: a\ndefault: a\n
Valid 2 Octet Sequence
raw: ñ\ndefault: ñ\n
Invalid 2 Octet Sequence
raw: �(\ndefault: 〓(\n
Invalid Sequence Identifier
raw: ��\ndefault: 〓〓\n
Valid 3 Octet Sequence
raw: ₡\ndefault: ₡\n
Invalid 3 Octet Sequence (in 2nd Octet)
raw: �(�\ndefault: 〓(〓\n
Invalid 3 Octet Sequence (in 3rd Octet)
raw: �(\ndefault: 〓(\n
Valid 4 Octet Sequence
raw: 𐌼\ndefault: 𐌼\n
Invalid 4 Octet Sequence (in 2nd Octet)
raw: �(��\ndefault: 〓(〓〓\n
Invalid 4 Octet Sequence (in 3rd Octet)
raw: �(�\ndefault: 〓(〓\n
Invalid 4 Octet Sequence (in 4th Octet)
raw: �(�(\ndefault: 〓(〓(\n
Valid 5 Octet Sequence (but not Unicode!)
raw: �����\ndefault: 〓〓〓〓〓\n
Valid 6 Octet Sequence (but not Unicode!)
raw: ������\ndefault: 〓〓〓〓〓〓\n
- Output for 4.3.0 - 4.3.11, 4.4.0 - 4.4.9, 5.0.0 - 5.0.5, 5.1.0 - 5.1.6, 5.2.0 - 5.2.17, 5.3.0 - 5.3.29
- Valid ASCII
raw: a\ndefault: a\n
Valid 2 Octet Sequence
raw: ñ\ndefault: ñ\n
Invalid 2 Octet Sequence
raw: �(\ndefault: (\n
Invalid Sequence Identifier
raw: ��\ndefault: \n
Valid 3 Octet Sequence
raw: ₡\ndefault: ₡\n
Invalid 3 Octet Sequence (in 2nd Octet)
raw: �(�\ndefault: (\n
Invalid 3 Octet Sequence (in 3rd Octet)
raw: �(\ndefault: (\n
Valid 4 Octet Sequence
raw: 𐌼\ndefault: 𐌼\n
Invalid 4 Octet Sequence (in 2nd Octet)
raw: �(��\ndefault: (\n
Invalid 4 Octet Sequence (in 3rd Octet)
raw: �(�\ndefault: (\n
Invalid 4 Octet Sequence (in 4th Octet)
raw: �(�(\ndefault: ((\n
Valid 5 Octet Sequence (but not Unicode!)
raw: �����\ndefault: �����\n
Valid 6 Octet Sequence (but not Unicode!)
raw: ������\ndefault: ������\n
preferences:
298.78 ms | 404 KiB | 459 Q