<?php
$examples = array(
'Valid ASCII' => "a",
'Valid 2 Octet Sequence' => "\xc3\xb1",
'Invalid 2 Octet Sequence' => "\xc3\x28",
'Invalid Sequence Identifier' => "\xa0\xa1",
'Valid 3 Octet Sequence' => "\xe2\x82\xa1",
'Invalid 3 Octet Sequence (in 2nd Octet)' => "\xe2\x28\xa1",
'Invalid 3 Octet Sequence (in 3rd Octet)' => "\xe2\x82\x28",
'Valid 4 Octet Sequence' => "\xf0\x90\x8c\xbc",
'Invalid 4 Octet Sequence (in 2nd Octet)' => "\xf0\x28\x8c\xbc",
'Invalid 4 Octet Sequence (in 3rd Octet)' => "\xf0\x90\x28\xbc",
'Invalid 4 Octet Sequence (in 4th Octet)' => "\xf0\x28\x8c\x28",
'Valid 5 Octet Sequence (but not Unicode!)' => "\xf8\xa1\xa1\xa1\xa1",
'Valid 6 Octet Sequence (but not Unicode!)' => "\xfc\xa1\xa1\xa1\xa1\xa1",
);
mb_substitute_character(0x3013);
foreach ($examples as $k => $v) {
echo "{$k}\n";
echo 'raw: ' . $v . "\n";
echo 'default: ' . mb_convert_encoding($v, 'UTF-8', 'UTF-8') . "\n";
echo "\n";
}
- Output for 5.4.0 - 5.4.45, 5.5.0 - 5.5.38, 5.6.0 - 5.6.28, 7.0.0 - 7.0.20, 7.1.0 - 7.1.10, 7.2.0 - 7.2.33, 7.3.12 - 7.3.33, 7.4.0 - 7.4.33, 8.0.0 - 8.0.30, 8.1.0 - 8.1.28, 8.2.0 - 8.2.18, 8.3.0 - 8.3.6
- Valid ASCII
raw: a
default: a
Valid 2 Octet Sequence
raw: ñ
default: ñ
Invalid 2 Octet Sequence
raw: �(
default: 〓(
Invalid Sequence Identifier
raw: ��
default: 〓〓
Valid 3 Octet Sequence
raw: ₡
default: ₡
Invalid 3 Octet Sequence (in 2nd Octet)
raw: �(�
default: 〓(〓
Invalid 3 Octet Sequence (in 3rd Octet)
raw: �(
default: 〓(
Valid 4 Octet Sequence
raw: 𐌼
default: 𐌼
Invalid 4 Octet Sequence (in 2nd Octet)
raw: �(��
default: 〓(〓〓
Invalid 4 Octet Sequence (in 3rd Octet)
raw: �(�
default: 〓(〓
Invalid 4 Octet Sequence (in 4th Octet)
raw: �(�(
default: 〓(〓(
Valid 5 Octet Sequence (but not Unicode!)
raw: �����
default: 〓〓〓〓〓
Valid 6 Octet Sequence (but not Unicode!)
raw: ������
default: 〓〓〓〓〓〓
- Output for 4.3.0 - 4.3.11, 4.4.0 - 4.4.9, 5.0.0 - 5.0.5, 5.1.0 - 5.1.6, 5.2.0 - 5.2.17, 5.3.0 - 5.3.29
- Valid ASCII
raw: a
default: a
Valid 2 Octet Sequence
raw: ñ
default: ñ
Invalid 2 Octet Sequence
raw: �(
default: (
Invalid Sequence Identifier
raw: ��
default:
Valid 3 Octet Sequence
raw: ₡
default: ₡
Invalid 3 Octet Sequence (in 2nd Octet)
raw: �(�
default: (
Invalid 3 Octet Sequence (in 3rd Octet)
raw: �(
default: (
Valid 4 Octet Sequence
raw: 𐌼
default: 𐌼
Invalid 4 Octet Sequence (in 2nd Octet)
raw: �(��
default: (
Invalid 4 Octet Sequence (in 3rd Octet)
raw: �(�
default: (
Invalid 4 Octet Sequence (in 4th Octet)
raw: �(�(
default: ((
Valid 5 Octet Sequence (but not Unicode!)
raw: �����
default: �����
Valid 6 Octet Sequence (but not Unicode!)
raw: ������
default: ������
preferences:
185.32 ms | 404 KiB | 353 Q