1

数週間前、私たちのウェブサイトにある古いリストに奇妙なキャラクターがいることに気付きました

二重引用符 (") と等しいこのような文字があります ?? のような文字があります

それらを修理したいのですが、方法がわかりません。

このコードを作成する場合、preg_replaceを使用するときから

$text = 'this is a listingTitle  with an encoding problem';
$test = preg_replace("/ÂÂ/",'"',$text);
echo$test;

$test には文字列にそのエンコーディング エラーがまだ含まれているため、preg_replace のようにエンコーディングの問題が表示されません。

これらのエンコーディングエラーを修復するために何ができるかについて考えている人はいますか? または、私が進めるべき方法はありますか

ありがとう

ここで編集------------------------------------------------ --

さて、私が今やろうとしているのは、各文字を utf8 に戻すことです。何らかの理由で、私の文字列にはいくつかのエンコーディングが混在しています..

これが文字列と、文字列を構成する char の内訳です。

Milwaukee 2415-21 M12 コードレス リチウムイオン 3/8Â 直角ドリル/ドライバー キット w

そしてこれがブレイクダウンです

atChar[0] = 'M' encoding is = ASCII
atChar[1] = 'i' encoding is = ASCII
atChar[2] = 'l' encoding is = ASCII
atChar[3] = 'w' encoding is = ASCII
atChar[4] = 'a' encoding is = ASCII
atChar[5] = 'u' encoding is = ASCII
atChar[6] = 'k' encoding is = ASCII
atChar[7] = 'e' encoding is = ASCII
atChar[8] = 'e' encoding is = ASCII
atChar[9] = ' ' encoding is = ASCII
atChar[10] = '2' encoding is = ASCII
atChar[11] = '4' encoding is = ASCII
atChar[12] = '1' encoding is = ASCII
atChar[13] = '5' encoding is = ASCII
atChar[14] = '-' encoding is = ASCII
atChar[15] = '2' encoding is = ASCII
atChar[16] = '1' encoding is = ASCII
atChar[17] = ' ' encoding is = ASCII
atChar[18] = 'M' encoding is = ASCII
atChar[19] = '1' encoding is = ASCII
atChar[20] = '2' encoding is = ASCII
atChar[21] = ' ' encoding is = ASCII
atChar[22] = 'C' encoding is = ASCII
atChar[23] = 'o' encoding is = ASCII
atChar[24] = 'r' encoding is = ASCII
atChar[25] = 'd' encoding is = ASCII
atChar[26] = 'l' encoding is = ASCII
atChar[27] = 'e' encoding is = ASCII
atChar[28] = 's' encoding is = ASCII
atChar[29] = 's' encoding is = ASCII
atChar[30] = ' ' encoding is = ASCII
atChar[31] = 'L' encoding is = ASCII
atChar[32] = 'i' encoding is = ASCII
atChar[33] = 't' encoding is = ASCII
atChar[34] = 'h' encoding is = ASCII
atChar[35] = 'i' encoding is = ASCII
atChar[36] = 'u' encoding is = ASCII
atChar[37] = 'm' encoding is = ASCII
atChar[38] = '-' encoding is = ASCII
atChar[39] = 'I' encoding is = ASCII
atChar[40] = 'o' encoding is = ASCII
atChar[41] = 'n' encoding is = ASCII
atChar[42] = ' ' encoding is = ASCII
atChar[43] = '3' encoding is = ASCII
atChar[44] = '/' encoding is = ASCII
atChar[45] = '8' encoding is = ASCII
atChar[46] = 'Â' encoding is = UTF-8
atChar[47] = '' encoding is = 
atChar[48] = ' ' encoding is = ASCII
atChar[49] = 'R' encoding is = ASCII
atChar[50] = 'i' encoding is = ASCII
atChar[51] = 'g' encoding is = ASCII
atChar[52] = 'h' encoding is = ASCII
atChar[53] = 't' encoding is = ASCII
atChar[54] = ' ' encoding is = ASCII
atChar[55] = 'A' encoding is = ASCII
atChar[56] = 'n' encoding is = ASCII
atChar[57] = 'g' encoding is = ASCII
atChar[58] = 'l' encoding is = ASCII
atChar[59] = 'e' encoding is = ASCII
atChar[60] = ' ' encoding is = ASCII
atChar[61] = 'D' encoding is = ASCII
atChar[62] = 'r' encoding is = ASCII
atChar[63] = 'i' encoding is = ASCII
atChar[64] = 'l' encoding is = ASCII
atChar[65] = 'l' encoding is = ASCII
atChar[66] = '/' encoding is = ASCII
atChar[67] = 'D' encoding is = ASCII
atChar[68] = 'r' encoding is = ASCII
atChar[69] = 'i' encoding is = ASCII
atChar[70] = 'v' encoding is = ASCII
atChar[71] = 'e' encoding is = ASCII
atChar[72] = 'r' encoding is = ASCII
atChar[73] = ' ' encoding is = ASCII
atChar[74] = 'K' encoding is = ASCII
atChar[75] = 'i' encoding is = ASCII
atChar[76] = 't' encoding is = ASCII
atChar[77] = ' ' encoding is = ASCII
atChar[78] = 'w' encoding is = ASCII

私は今何ができますか?

4

2 に答える 2

11

githubで人気のあるエンコーディングを特徴とするこのPHPクラスエンコーディングを確認してください

使用法:

$utf8_string = Encoding::toUTF8($utf8_or_latin1_or_mixed_string);

$latin1_string = Encoding::toLatin1($utf8_or_latin1_or_mixed_string);

また:

$utf8_string = Encoding::fixUTF8($garbled_utf8_string);

例:

echo Encoding::fixUTF8("Fédération Camerounaise de Football");
echo Encoding::fixUTF8("FÃédÃération Camerounaise de Football");
echo Encoding::fixUTF8("FÃÃédÃÃération Camerounaise de Football");
echo Encoding::fixUTF8("FÃÃÃédÃÃÃération Camerounaise de Football");

出力します:

Fédération Camerounaise de Football
Fédération Camerounaise de Football
Fédération Camerounaise de Football
Fédération Camerounaise de Football

アップデート:

これを確認してください:(私はこれが機能したことを確認しました)

$output = 'this is a listingTitle  with an encoding problem';
$output = preg_replace('/[^(\x20-\x7F)]*/','', $output);
echo($output);

出力:

this is a listingTitle with an encoding problem.

出力画像:

ここに画像の説明を入力

于 2013-09-11T15:20:22.623 に答える