java - Changing the default encoding for String(byte[])

Question

Is there a way to change the encoding used by the String(byte[]) constructor ?

In my own code I use String(byte[],String) to specify the encoding but I am using an external library that I cannot change.

String src = "with accents: é à";
byte[] bytes = src.getBytes("UTF-8");
System.out.println("UTF-8 decoded: "+new String(bytes,"UTF-8"));
System.out.println("Default decoded: "+new String(bytes));

The output for this is :

UTF-8 decoded: with accents: é à
Default decoded: with accents: Ã© Ã

I have tried changing the system property file.encoding but it does not work.

score 7 · Accepted Answer

JVM を起動する前にロケールを変更する必要があります。見る：

Java、バグ ID 4163515

一部の場所では、JVM の起動時に file.encoding 変数を設定することでこれを行うことができることを暗示しているようです。

java -Dfile.encoding=UTF-8 ...

...しかし、私はこれを自分で試していません。最も安全な方法は、オペレーティングシステムで環境変数を設定することです。

score 1 · Accepted Answer

defaultCharset()から引用

デフォルトの文字セットは、仮想マシンの起動時に決定され、通常は基盤となるオペレーティングシステムのロケールと文字セットに依存します。

ほとんどの OS では、環境変数を使用して文字セットを設定できます。

score 1 · Accepted Answer

これが必要だと思います: System.setProperty("file.encoding", "UTF-8");

いくつかの問題は解決しましたが、まだ別の問題があります。SO が ISO-8859-1 の場合、文字「í」と「Í」は正しく変換されません。起動時にJVMオプションを使用するだけで解決します。現在、NetBeans IDE の Java コンソールだけが、特殊文字を表示するときに文字セットをクラッシュさせています。

java - Changing the default encoding for String(byte[])

3 に答える 3

Related

Reference