Get Multilingual Data from ByteBuffer

Question

I am receiving ByteBuffers in an UDP Java application.

Now the data in this ByteBuffer can be any string in any language or any special chars separated by zero .

I use following code to get Strings from it.

public String getString() {
byte[] remainingBytes = new byte[this.byteBuffer.remaining()];
this.byteBuffer.slice().get(remainingBytes);
String dataString = new String(remainingBytes);
int stringEnd = dataString.indexOf(0);

if(stringEnd == -1) {
    return null;
} else {
    dataString = dataString.substring(0, stringEnd);
    this.byteBuffer.position(this.byteBuffer.position() + dataString.getBytes().length + 1);

    return dataString;
}
}

These strings are stored in MySQL DB with everything set as UTF8 .

IF i run application in Windows then special chars like ® are displayed but chinese are not.

On adding VM argument -Dfile.encoding=UTF8 chinese are displayed but chars like ® are shown as ?? etc.

Please Help.

Edit:

Input Strings in UDP packet are variable-length byte field, encoded in UTF-8, terminated by 0x00

For JDBC also i use useUnicode=true&characterEncoding=UTF-8

Answer 1

Not sure, but dataString contains only data till this zero, because stringEnd shows on first zero postion but not behind.

dataString = dataString.substring(0, stringEnd+1);

or

char specChar = dataString.substring(stringEnd, stringEnd+1); and it should return only special character, but as I said in the biggining, not sure...

Answer 2

String dataString = new String(remainingBytes); is wrong. You should almost never do that. You should find out what encoding was used to put the bytes into the UDP packet, and use the same encoding on that line:

String dataString = new String(remainingBytes, encoding); // e.g. "UTF-8"

Edit: based on your updated question, encoding should be "UTF-8"

Get Multilingual Data from ByteBuffer

Question

2 answers

solution1
0 2012-08-16 12:04:04

solution2
0 ACCPTED 2012-08-16 12:06:05

Get Multilingual Data from ByteBuffer

Question

2 answers

solution1 0 2012-08-16 12:04:04

solution2 0 ACCPTED 2012-08-16 12:06:05

solution1
0 2012-08-16 12:04:04

solution2
0 ACCPTED 2012-08-16 12:06:05