Page 1 of 1

Obtaining a hexadecimal dump of a Unicode Text Document

Posted: Fri Jan 02, 2009 10:03 am
by William
As part of the High Plane Unicode Mapping thread viewtopic.php?f=3&t=2527 I produced a Unicode Text Document from WordPad.
12345rainbow.txt
(16 Bytes) Downloaded 558 times
I remember that some years ago I saved a Unicode Text Document from WordPad and managed to produce a hexadecimal dump of the bytes in the file, which I could then study so as to understand how WordPad is storing the characters in the file.

I would like to study the bytes in the attached file to try to determine how the characters are stored in the file.

Unfortunately i cannot remember how I obtained the hexadecimal dump of the file.

Could someone suggest a method to produce a hexadecimal dump of the file please?

William Overington

2 January 2009

Re: Obtaining a hexadecimal dump of a Unicode Text Document

Posted: Fri Jan 02, 2009 11:10 am
by Erwin Denissen
William wrote:Could someone suggest a method to produce a hexadecimal dump of the file please?
Here is a hex representation of your file that contains 16 bytes:

Code: Select all

FFFE310032003300 34003500B8DB0DDC ÿþ1 2 3 4 5 ¸Û Ü
You can use this utility to view a file in it's hexadecimal presentation:
ViewHex.zip
(245.72 KiB) Downloaded 600 times

Re: Obtaining a hexadecimal dump of a Unicode Text Document

Posted: Fri Jan 02, 2009 12:23 pm
by William
Thank you.

William Overington

2 January 2009

Re: Obtaining a hexadecimal dump of a Unicode Text Document

Posted: Fri Jan 02, 2009 12:45 pm
by Erwin Denissen
You're welcome.

The first two bytes are a byte-order mark. 0xFF 0xFE indicates the file has a UTF-16 (little-endian order) encoding.