Obtaining a hexadecimal dump of a Unicode Text Document

William · Post by **William** » Fri Jan 02, 2009 10:03 am

As part of the High Plane Unicode Mapping thread viewtopic.php?f=3&t=2527 I produced a Unicode Text Document from WordPad.

12345rainbow.txt: (16 Bytes) Downloaded 558 times

I remember that some years ago I saved a Unicode Text Document from WordPad and managed to produce a hexadecimal dump of the bytes in the file, which I could then study so as to understand how WordPad is storing the characters in the file.

I would like to study the bytes in the attached file to try to determine how the characters are stored in the file.

Unfortunately i cannot remember how I obtained the hexadecimal dump of the file.

Could someone suggest a method to produce a hexadecimal dump of the file please?

William Overington

2 January 2009

Erwin Denissen · Post by **Erwin Denissen** » Fri Jan 02, 2009 11:10 am

William wrote:Could someone suggest a method to produce a hexadecimal dump of the file please?

Here is a hex representation of your file that contains 16 bytes:

Code: Select all

FFFE310032003300 34003500B8DB0DDC ÿþ1 2 3 4 5 ¸Û Ü

You can use this utility to view a file in it's hexadecimal presentation:

ViewHex.zip: (245.72 KiB) Downloaded 598 times

William · Post by **William** » Fri Jan 02, 2009 12:23 pm

Thank you.

William Overington

2 January 2009

Erwin Denissen · Post by **Erwin Denissen** » Fri Jan 02, 2009 12:45 pm

You're welcome.

The first two bytes are a byte-order mark. 0xFF 0xFE indicates the file has a UTF-16 (little-endian order) encoding.

Font Forum

Obtaining a hexadecimal dump of a Unicode Text Document

Obtaining a hexadecimal dump of a Unicode Text Document

Re: Obtaining a hexadecimal dump of a Unicode Text Document

Re: Obtaining a hexadecimal dump of a Unicode Text Document

Re: Obtaining a hexadecimal dump of a Unicode Text Document