As part of the High Plane Unicode Mapping thread viewtopic.php?f=3&t=2527 I produced a Unicode Text Document from WordPad.
I remember that some years ago I saved a Unicode Text Document from WordPad and managed to produce a hexadecimal dump of the bytes in the file, which I could then study so as to understand how WordPad is storing the characters in the file.
I would like to study the bytes in the attached file to try to determine how the characters are stored in the file.
Unfortunately i cannot remember how I obtained the hexadecimal dump of the file.
Could someone suggest a method to produce a hexadecimal dump of the file please?
William Overington
2 January 2009
Obtaining a hexadecimal dump of a Unicode Text Document
-
- Moderator
- Posts: 11155
- Joined: Fri Oct 04, 2002 12:41 am
- Location: Bilthoven, The Netherlands
- Contact:
Re: Obtaining a hexadecimal dump of a Unicode Text Document
Here is a hex representation of your file that contains 16 bytes:William wrote:Could someone suggest a method to produce a hexadecimal dump of the file please?
Code: Select all
FFFE310032003300 34003500B8DB0DDC ÿþ1 2 3 4 5 ¸Û Ü
-
- Top Typographer
- Posts: 2038
- Joined: Tue Sep 14, 2004 6:41 pm
- Location: Worcestershire, England
- Contact:
Re: Obtaining a hexadecimal dump of a Unicode Text Document
Thank you.
William Overington
2 January 2009
William Overington
2 January 2009
-
- Moderator
- Posts: 11155
- Joined: Fri Oct 04, 2002 12:41 am
- Location: Bilthoven, The Netherlands
- Contact:
Re: Obtaining a hexadecimal dump of a Unicode Text Document
You're welcome.
The first two bytes are a byte-order mark. 0xFF 0xFE indicates the file has a UTF-16 (little-endian order) encoding.
The first two bytes are a byte-order mark. 0xFF 0xFE indicates the file has a UTF-16 (little-endian order) encoding.