Thank you Richard for the font handling hints. I hope that you or somebody else will get the existing Tirhuta font to be Unicode conformant, ready for the keyboard layouts that will inevitably exist at some future stage of history (if relevant) for Maithili native speakers and scholars all around the world. Personally the font issue is far beyond of what I can do, given that I’ve never programmed the least standalone software, let alone a compiler. I’m just modifying keyboard driver C sources to some small extent. Here is an idea of what I’ve been able to do in the meantime:
Iʼve taken the Hindi Traditional Windows keyboard layout, and tried to get Excel match all Tirhuta characters thereupon, on the basis of the Unicode character names. As Richard W expected from the roadmap on, there are unsupported characters in Tirhuta, that do not match to any Devanagari character as present on the Hindi Traditional keyboard layout. As William suggested, Iʼll now post character lists, even if not complete ones, only the ones that are non-obvious to me.
The Tirhuta characters not matching Hindi Traditional Devanagari are the following twelve:
11480 TIRHUTA ANJI
11488 TIRHUTA LETTER VOCALIC RR
11489 TIRHUTA LETTER VOCALIC L
1148A TIRHUTA LETTER VOCALIC LL
114B6 TIRHUTA VOWEL SIGN VOCALIC RR
114B7 TIRHUTA VOWEL SIGN VOCALIC L
114B8 TIRHUTA VOWEL SIGN VOCALIC LL
114BA TIRHUTA VOWEL SIGN SHORT E
114BD TIRHUTA VOWEL SIGN SHORT O
114C4 TIRHUTA SIGN AVAGRAHA
114C5 TIRHUTA GVANG
114C6 TIRHUTA ABBREVIATION SIGN
On the other hand, there are Devanagari characters on the Hindi Traditional Windows keyboard layout that do not match any Tirhuta character. They are listed here:
0949 DEVANAGARI VOWEL SIGN CANDRA O
090D DEVANAGARI LETTER CANDRA E
0945 DEVANAGARI VOWEL SIGN CANDRA E
0911 DEVANAGARI LETTER CANDRA O
0931 DEVANAGARI LETTER RRA
0933 DEVANAGARI LETTER LLA
095F DEVANAGARI LETTER YYA
Now I guess that TIRHUTA LETTER VOCALIC LL and TIRHUTA LETTER VOCALIC RR might match DEVANAGARI LETTER LLA and DEVANAGARI LETTER RRA, which are in Shift of DEVANAGARI LETTER LA and DEVANAGARI LETTER RA on N and J keys, while AltGr places are left empty. So TIRHUTA VOWEL SIGN VOCALIC LL and TIRHUTA VOWEL SIGN VOCALIC RR in turn might take place in the AltGr shift state on these keys. After that, eight Tirhuta characters are left.
The problem with TIRHUTA LETTER VOCALIC L, that prevents it from being treated in the same manner, is that there is also a TIRHUTA LETTER LA, that matches DEVANAGARI LETTER LA, which is in the Base shift state where LLA is in Shift, that is on the N key. There is no other key with an L on Hindi Traditional. However, the key B to the left of LA and LLA is empty in Shift, so that we can put TIRHUTA LETTER VOCALIC L there and TIRHUTA VOWEL SIGN VOCALIC L above in AltGr, correspondingly to the above. After that, half of the unsupported Tirhuta characters are still left:
11480 TIRHUTA ANJI
114BA TIRHUTA VOWEL SIGN SHORT E
114BD TIRHUTA VOWEL SIGN SHORT O
114C4 TIRHUTA SIGN AVAGRAHA
114C5 TIRHUTA GVANG
114C6 TIRHUTA ABBREVIATION SIGN
TIRHUTA VOWEL SIGN SHORT E and TIRHUTA VOWEL SIGN SHORT O can be placed in AltGr on A and S keys above TIRHUTA VOWEL SIGN E and TIRHUTA VOWEL SIGN O which are in Base, and TIRHUTA LETTER E and TIRHUTA LETTER O which are in Shift, matching Devanagari counterparts there.
The TIRHUTA SIGN AVAGRAHA is not special of Tirhuta, as there is a DEVANAGARI SIGN AVAGRAHA too, encoded at U+093D, but not present on the Hindi Traditional keyboard layout as it ships with Windows. However, on the Windows Devanagari Inscript keyboard layout, it is in Shift+AltGr on the Period key, along with DEVANAGARI DOUBLE DANDA in AltGr, that is used in Tirhuta but not present on Hindi Traditional neither, which layout doubles the ASCII period and greater-than in these shift states.
The same applies to TIRHUTA ABBREVIATION SIGN, whose Devanagari version is U+0970 and is on AltGr+Comma on Devanagari Inscript. It can be mapped the same way on Tirhuta keyboard layout.
As for TIRHUTA ANJI by contrast, there is none in Devanagari, but well in Bengali, which script is mentioned as to be close to Tirhuta. But on none of the three Bengali layouts shipped with Windows, there is any U+0980 BENGALI ANJI.
And even more, the TIRHUTA GVANG is unique, as to date, there is no other gvang in any script encoded in Unicode.
Mapping these two characters needs some knowledge about their function and use, so that they may be placed on the appropriate key with respect to ASCII punctuation marks if applicable. There seems to be no ready information on the internet about them. Only based on their glyphic resemblance, I would place them on the same keys together with S (with respect to a stacked double long s) and W.
Then there is a list of characters borrowed from other scripts, additionally to danda and double danda, used in Tirhuta according to TUS §15.10:
09F4 BENGALI CURRENCY NUMERATOR ONE
09F5 BENGALI CURRENCY NUMERATOR TWO
09F6 BENGALI CURRENCY NUMERATOR THREE
09F7 BENGALI CURRENCY NUMERATOR FOUR
09F8 BENGALI CURRENCY NUMERATOR ONE LESS THAN THE DENOMINATOR
09F9 BENGALI CURRENCY DENOMINATOR SIXTEEN
1CF2 VEDIC SIGN ARDHAVISARGA
A830 NORTH INDIC FRACTION ONE QUARTER
A831 NORTH INDIC FRACTION ONE HALF
A832 NORTH INDIC FRACTION THREE QUARTERS
A833 NORTH INDIC FRACTION ONE SIXTEENTH
A834 NORTH INDIC FRACTION ONE EIGHTH
A835 NORTH INDIC FRACTION THREE SIXTEENTHS
A836 NORTH INDIC QUARTER MARK
A837 NORTH INDIC PLACEHOLDER MARK
A838 NORTH INDIC RUPEE MARK
A839 NORTH INDIC QUANTITY MARK
The first six are on the Bengali keyboard in AltGr and Shift+AltGr on A, S, and D keys. Thus there will be a collision with TIRHUTA VOWEL SIGN SHORT E and TIRHUTA VOWEL SIGN SHORT O on the first two keys. The AltGr shift state being mostly empty on letter keys, currency numerators and denominator might be either placed the same way as on Bengali but in the row above, on Q, W, and E keys, or shifted two keys to the right to end up on D, F, and G, or distributed over six keys in the row above, from Q through Y, all in AltGr (which would have been hard to achieve on the Bengali keyboard as there are already several AltGr key positions assigned).
VEDIC SIGN ARDHAVISARGA might be well mapped on the Minus key on which the visarga already is (in Shift), in AltGr because the minus sign there is already available in the Base shift state and has been doubled in AltGr to complete, by the principle of having in the two AltGr shift states the ASCII punctuations following the US-English layout.
To correctly map the North Indic number, quantity and currency signs, a Compose solution is generally the most appropriate. This does not require any dedicated Compose key, because Compose may technically be a simple dead key on whatever key position. This can be on AltGr+Space, because the ZWJ and ZWNJ are on the 1 and 2 keys on the Hindi layout, as Richard W already pointed, so that mapping them on the space bar is not required. This choice is an example of what should be subject to feedback in any case.
The problem with ligatures as a part of dead key sequences is resolved as soon as chained dead keys are available (like when .klc sources or C sources are edited and compiled by KbdUTool or directly by the compilers).
The Tirhuta digits, I suggest to place in the Base shift state, and ASCII digits in AltGr, not conversely as actually on the Hindi traditional layout. The clue is then to convert the unused CapsLock key to a KanaLock, which toggles the layout to a US-English one as currently used in India, following the method that I found yesterday on the KbdEdit web site, except that Tirhuta will be in Base, and English in Kana. If an English with CapsLock is desired (but improbably, given the amount of hate already accumulated on this toggle key by English users), English must be in Base, Tirhuta in Kana, and KanaLock on the uppermost leftmost key, that is not mapped in Hindi Traditional except for the ASCII grave and tilde characters, which will have to be remapped on AltGr+1 and AltGr+2, along with all AltGr key positions outside the letter keys, for some currency and technical characters, section and degree sign and so on.
The Numpad will be equally mapped on all eight shift states (which are ten in fact, if adding Ctrl and Shift+Ctrl, the latter of which is used in Hindi Traditional for ZWJ and ZWNJ). The choice there is between having Tirhuta in Base or in Shift, ASCII in Base or in Shift or in Kana only, and where to have super- and subscripts, hex letters lowercase and/or uppercase, and so on. Currency numerators and denominator may be on the numpad in AltGr shift states, or in Shift above Tirhuta digits, as an alternate solution avoiding dead keys (and the related Compose solution), which are said to be dreadful to many users who are not in the habits of using them. On the other hand, for a numpad without Tirhuta, I can apply the one I have available for French (and as far as the numpad is concerned, for universal Latin). That depends entirely on the expected usage.
But unlike what I supposed, I canʼt do it within one day, even when hoping not to spend one week, as I must finish the French keyboard layout within a useful delay too, and ethically as far as I am concerned—unlike most other people—this is prioritary in my work (however, ethics may turn out to show that I too should prioritize the Tirhuta keyboard layout, in which case I’ll conform).
Best,
Marcel