As ever, it wasn't until I had a real necessity for such a program a few days ago that I got around to writing a brand new version.
- Thaana Transcoder 1.0 Installer (184KB, MS Windows) - Thaana Transcoder 1.0 Executable only (78.8KB, MS Windows) Enjoy Latin Thaana Converter is a small, simple software for Microsoft Windows that performs transliteration on latinized (i.e.
This new release carries a new name (which I think is a more technically correct name for what it does) and sports a few aesthetic changes but is functionally almost exactly the same as the original - it is basically a recompile of my old code within the . Automated transliteration of Latin Thaana is not an entirely easy task.
Look up table based algorithms are simple to implement but are unable to correctly handle cases of sukun, present issues with most other fili and generally have a host of other problems as well.
Latin Thaana Converter utilizes a finite state machine and its transliteration mappings are based on a more extensive scheme extracted from an analysis of a body of Latin Thaana-to-Thaana sample data.
It maybe worth mentioning that the analysis had revealed that upto 4 characters were being used (and needed) for some Thaana transliterations.
However, it must be said that the quality of transliteration from this is limited by the accuracy and diversity of the sample data I had used and hence is by no means perfect. The converted text appears in the "Text in Thaana" box.
Since writing this program in 2003, I have experimented with probabilistic FSMs and also put machine learning techniques to the task with better results. - Latin Thaana Converter 2.0 Installer (126KB, MS Windows) - Latin Thaana Converter 2.0 Executable only (22.8KB, MS Windows) Hope someone finds it useful Immi has amalgamated some code for Dhivehi document converting from Hassan and myself, to make a nifty little utility to convert from either MLS format or Accent format into more universal document formats such as Rich Text.
I plan to write more extensively on Thaana transliteration algorithms at a later time... Copy-paste or type the Latin Thaana text into the "Text in Latin Thaana" box. The tool is available for download from the Technova website.
The utility is a bit rough at the moment and notably has Unicode support left out.
I received two feature requests in the email since I released the Thaana Transcoder late last month and both asked for the same thing; Unicode input text transcoding to Accent compatible ASCII output.
It just so happened that the feature requested was something that I had actually programmed in, more or less, but left out in the public 1.0 release because I wasn't sure anyone would need it.
So here is a new release with the said feature included and also bundles in a minor bug fix.