MTL Toolbox: Difference between revisions

From Taioaan Wiki
Jump to navigation Jump to search
(→‎Dictionaries (Jixtiern): il MTL Explainer)
(78 intermediate revisions by the same user not shown)
Line 1: Line 1:
'''MTL Toolbox''' is a set of scripts written in PHP that aid in understanding of [[Modern Taiwanese Language]] (MTL) and researching Taiwanese vocabulary ([[gwsuu]]).  
'''MTL Toolbox''' (https://learntaiwanese.org/MTLtoolbox/about.html) is software and data to help work with written Taiwanese using the [[MTL]] writing system and other romanizations for Taiwanese.


==Dictionaries (Jixtiern)==
== Features ==
*'''Modern Taiwanese-English Dictionary''' {{TE|}} - a search engine for the [[Hiexntai-buun Dictionary]]. It accepts MTL, English and Harnji queries. Links to [[MTL Explainer]], MoE audio, plus text-to-speech (TTS).
* six Taiwanese dictionaries spanning from [[Taioaan Jidpurn-sitai|Japanese era]] to present day
* '''[http://learntaiwanese.org/MTLtoolbox/MTL_TaiChi.php Gateway to the Tw-Ch (Taihoa) Dictionary]''' - converts an MTL word to a [[POJ]] word, looks it up in the POJ [[Taibuun/Hoabuun Svoarterng Sutiern]], and converts results from POJ to MTL. Also accepts [[Harnji]] for Taiwanese and [[Mandarin]] as search terms. Big5 encoding.
* full-text search engine accepts written Taiwanese as well as English, and ''[[Harnji]]''
* audio from government-compiled dictionary: [[Dictionary of Frequently-Used Taiwan Minnan|DFT]]
* word unjoiner to aid learning and searching at syllable level
* ''Seven Tones'' soundboard: [[table of all MLT finals]] with examples


==MTL learning tools==
== How to search ==
*'''[http://learntaiwanese.org/MTLtoolbox/POJvsMTLdemo.php POJ vs MTL Demonstrator]'''
* We describe use of "Taiwanese–English dictionaries full-text search"
**Converts a POJ UTF-8 word into POJ ASCII, [[MTLN]], [[MTLP]], and MTL formats.
**Can decompose an MTL word in the reverse direction.
*'''MTL Word Details''' {{x|}} - explains the syllables within MTL word and where [[tone sandhi]] occurs.
*'''MTL Text to Speech (TTS)''' {{tts|}} - simply paste an MTL sentence after the "q=" and listen right away!


== MTL/POJ Conversion ==
=== Typical usage ===
* '''[http://learntaiwanese.org/MTLtoolbox/POJUtexttoMTL.php POJ to MTL Converter]''' - converts [[POJ]] sentences (in UTF-8 encoding) to MTL.
* input: Taiwanese word (often consists of two syllables [[tone sandhi|joined together]]), for example:
* '''[http://learntaiwanese.org/MTLtoolbox/MTLtexttoPOJN.php MTL to POJ Converter]''' - converts MTL sentences to [[POJ]] in ASCII format.
** {{x|køefcie}}
* the Toolbox "unjoins" words by database lookup
* the results from historical works should be about the same as for:
** {{x|køea cie}}


==See Also==
* more examples:
*[[Taiwanese-English Dictionary]]
** {{x|chviafmng}}
** {{x|tøsia}}
** {{x|Taioaan}}
 
=== Monosyllable ===
* for a [[monosyllable]], exact matches are displayed by default, for example ([[Practical Taiwanese Conversation|PTC]]):
** {{x|goar}}
** {{x|lie}}
** {{x|ee}}
 
* "Monosyllable mode" normally allows only monosyllable results. To see more entries with this syllable, click "Khahzøe"
 
* if the syllable is a DFT monosyllable, a navigation bar displays adjacent DFT monosyllables in alphabetical order
 
== Data ==
Local copies of:
* HTB: ''[[Hiexntai-buun Dictionary]]''
* DFT: ''[[Dictionary of Frequently-Used Taiwan Minnan]]'' (in [[TL]]. We added MTL annotations and annotated over 5800 definitions in English for monosyllables)
* MK: ''[[Maryknoll Taiwanese-English Dictionary]]'' (in [[POJ]]. We added MTL annotations)
* EDUTECH: [[Liim Keahioong]] (2001-2003) ''EDUTECH: Taiwanese-English Dictionary Searched with Concise Atonal Spelling'' (in [[MLT]] with [[Talk:Øe|unified spellings]] (øe))
* [[Bernard L.M. Embree|Embree, Bernard L. M.]] (1973). ''[[A Dictionary of Southern Min]]: based on current usage in Taiwan and checked against the earlier works of Carstairs Douglas, Thomas Barclay, and Ernest Tipson''. Hong Kong: Hong Kong Language Institute. (in POJ. We added MLT annotations)
* Lim08: ''[[Tai-Nichi Daijiten]]'' translated into Taiwanese. (original 1931 & 1932, in [[Taioaan-guo kana|Taiwanese kana]]. Data in POJ. We added MLT annotations)
 
We also support searching other websites with conversion to POJ/TL:
* Lim (2019): ''[[Tai-Nichi Daijiten]]''
* ''[[Taibuun/Hoabuun Svoarterng Sutiern]]''
 
== Technical notes ==
* [[SQLite]]: [https://sqlite.org/fts3.html FTS4] for full-text search
* Token prefix queries: use the asterisk ('*') at the end. Similar to {{w|wildcard character}} in [[zokgiap hexthorng|operating systems]] (normal wildcard search not currently supported by FTS)
** Example: {{x|Taioa*}}, {{x|臺*}}
* Specify a column-name followed by a colon (':')
** Example: {{x|thj:頭*}} (returns entries where Taiwanese written with Harnji begins with character for [[thaau]])
* Add carat ^ before token to require token to be very first token in its column
** Example: {{x|^thaau}}
 
== See also ==
* [[Taiwanese-English dictionaries]]


== Acknowledgements ==
== Acknowledgements ==
* The MTL Toolbox uses data from the [[Maryknoll Taiwanese-English Dictionary]], published by the {{w|Maryknoll}} Language Service Center in 2001. They have generously released their [http://www.taiwanesedictionary.org/Mkdictionary.xls dictionary] to the public under a [https://creativecommons.org/licenses/by-nc-sa/3.0/ Creative Commons Attribution-NonCommercial-ShareAlike 3.0 License].
* The MTL Toolbox uses data from the ''[[Maryknoll Taiwanese-English Dictionary]]'', which was generously released to the public under a [https://creativecommons.org/licenses/by-nc-sa/3.0/ Creative Commons Attribution-NonCommercial-ShareAlike 3.0 License].
* Speech synthesis is provided by the {{w|National Museum of Taiwan Literature}}.


[[Category:POJ]]
[[Category:Peqoexji]]
[[Category:MTL]]
[[Category:Modern Literal Taiwanese]]
[[Category:Modern Taiwanese Language]]

Revision as of 16:45, 23 June 2024

MTL Toolbox (https://learntaiwanese.org/MTLtoolbox/about.html) is software and data to help work with written Taiwanese using the MTL writing system and other romanizations for Taiwanese.

Features

  • six Taiwanese dictionaries spanning from Japanese era to present day
  • full-text search engine accepts written Taiwanese as well as English, and Harnji
  • audio from government-compiled dictionary: DFT
  • word unjoiner to aid learning and searching at syllable level
  • Seven Tones soundboard: table of all MLT finals with examples

How to search

  • We describe use of "Taiwanese–English dictionaries full-text search"

Typical usage

  • input: Taiwanese word (often consists of two syllables joined together), for example:
  • the Toolbox "unjoins" words by database lookup
  • the results from historical works should be about the same as for:

Monosyllable

  • "Monosyllable mode" normally allows only monosyllable results. To see more entries with this syllable, click "Khahzøe"
  • if the syllable is a DFT monosyllable, a navigation bar displays adjacent DFT monosyllables in alphabetical order

Data

Local copies of:

We also support searching other websites with conversion to POJ/TL:

Technical notes

  • SQLite: FTS4 for full-text search
  • Token prefix queries: use the asterisk ('*') at the end. Similar to wildcard character in operating systems (normal wildcard search not currently supported by FTS)
  • Specify a column-name followed by a colon (':')
    • Example: thj:頭* (returns entries where Taiwanese written with Harnji begins with character for thaau)
  • Add carat ^ before token to require token to be very first token in its column

See also

Acknowledgements