MTL Toolbox: Difference between revisions

From Taioaan Wiki
Jump to navigation Jump to search
 
(6 intermediate revisions by the same user not shown)
Line 9: Line 9:


== How to search ==
== How to search ==
* try typing in or clicking these words ([[monosyllable]]s from [[Practical Taiwanese Conversation]]):
=== Typical usage ===
* input: Taiwanese word (often consists of two syllables [[tone sandhi|joined together]]), for example:
** {{x|køefcie}}
* the Toolbox "unjoins" words by database lookup
* the results from historical works should be about the same as for:
** {{x|køea cie}}
 
* more examples:
** {{x|chviafmng}}
** {{x|tøsia}}
** {{x|Taioaan}}
 
=== Monosyllable ===
* for a [[monosyllable]], exact matches are displayed by default, for example ([[Practical Taiwanese Conversation|PTC]]):
** {{x|goar}}
** {{x|goar}}
** {{x|lie}}
** {{x|lie}}
** {{x|ee}}
** {{x|ee}}


* for a monosyllable, exact matches are displayed by default
* to see more matching entries, click "Khahzøe"
* to see more matching entries, click "Khahzøe"
* try more than one syllable:
** {{x|chviar mng}}
** {{x|tøf sia}}
** {{x|køea cie}}
* try the following words (they consist of [[tone sandhi|joined syllables]]). results from non-MLT dictionaries should be about the same as respective unjoined syllables above
** {{x|chviafmng}}
** {{x|tøsia}}
** {{x|køefcie}}


== Data ==
== Data ==
Local copies of:
Local copies of:
* HTB: ''[[Hiexntai-buun Dictionary]]''
* HTB: ''[[Hiexntai-buun Dictionary]]''
* DFT: ''[[Dictionary of Frequently-Used Taiwan Minnan]]'' (originally [[TL]] with MTL annotations. We have annotated over 5800 definitions in English for monosyllables)
* DFT: ''[[Dictionary of Frequently-Used Taiwan Minnan]]'' (in [[TL]]. We added MTL annotations and annotated over 5800 definitions in English for monosyllables)
* MK: ''[[Maryknoll Taiwanese-English Dictionary]]'' (originally [[POJ]] with MTL annotations)
* MK: ''[[Maryknoll Taiwanese-English Dictionary]]'' (in [[POJ]]. We added MTL annotations)
* EDUTECH: [[Liim Keahioong]] (2001-2003) ''EDUTECH: Taiwanese-English Dictionary Searched with Concise Atonal Spelling'' (in [[MLT]] with [[Talk:Øe|unified spellings]] (øe))
* EDUTECH: [[Liim Keahioong]] (2001-2003) ''EDUTECH: Taiwanese-English Dictionary Searched with Concise Atonal Spelling'' (in [[MLT]] with [[Talk:Øe|unified spellings]] (øe))
* [[Bernard L.M. Embree|Embree, Bernard L. M.]] (1973). ''[[A Dictionary of Southern Min]]: based on current usage in Taiwan and checked against the earlier works of Carstairs Douglas, Thomas Barclay, and Ernest Tipson''. Hong Kong: Hong Kong Language Institute.
* [[Bernard L.M. Embree|Embree, Bernard L. M.]] (1973). ''[[A Dictionary of Southern Min]]: based on current usage in Taiwan and checked against the earlier works of Carstairs Douglas, Thomas Barclay, and Ernest Tipson''. Hong Kong: Hong Kong Language Institute. (in POJ. We added MLT annotations)
* Lim08: ''[[Tai-Nichi Daijiten]]'' translated into Taiwanese. (1931 & 1932, original uses Taiwanese kana, not POJ)
* Lim08: ''[[Tai-Nichi Daijiten]]'' translated into Taiwanese. (original 1931 & 1932, in [[Taioaan-guo kana|Taiwanese kana]]. Data in POJ. We added MLT annotations)


We also support searching other websites with conversion to POJ/TL:
We also support searching other websites with conversion to POJ/TL:
Line 40: Line 42:
* ''[[Taibuun/Hoabuun Svoarterng Sutiern]]''
* ''[[Taibuun/Hoabuun Svoarterng Sutiern]]''


== Technical Notes ==
== Technical notes ==
* [[SQLite]]: [https://sqlite.org/fts3.html FTS4] for full-text search
* [[SQLite]]: [https://sqlite.org/fts3.html FTS4] for full-text search
* Token prefix queries: use the asterisk ('*') at the end. Similar to {{w|wildcard character}} in [[zokgiap hexthorng|operating systems]] (normal wildcard search not currently supported by FTS)
* Token prefix queries: use the asterisk ('*') at the end. Similar to {{w|wildcard character}} in [[zokgiap hexthorng|operating systems]] (normal wildcard search not currently supported by FTS)
** Example: {{TE|Taioa*}}
** Example: {{x|Taioa*}}
* Specify a column-name followed by a colon (':')
* Specify a column-name followed by a colon (':')
** Example: {{dft|thj:頭*}} (you should see all DFT entries starting with this Harnji)
** Example: {{dft|thj:頭*}} (you should see all DFT entries starting with this Harnji)
Line 55: Line 57:
* The MTL Toolbox uses data from the ''[[Maryknoll Taiwanese-English Dictionary]]'', which was generously released to the public under a [https://creativecommons.org/licenses/by-nc-sa/3.0/ Creative Commons Attribution-NonCommercial-ShareAlike 3.0 License].
* The MTL Toolbox uses data from the ''[[Maryknoll Taiwanese-English Dictionary]]'', which was generously released to the public under a [https://creativecommons.org/licenses/by-nc-sa/3.0/ Creative Commons Attribution-NonCommercial-ShareAlike 3.0 License].


[[Category:POJ]]
[[Category:Peqoexji]]
[[Category:MTL]]
[[Category:Modern Literal Taiwanese]]
[[Category:Modern Taiwanese Language]]

Latest revision as of 16:02, 21 April 2024

MTL Toolbox (https://learntaiwanese.org/MTLtoolbox/about.html) is software and data to help work with written Taiwanese using the MTL writing system and other romanizations for Taiwanese.

Features

How to search

Typical usage

  • input: Taiwanese word (often consists of two syllables joined together), for example:
  • the Toolbox "unjoins" words by database lookup
  • the results from historical works should be about the same as for:

Monosyllable

  • to see more matching entries, click "Khahzøe"

Data

Local copies of:

We also support searching other websites with conversion to POJ/TL:

Technical notes

  • SQLite: FTS4 for full-text search
  • Token prefix queries: use the asterisk ('*') at the end. Similar to wildcard character in operating systems (normal wildcard search not currently supported by FTS)
  • Specify a column-name followed by a colon (':')
    • Example: thj:頭* (you should see all DFT entries starting with this Harnji)
  • Add carat ^ before token to require token to be very first token in its column

See also

Acknowledgements