MTL Toolbox: Difference between revisions

From Taioaan Wiki
Jump to navigation Jump to search
 
(2 intermediate revisions by the same user not shown)
Line 9: Line 9:


== How to search ==
== How to search ==
* We describe use of "Taiwanese–English dictionaries full-text search"
=== Typical usage ===
=== Typical usage ===
* input: Taiwanese word (often consists of two syllables [[tone sandhi|joined together]]), for example:
* input: Taiwanese word (often consists of two syllables [[tone sandhi|joined together]]), for example:
Line 27: Line 29:
** {{x|ee}}
** {{x|ee}}


* to see more matching entries, click "Khahzøe"
* "Monosyllable mode" normally allows only monosyllable results. To see more entries with this syllable, click "Khahzøe"
 
* if the syllable is a DFT monosyllable, a navigation bar displays adjacent DFT monosyllables in alphabetical order


== Data ==
== Data ==
Line 45: Line 49:
* [[SQLite]]: [https://sqlite.org/fts3.html FTS4] for full-text search
* [[SQLite]]: [https://sqlite.org/fts3.html FTS4] for full-text search
* Token prefix queries: use the asterisk ('*') at the end. Similar to {{w|wildcard character}} in [[zokgiap hexthorng|operating systems]] (normal wildcard search not currently supported by FTS)
* Token prefix queries: use the asterisk ('*') at the end. Similar to {{w|wildcard character}} in [[zokgiap hexthorng|operating systems]] (normal wildcard search not currently supported by FTS)
** Example: {{x|Taioa*}}
** Example: {{x|Taioa*}}, {{x|臺*}}
* Specify a column-name followed by a colon (':')
* Specify a column-name followed by a colon (':')
** Example: {{x|thj:頭*}} (returns entries where Taiwanese written with Harnji begins with character for [[thaau]])
** Example: {{x|thj:頭*}} (returns entries where Taiwanese written with Harnji begins with character for [[thaau]])

Latest revision as of 17:45, 23 June 2024

MTL Toolbox (https://learntaiwanese.org/MTLtoolbox/about.html) is software and data to help work with written Taiwanese using the MTL writing system and other romanizations for Taiwanese.

Features

  • six Taiwanese dictionaries spanning from Japanese era to present day
  • full-text search engine accepts written Taiwanese as well as English, and Harnji
  • audio from government-compiled dictionary: DFT
  • word unjoiner to aid learning and searching at syllable level
  • Seven Tones soundboard: table of all MLT finals with examples

How to search

  • We describe use of "Taiwanese–English dictionaries full-text search"

Typical usage

  • input: Taiwanese word (often consists of two syllables joined together), for example:
  • the Toolbox "unjoins" words by database lookup
  • the results from historical works should be about the same as for:

Monosyllable

  • "Monosyllable mode" normally allows only monosyllable results. To see more entries with this syllable, click "Khahzøe"
  • if the syllable is a DFT monosyllable, a navigation bar displays adjacent DFT monosyllables in alphabetical order

Data

Local copies of:

We also support searching other websites with conversion to POJ/TL:

Technical notes

  • SQLite: FTS4 for full-text search
  • Token prefix queries: use the asterisk ('*') at the end. Similar to wildcard character in operating systems (normal wildcard search not currently supported by FTS)
  • Specify a column-name followed by a colon (':')
    • Example: thj:頭* (returns entries where Taiwanese written with Harnji begins with character for thaau)
  • Add carat ^ before token to require token to be very first token in its column

See also

Acknowledgements