MTL Toolbox: Difference between revisions

From Taioaan Wiki
Jump to navigation Jump to search
 
(One intermediate revision by the same user not shown)
Line 4: Line 4:
* six Taiwanese dictionaries spanning from [[Taioaan Jidpurn-sitai|Japanese era]] to present day
* six Taiwanese dictionaries spanning from [[Taioaan Jidpurn-sitai|Japanese era]] to present day
* full-text search engine accepts written Taiwanese as well as English, and ''[[Harnji]]''
* full-text search engine accepts written Taiwanese as well as English, and ''[[Harnji]]''
* audio from latest [[Dictionary of Frequently-Used Taiwan Minnan|government-compiled dictionary]]
* audio from government-compiled dictionary: [[Dictionary of Frequently-Used Taiwan Minnan|DFT]]
* word unjoiner to aid learning and searching at syllable level
* word unjoiner to aid learning and searching at syllable level
* ''Seven Tones'' soundboard: [[table of all MLT finals]] with examples
* ''Seven Tones'' soundboard: [[table of all MLT finals]] with examples
Line 47: Line 47:
** Example: {{x|Taioa*}}
** Example: {{x|Taioa*}}
* Specify a column-name followed by a colon (':')
* Specify a column-name followed by a colon (':')
** Example: {{dft|thj:頭*}} (you should see all DFT entries starting with this Harnji)
** Example: {{x|thj:頭*}} (returns entries where Taiwanese written with Harnji begins with character for [[thaau]])
* Add carat ^ before token to require token to be very first token in its column
* Add carat ^ before token to require token to be very first token in its column
** Example: {{x|^thaau}}
** Example: {{x|^thaau}}

Latest revision as of 11:26, 25 May 2024

MTL Toolbox (https://learntaiwanese.org/MTLtoolbox/about.html) is software and data to help work with written Taiwanese using the MTL writing system and other romanizations for Taiwanese.

Features

  • six Taiwanese dictionaries spanning from Japanese era to present day
  • full-text search engine accepts written Taiwanese as well as English, and Harnji
  • audio from government-compiled dictionary: DFT
  • word unjoiner to aid learning and searching at syllable level
  • Seven Tones soundboard: table of all MLT finals with examples

How to search

Typical usage

  • input: Taiwanese word (often consists of two syllables joined together), for example:
  • the Toolbox "unjoins" words by database lookup
  • the results from historical works should be about the same as for:

Monosyllable

  • to see more matching entries, click "Khahzøe"

Data

Local copies of:

We also support searching other websites with conversion to POJ/TL:

Technical notes

  • SQLite: FTS4 for full-text search
  • Token prefix queries: use the asterisk ('*') at the end. Similar to wildcard character in operating systems (normal wildcard search not currently supported by FTS)
  • Specify a column-name followed by a colon (':')
    • Example: thj:頭* (returns entries where Taiwanese written with Harnji begins with character for thaau)
  • Add carat ^ before token to require token to be very first token in its column

See also

Acknowledgements