MTL Toolbox: Difference between revisions

Jump to navigation Jump to search
m
Line 56: Line 56:


== Technical notes ==
== Technical notes ==
* Our full-text search is provided by the [[SQLite]]: [https://sqlite.org/fts3.html FTS4] extension. We currently use the Standard Query Syntax.
Our full-text search is provided by the [[SQLite]] [https://sqlite.org/fts3.html FTS4] extension. We currently use the Standard Query Syntax. One of the three basic query types supported by FTS tables is "token or token prefix queries":
* Token prefix queries: use the asterisk ('*') at the end. Similar to {{w|wildcard character}} in [[zokgiap hexthorng|operating systems]] (normal wildcard search not currently supported by FTS)
* Specify a token prefix by appending an asterisk ('*') to the prefix. (While similar to {{w|wildcard character}} in [[zokgiap hexthorng|operating systems]], wildcard search is not currently supported by FTS)
** Example: {{TE|Taioa*}}, {{TE|臺*}}
** Example: {{TE|Taioa*}}, {{TE|臺*}}
* Specify a column-name followed by a colon (':')
* Specify a column-name followed by a colon (':')
** Example: {{TE|hj:頭*}} (returns entries where Taiwanese written with Harnji begins with character for [[thaau]])
** Example: {{TE|hj:頭*}} (returns entries where Taiwanese written with Harnji begins with character for [[thaau]])
* Add carat ^ before token to require token to be very first token in its column
* Prefix the token with carat ('^') to require token to be very first token in its column
** Example: {{TE|^thaau}}
** Example: {{TE|^thaau}}
* [[Ø]] is not folded to lower case by the tokenizer
 
Tokenizer: the default tokenizer ("simple") is used. It only does case folding of ASCII characters, so [[Ø]] is not folded to lower case.


== See also ==
== See also ==
46,004

edits

Navigation menu