Dictionary of Frequently-Used Taiwan Minnan/Monosyllables: Difference between revisions

Jump to navigation Jump to search
→‎Method: update. 1,800 distinct sounds
(→‎See also: Twenty-seven Taiwanese words beginning with the backquote)
(→‎Method: update. 1,800 distinct sounds)
Line 2: Line 2:


== Method ==
== Method ==
We isolated 2,936 rows from the dictionary that are monosyllables and converted their TRS to MTL. We only considered words from the first section of the dictionary because they appear to be frequently used, and ignored the second section. Then we counted the frequency of each MTL with Python's collections.Counter, which tells the number of dictionary rows matching each MTL, and got 1,813 unique MTL. Then we used Counter again on those results and found:  
We isolated 2,936 rows from the dictionary that are monosyllables and converted their TRS to MTL. We only considered words from the first section of the dictionary because they appear to be frequently used, and ignored the second section. We folded in backquoted words, for example ''`lie'' was counted as ''lix''. Then we counted the frequency of each sound with Python's collections.Counter, which tells the number of homophonic dictionary rows, and got 1,800 distinct sounds. Then we used Counter again on those results and found:  
* 1103 MTL (61%) and 1103 rows (38%) uniquely match one-to-one
* 1853 rows (63%) are homophonic, 1083 rows (37%) are not
* as expected, most rows (1833 or 62%) have at least one homophone. Out of the corresponding 710 MTL (39%):  
* The most homophonic sounds are: ''{{x|lie}}'', ''{{x|ky}}'', and ''{{x|køf}}'', which match 7 rows each, followed by ''{{x|cie}}'', ''{{x|kafn}}'', ''{{x|kefng}}'', ''{{x|sefng}}'', ''{{x|kaf}}'', ''{{x|kaq}}'', ''{{x|zngf}}'', ''{{x|ti}}'', ''{{x|sw}}'', ''{{x|leeng}}'', and ''{{x|kerng}}'', which match 6 rows each
** ''{{x|lie}}'', ''{{x|ky}}'', and ''{{x|køf}}'' match the most (7 rows each), and ''{{x|cie}}'', ''{{x|kafn}}'', ''{{x|kefng}}'', ''{{x|sefng}}'', ''{{x|kaf}}'', ''{{x|kaq}}'', ''{{x|zngf}}'', ''{{x|ti}}'', ''{{x|sw}}'', ''{{x|leeng}}'', and ''{{x|kerng}}'' match 6 rows each
** most homophones cover two rows: 896 rows (31%), 448 distinct sounds (25%)
** most of the homophones cover two rows (443 MTL (24%), 886 rows (30%))
** some sounds cover three rows: 516 rows (18%), 172 sounds (10%)
** some MTL cover three rows (173 MTL (10%), 519 rows (18%))
** the rest match from four to seven rows: 441 rows (15%), 97 sounds (5%)
** a small fraction match up four to seven rows (94 MTL (5%), 428 rows (15%))
 
[[File:rows and matching MTL vs. match level.png|thumb|none]]
[[File:rows and matching MTL vs. match level.png|thumb|none]]


45,178

edits

Navigation menu