Dictionary of Frequently-Used Taiwan Minnan/Polysyllable homophones: Difference between revisions
(→Method) |
(lead, homophones) |
||
Line 1: | Line 1: | ||
'''Frequently-Used Taiwanese Polysyllable Homonyms''' were gathered by analyzing the ''[[Dictionary of Frequently-Used Taiwan Minnan]]'' (''MoeDict''). We found 292 words that are written the same in [[TRS]] but have different meaning. They are all two-syllables using 145 TRS ranging from ''{{x|anleeng}}'' to ''{{x|zuxsyn}}''. | '''Frequently-Used Taiwanese Polysyllable Homonyms''' were gathered by analyzing the ''[[Dictionary of Frequently-Used Taiwan Minnan]]'' (''MoeDict''). We found 292 words that are written the same in [[TRS]] but have different meaning. They are all two-syllables using 145 TRS ranging from ''{{x|anleeng}}'' to ''{{x|zuxsyn}}''. | ||
After conversion to [[MTL]], we identified 361 dictionary rows that are spelled alike and sound alike. | After conversion to [[MTL]], we identified 361 dictionary rows that are spelled alike and sound alike, and so are homophones. These 178 MTL, covering 214 TRS, are all two-syllable, usually matching two rows. Five MTL match three rows: ''{{x|hibang}}'', ''{{x|honghoea}}'', ''{{x|huiekhix}}'', ''{{x|kofkex}}'', ''{{x|teng'ar}}''. | ||
== Full list == | == Full list == |
Revision as of 02:44, 30 September 2018
Frequently-Used Taiwanese Polysyllable Homonyms were gathered by analyzing the Dictionary of Frequently-Used Taiwan Minnan (MoeDict). We found 292 words that are written the same in TRS but have different meaning. They are all two-syllables using 145 TRS ranging from anleeng to zuxsyn.
After conversion to MTL, we identified 361 dictionary rows that are spelled alike and sound alike, and so are homophones. These 178 MTL, covering 214 TRS, are all two-syllable, usually matching two rows. Five MTL match three rows: hibang, honghoea, huiekhix, kofkex, teng'ar.
Full list
anleeng, baxnban, befthaau, bunbeeng, byseg, chiaciorng, chiahø, chiethaxm, chimjip, chit'ar, ciernsu, cionglaai, cirntong, cysi, goanheeng, goanthaau, goaxheeng, goaxsiofng, gyteng, haglek, hengseg, hengthaix, hibang, hichix, hiernsyn, hiexnkym, hoafn'exng, hoafnzexng, hoan'ar, hoansyn, hoatkag, hoeathaau, hogsip, honghioxng, honghoad, honghoan, honghoea, hongseg, hongtiaau, hongtiin, hongzuie, huiekhix, hunky, iaqar, iensvoax, in'ieen, iofng'iok, iogbong, iøqzef, iugvar, iusexng, ixteng, jinsu, jinsym, kanghw, kaohwn, kauar, kaukied, kayciuo, kayseg, kear, kefngkaix, kefngtix, keh'ar, kekafng, keng'ar, khiesex, khoanhan, khoat'ham, khoghii, khuiky, khypoef, kielok, kikhar, kileeng, kim'ar, kinzad, kioghan, kisw, ko'ar, koanciaux, koanlieen, koeato, kofkex, kokkaix, kong'eeng, kong'iong, kongzuo, kørpiet, kutkeq, kvoaxzuie, lamhofng, lienlok, lixhai, lyheeng, lyiuu, niuar, oankafng, pauhaam, pekkin, pengheeng, penghøo, phangzuie, pheklek, pintø, pipee, pitsw, pørsiuu, putcie, pvoachiaf, pvoaoe, senghau, sengtiorng, serngte, siausid, sidbut, sidiong, sinkii, siongbu, siongkaf, siongseg, siongsex, sirnhø, sirnsid, siusip, siusog, siuzuie, subin, suhoad, sutiern, sutviuo, tai'ar, taixcioxng, taixcix, taixkofng, taixliok, taixsw, tanghofng, tantok, tauar, taxngchiuo, tear, tefsvoax, teng'ar, tengsuun, texzuo, thekøf, thesefng, thiauar, thienpeeng, thongsixn, tiesuu, tiexnkhix, tiexnsixn, tngto, tofngkii, tongsii, toxkoex, vear, zaizeeng, zaizuo, zeahiexn, zeaphirn, zeng'efng, zengbie, zengcixn, zenggi, zengkerng, zhah'oe, zhuotviuo, zhutkex, zhutthaau, zhvezhaix, zhwkerng, zoancid, zoansyn, zøxheeng, zuxsyn
Method
- 15,368 rows were identified as polysyllable by inspecting the TRS
- TRS was converted to MTL and Python collections.Counter() was used to count the frequency of MTL
- from 15,185 MTL, only 178 had homophones, affecting 361 rows (2.3%)