Dictionary of Frequently-Used Taiwan Minnan/Polysyllable homophones
Frequently-Used Taiwanese Polysyllable Homonyms were gathered by analyzing the Dictionary of Frequently-Used Taiwan Minnan (MoeDict). After we converted the original TRS to MTL, we found 361 words (rows) that are homophones. All are two-syllable, covering 178 MTL that usually match two rows each. Just five MTL match three rows each: hibang, honghoea, huiekhix, kofkex, teng'ar.
Full list
anleeng, baxnban, befthaau, bunbeeng, byseg, chiaciorng, chiahø, chiethaxm, chimjip, chit'ar, ciernsu, cionglaai, cirntong, cysi, goanheeng, goanthaau, goaxheeng, goaxsiofng, gyteng, haglek, hengseg, hengthaix, hibang, hichix, hiernsyn, hiexnkym, hoafn'exng, hoafnzexng, hoan'ar, hoansyn, hoatkag, hoeathaau, hogsip, honghioxng, honghoad, honghoan, honghoea, hongseg, hongtiaau, hongtiin, hongzuie, huiekhix, hunky, iaqar, iensvoax, in'ieen, iofng'iok, iogbong, iøqzef, iugvar, iusexng, ixteng, jinsu, jinsym, kanghw, kaohwn, kauar, kaukied, kayciuo, kayseg, kear, kefngkaix, kefngtix, keh'ar, kekafng, keng'ar, khiesex, khoanhan, khoat'ham, khoghii, khuiky, khypoef, kielok, kikhar, kileeng, kim'ar, kinzad, kioghan, kisw, ko'ar, koanciaux, koanlieen, koeato, kofkex, kokkaix, kong'eeng, kong'iong, kongzuo, kørpiet, kutkeq, kvoaxzuie, lamhofng, lienlok, lixhai, lyheeng, lyiuu, niuar, oankafng, pauhaam, pekkin, pengheeng, penghøo, phangzuie, pheklek, pintø, pipee, pitsw, pørsiuu, putcie, pvoachiaf, pvoaoe, senghau, sengtiorng, serngte, siausid, sidbut, sidiong, sinkii, siongbu, siongkaf, siongseg, siongsex, sirnhø, sirnsid, siusip, siusog, siuzuie, subin, suhoad, sutiern, sutviuo, tai'ar, taixcioxng, taixcix, taixkofng, taixliok, taixsw, tanghofng, tantok, tauar, taxngchiuo, tear, tefsvoax, teng'ar, tengsuun, texzuo, thekøf, thesefng, thiauar, thienpeeng, thongsixn, tiesuu, tiexnkhix, tiexnsixn, tngto, tofngkii, tongsii, toxkoex, vear, zaizeeng, zaizuo, zeahiexn, zeaphirn, zeng'efng, zengbie, zengcixn, zenggi, zengkerng, zhah'oe, zhuotviuo, zhutkex, zhutthaau, zhvezhaix, zhwkerng, zoancid, zoansyn, zøxheeng, zuxsyn
Method
- 14,980 rows of dictionary sections 1 and 5 were identified as polysyllable by TRS: 12,447 duosyllable, 2,105 trisyllable, 426 four-syllable, and just 2 five-syllable. Analysis was done in Python using collections.Counter()
- Only some of the duosyllables are written the same in TRS: 292 words overlap on 145 distinct TRS
- TRS was converted to MTL to reveal the homophones. A total of 14,797 distinct MTL were counted
- The homophones are 361 words (2.3%) that overlap on 178 MTL (1.2%). Those rows cover 214 TRS.