Dictionary of Frequently-Used Taiwan Minnan/Polysyllable homophones: Difference between revisions

From Taioaan Wiki
Jump to navigation Jump to search
Line 8: Line 8:
== Method ==
== Method ==
* 14,980 rows of dictionary sections 1 and 5 were identified as polysyllable by TRS
* 14,980 rows of dictionary sections 1 and 5 were identified as polysyllable by TRS
* TRS was converted to MTL and Python collections.Counter() was used to count the frequency of MTL. A total of 14,797 distinct MTL words were found, suggesting several homophones exist
* TRS was converted to MTL and Python collections.Counter() was used to count the frequency of MTL. A total of 14,797 distinct MTL words were counted, with 14,619 MTL (almost 99%) counted only once
* 361 rows (2.3%) are homophones, written with 178 distinct MTL and 214 distinct TRS
* The remaining 178 MTL (1.2%) correspond to 361 rows (2.3%). They are written with 214 distinct TRS


== See also ==
== See also ==
* [[MTL words with double reverse mapping]]
* [[MTL words with double reverse mapping]]
* [[Frequently-Used Taiwanese Monosyllables]]
* [[Frequently-Used Taiwanese Monosyllables]]

Revision as of 01:56, 10 October 2018

Frequently-Used Taiwanese Polysyllable Homonyms were gathered by analyzing the Dictionary of Frequently-Used Taiwan Minnan (MoeDict). We found 292 words that are written the same in TRS but have different meaning. They are all two-syllables using 145 TRS ranging from anleeng to zuxsyn.

After conversion to MTL, we identified 361 dictionary rows that are spelled alike and sound alike, and so are homophones. These 178 MTL, covering 214 TRS, are all two-syllable, usually matching two rows. Five MTL match three rows: hibang, honghoea, huiekhix, kofkex, teng'ar.

Full list

anleeng, baxnban, befthaau, bunbeeng, byseg, chiaciorng, chiahø, chiethaxm, chimjip, chit'ar, ciernsu, cionglaai, cirntong, cysi, goanheeng, goanthaau, goaxheeng, goaxsiofng, gyteng, haglek, hengseg, hengthaix, hibang, hichix, hiernsyn, hiexnkym, hoafn'exng, hoafnzexng, hoan'ar, hoansyn, hoatkag, hoeathaau, hogsip, honghioxng, honghoad, honghoan, honghoea, hongseg, hongtiaau, hongtiin, hongzuie, huiekhix, hunky, iaqar, iensvoax, in'ieen, iofng'iok, iogbong, iøqzef, iugvar, iusexng, ixteng, jinsu, jinsym, kanghw, kaohwn, kauar, kaukied, kayciuo, kayseg, kear, kefngkaix, kefngtix, keh'ar, kekafng, keng'ar, khiesex, khoanhan, khoat'ham, khoghii, khuiky, khypoef, kielok, kikhar, kileeng, kim'ar, kinzad, kioghan, kisw, ko'ar, koanciaux, koanlieen, koeato, kofkex, kokkaix, kong'eeng, kong'iong, kongzuo, kørpiet, kutkeq, kvoaxzuie, lamhofng, lienlok, lixhai, lyheeng, lyiuu, niuar, oankafng, pauhaam, pekkin, pengheeng, penghøo, phangzuie, pheklek, pintø, pipee, pitsw, pørsiuu, putcie, pvoachiaf, pvoaoe, senghau, sengtiorng, serngte, siausid, sidbut, sidiong, sinkii, siongbu, siongkaf, siongseg, siongsex, sirnhø, sirnsid, siusip, siusog, siuzuie, subin, suhoad, sutiern, sutviuo, tai'ar, taixcioxng, taixcix, taixkofng, taixliok, taixsw, tanghofng, tantok, tauar, taxngchiuo, tear, tefsvoax, teng'ar, tengsuun, texzuo, thekøf, thesefng, thiauar, thienpeeng, thongsixn, tiesuu, tiexnkhix, tiexnsixn, tngto, tofngkii, tongsii, toxkoex, vear, zaizeeng, zaizuo, zeahiexn, zeaphirn, zeng'efng, zengbie, zengcixn, zenggi, zengkerng, zhah'oe, zhuotviuo, zhutkex, zhutthaau, zhvezhaix, zhwkerng, zoancid, zoansyn, zøxheeng, zuxsyn

Method

  • 14,980 rows of dictionary sections 1 and 5 were identified as polysyllable by TRS
  • TRS was converted to MTL and Python collections.Counter() was used to count the frequency of MTL. A total of 14,797 distinct MTL words were counted, with 14,619 MTL (almost 99%) counted only once
  • The remaining 178 MTL (1.2%) correspond to 361 rows (2.3%). They are written with 214 distinct TRS

See also