Dictionary of Frequently-Used Taiwan Minnan/Polysyllable homophones: Difference between revisions

From Taioaan Wiki
Jump to navigation Jump to search
mNo edit summary
 
(16 intermediate revisions by the same user not shown)
Line 1: Line 1:
'''Frequently-Used Taiwanese Polysyllable Homonyms''' were gathered by analyzing the ''[[Dictionary of Frequently-Used Taiwan Minnan]]'' (''MoeDict''). We found 292 words that are written the same in [[TRS]] but have different meaning. They are all two-syllables using 145 TRS ranging from ''{{x|anleeng}}'' to ''{{x|zuxsyn}}''.
'''Frequently-Used Taiwanese Polysyllable Homophones''' were gathered by analyzing the ''[[Dictionary of Frequently-Used Taiwan Minnan]]'' (''MoeDict''). After we converted the original [[TL]] to [[MLT]] or MTL, we found 361 entries (rows) that are homophones. All are two-syllable, covering 178 MTL that usually match two rows each. Just five MTL match three rows each: ''{{x|hibang}}'', ''{{x|honghøea}}'', ''{{x|huiekhix}}'', ''{{x|kofkex}}'', ''{{x|teng'ar}}''.
 
After conversion to [[MTL]], we identified 361 dictionary rows that are spelled alike and sound alike, and so are homophones. These 178 MTL, covering 214 TRS, are all two-syllable, usually matching two rows. Five MTL match three rows: ''{{x|hibang}}'', ''{{x|honghoea}}'', ''{{x|huiekhix}}'', ''{{x|kofkex}}'', ''{{x|teng'ar}}''.


== Full list ==
== Full list ==
{{x|anleeng}}, {{x|baxnban}}, {{x|befthaau}}, {{x|bunbeeng}}, {{x|byseg}}, {{x|chiaciorng}}, {{x|chiahø}}, {{x|chiethaxm}}, {{x|chimjip}}, {{x|chit'ar}}, {{x|ciernsu}}, {{x|cionglaai}}, {{x|cirntong}}, {{x|cysi}}, {{x|goanheeng}}, {{x|goanthaau}}, {{x|goaxheeng}}, {{x|goaxsiofng}}, {{x|gyteng}}, {{x|haglek}}, {{x|hengseg}}, {{x|hengthaix}}, {{x|hibang}}, {{x|hichix}}, {{x|hiernsyn}}, {{x|hiexnkym}}, {{x|hoafn'exng}}, {{x|hoafnzexng}}, {{x|hoan'ar}}, {{x|hoansyn}}, {{x|hoatkag}}, {{x|hoeathaau}}, {{x|hogsip}}, {{x|honghioxng}}, {{x|honghoad}}, {{x|honghoan}}, {{x|honghoea}}, {{x|hongseg}}, {{x|hongtiaau}}, {{x|hongtiin}}, {{x|hongzuie}}, {{x|huiekhix}}, {{x|hunky}}, {{x|iaqar}}, {{x|iensvoax}}, {{x|in'ieen}}, {{x|iofng'iok}}, {{x|iogbong}}, {{x|iøqzef}}, {{x|iugvar}}, {{x|iusexng}}, {{x|ixteng}}, {{x|jinsu}}, {{x|jinsym}}, {{x|kanghw}}, {{x|kaohwn}}, {{x|kauar}}, {{x|kaukied}}, {{x|kayciuo}}, {{x|kayseg}}, {{x|kear}}, {{x|kefngkaix}}, {{x|kefngtix}}, {{x|keh'ar}}, {{x|kekafng}}, {{x|keng'ar}}, {{x|khiesex}}, {{x|khoanhan}}, {{x|khoat'ham}}, {{x|khoghii}}, {{x|khuiky}}, {{x|khypoef}}, {{x|kielok}}, {{x|kikhar}}, {{x|kileeng}}, {{x|kim'ar}}, {{x|kinzad}}, {{x|kioghan}}, {{x|kisw}}, {{x|ko'ar}}, {{x|koanciaux}}, {{x|koanlieen}}, {{x|koeato}}, {{x|kofkex}}, {{x|kokkaix}}, {{x|kong'eeng}}, {{x|kong'iong}}, {{x|kongzuo}}, {{x|kørpiet}}, {{x|kutkeq}}, {{x|kvoaxzuie}}, {{x|lamhofng}}, {{x|lienlok}}, {{x|lixhai}}, {{x|lyheeng}}, {{x|lyiuu}}, {{x|niuar}}, {{x|oankafng}}, {{x|pauhaam}}, {{x|pekkin}}, {{x|pengheeng}}, {{x|penghøo}}, {{x|phangzuie}}, {{x|pheklek}}, {{x|pintø}}, {{x|pipee}}, {{x|pitsw}}, {{x|pørsiuu}}, {{x|putcie}}, {{x|pvoachiaf}}, {{x|pvoaoe}}, {{x|senghau}}, {{x|sengtiorng}}, {{x|serngte}}, {{x|siausid}}, {{x|sidbut}}, {{x|sidiong}}, {{x|sinkii}}, {{x|siongbu}}, {{x|siongkaf}}, {{x|siongseg}}, {{x|siongsex}}, {{x|sirnhø}}, {{x|sirnsid}}, {{x|siusip}}, {{x|siusog}}, {{x|siuzuie}}, {{x|subin}}, {{x|suhoad}}, {{x|sutiern}}, {{x|sutviuo}}, {{x|tai'ar}}, {{x|taixcioxng}}, {{x|taixcix}}, {{x|taixkofng}}, {{x|taixliok}}, {{x|taixsw}}, {{x|tanghofng}}, {{x|tantok}}, {{x|tauar}}, {{x|taxngchiuo}}, {{x|tear}}, {{x|tefsvoax}}, {{x|teng'ar}}, {{x|tengsuun}}, {{x|texzuo}}, {{x|thekøf}}, {{x|thesefng}}, {{x|thiauar}}, {{x|thienpeeng}}, {{x|thongsixn}}, {{x|tiesuu}}, {{x|tiexnkhix}}, {{x|tiexnsixn}}, {{x|tngto}}, {{x|tofngkii}}, {{x|tongsii}}, {{x|toxkoex}}, {{x|vear}}, {{x|zaizeeng}}, {{x|zaizuo}}, {{x|zeahiexn}}, {{x|zeaphirn}}, {{x|zeng'efng}}, {{x|zengbie}}, {{x|zengcixn}}, {{x|zenggi}}, {{x|zengkerng}}, {{x|zhah'oe}}, {{x|zhuotviuo}}, {{x|zhutkex}}, {{x|zhutthaau}}, {{x|zhvezhaix}}, {{x|zhwkerng}}, {{x|zoancid}}, {{x|zoansyn}}, {{x|zøxheeng}}, {{x|zuxsyn}}
{{x|anleeng}}, {{x|baxnban}}, {{x|befthaau}}, {{x|bunbeeng}}, {{x|byseg}}, {{x|chiaciorng}}, {{x|chiahø}}, {{x|chiethaxm}}, {{x|chimjip}}, {{x|chit'ar}}, {{x|ciernsu}}, {{x|cionglaai}}, {{x|cirntong}}, {{x|cysi}}, {{x|goanheeng}}, {{x|goanthaau}}, {{x|goaxheeng}}, {{x|goaxsiofng}}, {{x|gyteng}}, {{x|haglek}}, {{x|hengseg}}, {{x|hengthaix}}, {{x|hibang}}, {{x|hichix}}, {{x|hiernsyn}}, {{x|hiexnkym}}, {{x|hoafn'exng}}, {{x|hoafnzexng}}, {{x|hoan'ar}}, {{x|hoansyn}}, {{x|hoatkag}}, {{x|høeathaau}}, {{x|hogsip}}, {{x|honghioxng}}, {{x|honghoad}}, {{x|honghoan}}, {{x|honghøea}}, {{x|hongseg}}, {{x|hongtiaau}}, {{x|hongtiin}}, {{x|hongzuie}}, {{x|huiekhix}}, {{x|hunky}}, {{x|iaqar}}, {{x|iensvoax}}, {{x|in'ieen}}, {{x|iofng'iok}}, {{x|iogbong}}, {{x|iøqzef}}, {{x|iugvar}}, {{x|iusexng}}, {{x|ixteng}}, {{x|jinsu}}, {{x|jinsym}}, {{x|kanghw}}, {{x|kaohwn}}, {{x|kauar}}, {{x|kaukied}}, {{x|kayciuo}}, {{x|kayseg}}, {{x|køear}}, {{x|kefngkaix}}, {{x|kefngtix}}, {{x|keh'ar}}, {{x|kekafng}}, {{x|keng'ar}}, {{x|khiesex}}, {{x|khoanhan}}, {{x|khoat'ham}}, {{x|khoghii}}, {{x|khuiky}}, {{x|khypoef}}, {{x|kielok}}, {{x|kikhar}}, {{x|kileeng}}, {{x|kim'ar}}, {{x|kinzad}}, {{x|kioghan}}, {{x|kisw}}, {{x|ko'ar}}, {{x|koanciaux}}, {{x|koanlieen}}, {{x|køeato}}, {{x|kofkex}}, {{x|kokkaix}}, {{x|kong'eeng}}, {{x|kong'iong}}, {{x|kongzuo}}, {{x|kørpiet}}, {{x|kutkeq}}, {{x|kvoaxzuie}}, {{x|lamhofng}}, {{x|lienlok}}, {{x|lixhai}}, {{x|lyheeng}}, {{x|lyiuu}}, {{x|niuar}}, {{x|oankafng}}, {{x|pauhaam}}, {{x|pekkin}}, {{x|pengheeng}}, {{x|penghøo}}, {{x|phangzuie}}, {{x|pheklek}}, {{x|pintø}}, {{x|pipee}}, {{x|pitsw}}, {{x|pørsiuu}}, {{x|putcie}}, {{x|pvoachiaf}}, {{x|pvoaoe}}, {{x|senghau}}, {{x|sengtiorng}}, {{x|serngtøe}}, {{x|siausid}}, {{x|sidbut}}, {{x|sidiong}}, {{x|sinkii}}, {{x|siongbu}}, {{x|siongkaf}}, {{x|siongseg}}, {{x|siongsex}}, {{x|sirnhø}}, {{x|sirnsid}}, {{x|siusip}}, {{x|siusog}}, {{x|siuzuie}}, {{x|subin}}, {{x|suhoad}}, {{x|sutiern}}, {{x|sutviuo}}, {{x|tai'ar}}, {{x|taixcioxng}}, {{x|taixcix}}, {{x|taixkofng}}, {{x|taixliok}}, {{x|taixsw}}, {{x|tanghofng}}, {{x|tantok}}, {{x|tauar}}, {{x|taxngchiuo}}, {{x|te'ar}}, {{x|tefsvoax}}, {{x|teng'ar}}, {{x|tengsuun}}, {{x|texzuo}}, {{x|thekøf}}, {{x|thesefng}}, {{x|thiauar}}, {{x|thienpeeng}}, {{x|thongsixn}}, {{x|tiesuu}}, {{x|tiexnkhix}}, {{x|tiexnsixn}}, {{x|tngto}}, {{x|tofngkii}}, {{x|tongsii}}, {{x|toxkoex}}, {{x|vear}}, {{x|zaizeeng}}, {{x|zaizuo}}, {{x|zeahiexn}}, {{x|zeaphirn}}, {{x|zeng'efng}}, {{x|zengbie}}, {{x|zengcixn}}, {{x|zenggi}}, {{x|zengkerng}}, {{x|zhah'oe}}, {{x|zhuotviuo}}, {{x|zhutkex}}, {{x|zhutthaau}}, {{x|zhvezhaix}}, {{x|zhwkerng}}, {{x|zoancid}}, {{x|zoansyn}}, {{x|zøxheeng}}, {{x|zuxsyn}}


== Method ==
== Method ==
* 14,980 rows of dictionary sections 1 and 5 were identified as polysyllable by TRS
* 14,980 rows of dictionary sections 1 and 5 were identified as polysyllable by TL: 12,447 duosyllable, 2,105 trisyllable, 426 four-syllable, and just 2 five-syllable. Analysis was done in [[Python]] using collections.Counter()
* TRS was converted to MTL and Python collections.Counter() was used to count the frequency of MTL. A total of 14,797 distinct MTL words were counted, with 14,619 MTL (almost 99%) counted only once
* Only some of the duosyllables are written the same in TL: 292 words overlap on 145 distinct TL
* The remaining 178 MTL (1.2%) correspond to 361 rows (2.3%). They are written with 214 distinct TRS
* TL was converted to MTL to reveal the homophones. A total of 14,797 distinct MTL were counted
* The homophones are 361 rows (2.3%) that overlap on 178 MTL (1.2%). Those rows cover 214 TL.


== See also ==
== See also ==
* [[MTL words with double reverse mapping]]
* [[MTL words with double reverse mapping]]
* [[Frequently-Used Taiwanese Monosyllables]]
* [[Dictionary of Frequently-Used Taiwan Minnan/Monosyllables|DFT monosyllables]]

Latest revision as of 08:53, 2 July 2024

Frequently-Used Taiwanese Polysyllable Homophones were gathered by analyzing the Dictionary of Frequently-Used Taiwan Minnan (MoeDict). After we converted the original TL to MLT or MTL, we found 361 entries (rows) that are homophones. All are two-syllable, covering 178 MTL that usually match two rows each. Just five MTL match three rows each: hibang, honghøea, huiekhix, kofkex, teng'ar.

Full list

anleeng, baxnban, befthaau, bunbeeng, byseg, chiaciorng, chiahø, chiethaxm, chimjip, chit'ar, ciernsu, cionglaai, cirntong, cysi, goanheeng, goanthaau, goaxheeng, goaxsiofng, gyteng, haglek, hengseg, hengthaix, hibang, hichix, hiernsyn, hiexnkym, hoafn'exng, hoafnzexng, hoan'ar, hoansyn, hoatkag, høeathaau, hogsip, honghioxng, honghoad, honghoan, honghøea, hongseg, hongtiaau, hongtiin, hongzuie, huiekhix, hunky, iaqar, iensvoax, in'ieen, iofng'iok, iogbong, iøqzef, iugvar, iusexng, ixteng, jinsu, jinsym, kanghw, kaohwn, kauar, kaukied, kayciuo, kayseg, køear, kefngkaix, kefngtix, keh'ar, kekafng, keng'ar, khiesex, khoanhan, khoat'ham, khoghii, khuiky, khypoef, kielok, kikhar, kileeng, kim'ar, kinzad, kioghan, kisw, ko'ar, koanciaux, koanlieen, køeato, kofkex, kokkaix, kong'eeng, kong'iong, kongzuo, kørpiet, kutkeq, kvoaxzuie, lamhofng, lienlok, lixhai, lyheeng, lyiuu, niuar, oankafng, pauhaam, pekkin, pengheeng, penghøo, phangzuie, pheklek, pintø, pipee, pitsw, pørsiuu, putcie, pvoachiaf, pvoaoe, senghau, sengtiorng, serngtøe, siausid, sidbut, sidiong, sinkii, siongbu, siongkaf, siongseg, siongsex, sirnhø, sirnsid, siusip, siusog, siuzuie, subin, suhoad, sutiern, sutviuo, tai'ar, taixcioxng, taixcix, taixkofng, taixliok, taixsw, tanghofng, tantok, tauar, taxngchiuo, te'ar, tefsvoax, teng'ar, tengsuun, texzuo, thekøf, thesefng, thiauar, thienpeeng, thongsixn, tiesuu, tiexnkhix, tiexnsixn, tngto, tofngkii, tongsii, toxkoex, vear, zaizeeng, zaizuo, zeahiexn, zeaphirn, zeng'efng, zengbie, zengcixn, zenggi, zengkerng, zhah'oe, zhuotviuo, zhutkex, zhutthaau, zhvezhaix, zhwkerng, zoancid, zoansyn, zøxheeng, zuxsyn

Method

  • 14,980 rows of dictionary sections 1 and 5 were identified as polysyllable by TL: 12,447 duosyllable, 2,105 trisyllable, 426 four-syllable, and just 2 five-syllable. Analysis was done in Python using collections.Counter()
  • Only some of the duosyllables are written the same in TL: 292 words overlap on 145 distinct TL
  • TL was converted to MTL to reveal the homophones. A total of 14,797 distinct MTL were counted
  • The homophones are 361 rows (2.3%) that overlap on 178 MTL (1.2%). Those rows cover 214 TL.

See also