Home BERT-assisted behavioral profiling of polysemy: contrastive analysis of HONG in Chinese and RED in English
Article
Licensed
Unlicensed Requires Authentication

BERT-assisted behavioral profiling of polysemy: contrastive analysis of HONG in Chinese and RED in English

  • Yiming Song ORCID logo and Deliang Wang ORCID logo EMAIL logo
Published/Copyright: January 15, 2025

Abstract

This study conducts a corpus-based contrastive analysis of the basic color term “red” in Chinese (HONG) and English (RED), innovatively employing the AI model BERT to triangulate, validate, and enrich the findings of the behavioral profile (BP) analysis. Results illustrate HONG’s more variational semasiological structure than RED, with both exhibiting unique usage patterns among similarities. The triangulation with BERT embeddings visualizes the performance of each subjectively annotated BP variable, and further demonstrates the semantic extending process in both HONG and RED. The underlying cognitive and socio-cultural factors are lastly discussed. Theoretically, this study bridges frequency-based AI models and usage-based Cognitive Linguistics by simulating human cognitive processes to abstract linguistic knowledge from extensive data, advancing our understanding of language through AI technology. Methodologically, this study is the first to integrate AI models in assisting BP analysis, with BERT embeddings offering objective corroborations and deeper insights for conclusions drawn from subjectively labeled BP data.


Corresponding author: Deliang Wang, School of Foreign Languages and Literature, 47836 Beijing Normal University , Beijing, 100875, China, E-mail:

Award Identifier / Grant number: 1243200008

Acknowledgments

We are grateful to the two anonymous reviewers whose comments have significantly contributed to improving the quality of the article.

  1. Research funding: This work was supported by the Fundamental Research Funds for the Central Universities, Beijing Normal University (No. 1243200008).

References

Alkhayrat, Maha, Mohamad Aljnidi & Kadan Aljoumaa. 2020. A comparative dimensionality reduction study in telecom customer segmentation using deep learning and PCA. Journal of Big Data 7(9). 1–23. https://doi.org/10.1186/s40537-020-0286-0.Search in Google Scholar

Berlin, Brent & Paul Kay. 1969. Basic color terms: Their university and evolution. California: Berkeley University of California Press.Search in Google Scholar

Brugman, Claudia & George Lakoff. 1988. Cognitive topology and lexical network. In Steven L. Small, Garrison W. Cottrell & Michael K. Tanenhaus (eds.), Lexical ambiguity resolution: Perspectives from psycholinguistics, neuropsychology and artificial intelligence, 477–508. San Mateo: Morgan Kaufmann.10.1016/B978-0-08-051013-2.50022-7Search in Google Scholar

Cantor, Stephen J. 1963. Centenary of the red cross. Medical Journal of Australia 2. 724–725. https://doi.org/10.5694/j.1326-5377.1963.tb18018.x.Search in Google Scholar

Cao, Xiaoping. 2008. Yinghan jiben yanseci “hong” de yinyu renzhi duibi fenxi [A comparative analysis of metaphorical cognition of the basic color term “red” in English and Chinese]. Jiang Xi xingzheng xueyuan xuebao [Journal of Jiangxi Administration Institute] 10(2). 79–80.Search in Google Scholar

Chen, Lili. 1996. Shixi Hanyu he yingyu yanseci de shehui wenhua chayi [A comparative analysis of sociocultural differences in color terms in Chinese and English]. Jiefangjun waiyu xueyuan xuebao [Journal of PLA University of Foreign Languages] (2). 40–44.Search in Google Scholar

Chen, Jiaxu. 2003. Yinghan jiben yanseci de yinyu renzhi duibi [A comparative analysis of metaphorical cognition of basic color terms in Chinese and English]. Xinan minzu daxue xuebao (renwen sheke ban) [Journal of Southwest University for Nationalities (Humanities and Social Science Edition)] 24(12). 283–286.Search in Google Scholar

Chen, Jiaxu & Lei Qin. 2003. Hanyu jiben yanse de fanchouhua ji yinyuhua renzhi [The categorization and metaphorical cognition of basic colors in Chinese]. Henan shifan daxue xuebao (zhexue shehui kexue ban) [Journal of Henan Normal University (Philosophy and Social Sciences Edition)] 30(2). 75–77.Search in Google Scholar

Croft, William & D. Alan Cruse. 2004. Cognitive linguistics. Cambridge: Cambridge University Press.10.1017/CBO9780511803864Search in Google Scholar

Cruse, D. Alan. 1986. Semantics. Cambridge: Cambridge University Press.Search in Google Scholar

Deerwester, Scott, Susan T. Dumais, George W. Furnas, Thomas K. Landauer & Richard Harshman. 1990. Indexing by latent semantic analysis. Journal of the American Society for Information Science 41(6). 391–407. https://doi.org/10.1002/(sici)1097-4571(199009)41:6<391::aid-asi1>3.0.co;2-9.10.1002/(SICI)1097-4571(199009)41:6<391::AID-ASI1>3.0.CO;2-9Search in Google Scholar

Devlin, Jacob, Ming-Wei Chang, Kenton Lee & Kristina Toutanova. 2019. BERT: Pretraining of deep bidirectional transformers for language understanding. In Proceedings of the 2019 conference of the North American chapter of the association for computational linguistics: Human language technologies, vol. 1, 4171–4186.Search in Google Scholar

Dou, Jinmeng & Meichun Liu. 2023. Exploring color metaphor with behavioral profiles: A usage-based analysis on the metaphorical meanings of the Chinese color term bái “white”. Lingua 289. 103539. https://doi.org/10.1016/j.lingua.2023.103539.Search in Google Scholar

Dreiskaemper, Dennis, Bernd Strauss, Norbert Hagemann & Dirk Büsch. 2013. Influence of red Jersey color on physical parameters in combat sports. Journal of Sport & Exercise Psychology 35. 44–49. https://doi.org/10.1123/jsep.35.1.44.Search in Google Scholar

English Standard Version Bible. 2001. ESV online. Available at: https://esv.literalword.com.Search in Google Scholar

Firth, John. 1957. Papers in linguistics, 1934–1951. London: Oxford University Press.Search in Google Scholar

Geeraerts, Dirk. 1993. Vagueness’s puzzles, polysemy’s vagaries. Cognitive Linguistics 4. 223–272. https://doi.org/10.1515/cogl.1993.4.3.223.Search in Google Scholar

Glynn, Dylan. 2014a. Polysemy and synonymy: Cognitive theory and corpus method. In Dylan Glynn & Justyna Robinson (eds.), Corpus methods for semantics: Quantitative studies in polysemy and synonymy, 7–38. Amsterdam: John Benjamins.10.1075/hcp.43.01glySearch in Google Scholar

Glynn, Dylan. 2014b. The many uses of run: Corpus methods and socio-cognitive semantics. In Dylan Glynn & Justyna Robinson (eds.), Corpus methods for semantics: Quantitative studies in polysemy and synonymy, 117–144. Amsterdam: John Benjamins.10.1075/hcp.43.05glySearch in Google Scholar

Glynn, Dylan. 2014c. Correspondence analysis: Exploring data and identifying patterns. In Dylan Glynn & Justyna Robinson (eds.), Corpus methods for semantics: Quantitative studies in polysemy and synonymy, 443–486. Amsterdam: John Benjamins.10.1075/hcp.43.17glySearch in Google Scholar

Greenacre, Michael. 2007. Correspondence analysis in practice. London: Chapman & Hall.10.1201/9781420011234Search in Google Scholar

Gries, Stefan Th. 2003. Multifactorial analysis in corpus linguistics: A study of particle placement. London & New York: Continuum Press.Search in Google Scholar

Gries, Stefan Th. 2006. Corpus-based methods and cognitive semantics: The many meanings of to run. In Stefan Th. Gries & Anatol Stefanowitsch (eds.), Corpora in cognitive linguistics: Corpus-based approaches to syntax and lexis, 57–99. Berlin and New York: Mouton de Gruyter.10.1515/9783110197709.57Search in Google Scholar

Gries, Stefan Th. 2010. Behavioral profiles: A fine-grained and quantitative approach in corpus-based lexical semantics. The Mental Lexicon 5(3). 323–346. https://doi.org/10.1075/ml.5.3.04gri.Search in Google Scholar

Gries, Stefan Th. & Naoki Otani. 2010. Behavioral profiles: A corpus-based perspective on synonymy and antonymy. ICAME Journal 34. 121–150.Search in Google Scholar

He, Xiufeng. 2007. Cong “hong” yu “red” de yuyi duibi kan duiwai hanyu cihui jiaoxue [On vocabulary teaching in TCFL: A semantic comparison between the Chinese “hong” and the English “red”]. Journal of Yunnan Normal University (Teaching Chinese as a Foreign Language Edition) 5(3). 80–84.Search in Google Scholar

Hill, Russell A. & Robert A. Barton. 2005. Red enhances human performance in contests. Nature 435. 293. https://doi.org/10.1038/435293a.Search in Google Scholar

Hornby, Albert Sydney. 2018. Oxford advanced learner’s English-Chinese dictionary, 9th edn. Beijing: The Commercial Press.Search in Google Scholar

Hotelling, Harold. 1933. Analysis of a complex of statistical variables into principal components. Journal of Educational Psychology 24. 417–441. https://doi.org/10.1037/h0070888.Search in Google Scholar

Huang, Borong & Xudong Liao. 2017. Xiandai Hanyu: Xia Ce [Modern Chinese: Volume 2], 6th revised edn. Beijing: Higher Education Press.Search in Google Scholar

Institute of Linguistics CASS (ed.). 2016. The contemporary Chinese dictionary, 7th edn. Beijing: The Commercial Press.Search in Google Scholar

Jansegers, Marlies & Stefan Th. Gries. 2020. Towards a dynamic behavioral profile: A diachronic study of polysemous sentir in Spanish. Corpus Linguistics and Linguistic Theory 16(1). 145–187. https://doi.org/10.1515/cllt-2016-0080.Search in Google Scholar

Ji, Xiaojing. 2003. Hanyingyu zhong “hong” zhi yinyu duibi fenxi. [A comparative analysis of the metaphor of “red” in Chinese and English]. Dangdai Xiuci Xue [Contemporary Rhetoric] (4). 48–49.Search in Google Scholar

Jiao, Tongmei. 2009. Cong yanseci de yuyi tezheng kan zhongxi wenhua chayi. [On the cultural difference between China and western countries from the connotation of color words]. Henan Ligong Daxue Xuebao (Shehui Kexue Ban) [Journal of Henan Polytechnic University (Social Sciences)] 10(3). 433–436.Search in Google Scholar

Jones, Karen Spärck. 1972. A statistical interpretation of term specificity and its application in retrieval. Journal of Documentation 28(1). 11–21. https://doi.org/10.1108/eb026526.Search in Google Scholar

Katz, Jerrold J. & Jerry A. Fodor. 1963. The structure of a semantic theory. Language 39(2). 170–210. https://doi.org/10.2307/411200.Search in Google Scholar

Lenci, Alessandro. 2018. Distributional models of word meaning. Annual Review of Linguistics (4). 151–171. https://doi.org/10.1146/annurev-linguistics-030514-125254.Search in Google Scholar

Lenci, Alessandro & Sebastian Padó. 2022. Editorial: Perspectives for natural language processing between AI, linguistics and cognitive science. Frontiers in Artificial Intelligence 5. 1059998. https://doi.org/10.3389/frai.2022.1059998.Search in Google Scholar

Levshina, Natalia. 2015. How to do linguistics with R: Data exploration and statistical analysis. Amsterdam: John Benjamins.10.1075/z.195Search in Google Scholar

Lê, Sebastien, Julie Josse & Husson Francois. 2008. FactoMineR: A package for multivariate analysis. Journal of Statistical Software 25(1). 1–18. https://doi.org/10.18637/jss.v025.i01.Search in Google Scholar

Li, Xin. 2016. Hongloumeng zhong “hong” zi de yinyu chuangyi ji yizhe de celüe shiying [The metaphorical creativity of the character “red” in Dream of the Red Chamber and the translator’s strategy adaptation]. Mudanjiang Daxue Xuebao [Journal of Mudanjiang University] (5). 128–130.Search in Google Scholar

Li, Yuanyuan. 2023. Cultural differences between Chinese and English in color words and their translation: Taking red as an example. Modern Linguistics 11(1). 165–169. https://doi.org/10.12677/ml.2023.111023.Search in Google Scholar

Liao, Zhenggang & Zhong Yang. 2011. Yinghan jiben yanseci kua yufa fanchou de duibi yanjiu [A contrastive study of the grammatical transcategorization between English and Chinese basic color words]. Waiyu Xuekan [Foreign Language Research] (6). 21–24.Search in Google Scholar

Lichtenfeld, Stephanie, Markus A. Maier, Andrew J. Elliot & Reinhard Pekrun. 2009. The semantic red effect: Processing the word red undermines intellectual performance. Journal of Experimental Social Psychology 45. 1273–1276. https://doi.org/10.1016/j.jesp.2009.06.003.Search in Google Scholar

Liu, Meili. 2023. Towards a dynamic behavioral profile of the Mandarin Chinese temperature term re. Corpus Linguistics and Linguistic Theory 19(2). 289–321. https://doi.org/10.1515/cllt-2021-0046.Search in Google Scholar

Liu, Meichun & Jinmeng Dou. 2024. Metaphorical polysemy of the Chinese color term hēi “black”: A corpus-based cognitive semantic analysis with behavioral profiles. International Journal of Corpus Linguistics 29(1). 1–33. https://doi.org/10.1075/ijcl.21067.liu.Search in Google Scholar

McCann, Bryan, James Bradbury, Caiming Xiong & Richard Socher. 2017. Learned in translation: Contextualized word vectors. In Proceedings of the 31st international conference on neural information processing systems, 6297–6308.Search in Google Scholar

Mikolov, Tomas, Kai Chen, Greg Corrado & Jeffrey Dean. 2013. Efficient estimation of word representations in vector space. arXiv. 1301.3781. https://doi.org/10.48550/arXiv.1301.3781.Search in Google Scholar

Peters, Matthew E., Mark Neumann, Mohit Iyyer, Matt Gardner, Christopher Clark, Kenton Lee & Luke Zettlemoyer. 2018. Deep contextualized word representations. arXiv. 1802.05365. https://doi.org/10.48550/arXiv.1802.05365.Search in Google Scholar

Qi, Chun. 2023. Hanyu yanseci “hong” yuyi yanbian de lishi xingwei tezheng fenxi [Towards the semantic evolution of Chinese color term hong: A diachronic behavioral profile]. Shandong: Shandong University MA thesis.Search in Google Scholar

Quirk, Randolph, Sidney Greenbaum, Geoffrey Leech & Jan Svartvik. 1985. A comprehensive grammar of the English language. New York: Longman.Search in Google Scholar

Radford, Alec, Karthik Narasimhan, Tim Salimans & Ilya Sutskever. 2018. Improving language understanding by generative pre-training. OpenAI Technical Report. 1–12.Search in Google Scholar

Rosch, Eleanor H. 1973. Natural categories. Cognitive Psychology (3). 328–350. https://doi.org/10.1016/0010-0285(73)90017-0.Search in Google Scholar

Salton, Gerard, Anita Wong & Chung-Shu Yang. 1975. A vector space model for automatic indexing. Communications of the ACM 18(11). 613–620. https://doi.org/10.1145/361219.361220.Search in Google Scholar

Schmid, Hans-Jörg. 2000. English abstract nouns as conceptual shells: From corpus to cognition. Berlin & New York: Mouton de Gruyter.10.1515/9783110808704Search in Google Scholar

Shen, Jiaxuan. 2009. Wo kan hanyu de cilei [My view on Chinese word classes]. Yuyan kexue [Linguistic Sciences] 8(1). 1–12.Search in Google Scholar

Taylor, John. 2003. Linguistic categorization, 3rd edn. Oxford: Oxford University Press.10.1093/oso/9780199266647.001.0001Search in Google Scholar

Tyler, Andrea & Vyvyan Evans. 2003. The semantics of English prepositions: Spatial scenes, embodied meaning and cognition. Cambridge: Cambridge University Press.10.1017/CBO9780511486517Search in Google Scholar

Vaswani, Ashish, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Lukasz Kaiser & Illia Polosukhin. 2017. Attention is all you need. Advances in Neural Information Processing Systems. 5998–6008.Search in Google Scholar

Wang, Siying. 2017. Yinghan yanseci “hong” de gainian yinyu duibi fenxi [Comparative analysis of the conceptual metaphors of the color term “red” in English and Chinese]. Xiandai yuwen [Modern Chinese] (8). 150–151.Search in Google Scholar

Wu, Tieping. 1986. Lun yanseci jiqi moqi xingzhi [On color terms and their metaphorical nature]. Yuyan jiaoxue yu yanjiu [Language Teaching and Research] 2. 88–105.Search in Google Scholar

Wu, Shuqiong & Dilin Liu. 2020. Cihui yuyi de yuliaoku lianghua yanjiu: Xingwei tezheng fenxi fa [Quantitative corpus methods for lexical semantic studies: Behavioral profile analysis]. Yingyu yanjiu [The Journal of English Studies] (1). 153–164.Search in Google Scholar

Xun, Endong, Gaoqi Rao, Xiaoyue Xiao & Jiaojiao Zang. 2016. Da shuju beijing xia BCC yuliaoku de yanzhi [The construction of the BCC Corpus in the age of Big Data]. Yuliaoku Yuyanxue [Corpus Linguistics] 3(1). 93–109+118.Search in Google Scholar

Received: 2024-03-13
Accepted: 2024-12-09
Published Online: 2025-01-15

© 2024 Walter de Gruyter GmbH, Berlin/Boston

Downloaded on 7.9.2025 from https://www.degruyterbrill.com/document/doi/10.1515/cllt-2024-0029/html?lang=en&srsltid=AfmBOoq6Tz_eKxAjSN7bToaiqBuZb__eSN0iHWD44fSS2rHqpyQT_WWU
Scroll to top button