Abstract
This study conducts a corpus-based contrastive analysis of the basic color term “red” in Chinese (HONG) and English (RED), innovatively employing the AI model BERT to triangulate, validate, and enrich the findings of the behavioral profile (BP) analysis. Results illustrate HONG’s more variational semasiological structure than RED, with both exhibiting unique usage patterns among similarities. The triangulation with BERT embeddings visualizes the performance of each subjectively annotated BP variable, and further demonstrates the semantic extending process in both HONG and RED. The underlying cognitive and socio-cultural factors are lastly discussed. Theoretically, this study bridges frequency-based AI models and usage-based Cognitive Linguistics by simulating human cognitive processes to abstract linguistic knowledge from extensive data, advancing our understanding of language through AI technology. Methodologically, this study is the first to integrate AI models in assisting BP analysis, with BERT embeddings offering objective corroborations and deeper insights for conclusions drawn from subjectively labeled BP data.
Funding source: the Fundamental Research Funds for the Central Universities
Award Identifier / Grant number: 1243200008
Acknowledgments
We are grateful to the two anonymous reviewers whose comments have significantly contributed to improving the quality of the article.
-
Research funding: This work was supported by the Fundamental Research Funds for the Central Universities, Beijing Normal University (No. 1243200008).
References
Alkhayrat, Maha, Mohamad Aljnidi & Kadan Aljoumaa. 2020. A comparative dimensionality reduction study in telecom customer segmentation using deep learning and PCA. Journal of Big Data 7(9). 1–23. https://doi.org/10.1186/s40537-020-0286-0.Search in Google Scholar
Berlin, Brent & Paul Kay. 1969. Basic color terms: Their university and evolution. California: Berkeley University of California Press.Search in Google Scholar
Brugman, Claudia & George Lakoff. 1988. Cognitive topology and lexical network. In Steven L. Small, Garrison W. Cottrell & Michael K. Tanenhaus (eds.), Lexical ambiguity resolution: Perspectives from psycholinguistics, neuropsychology and artificial intelligence, 477–508. San Mateo: Morgan Kaufmann.10.1016/B978-0-08-051013-2.50022-7Search in Google Scholar
Cantor, Stephen J. 1963. Centenary of the red cross. Medical Journal of Australia 2. 724–725. https://doi.org/10.5694/j.1326-5377.1963.tb18018.x.Search in Google Scholar
Cao, Xiaoping. 2008. Yinghan jiben yanseci “hong” de yinyu renzhi duibi fenxi [A comparative analysis of metaphorical cognition of the basic color term “red” in English and Chinese]. Jiang Xi xingzheng xueyuan xuebao [Journal of Jiangxi Administration Institute] 10(2). 79–80.Search in Google Scholar
Chen, Lili. 1996. Shixi Hanyu he yingyu yanseci de shehui wenhua chayi [A comparative analysis of sociocultural differences in color terms in Chinese and English]. Jiefangjun waiyu xueyuan xuebao [Journal of PLA University of Foreign Languages] (2). 40–44.Search in Google Scholar
Chen, Jiaxu. 2003. Yinghan jiben yanseci de yinyu renzhi duibi [A comparative analysis of metaphorical cognition of basic color terms in Chinese and English]. Xinan minzu daxue xuebao (renwen sheke ban) [Journal of Southwest University for Nationalities (Humanities and Social Science Edition)] 24(12). 283–286.Search in Google Scholar
Chen, Jiaxu & Lei Qin. 2003. Hanyu jiben yanse de fanchouhua ji yinyuhua renzhi [The categorization and metaphorical cognition of basic colors in Chinese]. Henan shifan daxue xuebao (zhexue shehui kexue ban) [Journal of Henan Normal University (Philosophy and Social Sciences Edition)] 30(2). 75–77.Search in Google Scholar
Croft, William & D. Alan Cruse. 2004. Cognitive linguistics. Cambridge: Cambridge University Press.10.1017/CBO9780511803864Search in Google Scholar
Cruse, D. Alan. 1986. Semantics. Cambridge: Cambridge University Press.Search in Google Scholar
Deerwester, Scott, Susan T. Dumais, George W. Furnas, Thomas K. Landauer & Richard Harshman. 1990. Indexing by latent semantic analysis. Journal of the American Society for Information Science 41(6). 391–407. https://doi.org/10.1002/(sici)1097-4571(199009)41:6<391::aid-asi1>3.0.co;2-9.10.1002/(SICI)1097-4571(199009)41:6<391::AID-ASI1>3.0.CO;2-9Search in Google Scholar
Devlin, Jacob, Ming-Wei Chang, Kenton Lee & Kristina Toutanova. 2019. BERT: Pretraining of deep bidirectional transformers for language understanding. In Proceedings of the 2019 conference of the North American chapter of the association for computational linguistics: Human language technologies, vol. 1, 4171–4186.Search in Google Scholar
Dou, Jinmeng & Meichun Liu. 2023. Exploring color metaphor with behavioral profiles: A usage-based analysis on the metaphorical meanings of the Chinese color term bái “white”. Lingua 289. 103539. https://doi.org/10.1016/j.lingua.2023.103539.Search in Google Scholar
Dreiskaemper, Dennis, Bernd Strauss, Norbert Hagemann & Dirk Büsch. 2013. Influence of red Jersey color on physical parameters in combat sports. Journal of Sport & Exercise Psychology 35. 44–49. https://doi.org/10.1123/jsep.35.1.44.Search in Google Scholar
English Standard Version Bible. 2001. ESV online. Available at: https://esv.literalword.com.Search in Google Scholar
Firth, John. 1957. Papers in linguistics, 1934–1951. London: Oxford University Press.Search in Google Scholar
Geeraerts, Dirk. 1993. Vagueness’s puzzles, polysemy’s vagaries. Cognitive Linguistics 4. 223–272. https://doi.org/10.1515/cogl.1993.4.3.223.Search in Google Scholar
Glynn, Dylan. 2014a. Polysemy and synonymy: Cognitive theory and corpus method. In Dylan Glynn & Justyna Robinson (eds.), Corpus methods for semantics: Quantitative studies in polysemy and synonymy, 7–38. Amsterdam: John Benjamins.10.1075/hcp.43.01glySearch in Google Scholar
Glynn, Dylan. 2014b. The many uses of run: Corpus methods and socio-cognitive semantics. In Dylan Glynn & Justyna Robinson (eds.), Corpus methods for semantics: Quantitative studies in polysemy and synonymy, 117–144. Amsterdam: John Benjamins.10.1075/hcp.43.05glySearch in Google Scholar
Glynn, Dylan. 2014c. Correspondence analysis: Exploring data and identifying patterns. In Dylan Glynn & Justyna Robinson (eds.), Corpus methods for semantics: Quantitative studies in polysemy and synonymy, 443–486. Amsterdam: John Benjamins.10.1075/hcp.43.17glySearch in Google Scholar
Greenacre, Michael. 2007. Correspondence analysis in practice. London: Chapman & Hall.10.1201/9781420011234Search in Google Scholar
Gries, Stefan Th. 2003. Multifactorial analysis in corpus linguistics: A study of particle placement. London & New York: Continuum Press.Search in Google Scholar
Gries, Stefan Th. 2006. Corpus-based methods and cognitive semantics: The many meanings of to run. In Stefan Th. Gries & Anatol Stefanowitsch (eds.), Corpora in cognitive linguistics: Corpus-based approaches to syntax and lexis, 57–99. Berlin and New York: Mouton de Gruyter.10.1515/9783110197709.57Search in Google Scholar
Gries, Stefan Th. 2010. Behavioral profiles: A fine-grained and quantitative approach in corpus-based lexical semantics. The Mental Lexicon 5(3). 323–346. https://doi.org/10.1075/ml.5.3.04gri.Search in Google Scholar
Gries, Stefan Th. & Naoki Otani. 2010. Behavioral profiles: A corpus-based perspective on synonymy and antonymy. ICAME Journal 34. 121–150.Search in Google Scholar
He, Xiufeng. 2007. Cong “hong” yu “red” de yuyi duibi kan duiwai hanyu cihui jiaoxue [On vocabulary teaching in TCFL: A semantic comparison between the Chinese “hong” and the English “red”]. Journal of Yunnan Normal University (Teaching Chinese as a Foreign Language Edition) 5(3). 80–84.Search in Google Scholar
Hill, Russell A. & Robert A. Barton. 2005. Red enhances human performance in contests. Nature 435. 293. https://doi.org/10.1038/435293a.Search in Google Scholar
Hornby, Albert Sydney. 2018. Oxford advanced learner’s English-Chinese dictionary, 9th edn. Beijing: The Commercial Press.Search in Google Scholar
Hotelling, Harold. 1933. Analysis of a complex of statistical variables into principal components. Journal of Educational Psychology 24. 417–441. https://doi.org/10.1037/h0070888.Search in Google Scholar
Huang, Borong & Xudong Liao. 2017. Xiandai Hanyu: Xia Ce [Modern Chinese: Volume 2], 6th revised edn. Beijing: Higher Education Press.Search in Google Scholar
Institute of Linguistics CASS (ed.). 2016. The contemporary Chinese dictionary, 7th edn. Beijing: The Commercial Press.Search in Google Scholar
Jansegers, Marlies & Stefan Th. Gries. 2020. Towards a dynamic behavioral profile: A diachronic study of polysemous sentir in Spanish. Corpus Linguistics and Linguistic Theory 16(1). 145–187. https://doi.org/10.1515/cllt-2016-0080.Search in Google Scholar
Ji, Xiaojing. 2003. Hanyingyu zhong “hong” zhi yinyu duibi fenxi. [A comparative analysis of the metaphor of “red” in Chinese and English]. Dangdai Xiuci Xue [Contemporary Rhetoric] (4). 48–49.Search in Google Scholar
Jiao, Tongmei. 2009. Cong yanseci de yuyi tezheng kan zhongxi wenhua chayi. [On the cultural difference between China and western countries from the connotation of color words]. Henan Ligong Daxue Xuebao (Shehui Kexue Ban) [Journal of Henan Polytechnic University (Social Sciences)] 10(3). 433–436.Search in Google Scholar
Jones, Karen Spärck. 1972. A statistical interpretation of term specificity and its application in retrieval. Journal of Documentation 28(1). 11–21. https://doi.org/10.1108/eb026526.Search in Google Scholar
Katz, Jerrold J. & Jerry A. Fodor. 1963. The structure of a semantic theory. Language 39(2). 170–210. https://doi.org/10.2307/411200.Search in Google Scholar
Lenci, Alessandro. 2018. Distributional models of word meaning. Annual Review of Linguistics (4). 151–171. https://doi.org/10.1146/annurev-linguistics-030514-125254.Search in Google Scholar
Lenci, Alessandro & Sebastian Padó. 2022. Editorial: Perspectives for natural language processing between AI, linguistics and cognitive science. Frontiers in Artificial Intelligence 5. 1059998. https://doi.org/10.3389/frai.2022.1059998.Search in Google Scholar
Levshina, Natalia. 2015. How to do linguistics with R: Data exploration and statistical analysis. Amsterdam: John Benjamins.10.1075/z.195Search in Google Scholar
Lê, Sebastien, Julie Josse & Husson Francois. 2008. FactoMineR: A package for multivariate analysis. Journal of Statistical Software 25(1). 1–18. https://doi.org/10.18637/jss.v025.i01.Search in Google Scholar
Li, Xin. 2016. Hongloumeng zhong “hong” zi de yinyu chuangyi ji yizhe de celüe shiying [The metaphorical creativity of the character “red” in Dream of the Red Chamber and the translator’s strategy adaptation]. Mudanjiang Daxue Xuebao [Journal of Mudanjiang University] (5). 128–130.Search in Google Scholar
Li, Yuanyuan. 2023. Cultural differences between Chinese and English in color words and their translation: Taking red as an example. Modern Linguistics 11(1). 165–169. https://doi.org/10.12677/ml.2023.111023.Search in Google Scholar
Liao, Zhenggang & Zhong Yang. 2011. Yinghan jiben yanseci kua yufa fanchou de duibi yanjiu [A contrastive study of the grammatical transcategorization between English and Chinese basic color words]. Waiyu Xuekan [Foreign Language Research] (6). 21–24.Search in Google Scholar
Lichtenfeld, Stephanie, Markus A. Maier, Andrew J. Elliot & Reinhard Pekrun. 2009. The semantic red effect: Processing the word red undermines intellectual performance. Journal of Experimental Social Psychology 45. 1273–1276. https://doi.org/10.1016/j.jesp.2009.06.003.Search in Google Scholar
Liu, Meili. 2023. Towards a dynamic behavioral profile of the Mandarin Chinese temperature term re. Corpus Linguistics and Linguistic Theory 19(2). 289–321. https://doi.org/10.1515/cllt-2021-0046.Search in Google Scholar
Liu, Meichun & Jinmeng Dou. 2024. Metaphorical polysemy of the Chinese color term hēi “black”: A corpus-based cognitive semantic analysis with behavioral profiles. International Journal of Corpus Linguistics 29(1). 1–33. https://doi.org/10.1075/ijcl.21067.liu.Search in Google Scholar
McCann, Bryan, James Bradbury, Caiming Xiong & Richard Socher. 2017. Learned in translation: Contextualized word vectors. In Proceedings of the 31st international conference on neural information processing systems, 6297–6308.Search in Google Scholar
Mikolov, Tomas, Kai Chen, Greg Corrado & Jeffrey Dean. 2013. Efficient estimation of word representations in vector space. arXiv. 1301.3781. https://doi.org/10.48550/arXiv.1301.3781.Search in Google Scholar
Peters, Matthew E., Mark Neumann, Mohit Iyyer, Matt Gardner, Christopher Clark, Kenton Lee & Luke Zettlemoyer. 2018. Deep contextualized word representations. arXiv. 1802.05365. https://doi.org/10.48550/arXiv.1802.05365.Search in Google Scholar
Qi, Chun. 2023. Hanyu yanseci “hong” yuyi yanbian de lishi xingwei tezheng fenxi [Towards the semantic evolution of Chinese color term hong: A diachronic behavioral profile]. Shandong: Shandong University MA thesis.Search in Google Scholar
Quirk, Randolph, Sidney Greenbaum, Geoffrey Leech & Jan Svartvik. 1985. A comprehensive grammar of the English language. New York: Longman.Search in Google Scholar
Radford, Alec, Karthik Narasimhan, Tim Salimans & Ilya Sutskever. 2018. Improving language understanding by generative pre-training. OpenAI Technical Report. 1–12.Search in Google Scholar
Rosch, Eleanor H. 1973. Natural categories. Cognitive Psychology (3). 328–350. https://doi.org/10.1016/0010-0285(73)90017-0.Search in Google Scholar
Salton, Gerard, Anita Wong & Chung-Shu Yang. 1975. A vector space model for automatic indexing. Communications of the ACM 18(11). 613–620. https://doi.org/10.1145/361219.361220.Search in Google Scholar
Schmid, Hans-Jörg. 2000. English abstract nouns as conceptual shells: From corpus to cognition. Berlin & New York: Mouton de Gruyter.10.1515/9783110808704Search in Google Scholar
Shen, Jiaxuan. 2009. Wo kan hanyu de cilei [My view on Chinese word classes]. Yuyan kexue [Linguistic Sciences] 8(1). 1–12.Search in Google Scholar
Taylor, John. 2003. Linguistic categorization, 3rd edn. Oxford: Oxford University Press.10.1093/oso/9780199266647.001.0001Search in Google Scholar
Tyler, Andrea & Vyvyan Evans. 2003. The semantics of English prepositions: Spatial scenes, embodied meaning and cognition. Cambridge: Cambridge University Press.10.1017/CBO9780511486517Search in Google Scholar
Vaswani, Ashish, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Lukasz Kaiser & Illia Polosukhin. 2017. Attention is all you need. Advances in Neural Information Processing Systems. 5998–6008.Search in Google Scholar
Wang, Siying. 2017. Yinghan yanseci “hong” de gainian yinyu duibi fenxi [Comparative analysis of the conceptual metaphors of the color term “red” in English and Chinese]. Xiandai yuwen [Modern Chinese] (8). 150–151.Search in Google Scholar
Wu, Tieping. 1986. Lun yanseci jiqi moqi xingzhi [On color terms and their metaphorical nature]. Yuyan jiaoxue yu yanjiu [Language Teaching and Research] 2. 88–105.Search in Google Scholar
Wu, Shuqiong & Dilin Liu. 2020. Cihui yuyi de yuliaoku lianghua yanjiu: Xingwei tezheng fenxi fa [Quantitative corpus methods for lexical semantic studies: Behavioral profile analysis]. Yingyu yanjiu [The Journal of English Studies] (1). 153–164.Search in Google Scholar
Xun, Endong, Gaoqi Rao, Xiaoyue Xiao & Jiaojiao Zang. 2016. Da shuju beijing xia BCC yuliaoku de yanzhi [The construction of the BCC Corpus in the age of Big Data]. Yuliaoku Yuyanxue [Corpus Linguistics] 3(1). 93–109+118.Search in Google Scholar
© 2024 Walter de Gruyter GmbH, Berlin/Boston