Skip to main content
Article Open Access

The generation of tourism knowledge by language model-based AI: A call for a more critical perspective

  • Thomas Bausch is a Professor in the Faculty of Tourism at the Munich University of Applied Sciences. From 2019 to 2024, he served as the Director of the Competence Centre for Tourism and Mobility (CCTM) at the Free University of Bozen-Bolzano. Prior to that, he spent many years as the Director of the Alpine Research Institute in Bavaria. His research interests focus on destination management, sustainable tourism, regional development, and mobility.

    EMAIL logo
Published/Copyright: January 28, 2026

Abstract

This opinion paper critically examines the potential and limitations of language model-based AI, with a focus on ChatGPT, in the context of tourism research. While recent studies emphasize the use of generative AI in education and service applications, little attention has been paid to its validity, reliability, and objectivity for scientific knowledge creation in tourism. Through empirical tests involving text analysis of tourism abstracts and image interpretation, the study reveals significant shortcomings in consistency, reliability, and methodological rigor. The findings underscore the risks of overreliance on AI-generated outputs without critical verification and highlight the necessity of maintaining established scientific standards. The paper advocates a cautious integration of AI as a supportive tool rather than a substitute for rigorous research.

Zusammenfassung

Dieser Diskussionsbeitrag untersucht kritisch die Potenziale und Grenzen sprachmodellbasierter KI, insbesondere ChatGPT, im Kontext der Tourismusforschung. Während aktuelle Studien vor allem den Einsatz generativer KI in der Lehre sowie in Serviceanwendungen betonen, wurde deren Validität, Reliabilität und Objektivität für die wissenschaftliche Erkenntnisgewinnung im Tourismus bislang kaum Beachtung geschenkt. Durch empirische Tests, die Textanalysen touristischer Abstracts sowie Bildinterpretationen umfassen, deckt die Studie erhebliche Mängel in Bezug auf Konsistenz, Zuverlässigkeit und methodische Strenge auf. Die Ergebnisse unterstreichen die Risiken einer leichtfertigen Übernahme von KI-generierten Ergebnissen ohne kritische Überprüfung und betonen die Notwendigkeit, etablierte wissenschaftliche Standards beizubehalten. Der Beitrag plädiert für eine vorsichtige Integration von KI als unterstützendes Werkzeug und spricht sich gegen deren Einsatz als Ersatz für fundierte, eigenständige Forschung aus.

1 Introduction

ChatGPT is currently the most popular AI application in the category of Generative Pre-trained Transformers (GPT) for ‘text-to-text’ tasks (Gozalo-Brizuela and Garrido-Merchan 2023). This AI solution generates high-quality natural language responses to nearly all types of text-based queries. Recently, numerous publications have explored the use of ChatGPT in academic settings, particularly in education (e. g., Cotton et al. 2024; Rudolph et al. 2023). Additionally, there is a rapidly growing body of literature examining AI-based applications in the tourism and service industries (Carvalho and Ivanov 2023; Tuo et al. 2024), as well as in specific areas such as consumer experience creation (Ghesh et al. 2024; Sharma and Aggarwal 2024), hospitality industry (Yin et al. 2023) or cruise tourism-related services (Gonzáles-Santiago et al. 2024).

However, there remains a scarcity of literature on the application of GPT models in the social sciences research process, particularly in tourism research, which often relies on text-based empirical studies. Nautiyal et al. (2023) discuss issues such as truth, plagiarism, publishing ethics, interdisciplinarity, and equity, calling for a broader conversation beyond the issue of plagiarism. Skavronskaya et al. (2023) examine aspects of academic integrity and ethics, particularly AI-related plagiarism, and advocate for adapting AI-integrated pedagogies in tourism education. Ivanov and Soliman (2023) highlight the benefits of ChatGPT in tourism education and scientific writing, using examples from publications on robots in tourism. Shin and Kang (2023) take this a step further in the same journal, conducting an ‘interview’ with ChatGPT consisting of 15 questions on the evolution, current state, and future directions of tourism research. They conclude that ChatGPT offers promising topics for future research, such as health, social justice, climate change, and governance/policy issues. They also note that ChatGPT proposed novel methodologies, such as participatory action research and mixed-method systematic reviews, which are not commonly used in current tourism research. Rigin et al. (2025) examine the implications of using generative artificial intelligence (AI) models, such as ChatGPT and Bard, in social science research and conclude: “generative AI can enhance data collection and analysis, offering alternative approaches to traditional research methodologies” (p. 284). This raises the question if these positive research support assumptions are based on evident tests or just assumptions expressing the hope for reducing the amount of research work needed when following traditional research practices?

2 Analytic approach to analyze ChatGPT’s research capabilities

A common theme across these publications discussing the potential of AI and more specifically ChatGPT is the absence of doubts regarding the ability of GPT applications, to meet the fundamental requirements of the scientific community, as these are validity, reliability, and objectivity, that have been demanded of researchers for centuries. This paper aims to analyze and discuss the potential of ChatGPT as a tool for research. To this end, the author tested ChatGPT on several typical research-relevant topics commonly encountered in tourism research using the releases 3.5 (database 09/2022), GPT-4o (10/2023) and finally GPT-5 (10/2024). First, the completeness of resources of the knowledge base of ChatGPT was investigated. Afterward, the reliability of text analysis based on 31 paper abstracts was tested. Finally, the quality of picture content interpretation and picture creation using DALL-E were checked.

3 The role of scientific publications in the ChatGPT knowledge base

In a first step we asked ChatGPT about the role of scientific publications for tourism research when creating its latest release of the knowledge base (detailed question and answer by ChatGPT see Appendix).

Despite the awareness that the responses generated by ChatGPT are the product of a language model, clear limitations became evident. OpenAI’s developers are fully aware of the current constraints of ChatGPT, and more broadly, of any AI-based Generative Pre-trained Transformer (GPT) concerning both the knowledge base and the algorithmic training. The effectiveness of the knowledge base and the performance of the language model are contingent upon the quality of the data collected and the training process employed.

OpenAI’s GPT solutions are built on a three-step procedure (OpenAI 2022): In the first step, relevant data that is not yet part of the knowledge base is selected, and supervised training is conducted to produce the desired output. In the second step, similar data from the same subject area is used to generate responses based on the initial training. The generated responses are then compared, and their quality is ranked by a ‘labeler.’ This quality ranking informs a second round of training. The third step involves a continuous automated loop governed by a Proximal Policy Optimization (PPO) algorithm, a reinforcement learning technique used for optimizing decision-making and control policies in machine learning. It’s important to note that PPO is unrelated to the architecture or version of ChatGPT itself. These algorithms require specific parameters, such as threshold values, to initiate the feedback loop. A critical issue in this process is the definition of penalties for offensive or toxic language, repetition, or overly sentimental responses.

This three-step procedure necessitates human interaction with the AI. Initially, a human operator must define the ‘desired output behavior,’ which requires deep knowledge of the topic described by the input data. This introduces the potential for bias due to the operator’s knowledge gaps—the so-called ‘labeler’ effect. Acerbi and Stubbersfield (2023) described an additional risk of bias arising from the preselection of training sources and the cultural background of the training personnel, which can lead to human-like content biases, especially in sociological or behavioral contexts. Their research demonstrated that ChatGPT reproduces gender stereotypes analogous to those found in human societies, indicating that the concept of a ‘desired output’ during training may have been biased by the personal positions of the trainers. It is well established that, for many topics, there is no universally accepted normative stance; the desired output often reflects cultural or political positions. Given the cultural value differences observed across regions and countries (Schwartz and Cieciuch 2021), it is likely that the cultural background of the ‘labelers’ influences the model’s output. Moreover, as topics become more specialized, the need for extensive background knowledge among those training the system becomes more pronounced. There are clear limitations to this without the use of secondary and particularly scientific sources. The closer a topic is to ongoing research or recently published findings, the less likely it is that the ‘labeler’ can objectively assess the ‘desirability’ of an answer without access to scientific publications.

In the second step, an individual must evaluate the quality of the system’s alternative outputs, assessing both linguistic quality and context-related content quality. This judgment is subject to the same biases as those encountered in the first step. Finally, in the third step, the PPO algorithm requires parameter settings to calculate the reward or penalty values that define the loop for improving output quality. This step also requires input from a person programming the algorithm—the ‘labeler’.

Examining these three steps, it becomes clear that each is governed by human intervention. This introduces various correlations between the output of the GPT-system and the characteristics of the individuals involved in its training, such as their professional background, political affiliation, religion, culture, and gender. Expanding the knowledge base to include more complex topics, as is common in research, necessitates a higher level of expertise from those participating in the training process.

4 Findings of text and picture analysis tests

4.1 Text based research

While the limitations of Generative Artificial Intelligence, exemplified by ChatGPT, are clear, it is important to analyze the potential of these software solutions as research tools. The discussion should not be framed as a simple ‘pro’ or ‘con’ debate about AI in research. Rather, the focus should be on identifying specific steps in the research process where an AI tool could be effectively used to generate knowledge or replace less efficient methods. Meyer et al. (2023, 1) state “ChatGPT is powered by a Large Language Model (LLM), a type of deep learning model trained on massive amounts of publicly available text data, such as books, articles, and webpages, to generate human-like responses in conversations”. This suggests its use in research applications involving the analysis and structuring of large volumes of text. To test this potential, the author provided ChatGPT with the abstracts of the 33 papers discussed by (Bausch et al. 2024) about the neologism ‘overtourism’. 31 of these papers provided a comprehensive abstract and were copied into two separate word files. In the first file the abstracts were sorted by first author name from A to Z, in the second from Z to A. The test was performed on two different computers C1 and C2 logged in two different networks. As first test, ChatGPT was asked:

Task: This file contains 31 abstracts from scientific journal articles. Analyze first all concerning the key topics, methodology and described results. After write a summary along these categories with about 200 words in a scientific language style.

Even though the wording must not be expected to be identical following the scientific principle of reliability at least a largely consistent result in terms of content should be provided. It is remarkable how strong the two responses differ from each other. While on computer C2 ChatGPT follows the instruction exactly, on C1 it skips the precise presentation of key topics, methods, and results before providing the summary. The results also differ significantly in terms of level of detail. Neither response explicitly refers again to the numbered abstracts, which makes an in-depth analysis impossible.

In a second test after starting a new chat and using the same two files on two different computers, ChatGPT was given a more specific task for text analysis:

Task: This file contains abstracts from scientific journal publications dealing with overtourism. Some of them focus on overtourism in cities. First, analyze all the abstracts, and then provide a summary of those discussing overtourism in cities. Include a complete list of the cities discussed in the summary.

The short summaries of the 31 abstracts generated by ChatGPT on C1 are now more detailed than on C2. In addition, ChatGPT unexpectedly provides a precise reference (abstract number) on the first Computer without this having been explicitly requested. It is also noticeable that the short summaries often differ greatly. Moreover, the answer on C2 makes inaccurate generalizations. For example, it states about Barcelona that “several studies examine tourism degrowth discourse,” whereas in fact this topic appears in only one of the abstracts. Both responses share the problem of being incomplete: on C1 ChatGPT identifies twelve, C2 identifies only 10 of the total 14 cities mentioned in the abstracts. Such incomplete lists, despite the explicit request for a “complete list”, are unacceptable in the context of academic work.

In a final test these publications were grouped according to the methodologies used, the key topics discussed, and the main results mentioned in the abstracts. When comparing the AI-generated summary results for groups of 3–5 publications of similar type with those from a traditional content analysis conducted in parallel, a high level of coherence was observed. However, this requires the pre-structuring of the abstracts by the researcher and therefore makes the careful reading and study of each abstract indispensable. Therefore, the reduction in workload through the use of AI is negligible. In fact, it can be assumed that the time and effort required for a careful follow-up check will be higher than that of a traditional text analysis by a researcher.

4.2 Picture based research

Another test involved used ChatGPT for image analysis. ChatGPT was asked to analyze and compare images from the publication by Ortanderl and Bausch (2023), which examined the perception of nature photography in tourism. Specifically, ChatGPT was asked to assess the level of crowding depicted in the images. Table 1 presents two of the tested images along with ChatGPT’s responses to the prompt: “I uploaded several pictures showing situations in tourism destinations. Please describe the situations in general and, more specifically, evaluate the level of crowding considering the size of the area.”

Table 1:

Two illustrative photos of typical touristic situations and description/interpretation of ChatGPT

Picture

Description/interpretation by ChatGPT

General Description: The image shows a mountain trail with a large number of hikers, most of whom are dressed in outdoor clothing and carrying backpacks. The trail appears to be in a natural setting with trees and rocky terrain. The hikers are moving in a dense group, indicating a popular hiking spot.

Crowding Evaluation: The area seems relatively narrow, and the high concentration of people creates a crowded atmosphere. Given the size of the trail and the number of hikers, this can be considered a high level of crowding.

General Description: This image shows a ski resort with several buildings and ski lifts on a snowy mountain. Numerous people are visible on the snow-covered ground, some walking, while others may be skiing or waiting near the lifts. The overall scene suggests an active winter tourism destination.

Crowding Evaluation: The area is quite expansive, and while there are many people visible, they are spread out over a large area. The level of crowding is relatively low considering the open space available. The crowding could be considered light to moderate depending on the specific activities taking place.

This example demonstrates the current capabilities of ChatGPT in photo interpretation, providing a high-quality foundation for further text-based research. The assessment of crowding is appropriate for the images and situations depicted, but it is important to ask the question precisely, including phrases such as “considering the size of the area” to ensure accurate interpretation. E.g., asking for photo 2 “Please analyze the crowding level of the situation in the uploaded picture” ChatGPT answers: “I would classify the overall crowding level as moderate-to-high, with localized peaks of very high density around the lift and restaurant hub. This is typical for ski areas at main junction points where multiple lifts meet and services are concentrated”. This again highlights the issue of limited reliability.

low

medium

high

Figure 1:

Pictures generated by DALL-E using ChatGPT 5

Additionally, a test was conducted using ChatGPT 5 in conjunction with the DALL-E image generation software to create three images of a Mediterranean bay, each with different levels of crowding. These images were intended for use in a discrete choice modeling study. In this experiment, we uploaded an initial photo and requested the generation of three variations showing low, medium, and high levels of crowding. Figure 1 illustrates the results of this experiment.

Although the three images depict a Mediterranean bay with varying levels of crowding, the situations are not directly comparable due to differences in perspective and elements within each image. In the highly crowded image, DALL-E even added some hotels. These inconsistencies make the images unsuitable for a discrete choice modeling experiment, as the differing perspectives and additional elements could influence the overall perception. To accurately measure the effect of crowding, it is essential to have the exact same bay depicted with varying levels of crowding across all three situations. For this specific task, the current version of ChatGPT/DALL-E is inadequate. The underlying issue is that the three images are generated consecutively rather than simultaneously. When asked to produce the exact same background image, the response received was:

ChatGPT: For precise research needs, you may consider using a different method to obtain separate images, such as editing existing photos manually or using advanced image manipulation software. free to ask.

5 Discussion

The results of the conducted tests indicate that, at present, ChatGPT does not produce reliable, complete, and therefore valid outcomes when performing two relatively simple tasks involving text analysis. This underscores that such AI tools cannot replace the careful and thorough work of researchers. The incomplete analysis of texts when searching for a clearly defined object category, such as geographic locations mentioned in connection with overtourism, clearly demonstrates the limitations of the tool’s usability. In scientific work, a reliable and complete analysis of texts or data is essential as the basis for interpretation. If this basis is incomplete, any interpretation in relation to a research question will necessarily remain incomplete as well.

With the enhanced individualization of dialogue management in versions 4o and 5, there is a considerable risk that responses will diverge significantly between two researchers who interact with ChatGPT in different communication styles. From the perspective of result reliability, this represents a step backward. It remains unclear to what extent the individualization of AI responses, in combination with its ability to recognize personal background characteristics of researchers such as geographic origin, gender, academic training, and the capacity to formulate precise research questions, may influence the principle of objectivity. When AI begins to exert suggestive influence on the researcher in an effort to establish a personal relationship through dialogic interaction, and in doing so appears to objectify the researcher’s personal views, there is a risk that the research process may lose its foundation in evidence-based reasoning and thereby compromise its objectivity.

Even more concerning are recent journal publications that discuss the potential for plagiarism or the use of ChatGPT to structure future research proposals. As ChatGPT reports itself its knowledge base is neither up-to-date nor grounded in original academic sources. As a result, much of the existing scholarly work in the tourism knowledge system, as described by Tribe and Liburd (2016), is either ignored or only superficially incorporated through citations from unclear and frequently secondary sources. If ChatGPT’s claim that it has rarely used scientific publications to create its knowledge base is true, then plagiarism in the strict sense is not possible.

However, the responses generated by ChatGPT, particularly regarding methodological deficits and improvement options, should be viewed with even greater skepticism. For example, a statement about the methodologies found in the analyzed abstracts like “participatory action research and mixed-method systematic reviews, which are not commonly used in current tourism research,” is a prime example of AI hallucination, as the knowledge base lacks any substantive insight into the methods used in tourism research. The judgment “not commonly used” maybe was found in a blog or a discussion on Trinet, but it cannot be the result of a comprehensive analysis of thousands of scientific articles.

6 Conclusions

In general, progress in science is a gradual process. The generation of tourism knowledge by researchers typically begins with the observation of phenomena in the real world of tourism that demand explanation. Tourism, being an interdisciplinary field, draws from various disciplines, which is why Tribe (1997) referred to it as the “indiscipline of tourism”. The tourism knowledge system integrates multiple fields, such as social sciences, business studies, humanities, arts, and others (Tribe and Liburd 2016). Scholars from different disciplines, each with its own terminology and research traditions, must collaborate to generate different types of knowledge, such as problem-centered or value-based knowledge. This knowledge is eventually represented, disseminated, and stored through various types of publications: conference papers, books, and journal articles.

The exchange of knowledge among researchers, particularly those from different disciplines, requires precise and comprehensible descriptions of research, necessitating a common cross-disciplinary understanding of the terminology used (Belhassen and Caton 2009). Therefore, it is a standard practice in research to apply precise citations of reviewed publications that represent proven knowledge relevant to a particular problem. The traceability of knowledge evolution, along with the context, discipline, location, timing, and authorship of its generation, are key principles in research. This is what differentiates a text presenting mere assertions from one that is grounded in a consistent, yet not always traceable, set of knowledge-based arguments. Moreover, contributions to the tourism knowledge system require critical discourse and independent validation, including the possibility of falsification by other researchers. I advocate for a more critical and realistic reflection on what ChatGPT specifically, and Generative AI models in general, can contribute to tourism research.

Based on this analysis, it is crucial to distinguish between either using ChatGPT as a tool supporting the researcher or relying on it to generate new results for the tourism knowledge system. The use of AI as a tool can be seen as an opportunity in highly standardized tasks that are detached from the substantive engagement with the research question can offer support and potentially accelerate time-consuming steps. Examples include scripts for data preparation, implementation and programming of complex statistical analyses, formatting output tables in a specific style for publication, and checking texts for correct spelling and grammar. However, effective interaction with AI requires a sound understanding of all work steps, as precise use is otherwise not possible. The example of programming scripts for statistical analyses in R clearly illustrates the need for strong knowledge of R command syntax. Only if the researcher provides the AI with the correct variable names for the analysis and is able to specify which variables are dependent and which are independent in multivariate analyses can the AI be used effectively.

Moreover, only the researcher can provide the substantive interpretation of the results in light of the underlying research questions and hypotheses. The risks associated with the use of AI in tourism research lie primarily in the area of wrong or incomplete interpreting data or research findings in relation to specific research questions. On the one hand, the tests have shown that there is no guarantee of completeness or reliability. In addition, the knowledge base regarding existing research findings is incomplete due to the training techniques used in AI development. And finally, the new highly individualized dialogue system increases the risk of a lack of objectivity of researchers following the suggestions of the AI.

While AI-systems are undeniably fascinating, as researchers, we must maintain a critical distance and first ask, ‘What is behind this, and how does it work?’ Just as we approach statistical analysis with a careful selection of the most appropriate methods based on the research question and data type, we must apply the same rigor to AI. First, we need to understand how AI works and assess the adequacy of its knowledge base before deciding to use it for research purposes. Understanding that the current knowledge base of ChatGPT is widely disconnected from scientific publications should prevent any researcher from using it to generate outputs related to the research meta-level, as its output is merely a hallucination.

7 Limitations

This methodological approach has limitations, as the results are based on a specific set of questions and tasks asked to ChatGPT. A different way of asking and related individualized dialogues might lead to other results. This also applies to the selected set of abstracts as basis for text analysis. In another field of research, using different text, other outcomes might be observed. The fact that results may vary in quality and type depending on the questions and input data already highlights the general challenge researchers face when considering validity, reliability, and objectivity. The issues discussed in this paper raise a more fundamental question regarding the development of quality criteria for scientific work involving AI. It is not yet clear what such criteria should entail or how they could be implemented within AI models. Addressing this challenge will require future research and development efforts. This aspect currently remains an open and unresolved issue that deserves further exploration.

About the author

Prof. Dr. Thomas Bausch

Thomas Bausch is a Professor in the Faculty of Tourism at the Munich University of Applied Sciences. From 2019 to 2024, he served as the Director of the Competence Centre for Tourism and Mobility (CCTM) at the Free University of Bozen-Bolzano. Prior to that, he spent many years as the Director of the Alpine Research Institute in Bavaria. His research interests focus on destination management, sustainable tourism, regional development, and mobility.

References

Acerbi, Alberto, and Joseph M. Stubbersfield. 2023. “Large Language Models Show Human-Like Content Biases in Transmission Chain Experiments.” Proceedings of the National Academy of Sciences 120 (44): e2313790120. https://doi.org/10.1073/pnas.2313790120.10.1073/pnas.2313790120Search in Google Scholar

Bausch, Thomas, Tilman Schröder, and Verena Tauber. 2024. “What Is to Be Sustained? The Polysemy of Sustainability and Sustainable Tourism across Languages and Cultures.” Journal of Sustainable Tourism 32 (1): 108–31. https://doi.org/10.1080/09669582.2022.2124260.10.1080/09669582.2022.2124260Search in Google Scholar

Belhassen, Yaniv, and Kellee Caton. 2009. “Advancing Understandings: A Linguistic Approach to Tourism Epistemology.” Annals of Tourism Research 36 (2): 335–52. https://doi.org/10.1016/j.annals.2009.01.006.10.1016/j.annals.2009.01.006Search in Google Scholar

Carvalho, Inês, and Stanislav Ivanov. Forthcoming. “ChatGPT for Tourism: Applications, Benefits and Risks.” Tourism Review 79 (2): 290–303. https://doi.org/10.1108/TR-02-2023-0088.10.1108/TR-02-2023-0088Search in Google Scholar

Cotton, Debby R. E., Peter A. Cotton, and J. Reuben Shipway. 2024. “Chatting and Cheating: Ensuring Academic Integrity in the Era of ChatGPT.” Innovations in Education and Teaching International 61 (2): 228–39. https://doi.org/10.1080/14703297.2023.2190148.10.1080/14703297.2023.2190148Search in Google Scholar

Ghesh, Nada, Matthew Alexander, and Andrew Davis. 2024. “The Artificial Intelligence-Enabled Customer Experience in Tourism: A Systematic Literature Review.” Tourism Review 79 (5): 1017–37. https://doi.org/10.1108/TR-04-2023-0255.10.1108/TR-04-2023-0255Search in Google Scholar

González-Santiago, Marcya Stefany, Sandra Maria Correira Loureiro, Daniela Langaro, and Faizan Ali. 2024. “Adoption of Smart Technologies in the Cruise Tourism Services: A Systematic Review and Future Research Agenda.” Journal of Hospitality and Tourism Technology 15 (2): 285–308. https://doi.org/10.1108/JHTT-06-2022-0159.10.1108/JHTT-06-2022-0159Search in Google Scholar

Gozalo-Brizuela, Roberto, and Eduardo C. Garrido-Merchan. 2023. “ChatGPT Is Not All You Need: A State-of-the-Art Review of Large Generative AI Models.” arXiv preprint arXiv:2301.04655. https://doi.org/10.48550/arXiv.2301.04655.Search in Google Scholar

Ivanov, Stanislav, and Mohammad Soliman. 2023. “Game of Algorithms: ChatGPT Implications for the Future of Tourism Education and Research.” Journal of Tourism Futures 9 (2): 214–21. https://doi.org/10.1108/JTF-02-2023-0038.10.1108/JTF-02-2023-0038Search in Google Scholar

Meyer, Jesse G., Ryan J. Urbanowicz, Patrick C. N. Martin, Karen O’Connor, Ruowang Li, Pei-Chen Peng, Tiffani J. Bright, Nicholas Tatonetti, Kyoung Jae Won, Graciela Gonzalez-Hernandez, and Jason H. Moore. 2023. “ChatGPT and Large Language Models in Academia: Opportunities and Challenges.” BioData Mining 16 (1): 20. https://doi.org/10.1186/s13040-023-00339-9.10.1186/s13040-023-00339-9Search in Google Scholar

Nautiyal, Rajesh, Julia N. Albrecht, and Ashish Nautiyal. 2023. “ChatGPT and Tourism Academia.” Annals of Tourism Research 99: 103544. https://doi.org/10.1016/j.annals.2023.103544.10.1016/j.annals.2023.103544Search in Google Scholar

OpenAI. 2022. “Introducing ChatGPT.” ChatGPT (blog), November 30, 2022. https://openai.com/blog/chatgpt#OpenAI (accessed December 15, 2024).Search in Google Scholar

Ortanderl, Florian, and Thomas Bausch. 2023. “Wish You Were Here? Tourists’ Perceptions of Nature-Based Destination Photographs.” Journal of Destination Marketing & Management 29: 100799. https://doi.org/10.1016/j.jdmm.2023.100799.10.1016/j.jdmm.2023.100799Search in Google Scholar

Rigin, Sebastian, Noufal Naheem Kottekkadan, Toney K. Thomas, and Mohammad Niyas KK. 2025. “Generative AI Tools (ChatGPT*) in Social Science Research.” Journal of Information, Communication and Ethics in Society 23 (2): 284–90. https://doi.org/10.1108/JICES-10-2024-0145.10.1108/JICES-10-2024-0145Search in Google Scholar

Rudolph, Jürgen, Samson Tan, and Shannon Tan. 2023. “ChatGPT: Bullshit Spewer or the End of Traditional Assessments in Higher Education?” Journal of Applied Learning and Teaching 6 (1): Article 1. https://doi.org/10.37074/jalt.2023.6.1.9.10.37074/jalt.2023.6.1.9Search in Google Scholar

Schwartz, Shalom H., and Jan Cieciuch. 2021. “Measuring the Refined Theory of Individual Values in 49 Cultural Groups: Psychometrics of the Revised Portrait Value Questionnaire.” Assessment 29 (5): 1005–1019. https://doi.org/10.1177/1073191121998760.10.1177/1073191121998760Search in Google Scholar

Sharma, Ishani, and Arun Aggarwal. 2024. “Revolutionizing Creative Tourism: Integrating AI in Marketing for Immersive Travel Experiences.” In Innovative Technologies for Increasing Service Productivity, edited by Sridhar Manohar, et al., 179–197. Hershey, PA: IGI Global. https://doi.org/10.4018/979-8-3693-2019-8.ch011.10.4018/979-8-3693-2019-8.ch011Search in Google Scholar

Shin, Hakseung, and Juhyun Kang. 2023. “Bridging the Gap of Bibliometric Analysis: The Evolution, Current State, and Future Directions of Tourism Research Using ChatGPT.” Journal of Hospitality and Tourism Management 57: 40–47. https://doi.org/10.1016/j.jhtm.2023.09.001.10.1016/j.jhtm.2023.09.001Search in Google Scholar

Skavronskaya, Liubov, Arghavan (Hana) Hadinejad, and Debbie Cotterell. 2023. “Reversing the Threat of Artificial Intelligence to Opportunity: A Discussion of ChatGPT in Tourism Education.” Journal of Teaching in Travel & Tourism 23 (2): 253–58. https://doi.org/10.1080/15313220.2023.2196658.10.1080/15313220.2023.2196658Search in Google Scholar

Tribe, John. 1997. “The Indiscipline of Tourism.” Annals of Tourism Research 24 (3): 638–57. https://doi.org/10.1016/S0160-7383(97)00020-0.10.1016/S0160-7383(97)00020-0Search in Google Scholar

Tribe, John, and Janne J. Liburd. 2016. “The Tourism Knowledge System.” Annals of Tourism Research 57: 44–61. https://doi.org/10.1016/j.annals.2015.11.011.10.1016/j.annals.2015.11.011Search in Google Scholar

Tuo, Yanzheng, Jiankai Wu, Jingke Zhao, and Xuyang Si. 2024. “Artificial Intelligence in Tourism: Insights and Future Research Agenda.” Tourism Review 80 (4): 793–812. https://doi.org/10.1108/TR-03-2024-0180.10.1108/TR-03-2024-0180Search in Google Scholar

Yin, Dexiang, Minglong Li, Hailian Qiu, Billy Bai, and Lili Zhou. 2023. “When the Servicescape Becomes Intelligent: Conceptualization, Assessment, and Implications for Hospitableness.” Journal of Hospitality and Tourism Management 54: 290–99. https://doi.org/10.1016/j.jhtm.2023.01.001.10.1016/j.jhtm.2023.01.001Search in Google Scholar

Author information

Appendix

A1: Dialogue Documentation “The role of scientific publications in the ChatGPT knowledge base”

Question: How does ChatGPT use scientific publications in its knowledge base related to tourism research? Can you briefly summarize the models’ limitations to support scientific paper writing.

Answer ChatGPT:

  1. Knowledge Cutoff & Freshness: My training data only goes up to October 2024. I cannot automatically access or integrate the latest publications unless they are open-access online or provided by the user.

  2. Access to Sources: I do not have direct access to paywalled scientific databases (e.g., Elsevier, Springer, Sage, Taylor & Francis). This means I cannot retrieve or quote full-text articles unless they are uploaded or openly available.

  3. Accuracy & Verification: While I can summarize and synthesize well, I may generate plausible but unverified claims (sometimes called “hallucinations”). References and citations must always be checked against the original sources.

  4. Methodological Insight: I can describe common methods and frameworks, but I lack the capacity to design or conduct empirical research. My support is strongest in structuring, summarizing, and drafting text, not in replacing critical methodological reasoning.

  5. Writing Style & Nuance: I produce clear, fluent English but may lack the disciplinary nuance and rhetorical style expected in specific journals. Human revision is essential to align with field-specific terminology and argumentation norms.

  6. Ethical & Transparency Concerns: Overreliance on AI may obscure authorship and originality. Good practice requires transparency about AI assistance in academic writing, according to most journals’ guidelines.

A2: Tables

Table A1:

Summary test for 31 identical abstracts sorted by author name A–Z and Z–A

Summary of Key Topics, Methodology, and Results (A–Z) Computer 1 (C1)

Summary of Key Topics, Methodology, and Results (Z–A) Computer 2 (C2)

Summary

The reviewed abstracts collectively address overtourism as a pressing challenge in contemporary tourism studies, particularly in urban and heritage destinations. Key topics include the drivers of overtourism (low-cost carriers, digital platforms, cruise tourism, policy failures), its socio-economic and environmental impacts (resident displacement, gentrification, environmental degradation, community resistance), and potential governance and policy responses. A recurrent theme is the tension between residents’ rights and tourists’ access, often framed within debates on sustainability and degrowth.

Key Topics:

The central theme across the abstracts is overtourism and its consequences for urban, coastal, and heritage destinations. Recurring topics include social carrying capacity, resident attitudes, governance challenges, degrowth discourses, sustainability, and resilience. Case studies span European cities such as Barcelona, Venice, Dubrovnik, Budapest, Munich, Hamburg, Porto, Ljubljana, and Krakow, but also extend to Hong Kong, Ontario, and Australia’s Shipwreck Coast. Several papers emphasize conflicts between residents and tourists, the rise of short-term rentals and cruise tourism, and policy responses. Conceptual contributions include degrowth frameworks, overtourism typologies, monitoring models, and discourse analyses.

Methodologically, the studies employ a diverse range of approaches. Quantitative methods include resident and tourist surveys (e.g., Porto, Munich, Besalú, Krakow), statistical modeling (Hong Kong hysteresis model), and big data sentiment analysis (Barcelona). Qualitative methods encompass interviews with stakeholders, ethnographic research (Barcelona), discourse and policy analysis (Balearic Islands, Venice), and conceptual or integrative reviews. Mixed-methods and interdisciplinary approaches are also visible, particularly in case studies addressing governance, planning, and social mobilizations.

Methodologies:

A wide methodological spectrum is evident. Many studies rely on quantitative surveys (e.g., residents in Munich, Besalú, Porto, Majorca, Norway, Ontario, Ljubljana), while others use qualitative approaches such as discourse analysis, ethnography, interviews, participant observation, or critical literature reviews. Several papers propose conceptual models (e.g., conflict mechanisms in Hamburg, urban overtourism typology, monitoring models). A few employ big data and sentiment analysis (social media in Barcelona) or statistical models (hysteresis in Hong Kong). Comparative case studies and integrative reviews are also frequent.

Results consistently indicate that overtourism undermines community well-being, reduces resident support for tourism, and accelerates socio-ecological vulnerabilities. However, evidence also shows that stakeholder dialogue, improved governance, sustainable destination management, and alternative paradigms such as degrowth can mitigate negative impacts. Overall, the research highlights overtourism as both a manifestation of unchecked growth and an opportunity to reimagine tourism futures.

Results:

Findings highlight that overtourism produces social, cultural, and environmental degradation, rising costs of living, and resident opposition. Negative perceptions are often linked to housing market pressures, crowding, and nighttime economies. However, tourism also generates economic benefits, creating ambivalence among residents. Studies show that greater knowledge of tourism activities often heightens perceived costs. Solutions proposed include stakeholder dialogue, better governance, activity diversification, resilience-building, stricter regulation (especially for cruises and rentals), and degrowth-inspired strategies. Conceptual analyses stress that overtourism is multidimensional, socially constructed, and closely tied to global capitalism and growth-oriented tourism policies

200-word scientific summary:

The reviewed body of research collectively addresses the phenomenon of overtourism as a pressing challenge for global tourism systems. Empirical studies and conceptual contributions converge on the finding that excessive visitation exerts profound socio-cultural, economic, and environmental impacts, particularly in urban and heritage contexts. Key topics include social carrying capacity, resident–tourist conflicts, housing market distortions, tourism gentrification, and the sustainability of destination management strategies. Methodologically, the studies employ quantitative surveys, qualitative ethnographies, discourse and policy analysis, big data approaches, and conceptual modelling. This diversity of methods underscores the multidimensionality of overtourism and the heterogeneity of stakeholder perceptions. Results consistently reveal that while tourism fosters economic growth and cultural exchange, it simultaneously generates

resistance when thresholds of tolerance are exceeded, manifesting in protest movements and degrowth discourses. Case studies from European cities (Barcelona, Venice, Dubrovnik, Budapest, Munich) and beyond (Hong Kong, Ontario, Australia) illustrate both global commonalities and local specificities of overtourism dynamics. Policy recommendations include enhanced governance frameworks, stakeholder dialogue, diversification of activities, and the adoption of sustainable and degrowth-oriented strategies. Overall, the literature positions overtourism as a symptom of growth-dependent tourism development, demanding systemic change toward resilience and socio-ecological equity.

Table A2:

City test for 31 identical abstracts sorted by author name A–Z and Z–A

Cities discussed (A–Z) Computer 1

Cities discussed (Z–A) Computer 2

Barcelona (Spain): Case studies on the role of social media in attracting Chinese tourists (Abstract 1). Resident rejection linked to vacation rentals and rising housing prices (Abstract 16). Ethnographic research on protests, degrowth movements, and resistance to tourism monoculture (Abstract 17).

Porto (Portugal): Survey of 140 residents: overall positive perception of tourism but concerns about future problems (Abstract 4).

Hong Kong (China, SAR): Overtourism linked to “multiple-entry permit” policy, leading to conflicts with residents (Abstract 6).

Ljubljana (Slovenia): Rapidly growing destination, potential overtourism issues; monitoring model tested with resident attitudes (Abstract 15).

Munich (Germany): Survey of residents on different forms of urban tourism; multiple forms of overtourism identified (Abstract 19).

Dubrovnik (Croatia): Focus on planning interventions to mitigate overtourism, reclaiming spaces for locals (Abstract 23).

Venice is highlighted as a critical case, especially due to the impacts of cruise ship tourism, which threatens cultural heritage, the lagoon environment, and residents’ quality of life.

Krakow is studied through surveys of residents and service providers, showing mixed attitudes toward tourism growth, gentrification, and changes to the historic cityscape.

Budapest appears in multiple papers: one discusses resident resistance to developments and nightlife-related issues, another focuses on the “party quarter” (District VII), documenting resident complaints about noise, litter, and declining quality of life.

Hamburg is analyzed through a conflict model between residents and visitors, emphasizing crowding, Airbnb expansion, and inappropriate visitor behavior.

Budapest (Hungary): Overtourism in the “party quarter” (District VII), with complaints about noise, dirt, and crime (Abstract 25). Wider resistance movements in the city (Olympic bid rejection, city park protests, nightlife issues) (Abstract 27).

Hamburg (Germany): Conceptual model of resident–visitor conflicts applied to Hamburg using hotel/Airbnb data (Abstract 26).

Krakow (Poland): Research on resident and industry attitudes towards growing tourism and gentrification of historical districts (Abstract 28).

Venice (Italy): Tourism pressure from cruise ships; UNESCO warning; heritage and environmental degradation (Abstract 29).

Besalú (Spain): Heritage town with high day-tripper pressure; study on social carrying capacity (Abstract 18).

Majorca / Balearic Islands (Spain): Overtourism linked to vacation rentals and residents’ perceptions (Abstract 11). Tourism degrowth debate in Balearic Islands (Abstract 30).

Dubrovnik is studied from an urban planning perspective, with suggestions to reclaim tourist spaces for locals and integrate planning to mitigate segregation.

Munich is examined via a survey of residents’ perceptions, highlighting the complexity of different forms of overtourism within one city.

Barcelona receives significant attention: several studies examine tourism degrowth discourse, housing market effects of vacation rentals, grassroots activism, and the role of social networks in reinforcing visitor flows.

Ljubljana is presented as a fast-growing destination facing overtourism risks, with resident attitudes influencing management strategies.

Hong Kong is discussed in relation to cross-border day-trippers caused by a multiple-entry visa policy, which strained resident–visitor relations.

Porto is surveyed to assess residents’ views, revealing perceived economic benefits but also concerns about future overtourism.

Published Online: 2026-01-28
Published in Print: 2026-03-03

© 2025 the author(s), published by Walter de Gruyter GmbH, Berlin/Boston

This work is licensed under the Creative Commons Attribution 4.0 International License.

Downloaded on 29.4.2026 from https://www.degruyterbrill.com/document/doi/10.1515/tw-2025-0028/html?lang=en
Scroll to top button