Abstract
Purpose
This paper aims to analyze how vloggers use interactions of different modalities in transnational travel vlogs on YouTube to draw audiences’ attentions, as well as the impact of the socio-cultural context on the discourse of these vlogs and the represented city images.
Design/methodology/approach
This research project adopts a multimodal approach by using a new analytical framework based on provenance symbols in semiotics, appraisal systems, and visual grammar in linguistics as theoretical foundations.
Findings
Our research findings show that there are four dominant themes, including nature enjoyment, landmark visiting and street wandering, food tasting, and infrastructure evaluation in these vlogs. North American full-time vloggers with advanced skills and rich experience are the main contributors, which may lead to a fixed and biased representation of city images. The mechanism of city image formation, from cities’ physical existence to foreign viewers’ perception via transnational travel vlogs, has also been illustrated.
Implications
The city image exists in a dynamic landscape where online opinion leaders also hold significant discourse power. Though becoming more visible, cities are facing challenges related to the increasing homogenization of mediated images. The research findings also have practical implications for redefining approaches to the international communication of city images to diverse stakeholders including city administrators and travel vloggers.
Originality/value
This paper expands the scope of research on urban communication by adopting a comparative and global perspective. The new analytical framework can serve as a reference for future studies on new media content. Furthermore, the systematic mechanism of city image formation presented contributes to a deeper understanding of the role of new media in the international communication of city images.
1 Introduction
The rapid technological development in human society has allowed us to experience exponential changes, transitioning from a text-dominated life to an era exploding with information and replete with multitudinous images. Short videos are now popping up and being consumed and shared by ordinary Internet users worldwide. These short videos, along with their carriers, are reforming, reflecting, and recreating almost every aspect of social life. It appears that René Descartes’s famous phrase “Je pense, donc je suis” – I think, therefore I am (Descartes 2006), is evolving into “I film, therefore I am.”
One of the most important characteristics of short videos is their elasticity, as they are continually evolving. They serve as platforms to express emotions and follow trends, particularly among younger generations. Simultaneously, there is a significant emphasis on the quality of short videos, which encompass a wide spectrum of forms and contents. Conducting a research project on short videos is essential for understanding social life, youth culture, and the international communication of Chinese cities.
A vlog (video blog) is a form of blog in which the medium is video. According to Sanchez (2023), vlogging has risen almost simultaneously with YouTube after 2004, often referred to as the year of the vlog. Short videos are popular on various social platforms such as Facebook and TikTok and are utilized in different situations. These include information vlogs, fitness vlogs, and travel vlogs, among others. In the realm of travel vlogs, numerous researchers, especially in the field of tourism management, have explored their impact on usersʼ travel intentions (Cheng et al 2023; Le and Hancer 2021; Xu et al. 2021) and destination marketing (Abad and Borbon 2021; Peralta 2019). From a communication perspective, a travel vlog is a form of short video and image carrier that awaits further exploration.
According to data from the United Nations (2018), it is projected that 68 % of the world’s population will be living in urban areas by 2050. Modern society is increasingly characterized by a global network of cities. Travel vlogs predominantly focus on city images, capturing a wealth of vibrant details. As globalization persists, international travelers emerge as a significant group of vlog producers, contributing to the promotion of the international communication of city images. Urban administrators cannot overlook the influential role of vlogs in showcasing cities, both locally and globally. This paper holds significant scholarly value due to its innovative perspective in examining city images in transnational travel vlogs through a multimodal approach.
In this research, a new framework has been developed for the systematic analysis of all dimensions of modalities and their interactions in the sampled 60 travel vlogs of port cities. Provenance symbols in semiotics, which consist of three main types – natural objects, man-made objects, and pure symbols – are employed to comprehend the graphics within the vlogs. Appraisal systems, particularly the affiliated attitude system, focus on the verbal and paraverbal dimensions of these videos. Additionally, visual grammar in linguistics serves as a theoretical reference to understand kinesics and the editing of vlogs. After elucidating how vloggers utilize interactions of modalities to establish dominant themes, capture viewers’ attention, and shape city images, the paper also explores the socio-cultural context in which the vloggers are immersed. It seeks to evaluate the influence this context has on the discourse of vlogs and the construction of city images. At last, the mechanism of city image formation from cities’ physical existence to foreign viewers’ perception through transnational travel vlogs is illustrated. The findings inspire new perspectives on interpreting the international communication of city images, considering various actors such as urban administrators and travel vloggers. However, there remains a critical task for future research, which involves analyzing the audience and delving into a broader socio-cultural context.
2 Literature review
The review of existing literature has yielded three main themes: the ascent of vlogs, the portrayal of city images in media, and the exploration of provenance symbols, appraisal systems, and visual grammar. These themes collectively offer theoretical guidance for the current research.
2.1 The rise of vlogs
The emergence of vlogs can be traced back to the year 2010 when Adam Kontras posted a video of his journey to Los Angeles on his personal blog (Sanchez 2023). Subsequently, vlogs made a significant impact on media consumption and garnered widespread attention across various demographics. This form of video blogging is considered a noteworthy development following webcasting and short videos (Wu and Liu 2018), evolving into a new avenue for content marketing (Qu and Zhang 2019). Presently, vlogs have become an innovative means to shape mainstream discourse and document people’s lives.
The first-person narration and immersive experience offered by the vlog+news format are particularly popular among young audiences, enhancing mainstream media’s influence on significant occasions (Zhan and Li 2020). On platforms like Bilibili, a Shanghai-based video-sharing website, values such as non-fiction, personification, loose-circle socialization, and “life myth-making” in food and travel vlogs contribute to their core competitiveness and appeal (Zheng 2019). Additionally, during public emergencies such as the COVID-19 pandemic, vlogs have constructed a multi-dimensional narrative space (physical, psychological, and social spaces) that helps alleviate negative emotions and generate and sublimate meanings of life (Yu 2020). In terms of public service, vlogs have demonstrated mobilization effects by setting examples of ordinary people and disseminating relevant knowledge (Yan and Gao 2023). In summary, vlogs are increasingly becoming a crucial medium for individuals, especially the younger generation, to experience and construct reality, share and convey emotions, and activate and promote social practices.
Travel vlogs establish a virtual community that facilitates the construction of users’ visual images, garnering feedback and emotional support from the viewership. This innovative video format has undergone systematic analysis in tourism management studies, revealing that travel vlog marketing significantly influences viewers’ travel and shopping intentions. Moreover, a positive relationship exists between destination image, travel convenience, and travel intentions (Irfan et al. 2022). These vlogs typically involve a four-step process encompassing designing, shooting, editing, and publishing, aiming to achieve feelings of self-expression, documentation, ritual, and enjoyment (He et al. 2022). According to a comprehensive study by Cheng et al. (2020), the factors influencing travel vlogs in stimulating viewers’ travel intention operate primarily at cognitive and emotional levels. Source credibility, filming techniques, framing aesthetics, and on-site experiences enhance the videos’ attractiveness. Simultaneously, humorous narratives and interactive fan engagement contribute to increased audience participation, breaking down traditional boundaries between transmission and reception (Xu et al. 2021).
2.2 City image in media
City image is shaped through a combination of mass media, personal experiences, interpersonal communication, and memory, among other factors (Mumford 1961). A widely used term in marketing and tourism management, city/destination image, seen as a continuous mental process (Milman 2012), comprises three distinct yet hierarchically interrelated components: cognitive, affective, and overall image. Existing frameworks on image formation are also linked to agents that manipulate the image (Kim and Chen 2016). Illustrated by Gartner’s Image Formation Figure (1994) and Beerli and Martin’s Model of the Formation of Destination Image (2004), various image change agents, often mediated by diverse media, serve as sources of information. While the crucial role of media in the practice of city image formation has been acknowledged (Mercille 2005), there remains a dearth of research, particularly empirical studies, investigating the role of media discourses in destination image formation. Furthermore, the role of 21st-century media in this context is not fully understood (Khodadadi 2013).
Human civilization is intrinsically linked to media civilization, especially in modern times where cities and mass media exhibit a symbiotic relationship. In the era of Internet 3.0, diverse users are now engaging with smart media on a regional level, thereby shaping the image of cities in novel ways. These new media, characterized by their cross-media nature, interactivity, supra-temporality, and autonomy of choices, have brought about significant changes in communication ecology and cultural landscapes. Simultaneously, they have unleashed infinite possibilities for the international communication of city images.
A literature review of recent studies on the theme of city image in media (Drule et al. 2020; Li 2023; Li et al. 2020) reveals that this area of research has not garnered the wide scholarly attention it deserves. Existing literature on the subject tends to be primarily driven by practical considerations, aiming to explore how to effectively use video to promote city images and boost tourism in a given area. Drule et al. (2020) conducted a content analysis of short videos from the top 100 most popular cities worldwide, aiming to extract tourism promotion strategies for different regions. In terms of research methodology, many scholars have employed content analysis, a research method widely used in communication studies, to code and classify videos. Content analysis is preferred due to its ability to handle large-scale data processing, allowing for a combination of quantitative and qualitative analysis. For example, Li et al. (2020) utilized content analysis and data mining of comments on short videos on TikTok to uncover factors attracting viewers when watching food videos. In terms of research subjects, social media platforms such as TikTok and YouTube have been the primary sources of analysis. Most of the aforementioned studies share a commonality in focusing on a specific city (Li 2023). Despite the limited number of studies by communication scholars that consider the city as the unit of analysis, even fewer studies concentrate on non-Western cities (Meng and Rantanen 2016). This gap underscores the need for more comprehensive research in the field, particularly with a focus on diverse geographical and cultural contexts.
The present paper focuses on travel vlogs of international port cities as its subject of study, addressing a research gap in the existing literature characterized by a lack of a comparative and global perspective. Notably, the selected data in this research pertain to city images constructed and presented by foreign visitors, i.e., outsiders. This stands in contrast to images created from a local or official perspective, thereby rendering this paper distinct from its counterparts. Furthermore, vlogs, as a new form of content creation, have fundamentally altered the traditional pursuit of grand and sensational narratives about city images. Instead, they delve into the humanistic features of cities through microscopic but detailed personal experiences, replacing propaganda with dialogue and retelling with storytelling (Chu and Luo 2021). While the production mode and practical characteristics of travel vlogs, along with the new opportunities they offer for city image promotion, have been relatively clarified, the interaction between transnational travel vlogs produced by international travelers and the cross-cultural communication of city images remains a new research topic to be explored in the current study.
2.3 Theoretical perspectives for multimodal discourse analysis
2.3.1 The provenance symbols
Communication is defined as “a systemic process in which individuals interact with and through symbols to create and interpret meanings” (Wood 1997: 14). Throughout this process, reality is generated, maintained, revised, and transformed (Martin and Nakayama 2021). Video can be perceived as a text composed of symbols, with visual symbols being the most prominent and dominant symbol resources. Semiotics is the systematic study of sign processes and meaning-making (Chandler 2022). Provenance symbols are divided into two categories: natural objects and man-made objects (Deng and Tang 2022). Man-made objects, due to different manufacturing intentions, can be further subdivided into utilitarian or pure symbols. When these three types of symbols are incorporated into the urban landscape of the “microscopic human society,” an extensive array of counterparts can be identified. Every flower, blade of grass, and tree in the city represents the natural landscape of natural objects. Specific cultural symbols, such as landmarks and specialty products, often derive from man-made buildings and delicacies. This illustrates a certain relationship between objects and symbols, fostering a two-way mutually interactive dynamic (Zhao 2022). Therefore, a profound understanding of symbol values stems from focusing on the exchanges and connections between different symbols, rather than examining their respective roles and functions in isolation.
2.3.2 Appraisal systems
The appraisal systems by James R. Martin (2000) are rooted in systemic functional linguistics, which identifies three main meta-functions of language as conceptual, interpersonal, and textual functions. In the context of the interpersonal function of language, speakers utilize it to express identity, status, attitudes, motivations, evaluations, and judgments of people and things. Martin and his colleagues (Martin and Rose 2003; Martin and White 2005) extended this system by proposing a comprehensive evaluation system. As noted by Wang (2001), the appraisal system considers both the rhetorical function of evaluative words as the text unfolds and how individuals perceive the interpersonal meaning and social relations, particularly the solidarity created by affective empathy and genealogical ties.
Appraisal systems, derived from systemic functional linguistics, specifically focus on the interpersonal meaning of texts. This system elucidates the character and behavior of people, the nature of things, and the value of objects through evaluation resources, capturing the evaluator’s attitude towards people, things, and objects (Wang and Wu 2020). The appraisal system comprises three major subsystems: attitude, engagement, and graduation. Within the attitude system, affect, judgment, and appreciation are subdivided. Affect is typically expressed using words such as “happiness,” “sadness,” and “joy”; judgment involves evaluating human behaviors from the standpoint of morality, ethics, laws, and regulations; and appreciation can be directed toward either a person or a thing. The systems also encompass the source of appraisal, including self-reporting and reports by others. In simple terms, self-reporting originates from the speaker, while reporting by others involves citing or reflecting on a discourse, viewpoint, or attitude from someone else. The graduation system assesses the strength and clarity of attitudes. In discourse analysis, appraisal systems frequently serve as an analytical framework for scholars seeking to identify attitudes in a particular discourse.
2.3.3 Visual grammar
Visual grammar, introduced in Reading Images: The Grammar of Visual Design (2020) by Gunther Kress and Theo Van Leeuwen, is frequently employed in multimodal discourse analysis due to its systematic and effective approach to interpreting visual images. Drawing inspiration from Halliday and Matthiessen’s concept of the triple metafunctions of language (2004) – ideational, interpersonal, and textual metafunctions – Kress and Van Leeuwen have established three main components in the visual grammar system: representational, interactive, and compositional meaning. Given that image-based communication through social networks aims to establish and maintain an interactive relationship between content creators and viewers (Qian 2022), the analytical dimensions of interactive meaning hold great importance. Factors such as contact (gaze or gesture of characters), social distance (size of frame), attitude (point of view), and modality (color saturation) are crucial considerations.
This paper addresses a gap in the existing literature by concentrating on the intersection between symbolic meanings, appraisal systems, visual grammar, and city images through vlogs. Two research questions are consequently posed.
RQ1:
How do vloggers use interactions of different modalities in travel vlogs to draw audiences’ attentions and shape cities’ images?
RQ2:
To what extent does socio-cultural context have impact on travel vlogs’ discourse and the represented city images?
3 Data collection and analysis
YouTube is selected for the current research project because this social media platform boasts a broad audience and facilitates various forms of interpersonal communication efficiently (Hou 2019). Users on YouTube can not only upload videos but also engage with other users simultaneously or at different times through the comment function (Chovanec and Dynel 2015). From a semiotic perspective, the YouTube platform is replete with symbols such as words, images, and sounds, a phenomenon also known as Polylogue by scholars like Bou-Franch et al. (2012). This concept is employed to explore questions such as how users utilize individual symbols, how they interconnect multiple symbols, and how effective the communication of these uses is.
Research data for this study are gathered from 60 vlogs featuring 20 port cities globally, available on the YouTube platform. The selection of port cities as research objects is justified for several reasons. Firstly, port cities present a more specific and categorically distinct topic compared to cities without such classification. Additionally, examining port cities allows researchers to explore potential connections between the distinctive natural and artificial characteristics of these cities and how they are represented in media, such as the appearance and portrayal of ports in the videos. The choice of port cities is strategic in providing a unique perspective for conducting research on cities. While many formal studies focus on single cases or opt for Western developed cities without a global perspective, port cities, being a common city type worldwide, offer an opportunity to fill this gap. Moreover, the term “port cities” is relatively more objective than other definitions such as intercultural or global cities.
In this research, the Global Cities Index, Global City Competitiveness Index Maps, and the Globalization and World Cities Research Network, recognized as the three most well-known and authoritative city ranking lists globally, are considered. The six continents – Asia, Europe, Africa, Oceania, North America, and South America – are utilized as geographical categories to select port cities based on their rankings in these three lists. Due to differences in economic development levels and list quota limitations, the study sampled five cities in Asia, North America, and Europe, three cities in Oceania, and one city each in South America and Africa (see Table 1). Subsequently, researchers employed the search term “city name + travel vlog” with subject relevance as the search filter on YouTube to obtain samples. After collecting all available travel vlogs, the three most-viewed vlogs were selected as best practices for a double-check. It is important to note that the chosen travel vlogs had to be created by international travelers from another country who were undertaking their first transnational travel to the destination, aligning with the research questions. Additionally, all selected videos were in English (see Appendix A for all links to the sample videos). To gain a deeper understanding of how vloggers themselves can influence the content of these transnational travel vlogs, demographic information about the producers, including nationality, occupation, and their videosʼ theme orientation (whether focused solely on travel or not), was recorded for further analysis.
List for selected port-cities worldwide.
| Continents | Port cities (Countries) |
|---|---|
| Asia | Tokyo (Japan), Singapore (Singapore), Hong Kong (China), Shanghai (China) and Dubai (UAE) |
| Europe | London (UK), Frankfurt (Germany), Amsterdam (Netherlands), Stockholm (Sweden), Barcelona (Spain) |
| North America | New York (U.S.), Los Angeles (U.S.), San Francisco (U.S.), Toronto (Canada), Boston (U.S.) |
| Oceania | Sydney (Australia), Melbourne (Australia), Auckland (New Zealand) |
| South America | Buenos Aires (Argentina) |
| Africa | Johannesburg (South Africa) |
Modes are semiotic resources that bear the imprint of culture and, at the same time, are shaped by people’s social practices (Kress 2010; Sammut and Bauer 2021). A multimodal discourse is one that makes use of all semiotic resources, such as words, layouts, images, music, audio tapes, and more, to construct meaning. This concept emphasizes the complex interaction and relations between different modes. Roux (2019) summarizes multimodality with a focus on: (1) semiotics rather than language; (2) the relations between semiotics and their division of labor; (3) describing and explaining the misplacement of certain semiotics from a phylogenetic perspective; and (4) explaining how they affect communication and subjectivity through their potential functions. As this approach gains popularity due to technological development, numerous methodological frameworks that include multimodality as a central component have been created to address specific research questions (Martinec and Salway 2005; Zhang 2009).
Vloggers consistently engage in active communication with their audiences through various means, including words and expressions, images and pictures, sound effects, and editing skills. Given the high interactivity of videos on YouTube, particularly travel vlogs that are rich in semiotics, this paper employs a comprehensive and systematic multimodal approach to analyze the content of transnational travel vlogs created by vloggers on the platform.
As illustrated earlier, the adoption of a multimodal analytical framework provides a robust structure for dissecting videos across various dimensions. This meticulously constructed framework encapsulates the richness of visual, verbal, paraverbal, kinesics, editing, and auditory elements inherent in video content (see Table 2). The analytical framework of this study includes: 1) graphics: objects and scenes; 2) verbal: titles and descriptions of video; spoken language; captions and comments; 3) paraverbal: prosody; 4) kinesics: facial expressions; gestures and body movements; 5) editing: cover and closeup; colors; slow release; time stamps; and 6) acoustics: background music (BGM).
Dimensions of modalities in travel vlogs and theoretical perspectives.
| Modalities | Analytical objects | Theories |
|---|---|---|
| Graphics | Objects and scenes in videos | Provenance symbols |
| Verbal | Titles and descriptions of videos, spoken language of vloggers, captions, audiences’ comments | Appraisal systems |
| Paraverbal | Prosody | |
| Kinesics | Facial expressions, gestures and other body movements | Visual grammar |
| Editing | Cover; closeup; colors; slow release; time stamps | |
| Acoustics | BGM | Meanings of music |
The first five dimensions of modalities will be analyzed with the three theories mentioned above. Much of music’s power lies in its use as multimodal communication (Way and McKerrell 2017). From a social semiotic perspective, music can be analyzed as a part of a bigger construction, sharing the task of making meaning with other modes (verbal, visual, etc.) (Forte 2023). Specifically, background music’s role in multimodal environments and contexts, such as pictures (Klein et al. 2021), television commercials (Park and Young 1986) and digital stories (Requejo 2016), has been demonstrated. As the most frequently applied theory in the literature, meanings of music consists of both embodied and referential one (Zhu and Meyers-Levy 2005). The former is context-independent, eliciting positive or negative feelings in the audience with the evaluative function, and the latter concerns much about semantic associations. Margulis et al. (2019) consider that people experience music not just as an abstract sequence of sounds, but as episodes in an unfolding narrative. Taking BGM as one of the analytical objects in the narration of the sample vlogs, we try to identify the meaning of background music, either embodied or referential, in the videos and also to explore the cross-modal interactions achieved between it and other modalities, as well as its effects.
4 Results and discussion
Through a systematic analysis of all the travel vlogs, several dominant themes have been identified as the main components or detailed topics shared by the sample videos. These include nature enjoyment, landmark visiting and street wandering, food tasting, and infrastructure evaluation. While it takes all the modalities and their interactions to shape city images, it is evident that travel vlogs provide graphic-based or centered representations, with other modalities functioning and interacting as reinforcing supplements (Zhang 2009).
4.1 Dominant themes and modality interactions
4.1.1 Nature enjoyment
Nature has increasingly become a “refuge” for residents seeking respite from the fast-paced and high-pressured life of modern cities. For some vlog consumers who are temporarily unable to escape the “steel forest” of cities and face harsh realities, travel videos from other bloggers serve as a “substitute presence,” satisfying their desire for a dialogue with nature. From the perspective of provenance symbols, natural elements, including beaches, bay loops, and parks, are observed in the vlogs. Considering the two-way mutual relationship between different types of symbols, it is noteworthy that the relationship between plants and cities is represented through symbols such as cherry blossoms in Tokyo and tulips in Amsterdam. This reveals that seemingly simple natural symbols can embody multiple meanings, activating corresponding cultural schemas for video viewers.
To allow viewers to “experience” nature themselves, the presence of vloggers in natural scenes in travel vlogs is comparatively not as strong as in other settings. Taking the video Spending 2 PERFECT days in Auckland! produced by Travel For Phoebe as an example, the author primarily employs film techniques to showcase eye-opening distant views of Karekare waterfall and close-ups of wide ducks to illustrate the good ecological condition there. Soft and slow songs are simultaneously inserted as background music, creating a calming atmosphere. Meanwhile, the vlogger doesn’t appear in the video but expresses her direct feelings through narration. From the perspective of appraisal systems, adjectives like “beautiful” and “stunning” reflect the author’s appreciation of the natural spot. Sentences such as “It takes your breath away” and “And you just think, oh, this is what life is all about” describe the vlogger’s inner emotional state, making it easier for viewers to gain a feeling of participation. There are occasions when the vlogger does appear in the picture but maintains a relatively long “social distance” from the viewers without any eye contact. Shots from an oblique position or behind the character allow the audience to focus on the great nature and imply how the viewers could be integrated into the scene and “cured” by nature, just like the vlogger (see Figure 1).

Screenshot from the travel vlog Spending 2 PERFECT days in Auckland!
4.1.2 Landmark visiting and street wandering
Buildings with diverse or unified styles are one of the most direct ways to present the urban landscape. In travel vlogs, the relationship between bloggers and architecture can be both close and loose. For stores or residents along the street, these man-made symbols have practical functions, yet they have become a significant means of representing the image and culture of the city. For vloggers and viewers, it is a cross-cultural implicit experience. Some symbols that belong to this category include exquisite bookstores and coffee shops in downtown Tokyo, the streets of London, and the old houses of Spanish style in Buenos Aires, etc.
On the verbal level, narrations of bloggers reinforce the existing stereotypes about a site. One example is from a blogger who encountered a sudden rainfall in a London park and said, “this is really in London,” while reminding the viewers to always prepare an umbrella in London. Walter Lippmann (2004) defines stereotypes as a distorted picture or image in a person’s mind, not based on personal experience, but derived culturally. Stereotypes enable cognitive thinking while also enhancing a sense of group belonging. Travel vloggers are not just following stereotypes but also communicating about them.
Graphics play a pivotal role in enhancing the visual representation of landmarks. Some vloggers opt for detailed close-ups, employing overlays and annotations to provide viewers with an in-depth exploration of architectural marvels. This approach not only serves an informative purpose but also establishes the landmarks as central characters in the vlog narrative, contributing to the overall appreciation of the city’s cultural and historical richness. In the example above (Figure 2), a panoramic view of the Shard is presented in the video. This vlogger is a filmmaker, so his videos display more professionalism and skillfulness. This scene, which arouses direct visual perception, and his quotation, which gives the audience concrete imagination of the height, work together to deliver a marvelous picture of this building.

Screenshot from the travel vlog Travelling to LONDON!
However, a distinctive pattern emerges as some vloggers choose a different modality, positioning themselves as the focal point against the backdrop of city landmarks. In these instances, landmarks serve as atmospheric scenery, framing the vlogger’s personal narrative rather than taking center stage.
4.1.3 Food tasting
Food, as a man-made symbol, combines functionality and symbolism, with the latter often representing local cuisines. In the eyes of some travel vloggers, the experience of the city almost equals the exploration of good food, which is a cultural entity that can be directly experienced and felt. Constructed as a verbal modality, the titles of many travel vlogs contain food-related expressions like “street food tour” and “what/where to eat,” some even with capital letters for stress: “try DELICIOUS FOODS.” “First Impressions of Stockholm | Swedish Meatballs & Fika!” explicitly shows the important role typical local food plays culturally in city image perception. In addition, food and dining scenes are always highlighted on covers and in time stamps of the video descriptions for interested viewers to quickly dive in.
Through the lens of travel vloggers, a new “charm” is created with filming and editing skills. Before tasting, detailed close-ups of food with attached names and high color saturation (high sensory modality) can be interpreted as attention-catching exaggeration in terms of rhetoric. In the process of eating, food, together with the vlogger’s mouth, is placed in the center of the whole picture to emphasize how it has been enjoyed, interacting with narrations of appreciation such as “so good” and also intertextually with audience comments like “It looks so delicious, I want to eat it too” (see Figure 3). The vlogger’s attempt in eating and evaluating the food has satisfied viewers’ curiosity and aroused sensations of the audiences at the same time, forming a virtual interaction ritual chain to build up understanding of city image and creating the notion of “trying one food and knowing a city.”

Screenshot from the travel vlog JAPAN IS OPEN!! Tokyo Food Tour vlog| HIDDEN GEM best ramen, Michelin Star Ramen, Sushi Omakase.
4.1.4 Infrastructure evaluation
A prevalent theme observed in vlogs is the meticulous portrayal of a city’s transportation infrastructure and accommodation facilities. Vloggers often employ verbal narration to provide insights into the efficiency of public transportation systems (e.g., bicycles in Holland; taxi driver detouring in the streets of Dubai; a NYC street littered with garbage), highlighting convenience and accessibility. Paraverbal elements, such as tone, further accentuate their experiences, influencing the audience’s emotional engagement. Intriguingly, kinesics plays a crucial role as vloggers visually interact with transportation modes, showcasing firsthand experiences of navigating through the city. The deliberate inclusion of graphics, such as route maps and transportation statistics, enhances the informative aspect, creating a comprehensive visual narrative. Also, vloggers strategically use music to complement these visuals, creating a harmonious blend that enhances the overall viewing experience. The screenshot extracted from the video “Stockholm Travel Guide” shows a traffic scene of Stockholm in slow motion (see Figure 4), which gives the impression of orderliness on the road. The soft magical music adds to this effect of presenting a dynamic and organized city. In addition, the vlogger’s comment that he and his companion can walk anywhere on foot precisely points out the traffic convenience.

Screenshot from the travel vlog Stockholm Travel Guide.
Moreover, the focus extends to the city’s hospitality sector, with vloggers featuring various hotels and accommodations. This incorporates visuals of hotel interiors, verbal commentary on amenities, and paraverbal expressions of satisfaction or surprise. The integration of hyperlinks in the video description further demonstrates a strategic use of technology to directly connect viewers with relevant public services, offering an interactive and immersive experience. Furthermore, it is found that sponsored content often takes center stage in some vlogs, influencing the narrative structure and emphasis within the vlog. Vloggers, acting as brand ambassadors, meticulously showcase sponsored hotels, offering a detailed glimpse into the amenities, services, and overall ambiance. As vlogging continues to evolve, the symbiotic relationship between content creators and local businesses reshapes the landscape of city representation. Sponsored narratives not only influence the emphasis on specific public services but also redefine the vlogger’s role within the narrative. From sponsored hotel showcases to personal beauty escapades, the multimodal symphony of vlogging encompasses a complex dance of promotional agendas, personal branding, and the multifaceted exploration of cityscapes.
Furthermore, as all the sample vlogs were about travel experiences in port cities, it was assumed that images specifically related to ports, such as harbor views, would be easy to identify. However, our qualitative analysis revealed that ports and their features were relatively under-represented. Vloggers who filmed near ports either used the location as a background or briefly skimmed it. In contrast, visiting landmarks and tasting food, as much more popular routines, have shown a tendency towards homogenization of mediated representations of city images.
4.1.5 Regional differences
Concerning these four common themes, there are also notable differences between different regions. Take these travel vlogs from two cities, Buenos Aires in South America and Johannesburg in Africa, for example. While food tasting remains one of the dominant themes, infrastructure evaluations, such as detailed hotel tours (recommendation) or compliments on local transport, are rare. In addition, due to the underdeveloped status of these two cities, the vloggers’ zone of activity mainly covers the streets of the city center. Comparatively fewer filming skills are used, as these vloggers simply hold cameras to their own faces as they walk and talk freely. It is also worth noting that some locals have been invited by the vloggers to provide information about these two cities. For example, in the travel vlog 24 hours in JOHANNESBURG, South Africa, the local driver, April, who is facing the camera, not only shows the vlogger around but also tells stories and relevant culture on-site. This can be explained by the vloggers’ lack of knowledge of these cities, using locals as intermediaries. On the other hand, the presence and introduction of the locals also make the content more “authentic” and increase the credibility of these travel vlogs.
4.2 Socio-cultural context, vloggers and city image formation
Multimodal interactions in travel vlogs show dominant themes such as nature and food, shaping city images from various perspectives. However, the representation of these images is mediated and selective, highlighting the significance of addressing RQ2 (Qian 2022; Qin and Li 2021).
The mentioned practice involves travel vloggers, who utilize their physical attractiveness, social appeal, and credibility to positively impact audiences’ wishful identifications (Le and Hancer 2021). Simultaneously, vloggers are influenced by the broader socio-cultural context, which shapes their conduct in significant ways. From the recorded demographic information of vloggers and systematic analysis of the dominant themes, the following findings are to be pointed out and deserve special attention.
According to introductions on personal homepages of vloggers, the majority of video producers in this research are full-time regularly-updating professional travel vloggers with advanced filming equipment, as many viewers ask in comments about what cameras they are using, and rich experience, as sample videos of two cities are occasionally created by the same vlogger. This group of vloggers, over half of which come from the USA or Canada, knows the mechanism of travel vlogs, no matter if these videos are skill-showing, tour-guiding, or specific-topic oriented. Comparatively, chances are rare for those vlogs in freestyle to go viral. This has resulted in the representation of city images being mainly constructed by pro-vloggers, with a fixed, unchanged, and one-sided portrayal due to the homogenization of displayed symbols and superficial rhetorical expressions.
Professional vloggers effectively create an appealing and inviting virtual community in a participatory culture by utilizing various modalities to construct representative scenes. These scenes cater to the diverse needs of their audience, be it for relaxation, emotional expression, or a sense of belonging. By making city images easily perceivable and shortening the psychological distance between the viewers and the cities, it becomes apparent that a combination of self-presentation and platform logic had a significant impact on these vloggers. The vloggers’ figures are consistently positioned in the center of the picture and interact with verbal output and viewers’ compliments regarding their appearance, demonstrating notable discipline. Hyperlinks to vloggers’ social media accounts, such as Instagram and others, as well as links to recommended hotels, restaurants, and filming equipment, highlight how content creators harness and benefit from network traffic.
The analysis of vlogging discourse (multimodal interactions) and also vloggers’ constructive practice is a necessary and significant step to understand city image formation through transnational travel vlogs as a new form of media from a macro and systematic perspective, as there are different stages when a city’s physical existence is represented and disseminated to viewers on the cloud, who could then form an overall image of a specific foreign city. These vloggers are first organic agents (Gartner 1994), because they have done actual visitation to these port cities and own personal experience on the ground. As a result, their travel vlogs have a relatively higher level of credibility, which can then contribute to the formation of a parasocial relationship (Dewantara et al. 2023) between them and viewers and could make their videos more attractive.
Using vlogs as a new medium and its affordances of multimodality, vloggers as gatekeepers and opinion leaders selectively and strategically encode the physical existence of foreign port cities into several dominant themes and form mediated representations of these cities. These themes include not only representative symbols such as landmarks that help activate viewers’ stereotypes and cultural schemas to form or recall their cognitive image of a city, but also healing natural landscapes and eye-catching local foods that serve as motivations for travel and form viewers’ affective image of a city. Vloggers are a crucial factor, as a solid parasocial relationship is a prerequisite for viewers’ video consumption and city image formation. With advanced filming and editing skills, rich experience in vlogging and the YouTube platform, and also language advantage, North American vloggers are much more successful in shaping foreign viewers’ city image worldwide as solicited organic image formation agents (Gartner 1994).
5 Conclusions
This paper has analyzed 60 transnational travel vlogs on YouTube from 20 port cities worldwide from the perspective of multimodal discourse analysis. An analytical framework, based on theories of provenance symbols in semiotics and appraisal systems, visual grammar in linguistics, has been structured to systematically clarify the multimodal interaction practices in the videos. It finds that these vlogs mainly demonstrated four dominant mediated themes set by the vloggers, including nature, landmark and street, food, and infrastructure. Several regional differences between two specific port cities, Buenos Aires and Johannesburg, and other cities have also been illustrated, such as less hotel tours, fewer filming skills, and locals as intermediaries.
Furthermore, based on the results of RQ1 and the demographic information of the vloggers, this research explores the possible impact that the socio-cultural context can have on the discourse of the vlogs and the represented city images. Knowing how it works and disciplined by the platform logic, professional full-time travel vloggers from North America have almost dominated these popular videos, which may leave the mediated city images unchanged or biased. In addition, based on the above findings, this paper also systematically presents the mechanism of how the physical existence of these cities was represented with dominant themes and disseminated through vloggers with a double identity (as organic/solicited organic image formation agents) and their vlogs’ multimodal discourses to online foreign viewers, shaping their city image cognitively and also affectively.
This paper expands the scope of research on urban communication by adopting a comparative and global perspective. Focusing on port cities, it incorporates non-Western samples. The study employs theories from both semiotics and linguistics to establish an innovative analytical framework for investigating multimodal interactions and the dominant themes represented in transnational travel vlogs. This framework serves as a valuable reference for future studies on new media content. Additionally, considering the socio-cultural context, the paper underscores the role of vloggers as crucial contributors to vlog discourse and the formation of city images. Furthermore, by integrating the image concept from tourism marketing, the systematic mechanism presented for city image formation – from the physical existence of cities to the perception of foreign online viewers through transnational travel vlogs – contributes to a deeper understanding of the role of new media in the international communication of city images.
The research findings have practical implications for redefining approaches to the international communication of city images to diverse stakeholders. City administrators, particularly those engaged in tourism marketing, must comprehend the mechanism underlying city image formation through transnational travel vlogs. Recognizing vlogs as crucial and complementary channels for information dissemination, administrators can leverage the dominant themes identified in the videos and viewer comments as guiding references. This insight aids in the specific adaptation and optimization of services, such as enhancing food offerings for restaurants and incorporating detailed hotel tours for accommodations.
Given that a city’s image is multi-dimensional, encompassing both cognitive and affective aspects, administrators should deploy a variety of strategies. For instance, emphasizing representative landmarks and local cuisine aims to trigger memories and recognition among viewers, while showcasing the city’s natural beauty seeks to emotionally motivate viewers to visit in person and fully appreciate the experience. To achieve these goals, collaboration between city governments and professional travel vloggers, possessing rich experience and advanced filming and editing skills, is crucial. Inviting such vloggers to visit cities and produce high-quality, well-designed vlogs can significantly expand the cities’ global audience.
From the vloggers’ standpoint, travel vlogs serve as a medium for establishing emotional connections with their audiences and deriving financial benefits. By studying top examples, vloggers can enhance the attractiveness of their content and reap rewards from city authorities. In the era of smart media, the city image exists in a dynamic landscape where not only officials but all online participants, especially opinion leaders, wield significant discourse power. This duality manifests as cities becoming more visible through vlogs, yet concurrently facing challenges related to the increasing homogenization and one-sidedness of mediated city images.
Limitations remain to be further explored in future research. The current research focuses on the vlogger but have hardly considered the viewers’ comments from a quantitative way. Additional research could have been done to provide more insights by focusing on the interaction between vloggers and viewers to clarify how the represented city images have been consumed and reproduced circularly. Also, the socio-cultural context in which the travel vlogs are embodied needs to be further explored. Besides the vloggers as mediators, the economic climate (symbolic economy of urban development) and the practice of cultural exchange (intercultural communication) are also influential factors. We expect the current research findings to reach a wider audience for a better understanding and communication of city images around the globe.
Sample travel vlogs and links.
| Number | City | Title and link |
|---|---|---|
| 1 | Singapore | Singapore Travel Guide – City of the Future https://www.youtube.com/watch?v=ncg5piJD_GM |
| 2 | Singapore | SINGAPORE Tour/Vlog Oct 2022 | PART 1 | OUR FIRST TIME WATCHING F1 https://www.youtube.com/watch?v=gKpa0YaQoUk |
| 3 | Singapore | Exploring SINGAPORE! (Travel Vlog!) | Ranz and Niana https://www.youtube.com/watch?v=FyWEbgmgciI |
| 4 | Hong Kong | TOP 10 THINGS TO DO IN HONG KONG https://www.youtube.com/watch?v=eXBr3VZRK6M |
| 5 | Hong kong | Hong Kong Street Food Tour & Travel Guide https://www.youtube.com/watch?v=wnTyX02JeHc |
| 6 | Hong kong | HONG KONG TRAVEL VLOG | first time in HK, exploring the city, best places to eat, & meeting friends! https://www.youtube.com/watch?v=1r9q012K3jc |
| 7 | Tokyo | We went to JAPAN! TOKYO TRAVEL VLOG https://www.youtube.com/watch?v=2BzoQ31IPhk |
| 8 | Tokyo | JAPAN IS OPEN!! Tokyo Food Tour vlog| HIDDEN GEM best ramen, Michelin Star Ramen, Sushi Omakase https://www.youtube.com/watch?v=7duyfTnRDKA |
| 9 | Tokyo | I Went to Japan for the First Time | Tokyo Travel Vlog (Pt.1) https://www.youtube.com/watch?v=QPcwfuStFnU |
| 10 | Shanghai | SHANGHAI Travelogue https://www.youtube.com/watch?v=-1BFsPicHWk |
| 11 | Shanghai | My Travel Diaries | SHANGHAI https://www.youtube.com/watch?v=0T6Q0MRIoo8 |
| 12 | Shanghai | Magnificent Shanghai https://www.youtube.com/watch?v=vFQQNPT2wdY |
| 13 | Dubai | Our First Trip To Dubai! (Whole Family!!!) | Ranz and Niana https://www.youtube.com/watch?v=IZ9ZcmtjS6Q |
| 14 | Dubai | DUBAI TRIP | ZEINAB HARAKE https://www.youtube.com/watch?v=gXrEouRu3d0 |
| 15 | Dubai | DUBAI TRIP with the fam | Chelseah Hilary https://www.youtube.com/watch?v=l5ErrZuFQsM |
| 16 | Sydney | Sydney in 5 Days – VLOG | Nicole Andersson https://www.youtube.com/watch?v=hulQTtyw-Yo |
| 17 | Sydney | What To Do And Eat In Sydney, Australia https://www.youtube.com/watch?v=H4y5ydsnMuk |
| 18 | Sydney | Explore Sydney: Bondi Beach, Hidden Gems and Unforgettable Experiences | 4K Travel Video Blog https://www.youtube.com/watch?v=u-nEMNGp6F8 |
| 19 | Melbourne | HOW TO TRAVEL MELBOURNE (Best Destinations and Prices) https://www.youtube.com/watch?v=ZjqBTeowZGk |
| 20 | Melbourne | Best Things To Do In Melbourne in 2023! (Food, Activities, Markets, Sights & More) https://www.youtube.com/watch?v=_CAE82h10sI |
| 21 | Melbourne | First Time in Victoriaʼs Capital – MELBOURNE https://www.youtube.com/watch?v=85c5P9VXBg8 |
| 22 | Auckland | Auckland Food Tour – Trying DELICIOUS New Zealand FOODS! https://www.youtube.com/watch?v=Ei3atq9lAls |
| 23 | Auckland | Spending 2 PERFECT days in Auckland! https://www.youtube.com/watch?v=R8bNdnN7yU8 |
| 24 | Auckland | 3 Days in Auckland | New Zealand vlog, Hobbiton, Mt Eden, Auckland Domain, exploring the city, food https://www.youtube.com/watch?v=IiyEEh7cuko |
| 25 | Johannesburg | 24 hours in JOHANNESBURG, South Africa https://www.youtube.com/watch?v=iUWPoyu4jfE |
| 26 | Johannesburg | My First Day in Johannesburg South Africa Was Unforgettable! South Africa 2022 https://www.youtube.com/watch?v=t_E8YQRJlV0 |
| 27 | Johannesburg | MOST DANGEROUS Part of Johannesburg, South Africa https://www.youtube.com/watch?v=egpUZHgRPQM |
| 28 | Buenos Aires | FIRST IMPRESSIONS OF ARGENTINA! BUENOS AIRES https://www.youtube.com/watch?v=55U4zM4uXDU |
| 29 | Buenos Aires | What To Expect – Buenos Aires, Argentina (Our First Trip) https://www.youtube.com/watch?v=Za08jzqn_Gs |
| 30 | Buenos Aires | What to do in Buenos Aires, Argentina | San Telmo & La Boca City Guide https://www.youtube.com/watch?v=tmRApRkJ5sw |
| 31 | London | Traveling to LONDON! https://www.youtube.com/watch?v=fabOfUKc4l0 |
| 32 | London | VLOG | BEST FRIENDʼS DREAM BIRTHDAY TRIP TO LONDON https://www.youtube.com/watch?v=BfNRs492cqc |
| 33 | London | My First Solo Trip to London GB 5 Days in UK VLOG https://www.youtube.com/watch?v=fw-pQAhs_XY |
| 34 | Frankfurt | Layover in Frankfurt – GERMAN FOOD, Doner Kebab, and Sightseeing in Frankfurt, Germany! https://www.youtube.com/watch?v=7HstJqjGzFg |
| 35 | Frankfurt | Best things to do, eat, and see in Frankfurt | MY FAVORITE GERMAN CITY! https://www.youtube.com/watch?v=n-ycBNAjw0g |
| 36 | Frankfurt | Frankfurt Germany Travel Vlog | Frankfurt City Tour | Europe Trip EP-22 https://www.youtube.com/watch?v=b5KfadfFgfU |
| 37 | Amsterdam | Welcome to Amsterdam! | Dhruv Rathee Vlogs https://www.youtube.com/watch?v=bYZRhK1rlUQ |
| 38 | Amsterdam | TOP 20 Things to Do in AMSTERDAM Netherlands 2023 | Travel Guide https://www.youtube.com/watch?v=miOJ1fA4nz0 |
| 39 | Amsterdam | HOW TO TRAVEL AMSTERDAM in 2019 https://www.youtube.com/watch?v=YIUs1pagLnM |
| 40 | Stockholm | Stockholm Travel Guide https://www.youtube.com/watch?v=DS_6M3Nqh7k |
| 41 | Stockholm | First Impressions of Stockholm | Swedish Meatballs & Fika! https://www.youtube.com/watch?v=VMmBCN2pp4c |
| 42 | Stockholm | Free things to do in STOCKHOLM, SWEDEN! https://www.youtube.com/watch?v=B2kH-we97E4 |
| 43 | New York | The Dark Side of New York | Hollywood Movies Never Show You This! https://www.youtube.com/watch?v=hy01GPP2n6w |
| 44 | New York | OUR DREAM TRIP | 7 Day New York City Travel Guide https://www.youtube.com/watch?v=Y7HVDYGSJsI&t=18s |
| 45 | New York | Travelling to Worlds Most Famous City! https://www.youtube.com/watch?v=PpXMYCLXQ5s |
| 46 | Los Angles | Exploring San Francisco! https://www.youtube.com/watch?v=0YLdjj-xVTU |
| 47 | Los Angles | Los Angeles, California! USA Part 1! | Gabbi Garcia https://www.youtube.com/watch?v=eBfybLMIebs |
| 48 | Los Angles | LOS ANGELES TRAVEL VLOG | Top Things To Do In LA https://www.youtube.com/watch?v=gxuDsPaHsA8 |
| 49 | San Francisco | 48 HOURS in SAN FRANCISCO!! Travel vlog, Golden Gate Bridge, good food & lots of laughs https://www.youtube.com/watch?v=aspFtX85_sM |
| 50 | San Francisco | The best weekend in San Francisco | where to eat in SF, vlog https://www.youtube.com/watch?v=wlGVzBYjv14 |
| 51 | San Francisco | Exploring San Francisco! https://www.youtube.com/watch?v=0YLdjj-xVTU&t=8s |
| 52 | Toronto | VLOG: SOLO TRIP TO TORONTO! shopping, eating, exploring in Canada (alone) https://www.youtube.com/watch?v=uQ4P2gBem44 |
| 53 | Toronto | VLOG | Exploring Toronto: Shopping, markets, going out, dressing up, yacht, Chinatown, food, etc! https://www.youtube.com/watch?v=Q8GU_fGAJjk |
| 54 | Toronto | Toronto, Ontario vlog | First time here, Exploring downtown, Shopping, Eating | vlog https://www.youtube.com/watch?v=y1Xgwv76O04 |
| 55 | Boston | FIRST TIME TRAVELLING TO BOSTON, MASSACHUSETTS // 3-Day Travel Vlog in the United States https://www.youtube.com/watch?v=_A9R3dlxh_o |
| 56 | Boston | travel vlog | first solo trip, spending more time alone, brand trip, events, exploring Boston https://www.youtube.com/watch?v=JoIpugt4Ask |
| 57 | Boston | BOSTON Travel Guide 2023 | Top things to do in Boston in 3 days! https://www.youtube.com/watch?v=zckpf6kRhuo |
| 58 | Barcelona | How To Travel BARCELONA| The ONLY Guide Youʼll Need in 2023 |SPAIN https://www.youtube.com/watch?v=C2buQ_isjXk |
| 59 | Barcelona | BARCELONA TRAVEL VLOG | girls trip, exploring the city, beach time & good vibes https://www.youtube.com/watch?v=F5895OJG_No&t=12s |
| 60 | Barcelona | BARCELONA TRAVEL VLOG | exploring the city, beach days, & good eats https://www.youtube.com/watch?v=xc-lCmJMJh4&t=392s |
References
Abad, Princess Elisha S. & Noelah Mae D. Borbon. 2021. Influence of travel vlog: Inputs for destination marketing model. International Journal of Research 9(3). 47–66. https://doi.org/10.5861/ijrsm.2021.m7729.Search in Google Scholar
Beerli, Asuncion & Josefa D. Martin. 2004. Factors influencing destination image. Annals of Tourism Research 31(3). 657–681. https://doi.org/10.1016/j.annals.2004.01.010.Search in Google Scholar
Bou-Franch, Patricia, Nuria Lorenzo-Dus & Pilar Garcés-Conejos Blitvich. 2012. Social interaction in YouTube text-based polylogues: A study of coherence. Journal of Computer-Mediated Communication 17(4). 501–521. https://doi.org/10.1111/j.1083-6101.2012.01579.x.Search in Google Scholar
Chandler, Daniel. 2022. Semiotics: The basics, 4th edn. London: Routledge.10.4324/9781003155744Search in Google Scholar
Chovanec, Jan & Marta Dynel. 2015. Researching interactional forms and participant structures in public and social media. Participation in Public & Social Media Interactions 256. 1.10.1075/pbns.256.01choSearch in Google Scholar
Chu, Xiaohui & Yuan Luo. 2021. 群体传播时代Vlog在城市形象传播中的应用 [The application of Vlog in city image communication in the era of mass communication]. Journalism Lover 2021(02). 85–87.Search in Google Scholar
Cheng, Wenyi, Ruwen Tian & Dickson K. W. Chiu. 2023. Travel vlogs influencing tourist decisions: Information preferences and gender differences. Aslib Journal of Information Management (ahead of print). https://doi.org/10.1108/ajim-05-2022-0261.Search in Google Scholar
Cheng, Yusi, Wei Wei & Lu Zhang. 2020. Seeing destinations through vlogs: Implications for leveraging customer engagement behavior to increase travel intention. International Journal of Contemporary Hospitality Management 32(10). 3227–3248. https://doi.org/10.1108/ijchm-04-2020-0319.Search in Google Scholar
Descartes, René. 2006. A Discourse on the method. Oxford: OUP Oxford.Search in Google Scholar
Deng, Xiujun & Siqi Tang. 2022. 跨文化传播视阈下城市形象短视频的符号整合与文本重构 [Symbolic integration and textual reconstruction of city image short video in the threshold of intercultural communication]. China Television 2022(09). 79–83.Search in Google Scholar
Dewantara, Made Handijiaya, Xin Jin & Sarah Gardiner. 2023. What makes a travel vlog attractive? Parasocial interactions between travel vloggers and viewers. Journal of Vacation Marketing. https://doi.org/10.1177/13567667231186554.Search in Google Scholar
Drule, Alexandra M., Mihai F. Băcilă, Luiza M. Souca & Raluca Ciornea. 2020. Projected destination image: A content analysis of promotional videos for city-level tourism destination. Marketing-from Information to Decision Journal 3(1). 45–57. https://doi.org/10.2478/midj-2020-0004.Search in Google Scholar
Forte, Diego L. 2023. Music and discourse: A systemic-functional approach for music analysis in multimodal contexts. Multimodality & Society 3(1). 69–81. https://doi.org/10.1177/26349795231153963.Search in Google Scholar
Gartner, William C. 1994. Image formation process. Journal of Travel & Tourism Marketing 2(2–3). 191–216. https://doi.org/10.1300/j073v02n02_12.Search in Google Scholar
Gunther Kress & Theo Van Leeuwen. 2020. Reading images: The grammar of visual design. New York: Routledge.10.4324/9781003099857Search in Google Scholar
Halliday, Michael & Christian Matthiessen. 2004. An introduction to functional grammar, 3rd edn. London: Hodder Arnold.Search in Google Scholar
Hou, Mingyi. 2019. Social media celebrity and the institutionalization of YouTube. Convergence 25(3). 534–553. https://doi.org/10.1177/1354856517750368.Search in Google Scholar
He, Jihong, Ding Xu & Tingzhen Chen. 2022. Travel vlogging practice and its impacts on tourist experiences. Current Issues in Tourism 25(15). 2518–2533. https://doi.org/10.1080/13683500.2021.1971166.Search in Google Scholar
Irfan, Muhammad, Muhammad Shaukat Malik & Syeda Khadija Zubair. 2002. Impact of vlog marketing on consumer travel intent and consumer purchase intent with the moderating role of destination image and ease of travel. SAGE Open 12(2). 21582440221099522.10.1177/21582440221099522Search in Google Scholar
Khodadadi, Masood. 2013. The role of media in the formation of destination images. In Metin Kozak & Nazmi Kozak (eds.), Aspects of tourist behavior. Newcastle: Cambridge Scholars Publishing.Search in Google Scholar
Kress, Gunther. 2010. Multimodality: A social semiotic approach to contemporary communication. London: Routledge.Search in Google Scholar
Kim, Hyangmi & Joseph S. Chen. 2016. Destination image formation process: A holistic model. Journal of Vacation Marketing 22(2). 154–166. https://doi.org/10.1177/1356766715591870.Search in Google Scholar
Klein, Kristina, Valentyna Melnyk & Franziska Voelckner. 2021. Effects of background music on evaluations of visual images. Psychology & Marketing 38(12). 2240–2246. https://doi.org/10.1002/mar.21588.Search in Google Scholar
Li, Yixuan. 2023. A study on the success of short videos themed on Beijing cuisines and its influence on the city’s image. In SHS web of conferences, vol. 155, 02010. Les Ulis: EDP Sciences.10.1051/shsconf/202315502010Search in Google Scholar
Lippmann, Walter. 2004. Public opinion. Vol.1. Piscataway: Transaction Publishers.Search in Google Scholar
Le, Linh Ha & Murat Hancer. 2021. Using social learning theory in examining YouTube viewers’ desire to imitate travel vloggers. Journal of Hospitality & Tourism Technology 12(3). 512–532. https://doi.org/10.1108/jhtt-08-2020-0200.Search in Google Scholar
Li, Yi, Xiuxiu Xu, Bo Song & He Hong. 2020. Impact of short food videos on the tourist destination image—Take Chengdu as an example. Sustainability 12(17). 6739. https://doi.org/10.3390/su12176739.Search in Google Scholar
Martin, James R. 2000. Beyond exchange: Appraisal systems in English. In: Susan Hunston and Geoffrey Thompson (eds.), Evaluation in Text: Authorial stance and the construction of discourse. 142–175. Oxford: Oxford University Press.10.1093/oso/9780198238546.003.0008Search in Google Scholar
Mercille, Julien. 2005. Media effects on image: The case of Tibet. Annals of Tourism Research 32(4). 1039–1055. https://doi.org/10.1016/j.annals.2005.02.001.Search in Google Scholar
Milman, Ady. 2012. Postcards as representation of a destination image: The case of Berlin. Journal of Vacation Marketing 18(2). 157–170. https://doi.org/10.1177/1356766711435975.Search in Google Scholar
Mumford, Lewis. 1961. The city in history: Its origins, its transformations, and its prospects, vol. 67. Boston: Houghton Mifflin Harcourt.10.2307/1846267Search in Google Scholar
Martin, James R. & David Rose. 2003. Working with discourse: Meaning beyond the clause. New York: Continuum International Publishing Group.Search in Google Scholar
Martin, James R. & Peter R. White. 2005. The language of evaluation: Appraisal in English. London/New York: Palgrave/Macmillan.Search in Google Scholar
Martin, Judith N. & Thomas K. Nakayama. 2021. Intercultural communication in contexts, 8th edn. New York: McGraw Hill.Search in Google Scholar
Martinec, Radan & Andrew Salway. 2005. A system for image — Text relations in new (and old) media. Visual Communication 4(3). 337–371. https://doi.org/10.1177/1470357205055928.Search in Google Scholar
Meng, Bingchun & Terhi Rantanen. 2016. The worlding of St. Petersburg and Shanghai: Comparing cultures of comunication in two cities before and after revolutions. Communication, Culture & Critique 9(3). 323–340. https://doi.org/10.1111/cccr.12116.Search in Google Scholar
Margulis, Elizabeth Hellmuth, Patrick C. M. Wong, Rhimmon Simchy-Gross & McAuley J. Devin. 2019. What the music said: Narrative listening across cultures. Palgrave Communications 5(1). 1–8. https://doi.org/10.1057/s41599-019-0363-1.Search in Google Scholar
Peralta, Rachel Luna. 2019. How vlogging promotes a destination image: A narrative analysis of popular travel vlogs about the Philippines. Place Branding & Public Diplomacy 15. 244–256. https://doi.org/10.1057/s41254-019-00134-6.Search in Google Scholar
Park, C. Whan & S. Mark Young. 1986. Consumer response to television commercials: The impact of involvement and background music on brand attitude formation. Journal of Marketing Research 23(1). 11–24. https://doi.org/10.2307/3151772.Search in Google Scholar
Qian, Xiao. 2022. Three narrative patterns of the city image visually presented on Instagram under the influence of self-presentation. Media, Culture & Society 44(6). 1149–1165. https://doi.org/10.1177/01634437211069968.Search in Google Scholar
Qin, Zongcai & Xinjie Li. 2021. 城市形象塑造的短视频符号修辞研究 [A study on the symbolic rhetoric of short videos for city image construction]. Modern Communication (Journal of Communication University of China) 43(04). 93–97.Search in Google Scholar
Qu, Min & Xiang Zhang. 2019. Vlog一种内容营销的新形态 [Vlog: A new form of content marketing]. Youth Journalist 2019(24). 81–82.Search in Google Scholar
Requejo, M. Dolores Porto. 2016. Music in multimodal narratives: The role of the soundtrack in digital stories. Audionarratology: Interfaces of Sound & Narrative 52. 29.10.1515/9783110472752-003Search in Google Scholar
Roux, Shanleigh Dannica. 2019. A social semiotic approach to multimodality in the Vagina Varsity YouTube campaign series. Cape Town: Univ. of the Western Cape.Search in Google Scholar
Sammut, Gordon & Martin W. Bauer. 2021. The psychology of social influence: Modes and modalities of shifting common sense. London: Cambridge University Press.10.1017/9781108236423Search in Google Scholar
Sanchez, Dan. 2023. History of vlogging, the first vlogger, & how vlogging evolved. Available at: https://danchez.com/history-of-vlogging/.Search in Google Scholar
United Nations. 2018. 68% of the world population projected to live in urban areas by 2050, says UN. Available at: https://www.un.org/development/desa/en/news/population/2018-revision-of-world-urbanization-prospects.Search in Google Scholar
Wang, Zhenhua. 2001. 评价系统及其运作——系统功能语言学的新发展 [Appraisal systems and their operation: A new development in the systemic functional linguistics]. Journal of Foreign Languages 2001(06). 13–20.Search in Google Scholar
Wood, Julia. 1997. Communication theories in action: An introduction. Belmont: Wadsworth.Search in Google Scholar
Wang, Zhenhua & Qijing Wu. 2020. 元话语和评价系统在人际意义研究上的互补 [The complementarities of metadiscourse and the appraisal system in the study of interpersonal meaning]. Contemporary Rhetoric 2020(03). 51–60.Search in Google Scholar
Way, Lyndon C. S. & Simon McKerrell (eds.). 2017. Music as multimodal discourse: Semiotics, power and protest. Oxford: Bloomsbury Publishing.Search in Google Scholar
Wu, Hongli & Mengjiao Liu. 2018. Vlog:短视频下一个爆发点—基于B站的热门Vlog视频内容生产策略分析 [Vlog: Next breakout point of video-production strategy analysis of popular vlog video based on Bilibili website]. Journal of Hubei University of Education 2018(06). 129–132.Search in Google Scholar
Xu, Ding, Tingzhen Chen, John Pearce, Zohre Mohammadi & Philip L. Pearce. 2021. Reaching audiences through travel vlogs: The perspective of involvement. Tourism Management 86. 104326. https://doi.org/10.1016/j.tourman.2021.104326.Search in Google Scholar
Yu, Ning. 2020. 抗“疫”Vlog:第一人称纪录片的叙事探析 [Vlog fighting epidemic: Exploration of narration of first-person documentary]. TV Research 2020(05). 73–76.Search in Google Scholar
Yan, Daocheng & Ziye Gao. 2023. 新媒体时代公益传播的创新转型——以B站公益Vlog为例 [The innovative transformation of public welfare communication in the new media era:The case of Bilibili public welfare vlog]. Youth Journalist 2023(02). 74–76.Search in Google Scholar
Zhang, Delu. 2009. 多模态话语分析综合理论框架探索 [On a synthetic theoretic framework for multimodal discourse analysis]. Foreign Languages in China 2009(01). 24–30.Search in Google Scholar
Zhao, Yiheng. 2022. “三联滑动”与“三性共存”——艺术与产业融合的理论基础 [Partial sliding between the three-phases of semiosis-the semio-aesthetic foundation]. Tianjin Social Sciences 2022(06). 81–87.Search in Google Scholar
Zheng, Manning. 2019. 短视频时代Vlog的价值、困境与创新 [The value, dilemma and innovation of vlog in the short video era]. China Publishing Journal 2019(19). 59–62.Search in Google Scholar
Zhan, Xuwu & Ke Li. 2020. Vlog+新闻:主流话语的传播创新路径——以“康辉Vlog”为例 [Vlog+news: The innovative path of communication of mainstream discourse – Take vlogs of Kanghui as an example]. News & Writing 2020(03). 98–102.Search in Google Scholar
Zhu, Rui & Joan Meyers-Levy. 2005. Distinguishing between the meanings of music: When background music affects product perceptions. Journal of Marketing Research 42(3). 333–345. https://doi.org/10.1509/jmkr.2005.42.3.333.Search in Google Scholar
© 2024 the author(s), published by De Gruyter, Berlin/Boston
This work is licensed under the Creative Commons Attribution 4.0 International License.
Articles in the same Issue
- Frontmatter
- Research Articles
- Victims or villains? How editorial cartoons depict the 2023 Israel – Palestine war
- Affect, credibility, and solidarity: strategic narratives of NGOs’ relief and advocacy efforts for Gaza
- “Tears have never won anyone freedom:” a multimodal discourse analysis of Ukraine’s use of memes in a propaganda war of global scale
- City images in transnational travel vlogs from a multimodal perspective: an investigation of 20 port cities worldwide
- Mobile app adoption comparison between U.S. and Chinese college students: information processing style and use frequency after download
- Featured Translated Research Outside the Anglosphere
- Automation of visual communication and aesthetic construction of national image: a computational aesthetic analysis of social bots on Twitter
Articles in the same Issue
- Frontmatter
- Research Articles
- Victims or villains? How editorial cartoons depict the 2023 Israel – Palestine war
- Affect, credibility, and solidarity: strategic narratives of NGOs’ relief and advocacy efforts for Gaza
- “Tears have never won anyone freedom:” a multimodal discourse analysis of Ukraine’s use of memes in a propaganda war of global scale
- City images in transnational travel vlogs from a multimodal perspective: an investigation of 20 port cities worldwide
- Mobile app adoption comparison between U.S. and Chinese college students: information processing style and use frequency after download
- Featured Translated Research Outside the Anglosphere
- Automation of visual communication and aesthetic construction of national image: a computational aesthetic analysis of social bots on Twitter