Startseite Linguistik & Semiotik Questions in the TED-Multilingual Discourse Bank and the development of an annotation scheme
Artikel Open Access

Questions in the TED-Multilingual Discourse Bank and the development of an annotation scheme

  • Deniz Zeyrek EMAIL logo und Amália Mendes
Veröffentlicht/Copyright: 10. Juni 2025
Linguistics Vanguard
Aus der Zeitschrift Linguistics Vanguard

Abstract

In this paper, we analyze question-answer pairs and stand-alone questions within the spoken discourse of TED Talks, specifically focusing on the TED-Multilingual Discourse Bank. Our aim is to reveal various characteristics of questions through an annotation approach. We have developed a taxonomy, referred to as the TAQ-TED, to categorize types of questions and examine their information transfer and dialogue control functions, drawing on the Dynamic Interpretation Theory++ taxonomy of dialogue acts and their attribution in accordance with the Penn Discourse Treebank annotation guidelines. We outline the taxonomy, present our annotation results, and provide a preliminary cross-linguistic analysis comparing English questions with their Turkish and Portuguese translations. The TAQ-TED represents a promising initial framework for annotating questions in monologic discourse across multiple languages.

1 Introduction

Question-answer pairs are canonical examples of adjacency pairs that have been extensively analyzed since the research by Schegloff and Sacks (1973) and Sacks et al. (1974). They are integral to understanding dialogue structure (Afantenos et al. 2012; Asher and Lascarides 1998; Stolcke et al. 2000; among others) and can also appear in monologues. The paper explores questions – both in question-answer pairs and when they appear alone – within the specific type of spoken discourse found in TED Talks by developing a taxonomy based on the Dynamic Interpretation Theory++ (DIT++) framework and the Penn Discourse Treebank (PDTB) framework. While we mainly focus on English, the ultimate goal is to apply the taxonomy to multiple languages to enhance the empirical basis for our understanding of questions and their functions, and hence their contributions to monologic discourse.

TED is an acronym that stands for Technology, Entertainment, and Design. The TED organization features presentations delivered in English by speakers from various fields, covering a wide range of topics for a live audience. The videos of TED Talks are published on TED’s website, together with English transcripts and translations into multiple languages, making them accessible to interested viewers and researchers.

The data for the current work are extracted from the TED-Multilingual Discourse Bank (TED-MDB; Zeyrek et al. 2020), a corpus of six TED Talks in English with translations into multiple languages that have been reliably annotated for discourse relations by teams of native speakers following the PDTB guidelines (Prasad et al. 2007; Webber et al. 2019). It is important to note that the TED-MDB does not aim to annotate questions, though it does include among its annotation categories a specific type of question-answer pair known as hypophora (cf. Section 2.1.1). In the annotation process, questions and the answers that depend on them were consistently sought within adjacent text spans. When identified, they have been annotated as alternative lexicalizations of a discourse relation (AltLex) with the wh-word (in wh-questions) or the auxiliary (in yes/no questions) as the lexical anchors of the relation. Hypophora has been annotated as an implicit relation in the case of declarative questions found in the Portuguese subcorpus, that is, questions that have the syntactic structure of a statement and rely solely on prosody to indicate their interrogative function. In the current work, the targeted questions encompass the questioning part of hypophora and all the remaining questions in the TED-MDB.

The taxonomy, which we call the Taxonomy for Annotating Questions in TED Talks (TAQ-TED), has been developed through a detailed study of questions in the English section of the TED-MDB. It integrates relevant functions from the DIT++ taxonomy of dialogue acts (Bunt 2009, 2012; Bunt et al. 2017a, 2020; Prasad and Bunt 2015) and the source and type features from the attribution annotation in the PDTB 2.0 (Prasad et al. 2007).

Considering the specific properties of TED Talks associated with monologic discourse, we explore questions beginning with hypophora already annotated in the TED-MDB and then expand the analysis to include stand-alone questions in the corpus data, such as check questions and rhetorical questions. Thus, one of the outcomes of this study is a small dataset of English questions annotated according to the TAQ-TED. The study culminates in a preliminary crosslinguistic analysis of English questions and their Turkish and Portuguese translations annotated with the TAQ-TED.

1.1 The role of the DIT++ taxonomy of dialogue acts and the PDTB in the current work

In the DIT++ framework, a dialogue act is a “communicative activity of a dialogue participant, interpreted as having a certain communicative function and semantic content” (Bunt et al. 2017b: 110). The semantic content specifies the objects, propositions, events, and so on that the dialogue act pertains to, and the communicative function shows how an addressee should use the semantic content to update their information state. Communication involves performing several activities in parallel (e.g., pursuing a specific task or activity, providing and eliciting feedback, taking turns), each of which is referred to as a dimension. Two classes of communicative functions are distinguished: general-purpose information transfer and action discussion functions that can apply to any semantic content, and dialogue control functions that apply to acts in a specific dimension, such as Take Turn and Turn Release in the Turn Management dimension (Bunt et al. 2012). The DIT++ taxonomy has evolved into a comprehensive, multidimensional framework for capturing dialogue acts and their functional aspects through a series of releases. It currently includes a set of hierarchically organized general-purpose communicative functions, dimension-specific functions (release 5.1), and a core set of discourse relations (release 5.2). This framework serves as the foundation for ISO standards related to dialogue annotation.

The functions we find most suitable for analyzing questions in TED Talks include (a) general-purpose information-seeking and information-providing functions and (b) dialogue control functions involving dimension-specific allo-feedback and discourse-structuring functions. These functions, along with the PDTB’s attribution annotation, are at the core of the current analysis.

The PDTB framework offers a lexically grounded approach to discourse relations (DRels), which are semantic relations that hold between two adjacent text segments with an “abstract object” interpretation, such as eventualities, propositions, conditions, and facts (Asher 1993), referred to as Argument1 and Argument2. A lexically grounded approach means identifying the DRel as, for instance, a temporal, contingency, or comparison relation based on explicit markers, known as discourse connectives, such as then, because, and nonetheless. Even without an explicit cue, annotators identify the implicit DRel based on the adjacency of discourse segments and insert a connective that best represents the underlying semantic relation. It should be highlighted here that there is an important distinction between the DIT++ taxonomy and the PDTB framework. While the PDTB focuses on identifying the binary arguments of DRels, which serve as the basis for inferring DRels, the DIT++ scheme assigns functions to arguments rather than the relations themselves. This paper excludes DRel annotation from its scope, as any relations between a question and the adjacent text segment have already been annotated in the TED-MDB, and focuses on the functions of questions.

The PDTB annotates attribution by identifying the agent to whom a DRel or its arguments are attributed. In the PDTB framework, “attribution is a relation of ‘ownership’ between abstract objects and individuals or agents” (Prasad et al. 2007: 40). A DRel may hold between the attributions (along with the agents responsible for them) or between the abstract object arguments of the attribution. In (1), for example, the temporal relation instantiated by when holds between the eventuality of “Mr. Green winning the verdict” and “the judge giving him an additional award”, while in (2), the contrast relation conveyed by while holds between the agent arguments of the attribution relation. Thus, attribution is part of the contrast relation. Both examples are from the PDTB. In these examples, italic fonts indicate Argument1 and bold fonts indicate Argument2 of the DRel. Underlined words show discourse connectives.

(1)
When Mr. Green won a $240,000 verdict in a land condemnation case against the state in June 1983, he says Judge O’Kicki unexpectedly awarded him an additional $100,000.
(2)
Advocates said the 90-cent-an-hour rise, to $4.25 an hour by April 1991, is too small for the working poor, while opponents argued that the increase would still hurt small businesses and cost many thousands of jobs.

In this study, we annotate attribution by applying the PDTB 2.0 guidelines to question-answer pairs as well as stand-alone questions in the corpus data.

1.2 Comparison of our approach with existing approaches

Our methodology focuses on explicitly posed questions within the local discourse context. It diverges from existing approaches, such as that of Westera et al. (2020), who annotate “evoked” questions after every sentence based on the TED-MDB within the Question Under Discussion (QUD) theories (Roberts 2012). While Westera et al. (2020) develop a methodology to elicit actual and potential QUDs (from nonexpert annotators), we leave implicit (pragmatically retrieved) questions out of scope.

Our approach also differs from that of Silvano et al. (2022) and Silvano et al. (2025), who annotate TED Talks in multiple languages according to ISO standards. While the annotation categories, such as question type and the communicative function of feedback elicitation, are similar to theirs, our focus differs. For instance, we concentrate on questions but also take into account the subsequent discourse segment to distinguish between stand-alone questions and those followed by an answer (given by the speaker or another agent). In addition, we address the questions’ general-purpose and dimension-specific functions, as outlined in the DIT++ framework.

The rest of the paper proceeds as follows: Section 2 describes the types of questions found within the data, examines their functions according to the DIT++ framework, and introduces how attribution is identified according to the PDTB framework. Section 3 outlines the methodology, while Section 4 presents the data analysis, along with illustrative examples and a brief comparison of English questions with their translations into Portuguese and Turkish in the TED-MDB, to understand whether the approach developed here can reveal similarities or differences in non-English languages. Finally, Section 5 offers the conclusion.

2 Dissecting questions: their types, functions, and attribution

Questions are speech acts that express a goal and direct the addressee to provide an answer within cooperative dialogue (Asher and Lascarides 1998; Searle 1969). But not all questions are asked to be answered, and not all responses fulfill the property of answerhood. Since we not only deal with questions that have a corresponding answer, but also those that stand alone, we address the notion of answerhood, by drawing on Ginzburg’s (1995a, 1995b) seminal work, which identifies three properties of answers: truth, resolvedness, and aboutness. Truth pertains to the semantic content of the answer; resolvedness ensures that the answer is fully informative within the given context, and aboutness shows that the answer provides the required information. These properties clearly define which question-answer pairs are co-dependent, namely, hypophora, in the TED-MDB.

Our scheme also takes into consideration the type of question. Questions may take the form of a propositional question (i.e., a yes/no question), a set question (i.e., a wh-question), or a choice question (i.e., given a list of alternative propositions, the speaker wants to know which one is true). Yes/no questions can be expressed as declarative questions that rely solely on prosody.

2.1 Questions with and without answers

This section details how we deal with questions with an answer and stand-alone questions in the data. In the rest of the paper, each question type is given initial capitals to denote that it is included in the TAQ-TED as an annotation category. In the examples, bold fonts represent the question, and italics indicate the answer. The file number in the TED-MBD of the TED Talk that each question comes from is shown at the end of each example (in parentheses), together with the DRel label assigned to the question and the adjacent segment; while the category of the question in the TAQ-TED is indicated in an introductory line.

2.1.1 Hypophora

In Zeyrek et al. (2020), the figure of speech called hypophora (Lanham 1991) is used to describe questions and answers where the speaker, using their voice, assumes the roles of both questioner and answerer to ask questions that motivate the listener, attract their attention, or persuade them to think in a specific way. The speaker always provides the answer themselves. The label Hypophora was used to address question-answer pairs in TED Talks, based on the implicit assumption that TED speakers mimic real verbal exchanges in communication. Hypophora can be considered among the linguistic manifestations of a “fictive verbal interaction” (see Pascual 2006, 2014), as the questions serve other functions in the discourse rather than inviting a response from the addressee.

Hypophora denotes a specific question-answer pair where the speaker is the source of the question and the answer, and the answer possesses the fundamental properties of answerhood. Consider the question-answer pair in example (3), annotated as Hypophora on these criteria, and example (4), which is not marked as Hypophora in the annotation process since the textual segment that follows the question fails to meet the criterion of resolvedness. In the TED-MDB, (4) is annotated as a NoRel (no relation) instance, referring to a weak coherence link between adjacent sentences. As explained below, the question part of (4) is treated in the present study as a Stand-Alone Question, more specifically, a Rhetorical Question.

(3)
Hypophora
Are investors, particularly institutional investors, engaged? Well, some are, and a few are really at the vanguard. (TED-MDB_1927|Hypophora)
(4)
Rhetorical Question
How many times have we designated something a classic, a masterpiece even, while its creator considers it hopelessly unfinished, riddled with difficulties and flaws, in other words, a near win? Elizabeth Murray surprised me with her admission about her earlier paintings. (TED-MDB_1978|NoRel)

2.1.2 Stand-Alone Questions

This category incorporates three types of questions in the data:

  1. Check Questions that do not receive an explicit answer

  2. Rhetorical Questions, where no answer is expected

  3. questions that remain unanswered but cannot be classified as either Rhetorical Questions or Check Questions

Stand-Alone Questions of type (c) present hypothetical future scenarios to the listener. This is regarded as an information-providing act rather than an information-seeking act, and the questions are referred to as Informative Questions. None of the above-listed Stand-Alone Questions were annotated as questions per se in the TED-MDB. However, questions categorized as (a) and (b) were labeled as an argument of a NoRel or EntRel (entity-based coherence) relation. The current work examines all these question types to capture their characteristics independently from the subsequent discourse segment, hence the term Stand-Alone Question.

Check Questions consist of declarative and interrogative parts, combining an assertion with a request for agreement or confirmation (Dayal 2016). In the DIT++ taxonomy, Check Questions come in negative and positive forms.

Rhetorical Questions are syntactically structured as interrogatives but are not intended to solicit information; instead, they function as assertions, either positive or negative (Dayal 2016).

Informative Questions, specifically those serving an information-providing function by providing hypothetical scenarios, involve cases like (5):[1]

(5)
Informative Question
What if there isn’t enough clean air and fresh water? Now a fair question might be, what if all this sustainability risk stuff is exaggerated, overstated, it’s not urgent, something for virtuous consumers or lifestyle choice? (TED-MDB_1927|Unannotated)

2.2 Functions of questions

As mentioned, the DIT++ scheme identifies two essential communicative functions relevant to the analysis of questions: general-purpose information transfer functions; and dialogue control functions that apply to acts in a specific dimension. In the TED-MDB, we focused on the hypophora type of question-answer pairs and treated the two segments as part of a single communicative intention; here, we observe the functions of the questioning part of Hypophora and Stand-Alone Questions with respect to both functions in the DIT taxonomy.

Let us examine how the general-purpose information transfer functions and the dimension-specific dialogue control functions we are concerned with apply to questions in the data.

2.2.1 Information transfer functions

Information-Seeking: The information-Seeking function is evident in the questioning parts of Hypophora. It is also applicable to Check Questions, as speakers often aim for nonverbal responses in TED Talks, such as laughter or applause, that imply affirmative answers. Even if such responses are not received, Check Questions should still be considered Information-Seeking because they are part of a fictive interactive setting, representing the simulated interaction between the speaker and the audience.

Information-Providing: Information-Providing applies to Rhetorical Questions, where the answer is either known to the speaker and audience or considered obvious (Dayal 2016). It also applies to Informative Questions.[2]

2.2.2 Dialogue control functions

Feedback Elicitation: In the DIT++ scheme, feedback-specific functions include auto- and allo-feedback dimensions. While auto-feedback relates to the attention and processing of a previous utterance by the speaker, allo-feedback pertains to the speaker’s beliefs about the addressee’s attention and processing. The communicative functions of allo-feedback, which include feedback-providing and feedback-elicitation functions (Bunt 2012), are very pertinent to TED Talks. Allo-feedback means that the speaker asks the addressee to provide information about how the addressee interpreted the speaker’s previous utterance without expecting verbal confirmation. For instance, in a manner that mirrors real dialogues, TED speakers can employ Check Questions as feedback elicitation mechanisms (specifically as an allo-feedback elicitation act) to check the addressee’s comprehension, thereby actively engaging the audience in the communication process.

Although the DIT++ taxonomy identifies several processing levels related to allo-feedback elicitation (e.g., attention, perception, interpretation), these are not considered in the annotation.

Discourse-Structuring: In our data, questions serve a discourse-structuring function, particularly when the speaker makes transitions in speech, such as when the speaker introduces a new idea unit or concludes a (sub)topic. Currently, it is used as a top-level category in the TAQ-TED, with further specifications, such as opening, topic introduction, and so on left for future research on additional TED Talks.

Attention-Grabbing: In line with the multidimensionality of the DIT++ taxonomy, another function related to interactional management is identified: maintaining audience interest. This function is not included in the DIT++ taxonomy. In the TAQ-TED, we refer to it as the Attention-Grabbing function of questions.

2.3 Attribution

We also identify attribution, which can apply to questions as well as answers. Attribution is not a new question type but rather an orthogonal feature that operates over both questions and answers.

Among the key properties of attribution identified by the PDTB framework, we chose to use source and type features. The source feature distinguishes between different types of agents:

  1. Writer/Speaker (Speaker in our case)

  2. some specific agents introduced in the text (Other agents)

  3. some arbitrary individual expressed by a nonspecific reference in the text (Arbitrary) indicated by adverbs like reportedly and allegedly

In addition, we introduce a new source for questions that we observe in the dataset: the Virtual speaker (Grésillon and Lebrave 1984). The Virtual speaker differs from an Arbitrary individual; it represents a non-specified agent’s perspective without explicit clues and potentially involves the addressee’s perspective as well. It is implicit attribution, inferred from the discourse context. It typically applies to the questioning part of Hypophora but can also apply to Stand-Alone Questions.

The type feature describes the relationship between abstract objects and agents, leading to varying inferences about the factuality of the abstract object. The PDTB recognizes four types of abstract objects (assertion propositions, belief propositions, facts, and eventualities) that correspond to attribution relations and the types of verbs that convey them: (a) verbs of communication (e.g., tell, say), (b) propositional attitude verbs (e.g., think, believe), (c) factive predicates (e.g., know, regret) and (d) control verbs (e.g., persuade, order).

The source of questions and answers and the types of attribution relations are captured in the TAQ-TED by annotating verbs of attribution along with their subject – the results are known as attribution spans. As in the PDTB, attribution spans serve as surface cues for the features associated with attribution.

3 Methodology

The methodology involved extracting questions from the English part of the TED-MDB, which contains 34 instances, and examining their empirically observed properties.

Once the questions and adjacent discourse segments were extracted, they were saved in a spreadsheet together with the DRel label assigned to each question and the adjacent discourse segment. The authors examined these questions and the subsequent discourse segments and annotated the functions of questions individually, considering the role of the adjacent segment and the broader context. This process led to the first version of the TAQ-TED and the annotation guidelines. Through several rounds of discussion between the authors, differences were reconciled, providing an understanding of the inter-annotator agreement, and the taxonomy, as well as the annotation guidelines, were updated. The annotations were reviewed several times, repeating the discussion-reconciliation-updating cycle until the version of the taxonomy presented here was reached.

Table 1 outlines the TAQ-TED, featuring three categories (Q/A, QTYPE, FUNCTION), their types, and their subtypes, where available. The label Q/A refers to the presence of a question, with or without an answer, dividing those without an answer (Stand-Alone Questions) into four subtypes. QTYPE identifies the type of all the questions in the data. FUNCTION defines the communicative roles of questions, dividing the communicative roles into those that represent general-purpose Information Transfer functions and those that represent dimension-specific Dialogue Control functions, each with their respective subtypes. In addition, the TAQ-TED includes a set of attribution features (see Table 2) where the default case for the source of questions and answers is Speaker, and the default verb type is Null.

Table 1:

Categories in the Taxonomy for Annotating Questions in TED talks (TAQ-TED).

Category Type Subtype
Q/A Hypophora (question with an answer)
Stand-Alone Question (question with no answer) Check Question (Positive)

Check Question (Negative)

Rhetorical Question

Informative Question

QTYPE Set Question (wh-question)
Propositional Question (yes/no question)
Declarative Question
Choice Question

FUNCTION Information Transfer Information-Providing
Information-Seeking
Dialogue Control Allo-Feedback Elicitation
Discourse-Structuring
Attention-Grabbing
Table 2:

Attribution features in the Taxonomy for Annotating Questions in TED talks (TAQ-TED) and the values assigned to them.

Features Values
Source Speaker
Other (Nonstandard)
Arbitrary (Nonstandard)
Virtual
Null

Type Communication verb
Control verb
Factive verb
Propositional attitude verb
Null

Figures 1 and 2 illustrate how we integrate our approach into the DIT++ scheme. The unused categories are grayed out, and the new categories are highlighted with dotted lines.

Figure 1: 
General-purpose communicative functions of the DIT++ taxonomy used in the TAQ-TED (grayed: unused; dotted: new).
Figure 1:

General-purpose communicative functions of the DIT++ taxonomy used in the TAQ-TED (grayed: unused; dotted: new).

Figure 2: 
The dimension-specific feedback functions of the DIT++ taxonomy used in the TAQ-TED.
Figure 2:

The dimension-specific feedback functions of the DIT++ taxonomy used in the TAQ-TED.

Furthermore, we implement the following revisions on question annotations in the data:

  1. In contrast to the TED-MDB, where Hypophora is marked as an AltLex relation (except for Declarative Questions in Portuguese), we consider Hypophora, particularly its questioning part, a dialogue act in this work.

  2. Unlike in the TED-MDB, question-answer pairs are identified in adjacent and nonadjacent spans.

  3. In the TED-MDB, question-answer pairs attributed to Other or Arbitrary agents are annotated as implicit DRels that hold between the binary arguments of the DRel – one argument being the question and its attribution, and the other being the answer and its attribution. In this work, we refer to these as Nonstandard cases and revise the annotations. That is, the attribution span is stripped away, and the source of attribution and the types of attribution relations are added, as demonstrated in Section 4.2.

4 Data analysis

This section presents the results from the annotated corpus, provides examples to demonstrate the methodology, and briefly compares the observed differences in two translated languages: Turkish and Portuguese. Given the low number of annotated instances (34 instances per language), the analysis of Turkish and Portuguese translations aims to highlight the trends that appear after a short analysis, rather than drawing definitive conclusions.

4.1 Results

Table 3 presents the distribution of Q/As in the English data according to their QTYPE and shows that while Choice Questions and Declarative Questions are not found, Stand-Alone Questions and Hypophora are equally prevalent.

Table 3:

Number of questions (Q/A) in the data and their distribution by QTYPE.

Q/A QTYPE Total
Yes/no question Wh-question Choice question Declarative question
Stand-Alone Question 6 8 0 0 14
Hypophora (question element) 3 11 0 0 14
Nonstandard question 1 5 0 0 6
Total 10 24 0 0 34

The questions observed in the data include 14 Stand-Alone Questions comprised of five Check Questions, five Rhetorical Questions, four Informative Questions. There are also 14 instances of the question element of Hypophora. In addition, the data includes six Nonstandard cases. Among these, three involve questions and answers with attribution, two involve only answers with attribution, and one involves only a question attributed to another agent.

Table 3 highlights a predominance of wh-questions over yes/no questions. According to Levinson (1983: 184), “Yes/no questions will generally have vacuous presuppositions, being the disjunction of their possible answers.” For instance, the question Are investors, particularly institutional investors, engaged? (from TED-MDB Talk no. 1927) presupposes that investors are engaged, or that investors are not engaged. The predominance of wh-questions implies that the TED speakers in the corpus prefer to avoid questions with vacuous presuppositions. This might be related to the Discourse-Structuring and the Attention-Grabbing functions: the speaker starts from a proposition for which a truth-value is provided, and the question will focus on one variable the listener is asked to fill in. For instance, in “why does X?”, X is taken as true, and the listener is asked to reflect on the missing variable (why) and provide an answer (Attention-Grabbing). The listener is no longer passively listening and is actively thinking about the content of the utterance.

Table 4 lists the FUNCTIONS associated with questions, showing that the questions in the data serve more than one function simultaneously, as envisioned by the DIT++ framework. For example, dimension-specific dialogue functions can occur with both Information-Seeking and Information-Providing functions. The five Stand-Alone Questions shown in the first row of the table have Information-Providing and Allo-Feedback Elicitation functions. Check Questions are associated with both Information-Seeking and Attention-Grabbing functions, and the question element of Hypophora is associated with Information-Seeking and Discourse-Structuring functions.

Table 4:

Distribution of FUNCTIONS in the data by question type (Q/A).

FUNCTION Q/A Hypophora Nonstandard questions Total
Stand-Alone Questions
Information-Providing & Allo-Feedback Elicitation 5 (1 Rhetorical Question; 4 Informative Questions) 0 0 5
Information-Providing & Discourse-Structuring 1 (1 Rhetorical Question) 0 2 3
Information-Providing & Attention-Grabbing 3 (3 Rhetorical Questions) 0 4 7
Information-Seeking & Allo-Feedback Elicitation 4 (4 Check Questions) 0 0 4
Information-Seeking & Discourse-Structuring 0 9 0 9
Information-Seeking & Attention-Grabbing 1 (1 Check Question) 5 0 6
Total 14 14 6 34

4.2 Examples from the annotated corpus

In this section, we demonstrate the methodology with examples from the annotated corpus. Square brackets denote the functional aspects of the questions annotated in the current work. The table following each example presents the attribution features (Source, Type).

Example (6) illustrates an instance of Hypophora, where the question is annotated with the Discourse-Structuring function. The annotation also captures the fact that the question is attributed to a Virtual speaker.

Question Answer
Source Virtual Nulla
Type Null Null
  1. aThe “Null” value signifies the absence of an attribution relation, and hence the lack of a verb that conveys such a relation.

(6)
Hypophora
So if sustainability matters financially today, and all signs indicate more tomorrow, is the private sector paying attention? Well, the really cool thing is that most CEOs are. (TED-MDB_1927|Hypophora) [Discourse-Structuring]

Example (7) demonstrates a (Positive) Check Question asked by the speaker after recounting a success story.[3] The Check Question is annotated as an Allo-Feedback Elicitation, and, simultaneously, it is attributed to the Speaker, the default scenario in most of the samples.[4]

Question Answer
Source Speaker Null
Type Null Null

(7)
Check Question (Positive)
That’s the equivalent of taking 21,000 cars off the road. So awesome, right? Another example is Pentair. (TED-MDB_1927|NoRel) [Allo-Feedback Elicitation]

Example (8), yet another Stand-Alone Question, is a Rhetorical Question posed by the speaker to discuss the mastery of craft as a process rather than an end point. The Rhetorical Question states that “many inventors and untold entrepreneurs live out this phenomenon”, and it functions as an Attention-Grabbing element. Moreover, the question is attributed to a Virtual speaker.

Question Answer
Source Virtual Null
Type Null Null

(8)
Rhetorical Question
Mastery is about sacrificing for your craft and not for the sake of crafting your career. How many inventors and untold entrepreneurs live out this phenomenon? We see it even in the life of the indomitable Arctic explorer Ben Saunder … (TED-MDB_1978|EntRel) [Attention-Grabbing]

We demonstrate how we annotate Nonstandard cases through examples (9) and (10). In the TED-MDB, example (9) is annotated with the implicit DRel Expansion:Conjunction that holds between the eventuality of the professor asking the speaker “if he knew how to solve this problem” (Argument1) and “the speaker saying ‘No, not yet, but I would love to figure it out’” (Argument2). This annotation has been revised by removing the attribution span from the DRel (indicated as boxed). The sources of both the question and the answer (Other and Speaker, respectively) are annotated, along with verbs of communication indicating the types of attribution relations. In addition, the function of the question is annotated (with the Discourse-Structuring function).

(9)
Nonstandard
Question Answer
Source Other Speaker
Type Communication Communication

Example (10) also involves a Nonstandard instance previously annotated in the TED-MDB as an implicit Contingency:Cause:Reason relation, linking the question in Argument1 to the answer in Argument2 along with its attribution. The updated annotation separates the attribution span and includes additional features, specifically, the source of the question and its answer (Virtual and Other, respectively) as well as the values indicating the types of attribution relations (Null and Propositional Attitude verb, respectively). Furthermore, the function of the question (Attention-Grabbing) is annotated.

(10)
Nonstandard
Question Answer
Source Virtual Other
Type Null Propositional attitude

4.3 Crosslinguistic explorations

The annotations of the questions in English were compared with those in the Portuguese and Turkish translations. The comparisons indicated that the translations are similar in many respects: the functions of questions and their distribution across texts are the same; the sources of questions (or answers), types of attribution, and the verbs that convey them show close similarities. Of course, the syntactic properties of the questions may vary across languages due to the specific grammatical rules governing question formation in those languages, such as declarative questions in Portuguese (see (11) and its translation to Portuguese in (12a)). Declarative questions are not observed in the English talks or their Turkish translation in the dataset, although both languages can ask questions in the same way.

(11)
Do companies that take sustainability into account really do well financially? The answer that may surprise you is yes. (TED-MDB, EN_1927) [Attention-Grabbing]
(12)
a.
As companhias que praticam a sustentabilidade estão mesmo bem financeiramente? (TED-MDB, PT_1927) [Attention-Grabbing]
b.
‘Companies that practice sustainability are really well off financially?’ (back translation of (12a))

However, there are differences between English and translated languages in presenting the source of attribution. For example, Portuguese texts may present the source of the answer differently, as shown in (13) and (14). In this excerpt, the English expression one example is the subject of the text containing the answer, while the source is not mentioned. In Portuguese, the corresponding noun phrase is preceded by a verb in the first person plural form, temos um exemplo ‘we have an example’, as demonstrated in the back translation in (14b). This first person plural verb specifies the source as the speaker and the addressees, in effect reducing the distance between the TED speaker and the addressees.

(13)
So, how are companies actually leveraging ESG to drive hard business results? One example is near and dear to our hearts. (TED-MDB_1927|Hypophora) [Discourse-Structuring]
(14)
a.
Então, como é que as empresas estão a promover o ASG para alcançarem resultados sólidos? Temos um exemplo bem perto que nos é muito querido. (TED-MDB_1927|Hypophora) [Discourse-Structuring]
b.
‘So how are companies leveraging ESG to obtain solid results? We have an example very close that is dear to our hearts.’ (back translation of (14a))

This observation is supported by an analysis by Celle (2009), who discusses different strategies for expressing questions in academic texts in French and English, using a sample of translated examples. Her findings indicate differences concerning attribution. She shows that in French academic discourse, questions are very frequent and involve the addressee as a fictitious anchor point. In English, the distance between the speaker and the addressee tends to be closer (as seen in the use of we, for instance), and consequently, questions are translated as embedded or supplemented structures with a definite source of attribution.

A similar difference is observed in the Turkish translations of questions. In (15), the question and its answer are direct quotes attributed to others, expressed through communicative verbs meaning ‘ask’ and ‘say’. In the Turkish translation in (16), although the question and the answer are attributed to specific sources as in the original language, the answer is presented differently, by the expression şu cevabı ‘this answer’ meaning ‘the following answer’ (see the back translation (16b)).

(15)
The Paris Review got it out of James Baldwin when they asked him, “What do you think increases with knowledge?” and he said, “You learn how little you know.” (TED-MDB, EN_1978|Expansion:Conjunction)
(16)
a.
Paris Review, James Baldwin’e sorduğunda şu cevabı alabilmişti, “Bilgiyle artan şey sizce nedir?” ve o da şöyle söyledi, “Ne kadar az bildiğini öğreniyorsun.” (TED-MDB, TR_1978| Expansion:Conjunction)
b.
‘When Paris Review interviewed James Baldwin, they got this answer: “What do you think increases with knowledge?” and he said, “You learn how little you know.”’ (back translation of (16a))

Turkish has three determiners and three demonstrative pronouns derived from the determiners – bu ‘this’, şu ‘that’, and o ‘that, yonder’ – referring to objects at varying distances from the speaker. However, a major difference between şu and the others is that while bu and o usually refer to entities mentioned before, şu is almost always used to draw the addressee’s attention to an entity for the first time (Göksel and Kerslake 2004; Küntay and Özyürek 2006). It typically occurs when the speaker and the addressee are in a close, face-to-face relationship. In discourse, it can be used cataphorically to direct the listener’s attention to the forthcoming discourse segment. Therefore, it can be inferred that the choice of the determiner şu in the translation of (15) serves as a tactic to direct the listeners’ focus toward the answer. This usage is addressee-oriented and thus has the potential to reduce the distance between the TED speaker and the audience.

These preliminary observations indicate that TED speakers and translators use tactics to reduce the distance between themselves and the audience. The range of linguistic means by which this is achieved requires further research in multilingual corpora.

5 Conclusions

This research examined the variety of questions found in the English section of the TED-MDB and identified their characteristics and functions, which enhances our understanding of questions in TED Talks. Using established frameworks, we developed a taxonomy that classifies question types, determines whether questions have answers that meet the criteria for answerhood or function as stand-alone questions, outlines their pragmatic roles, specifies to whom questions or question-answer pairs are attributed, and describes the type of the attribution relationships.

Going beyond questions asked and answered by the speaker – that is, Hypophora – already annotated in the TED-MDB, Stand-Alone Questions, including Check Questions, Rhetorical Questions, and Informative Questions, were identified in the data. These were kept distinct from Nonstandard cases that involved question-answer pairs attributed to others.

The Attention-Grabbing and Discourse-Structuring functions of the questioning component in Hypophora, along with the Allo-Feedback Elicitation and the Attention-Grabbing functions of Stand-Alone Questions lead us to conclude that in TED Talks, questions play an indispensable role. They emerge as a rhetorical tactic that invites the audience to pay attention, contemplate potential answers, and then evaluate the speaker’s actual response against these possible answers. This approach bridges the gap between speakers and the audience, both in the hall and among viewers of TED videos, breaking the monotony of possibly pre-scripted material, adding interest, and enhancing the overall effectiveness of the message. Questions are also used to orient the discourse toward a new topic or aspect of a current topic, serving a function at the discourse-structuring level. While all these characteristics apply in a variety of other genres, this work highlights them in an “emergent genre” (Ludewig 2017).

Our research primarily focuses on the findings from the English section of the corpus, though we also examined the Turkish and Portuguese parts. Preliminary observations revealed similarities among the languages. Notably, questions were consistently translated into non-English languages while maintaining the same communicative function. This observation may be attributed to the nature of translation in general and TED Talks in particular, where the primary goal is to preserve the original meaning and rhetorical strategies of the source text. Differences emerged in how the source of the question or answer was translated, and it was observed that translators tended to choose linguistic structures that reduced the distance between the speaker and the audience.

The approach adopted in devising the TAQ-TED aims to appeal to researchers investigating various aspects of questions, their pragmatic functions, and contributions to monologic discourse. It has the potential to be scaled up to annotate additional samples from TED Talks for applications in natural language processing.

However, this work is limited by the number and type of questions observed in the English section of the TED-MDB. Future research could expand the dataset with more TED Talks, enabling a more comprehensive analysis of questions and subsequent enhancement of the TAQ-TED. In addition, future studies could explore the role of prosody in questions. Finally, applying the proposed scheme for annotating questions in other types of monologic discourse would enhance the use and effectiveness of the current analysis.


Corresponding author: Deniz Zeyrek, Cognitive Science Department, Middle East Technical University, Ankara, Türkiye; and Department of Linguistics, Boğaziçi University, Istanbul, Türkiye, E-mail:

Acknowledgments

We would like to thank the anonymous reviewers for their helpful comments on earlier versions of this paper. We would also like to thank Mustafa Erolcan Er for automatically extracting the questions from the TED-MDB, and Alan Libert for proofreading the paper and for his insightful comments. Any remaining errors are our own. This work was partially supported by Fundação para a Ciência e a Tecnologia as part of the project of Centro de Linguística da Universidade de Lisboa (UIDB/00214/2020).

References

Afantenos, S., N. Asher, F. Benamara, A. Cadilhac, C. Dégremont, P. Denis, M. Guhe, S. Keizer, A. Lascarides, O. Lemon, P. Muller, S. Paul, V. Popescu, V. Rieser & L. Vieu. 2012. Modelling strategic conversation: Model, annotation design and corpus. In S. Brown-Schmidt, J. Ginzburg & S. Larsson (eds.), Proceedings of SemDial 2012: The 16th workshop on the semantics and pragmatics of dialogue, 167–168. Paris: Université Paris-Diderot (Paris 7).Suche in Google Scholar

Asher, Nicholas. 1993. Reference to abstract objects in discourse. Dordrecht: Kluwer Academic.10.1007/978-94-011-1715-9Suche in Google Scholar

Asher, Nicholas & Alex Lascarides. 1998. Questions in dialogue. Linguistics and Philosophy 21. 237–309.10.1023/A:1005364332007Suche in Google Scholar

Bondarenko, Antonina. 2025. Verbless questions in oral monologic discourse. Linguistics Vanguard.Suche in Google Scholar

Bunt, Harry. 2009. The DIT++ taxonomy for functional dialogue markup. In D. Heylen, C. Pelachaud, R. Catizone & D. Traum (eds.), Proceedings of AAMAS 2009 workshop, towards a standard markup language for embodied dialogue acts, 13–24. Budapest: International Foundation for Autonomous Agents and Multi-agent Systems.Suche in Google Scholar

Bunt, Harry. 2012. The semantics of feedback. In S. Brown-Schmidt, J. Ginzburg & S. Larsson (eds.), Proceedings of SemDial 2012: The 16th workshop on the semantics and pragmatics of dialogue, 118–127. Paris: Université Paris-Diderot (Paris 7).Suche in Google Scholar

Bunt, Harry, Jan Alexandersson, Jae-Woong Choe, Alex Chengyu Fang, Koiti Hasida, Volha Petukhova, Andrei Popescu-Belis & David Traum. 2012. ISO 24617-2: A semantically-based standard for dialogue annotation. In Proceedings of the eighth international conference on language resources and evaluation (LREC’12), 430–437. European Language Resources Association.Suche in Google Scholar

Bunt, Harry, Volha Petukhova & Alex Fang. 2017a. Revisiting the ISO standard for dialogue act annotation. In H. Bunt (ed.), Proceedings of the 13th joint ISO-ACL workshop on interoperable semantic annotation (ISA-13), 37–50. Tilburg: Tilburg University.Suche in Google Scholar

Bunt, Harry, Volha Petukhova, David Traum & Jan Alexandersson. 2017b. Dialogue act annotation with the ISO 24617-2 standard. In D. Dahl (ed.), Multimodal interaction with W3C standards: Toward natural user interfaces to everything, 109–135. Springer.10.1007/978-3-319-42816-1_6Suche in Google Scholar

Bunt, Harry, Volha Petukhova, Emer Gilmartin, Catherine Pelachaud, Alex Fang, Simon Keizer & Laurent Prévot. 2020. The ISO standard for dialogue act annotation. In Proceedings of the 12th edition of its language resources and evaluation conference (LREC 2020), 549–558. European Language Resources Association.Suche in Google Scholar

Cardo, Michèle & Agnès Celle. 2025. Question sequences in English TED Talks. Linguistics Vanguard.10.1515/lingvan-2025-0027Suche in Google Scholar

Celle, Agnès. 2009. Question, mise en question: La traduction de l’interrogation dans le discours théorique. Revue Française de Linguistique Appliquée 14(1). 39–52. https://doi.org/10.3917/rfla.141.0039.Suche in Google Scholar

Dayal, Veneeta. 2016. Questions. Oxford: Oxford University Press.Suche in Google Scholar

Ginzburg, Jonathan. 1995a. Resolving questions, I. Linguistics and Philosophy 18. 459–527. https://doi.org/10.1007/bf00985365.Suche in Google Scholar

Ginzburg, Jonathan. 1995b. Resolving questions, II. Linguistics and Philosophy 18. 567–609. https://doi.org/10.1007/bf00983299.Suche in Google Scholar

Göksel, Aslı & Celia Kerslake. 2004. Turkish: A comprehensive grammar. London and New York: Routledge.10.4324/9780203340769Suche in Google Scholar

Grésillon, Almuth & Jean-Louis Lebrave. 1984. Qui interroge qui et pourquoi? In A. Grésillon & J.-L. Lebrave (eds.), La langue au ras du texte, 57–72. Lille: Presses Universitaires de Lille.Suche in Google Scholar

Küntay, Aylin & Aslı Özyürek. 2006. Learning to use demonstratives in conversation: What do language specific strategies in Turkish reveal? Journal of Child Language 33. 303–320. https://doi.org/10.1017/S0305000906007380.Suche in Google Scholar

Lanham, R. 1991. A handlist of rhetorical terms. Berkeley: University of California Press.10.1525/9780520912045Suche in Google Scholar

Levinson, Stephen C. 1983. Pragmatics. Cambridge: Cambridge University Press.Suche in Google Scholar

Ludewig, Julia. 2017. TED Talks as an emergent genre. CLCWeb: Comparative Literature and Culture 19(1). https://doi.org/10.7771/1481-4374.2946.Suche in Google Scholar

Pascual, Esther. 2006. Fictive interaction within the sentence: A communicative type of fictivity in grammar. Cognitive Linguistics 17(2). 245–267. https://doi.org/10.1515/cog.2006.006.Suche in Google Scholar

Pascual, Esther. 2014. Fictive interaction: The conversation frame in thought, language, and discourse. Amsterdam: John Benjamins.10.1075/hcp.47Suche in Google Scholar

Prasad, Rashmi & Harry Bunt. 2015. Semantic relations in discourse: The current state of ISO 24617-8. In Proceedings of the 11th joint ACL-ISO workshop on interoperable semantic annotation (ISA-11). Available at: https://aclanthology.org/W15-0210/.Suche in Google Scholar

Prasad, Rashmi, Eleni Miltsakaki, Nikhil Dinesh, Alan Lee, Aravind Joshi, Livio Robaldo & Bonnie Webber. 2007. The Penn Discourse Treebank 2.0 annotation manual. 17 December 2007. https://www.cis.upenn.edu/∼elenimi/pdtb-manual.pdf (accessed 22 April 2025).Suche in Google Scholar

Roberts, Craige. 2012. Information structure: Towards an integrated formal theory of pragmatics. Semantics and Pragmatics 5(6). 1–69. https://doi.org/10.3765/sp.5.6.Suche in Google Scholar

Sacks, Harvey, Emanuel A. Schegloff & Gail Jefferson. 1974. A simplest systematics for the organization of turn-taking for conversation. Language 50(4). 696–735. https://doi.org/10.2307/412243.Suche in Google Scholar

Schegloff, Emanuel A. & Harvey Sacks. 1973. Opening up closings. Semiotica 8(4). 289–327. https://doi.org/10.1515/semi.1973.8.4.289.Suche in Google Scholar

Searle, John. 1969. Speech acts. Cambridge: Cambridge University Press.Suche in Google Scholar

Silvano, Purificação, Mariana Damova, Giedre V. Oleškeviciené, Chaya Liebeskind, Christian Chiarcos, Dimitar Trajanov, Ciprian-Octavian Truica, Elena-Simona Apostol & Anna Baczkowska. 2022. ISO-based annotated multilingual parallel corpus for discourse markers. In Proceedings of the 13th international conference on language resources and evaluation (LREC 2022), 2739–2749. European Language Resources Association.Suche in Google Scholar

Silvano, Purificação, María de los Ángeles Gómez González & João Cordeiro. 2025. Tag questions in English and Portuguese monologues: Types, features and functions. Linguistics Vanguard.Suche in Google Scholar

Stolcke, Andreas, Klaus Ries, Noah Coccaro, Elisabeth Shriberg, Rebecca Bates, Daniel Jurafsky, Paul Taylor, Rachel Martin, Carol Van Ess-Dykema & Marie Meteer. 2000. Dialogue act modeling for automatic tagging and recognition of conversational speech. Computational Linguistics 26(3). 339–373. https://doi.org/10.1162/089120100561737.Suche in Google Scholar

Webber, Bonnie, Rashmi Prasad, Alan Lee & Aravind Joshi. 2019. The Penn Discourse Treebank 3.0 annotation manual. Philadelphia: University of Pennsylvania.Suche in Google Scholar

Westera, Matthijs, Laia Mayol & Hannah Rohde. 2020. TED-Q: TED Talks and the questions they evoke. In Proceedings of LREC 2020 – 12th international conference on language resources and evaluation, 1118–1127. European Language Resources Association.Suche in Google Scholar

Zeyrek, Deniz, Amália Mendes, Yulia Grishina, Murathan Kurfalı, Samuel Gibbon & Maciej Ogrodniczuk. 2020. TED Multilingual Discourse Bank (TED-MDB): A parallel corpus annotated in the PDTB style. Language Resources and Evaluation 54(2). 587–613. https://doi.org/10.1007/s10579-019-09445-9.Suche in Google Scholar

Received: 2024-08-01
Accepted: 2025-03-21
Published Online: 2025-06-10

© 2025 the author(s), published by De Gruyter, Berlin/Boston

This work is licensed under the Creative Commons Attribution 4.0 International License.

Heruntergeladen am 8.12.2025 von https://www.degruyterbrill.com/document/doi/10.1515/lingvan-2025-0035/html?lang=de
Button zum nach oben scrollen