Home Linguistics & Semiotics Corpus of Polish spoken in Germany
Chapter
Licensed
Unlicensed Requires Authentication

Corpus of Polish spoken in Germany

Collecting and analysing written & spoken data for investigating contact-induced change
  • Agnieszka Czachór
View more publications by John Benjamins Publishing Company

Abstract

The present paper reflects on methodological aspects of the data gathering, analysis, and reuse and will present the practical experience from designing a test battery and selecting and approaching the participants, to conducting the experiments. Our project aims to provide a descriptive survey of contact-­induced change in two groups of bilingual (L1 Polish) speakers currently living in Germany. The corpus contains written and spoken, elicited and free data that complement each other with regard to investigating morphosyntactic phenomena. This paper gives a description of each part of our tests: selecting the topics, conducting the experiment, and choosing technical equipment needed for recording of the speech data; stimuli presentation and description of the software used for grammaticality judgments; construction of a gapped text, and finally the sociolinguistic questionnaire and self-evaluation tasks.

Abstract

The present paper reflects on methodological aspects of the data gathering, analysis, and reuse and will present the practical experience from designing a test battery and selecting and approaching the participants, to conducting the experiments. Our project aims to provide a descriptive survey of contact-­induced change in two groups of bilingual (L1 Polish) speakers currently living in Germany. The corpus contains written and spoken, elicited and free data that complement each other with regard to investigating morphosyntactic phenomena. This paper gives a description of each part of our tests: selecting the topics, conducting the experiment, and choosing technical equipment needed for recording of the speech data; stimuli presentation and description of the software used for grammaticality judgments; construction of a gapped text, and finally the sociolinguistic questionnaire and self-evaluation tasks.

Chapters in this book

  1. Prelim pages i
  2. Table of contents v
  3. Introduction xi
  4. Section 1. Learner and attrition corpora
  5. The LeaP corpus 3
  6. Technological and methodological challenges in creating, annotating and sharing a learner corpus of spoken German 25
  7. Creation and analysis of a reading comprehension exercise corpus 47
  8. The ALeSKo learner corpus 71
  9. Corpora of spoken Spanish by simultaneous and successive German-Spanish bilingual and Spanish monolingual children 97
  10. Monolingual and bilingual phonoprosodic corpora of child German and child Spanish 107
  11. Pragmatic corpus analysis, exemplified by Turkish-German bilingual and monolingual data 123
  12. Corpus of Polish spoken in Germany 153
  13. The HABLA-corpus (German-French and German-Italian) 163
  14. Section 2. Language contact corpora
  15. The Hamburg Corpus of Argentinean Spanish (HaCASpa) 183
  16. Ad hoc contact phenomena or established features of a contact variety? 199
  17. Phonoprosodic corpus of spoken Catalan (PhonCAT) 215
  18. Researching the intelligibility of a (German) dialect 231
  19. Annotating ambiguity 245
  20. Section 3. Interpreting corpora
  21. Sharing community interpreting corpora 275
  22. CoSi – A Corpus of Consecutive and Simultaneous Interpreting 295
  23. The corpus “Interpreting in Hospitals” 305
  24. Section 4. Comparable and parallel corpora
  25. The GeWiss corpus 319
  26. Korpus C4 339
  27. Treebanks in translation studies 347
  28. Section 5. Corpus tools
  29. Multilingual phonological corpus analysis 365
  30. Finding the balance between strict defaults and total openness 383
  31. General index 401
  32. Corpora index 405
  33. Language index 407
Downloaded on 29.9.2025 from https://www.degruyterbrill.com/document/doi/10.1075/hsm.14.10cza/html
Scroll to top button