John Benjamins Publishing Company
Seek&Hide
-
, , , and
Abstract
This article presents the system Seek&Hide, a text message processing tool developed for the sud4science LR (http://www.sud4science.org/) project. It performs the anonymisation/de-identification of a corpus. At present, it has been used to anonymise the sud4science LR corpus of French text messages collected during the project. This is done in two phases. In the first phase, it automatically processes over 70% of the corpus. The rest of the corpus is processed in the second phase, aided by an expert annotator via a web interface specifically designed to simplify the task.
Abstract
This article presents the system Seek&Hide, a text message processing tool developed for the sud4science LR (http://www.sud4science.org/) project. It performs the anonymisation/de-identification of a corpus. At present, it has been used to anonymise the sud4science LR corpus of French text messages collected during the project. This is done in two phases. In the first phase, it automatically processes over 70% of the corpus. The rest of the corpus is processed in the second phase, aided by an expert annotator via a web interface specifically designed to simplify the task.
Chapters in this book
- Prelim pages i
- Table of contents v
- Acknowledgements vii
- Foreword 1
- Introduction 3
-
Articles
- Seek&Hide 11
- SMS experience and textisms in young adolescents 29
- Automatic or Controlled Writing? 47
- Development of SMS language from 2000 to 2010 67
- Texto4Science 87
- SMS communication as plurilingual communication 111
- French text messages 141
- A sociolinguistic analysis of transnational SMS practices 169
- Negation marking in French text messages 191
- “i didn’t spel that wrong did i. Oops” 217
- Lol , mdr and ptdr 239
- Index 265
Chapters in this book
- Prelim pages i
- Table of contents v
- Acknowledgements vii
- Foreword 1
- Introduction 3
-
Articles
- Seek&Hide 11
- SMS experience and textisms in young adolescents 29
- Automatic or Controlled Writing? 47
- Development of SMS language from 2000 to 2010 67
- Texto4Science 87
- SMS communication as plurilingual communication 111
- French text messages 141
- A sociolinguistic analysis of transnational SMS practices 169
- Negation marking in French text messages 191
- “i didn’t spel that wrong did i. Oops” 217
- Lol , mdr and ptdr 239
- Index 265