Chapter
Licensed
Unlicensed
Requires Authentication
Notes
-
Vincent B. Y. Ooi
You are currently not able to access this content.
You are currently not able to access this content.
Chapters in this book
- Frontmatter i
- Contents v
- Acknowledgements ix
- Abbreviations xi
-
1 What is Computer Corpus Lexicography (CCL)?
- 1.0 Introduction 1
- 1.1 Developing notions of the lexicon 3
- 1.2 Lexical content 11
- 1.3 Lexical form 16
- 1.4 Conclusion and the organisation of this book 19
- 1.5 Study questions 20
- 1.6 Further reading 21
- Notes 21
-
2 The lexicon in Computational Linguistics (CLl), Computational Lexicography (CL2), and Corpus Linguistics (CL3)
- 2.0 Introduction 23
- 2.1 What is Computational Linguistics ( CL1)? 23
- 2.2 What is Computational Lexicography (CL2)? 29
- 2.3 What is Corpus Linguistics (CL3)? 34
- 2.4 The interdependence and convergence ofCL1, CL2, and CL3 37
- 2.5 The TEl for the emergence of textual and lexical standards 39
- 2.6 The notion of the reusability oflexical resources 40
- 2.7 The core-reusable multifunctional lexicon 42
- 2.8 The polytheoreticallexicon 43
- 2. 9 Conclusion 44
- 2.10 Study questions 45
- 2.11 Further reading 45
- Notes 45
-
3 Corpus evidence and lexicon-based language modelling
- 3.0 Introduction 47
- 3.1 Types oflexicographic and lexical evidence 47
- 3.2 The corpus as a lexical resource: the issue of representativeness 52
- 3.3 Corpus size and the 'Bank of English' 55
- 3.4 The search for units of meaning 57
- 3.5 Conclusion 63
- 3.6 Study questions 63
- 3. 7 Further reading 65
- Notes 65
-
4 Methods oflexical acquisition: 'Learning' a lexicon for real texts from real text
- 4.0 Introduction 67
- 4.1 The relation between the lexicon and the corpus 67
- 4.2 Manual lexical acquisition 73
- 4.3 Automatic and semi-automatic lexical acquisition 75
- 4.4 The lexicographer/linguist's workbench for lexical acquisition 80
- 4.5 A framework for lexical analysis 86
- 4.6 Conclusion 87
- 4. 7 Study questions 88
- 4. 8 Further reading 89
- Notes 89
-
5 Computational storage of the lexicon
- 5.0 Introduction 91
- 5.1 The question of formalism 91
- 5.2 Lexical database vs lexical knowledge base? 94
- 5.3 The notion of inheritance 95
- 5.4 The DATR lexical knowledge representation language 97
- 5.5 The ACQUILEX lexical knowledge base 100
- 5.6 The PROLEX Prolog lexical database 101
- 5.7 The WORDNET lexical database of English 104
- 5.8 The CELEX lexical database 106
- 5.9 Using the World Wide Web for lexical research 118
- 5.10 Conclusion 119
- 5.11 Study questions 119
- 5.12 Further reading 121
- Notes 121
-
6 A case study: Applying the LFA framework to two corpora of business English
- 6.0 Introduction 123
- 6.1 The notions of sublanguage, genre, and register 123
- 6.2 The PROLEX corpus 125
- 6.3 The PROCOMPARE corpus 131
- 6.4 Corpus tagging 135
- 6.5 Corpus syntactic parsing 137
- 6.6 Corpus word-extraction 143
- 6.7 Corpus collocation 143
- 6.8 Corpus semantic tagging and parsing 145
- 6. 9 Corpus pragmatics 145
- 6.10 Structuring lexical entries from the PROLEX and PROCOMPARE corpora 149
- 6.11 A discussion of the PRO LEX and PROCOMPARE specimen lexical entries 161
- 6.12 Conclusion 171
- 6.13 Study questions 172
- 6.14 Further reading 172
- Notes 172
- 7 Conclusion: Towards a more adequate concept of the lexicon 173
- Appendix A: Specimen lexical entries 177
- Appendix B: Some World Wide Web sites for CL 1, CL2, and CL3 211
- Appendix C: Suggested solutions to exercises 215
- Bibliography 219
- Index 241
Chapters in this book
- Frontmatter i
- Contents v
- Acknowledgements ix
- Abbreviations xi
-
1 What is Computer Corpus Lexicography (CCL)?
- 1.0 Introduction 1
- 1.1 Developing notions of the lexicon 3
- 1.2 Lexical content 11
- 1.3 Lexical form 16
- 1.4 Conclusion and the organisation of this book 19
- 1.5 Study questions 20
- 1.6 Further reading 21
- Notes 21
-
2 The lexicon in Computational Linguistics (CLl), Computational Lexicography (CL2), and Corpus Linguistics (CL3)
- 2.0 Introduction 23
- 2.1 What is Computational Linguistics ( CL1)? 23
- 2.2 What is Computational Lexicography (CL2)? 29
- 2.3 What is Corpus Linguistics (CL3)? 34
- 2.4 The interdependence and convergence ofCL1, CL2, and CL3 37
- 2.5 The TEl for the emergence of textual and lexical standards 39
- 2.6 The notion of the reusability oflexical resources 40
- 2.7 The core-reusable multifunctional lexicon 42
- 2.8 The polytheoreticallexicon 43
- 2. 9 Conclusion 44
- 2.10 Study questions 45
- 2.11 Further reading 45
- Notes 45
-
3 Corpus evidence and lexicon-based language modelling
- 3.0 Introduction 47
- 3.1 Types oflexicographic and lexical evidence 47
- 3.2 The corpus as a lexical resource: the issue of representativeness 52
- 3.3 Corpus size and the 'Bank of English' 55
- 3.4 The search for units of meaning 57
- 3.5 Conclusion 63
- 3.6 Study questions 63
- 3. 7 Further reading 65
- Notes 65
-
4 Methods oflexical acquisition: 'Learning' a lexicon for real texts from real text
- 4.0 Introduction 67
- 4.1 The relation between the lexicon and the corpus 67
- 4.2 Manual lexical acquisition 73
- 4.3 Automatic and semi-automatic lexical acquisition 75
- 4.4 The lexicographer/linguist's workbench for lexical acquisition 80
- 4.5 A framework for lexical analysis 86
- 4.6 Conclusion 87
- 4. 7 Study questions 88
- 4. 8 Further reading 89
- Notes 89
-
5 Computational storage of the lexicon
- 5.0 Introduction 91
- 5.1 The question of formalism 91
- 5.2 Lexical database vs lexical knowledge base? 94
- 5.3 The notion of inheritance 95
- 5.4 The DATR lexical knowledge representation language 97
- 5.5 The ACQUILEX lexical knowledge base 100
- 5.6 The PROLEX Prolog lexical database 101
- 5.7 The WORDNET lexical database of English 104
- 5.8 The CELEX lexical database 106
- 5.9 Using the World Wide Web for lexical research 118
- 5.10 Conclusion 119
- 5.11 Study questions 119
- 5.12 Further reading 121
- Notes 121
-
6 A case study: Applying the LFA framework to two corpora of business English
- 6.0 Introduction 123
- 6.1 The notions of sublanguage, genre, and register 123
- 6.2 The PROLEX corpus 125
- 6.3 The PROCOMPARE corpus 131
- 6.4 Corpus tagging 135
- 6.5 Corpus syntactic parsing 137
- 6.6 Corpus word-extraction 143
- 6.7 Corpus collocation 143
- 6.8 Corpus semantic tagging and parsing 145
- 6. 9 Corpus pragmatics 145
- 6.10 Structuring lexical entries from the PROLEX and PROCOMPARE corpora 149
- 6.11 A discussion of the PRO LEX and PROCOMPARE specimen lexical entries 161
- 6.12 Conclusion 171
- 6.13 Study questions 172
- 6.14 Further reading 172
- Notes 172
- 7 Conclusion: Towards a more adequate concept of the lexicon 173
- Appendix A: Specimen lexical entries 177
- Appendix B: Some World Wide Web sites for CL 1, CL2, and CL3 211
- Appendix C: Suggested solutions to exercises 215
- Bibliography 219
- Index 241