This publication is presented to you through Paradigm Publishing Services
Edinburgh University Press
Chapter
Licensed
Unlicensed
Requires Authentication
2.8 The polytheoreticallexicon
You are currently not able to access this content.
You are currently not able to access this content.
Chapters in this book
- Frontmatter i
- Contents v
- Acknowledgements ix
- Abbreviations xi
-
1 What is Computer Corpus Lexicography (CCL)?
- 1.0 Introduction 1
- 1.1 Developing notions of the lexicon 3
- 1.2 Lexical content 11
- 1.3 Lexical form 16
- 1.4 Conclusion and the organisation of this book 19
- 1.5 Study questions 20
- 1.6 Further reading 21
- Notes 21
-
2 The lexicon in Computational Linguistics (CLl), Computational Lexicography (CL2), and Corpus Linguistics (CL3)
- 2.0 Introduction 23
- 2.1 What is Computational Linguistics ( CL1)? 23
- 2.2 What is Computational Lexicography (CL2)? 29
- 2.3 What is Corpus Linguistics (CL3)? 34
- 2.4 The interdependence and convergence ofCL1, CL2, and CL3 37
- 2.5 The TEl for the emergence of textual and lexical standards 39
- 2.6 The notion of the reusability oflexical resources 40
- 2.7 The core-reusable multifunctional lexicon 42
- 2.8 The polytheoreticallexicon 43
- 2. 9 Conclusion 44
- 2.10 Study questions 45
- 2.11 Further reading 45
- Notes 45
-
3 Corpus evidence and lexicon-based language modelling
- 3.0 Introduction 47
- 3.1 Types oflexicographic and lexical evidence 47
- 3.2 The corpus as a lexical resource: the issue of representativeness 52
- 3.3 Corpus size and the 'Bank of English' 55
- 3.4 The search for units of meaning 57
- 3.5 Conclusion 63
- 3.6 Study questions 63
- 3. 7 Further reading 65
- Notes 65
-
4 Methods oflexical acquisition: 'Learning' a lexicon for real texts from real text
- 4.0 Introduction 67
- 4.1 The relation between the lexicon and the corpus 67
- 4.2 Manual lexical acquisition 73
- 4.3 Automatic and semi-automatic lexical acquisition 75
- 4.4 The lexicographer/linguist's workbench for lexical acquisition 80
- 4.5 A framework for lexical analysis 86
- 4.6 Conclusion 87
- 4. 7 Study questions 88
- 4. 8 Further reading 89
- Notes 89
-
5 Computational storage of the lexicon
- 5.0 Introduction 91
- 5.1 The question of formalism 91
- 5.2 Lexical database vs lexical knowledge base? 94
- 5.3 The notion of inheritance 95
- 5.4 The DATR lexical knowledge representation language 97
- 5.5 The ACQUILEX lexical knowledge base 100
- 5.6 The PROLEX Prolog lexical database 101
- 5.7 The WORDNET lexical database of English 104
- 5.8 The CELEX lexical database 106
- 5.9 Using the World Wide Web for lexical research 118
- 5.10 Conclusion 119
- 5.11 Study questions 119
- 5.12 Further reading 121
- Notes 121
-
6 A case study: Applying the LFA framework to two corpora of business English
- 6.0 Introduction 123
- 6.1 The notions of sublanguage, genre, and register 123
- 6.2 The PROLEX corpus 125
- 6.3 The PROCOMPARE corpus 131
- 6.4 Corpus tagging 135
- 6.5 Corpus syntactic parsing 137
- 6.6 Corpus word-extraction 143
- 6.7 Corpus collocation 143
- 6.8 Corpus semantic tagging and parsing 145
- 6. 9 Corpus pragmatics 145
- 6.10 Structuring lexical entries from the PROLEX and PROCOMPARE corpora 149
- 6.11 A discussion of the PRO LEX and PROCOMPARE specimen lexical entries 161
- 6.12 Conclusion 171
- 6.13 Study questions 172
- 6.14 Further reading 172
- Notes 172
- 7 Conclusion: Towards a more adequate concept of the lexicon 173
- Appendix A: Specimen lexical entries 177
- Appendix B: Some World Wide Web sites for CL 1, CL2, and CL3 211
- Appendix C: Suggested solutions to exercises 215
- Bibliography 219
- Index 241
Chapters in this book
- Frontmatter i
- Contents v
- Acknowledgements ix
- Abbreviations xi
-
1 What is Computer Corpus Lexicography (CCL)?
- 1.0 Introduction 1
- 1.1 Developing notions of the lexicon 3
- 1.2 Lexical content 11
- 1.3 Lexical form 16
- 1.4 Conclusion and the organisation of this book 19
- 1.5 Study questions 20
- 1.6 Further reading 21
- Notes 21
-
2 The lexicon in Computational Linguistics (CLl), Computational Lexicography (CL2), and Corpus Linguistics (CL3)
- 2.0 Introduction 23
- 2.1 What is Computational Linguistics ( CL1)? 23
- 2.2 What is Computational Lexicography (CL2)? 29
- 2.3 What is Corpus Linguistics (CL3)? 34
- 2.4 The interdependence and convergence ofCL1, CL2, and CL3 37
- 2.5 The TEl for the emergence of textual and lexical standards 39
- 2.6 The notion of the reusability oflexical resources 40
- 2.7 The core-reusable multifunctional lexicon 42
- 2.8 The polytheoreticallexicon 43
- 2. 9 Conclusion 44
- 2.10 Study questions 45
- 2.11 Further reading 45
- Notes 45
-
3 Corpus evidence and lexicon-based language modelling
- 3.0 Introduction 47
- 3.1 Types oflexicographic and lexical evidence 47
- 3.2 The corpus as a lexical resource: the issue of representativeness 52
- 3.3 Corpus size and the 'Bank of English' 55
- 3.4 The search for units of meaning 57
- 3.5 Conclusion 63
- 3.6 Study questions 63
- 3. 7 Further reading 65
- Notes 65
-
4 Methods oflexical acquisition: 'Learning' a lexicon for real texts from real text
- 4.0 Introduction 67
- 4.1 The relation between the lexicon and the corpus 67
- 4.2 Manual lexical acquisition 73
- 4.3 Automatic and semi-automatic lexical acquisition 75
- 4.4 The lexicographer/linguist's workbench for lexical acquisition 80
- 4.5 A framework for lexical analysis 86
- 4.6 Conclusion 87
- 4. 7 Study questions 88
- 4. 8 Further reading 89
- Notes 89
-
5 Computational storage of the lexicon
- 5.0 Introduction 91
- 5.1 The question of formalism 91
- 5.2 Lexical database vs lexical knowledge base? 94
- 5.3 The notion of inheritance 95
- 5.4 The DATR lexical knowledge representation language 97
- 5.5 The ACQUILEX lexical knowledge base 100
- 5.6 The PROLEX Prolog lexical database 101
- 5.7 The WORDNET lexical database of English 104
- 5.8 The CELEX lexical database 106
- 5.9 Using the World Wide Web for lexical research 118
- 5.10 Conclusion 119
- 5.11 Study questions 119
- 5.12 Further reading 121
- Notes 121
-
6 A case study: Applying the LFA framework to two corpora of business English
- 6.0 Introduction 123
- 6.1 The notions of sublanguage, genre, and register 123
- 6.2 The PROLEX corpus 125
- 6.3 The PROCOMPARE corpus 131
- 6.4 Corpus tagging 135
- 6.5 Corpus syntactic parsing 137
- 6.6 Corpus word-extraction 143
- 6.7 Corpus collocation 143
- 6.8 Corpus semantic tagging and parsing 145
- 6. 9 Corpus pragmatics 145
- 6.10 Structuring lexical entries from the PROLEX and PROCOMPARE corpora 149
- 6.11 A discussion of the PRO LEX and PROCOMPARE specimen lexical entries 161
- 6.12 Conclusion 171
- 6.13 Study questions 172
- 6.14 Further reading 172
- Notes 172
- 7 Conclusion: Towards a more adequate concept of the lexicon 173
- Appendix A: Specimen lexical entries 177
- Appendix B: Some World Wide Web sites for CL 1, CL2, and CL3 211
- Appendix C: Suggested solutions to exercises 215
- Bibliography 219
- Index 241