Testing search engine frequencies: Patterns of inconsistency
-
Jinseung Eu
Abstract
This article addresses the problem of the inconsistency of the frequencies reported by the commercial search engines Google and Yahoo, assessing their reliability for academic use and seeking ways to use them more safely for language research and learning. The main part of the article is the presentation of three tests on the self-consistency and relational consistency of search engine frequencies and their results showing some important patterns. It is found that despite occasional frequency shifts of large magnitude, the engines (especially Yahoo) generally maintain a reasonable degree of self-consistency. However, some of the Google frequencies were apparently incorrect as indicated by their relational inconsistency. The tests are preceded by an overview of search engine uses and problems and a discussion of ‘query cache’, which is an essential component of the frequency mechanism providing a possible explanation of search engine behaviors. In conclusion, a search strategy and ways of carrying this work further to develop a model of search engine use are proposed.
© 2008 by Walter de Gruyter GmbH & Co. KG, D-10785 Berlin
Articles in the same Issue
- Semantic preference and semantic prosody re-examined
- Cross-linguistic comparisons of the market metaphors
- Testing search engine frequencies: Patterns of inconsistency
- Discourse and metaphor: A corpus-driven inquiry
- Agile corpus creation
- On the computation of collostruction strength: Testing measures of association as expressions of lexical bias
- NTU corpus of Formosan languages: A state-of-the-art report
- Contents Volume 4 (2008)
Articles in the same Issue
- Semantic preference and semantic prosody re-examined
- Cross-linguistic comparisons of the market metaphors
- Testing search engine frequencies: Patterns of inconsistency
- Discourse and metaphor: A corpus-driven inquiry
- Agile corpus creation
- On the computation of collostruction strength: Testing measures of association as expressions of lexical bias
- NTU corpus of Formosan languages: A state-of-the-art report
- Contents Volume 4 (2008)