Abstract
This paper analyzes gender differences in student performance in Multiple-Choice Tests (MCT). We report evidence from a field experiment suggesting that, when MCT use a correction for guessing formula to obtain test scores, on average women tend to omit more items, get less correct answers and lower grades than men. We find that the gender difference in average test scores is concentrated at the upper tail of the distribution of scores. In addition, gender differences strongly depend on the framing of the scoring rule.
Funding source: Ministerio de Economía / Fondo Europeo Desarrollo Regional
Award Identifier / Grant number: ECO2015-64467-R
Funding source: Instituto de la Mujer, Gobierno de España / Fondo Social Europeo
Award Identifier / Grant number: 2011-0004-INV-00081
Funding source: Eusko Jaurlaritza / Basque Government
Award Identifier / Grant number: IT783-13
Award Identifier / Grant number: IT1336-19
Funding: Maria Paz Espinosa and Javier Gardeazabal received funding from MINECO/FEDER (ECO2015-64467-R), Instituto de la Mujer (2011-0004-INV-00081) and Eusko Jaurlaritza (IT783-13 and IT1336-19) and it is gratefully acknowledged.
References
Baldiga, K. 2014. “Gender Differences in Willingness to Guess.” Management Science 60 (2): 434–48, https://doi.org/10.1287/mnsc.2013.1776.Search in Google Scholar
Bar-Hillel, M., Budescu, D., and Attali, Y. 2005. “Scoring and Keying Multiple Choice Tests: A Case Study in Irrationality.” Mind & Society 4 (1): 3–12, https://doi.org/10.1007/s11299-005-0001-z.Search in Google Scholar
Barber, B.M., Odean, T. 2001. Boys will be Boys: Gender, Overconfidence, and Common Stock Investment. The Quarterly Journal of Economics 116 (1), 261–92. https://doi.org/10.1162/003355301556400.Search in Google Scholar
Ben-Shakhar, G., and Sinai, Y. 1991. “Gender Differences in Multiple-choice Tests: The Role of Differential Guessing Tendencies.” Journal of Educational Measurement 28 (1): 23–35, https://doi.org/10.1111/j.1745-3984.1991.tb00341.x.Search in Google Scholar
Bereby-Meyer, Y., Meyer, J., and Budescu, D. V. 2003. “Decision Making Under Internal Uncertainty: The Case of Multiple-choice Tests with Different Scoring Rules.” Acta Psychologica 112 (2): 207–20, https://doi.org/10.1016/s0001-6918(02)00085-9.Search in Google Scholar
Betts, L. R., Elder, T. J., Hartley, J., and Trueman, M. 2009. “Does Correction for Guessing Reduce Students' Performance on Multiple-choice Examinations? Yes? No? Sometimes?.” Assessment & Evaluation in Higher Education 34 (1): 1–15, https://doi.org/10.1080/02602930701773091.Search in Google Scholar
Bliss, L. B. 1980. “A Test of Lord's Assumption Regarding Examinee Guessing Behavior on Multiple-choice Tests Using Elementary School Students.” Journal of Educational Measurement 17 (2): 147–52, https://doi.org/10.1111/j.1745-3984.1980.tb00823.x.Search in Google Scholar
Bolger, N., and Kellaghan, T. 1990. “Method of Measurement and Gender Differences in Scholastic Achievement.” Journal of Educational Measurement 27 (2): 165–74, https://doi.org/10.1111/j.1745-3984.1990.tb00740.x.Search in Google Scholar
Borghans, L., Heckman, J. J., Golsteyn, B. H. H., and Meijers, H. 2009. “Gender Differences in Risk Aversion and Ambiguity Aversion.” Journal of the European Economic Association 7 (2–3): 649–58, https://doi.org/10.1162/jeea.2009.7.2-3.649.Search in Google Scholar
Budescu, D., and Bar-Hillel, M. 1993. “To Guess or not to Guess: A Decision-theoretic View of Formula Scoring.” Journal of Educational Measurement 30 (4): 277–91, https://doi.org/10.1111/j.1745-3984.1993.tb00427.x.Search in Google Scholar
Budescu, D. V., and Bo, Y. 2015. “Analyzing Test-taking Behavior: Decision Theory Meets Psychometric Theory.” Psychometrika 80 (4): 1105–22, https://doi.org/10.1007/s11336-014-9425-x.Search in Google Scholar
Collet, L. S. 1971. “Elimination Scoring: An Empirical Evaluation.” Journal of Educational Measurement 8 (3): 209–14, https://doi.org/10.1111/j.1745-3984.1971.tb00927.x.Search in Google Scholar
Croson, R., and Gneezy, U. 2009. “Gender Differences in Preferences.” Journal of Economic Literature 47 (2): 448–74, https://doi.org/10.1257/jel.47.2.448.Search in Google Scholar
Direr, A. 2020. Efficient Scoring of Multiple-choice Tests. Available at SSRN: https://ssrn.com/abstract=3546770 or https://doi.org/10.2139/ssrn.3546770.Search in Google Scholar
Espinosa, M. P., and Gardeazabal, J. 2010. “Optimal Correction for Guessing in Multiple-choice Tests.” Journal of Mathematical Psychology 54 (5): 415–25, https://doi.org/10.1016/j.jmp.2010.06.001.Search in Google Scholar
Espinosa, M. P., and Gardeazabal, J. 2013. “Do Students Behave Rationally in multiple choice tests? Evidence from a Field Experiment.” Journal of Economics and Management 9 (2): 107–35, https://doi.org/10.2139/ssrn.878548.Search in Google Scholar
Huang, J., and Kisgen, D. J. 2013. “Gender and Corporate Finance: Are Male Executives Overconfident Relative to Female Executives?” Journal of Financial Economics 108 (3): 822–39, https://doi.org/10.1016/j.jfineco.2012.12.005.Search in Google Scholar
Iriberri, N., Rey-Biel, P. 2019a. Brave Boys and Play-it-safe Girls: Gender Differences in Willingness to Guess in a Large Scale Natural Field Experiment. CEPR Discussion Paper No. DP13541.10.1016/j.euroecorev.2020.103603Search in Google Scholar
Iriberri, N., and Rey-Biel, P. 2019b. “Competitive Pressure Widens the Gender Gap in Performance: Evidence from a Two-stage Competition in Mathematics.” The Economic Journal 129 (620): 1863–93, https://doi.org/10.1111/ecoj.12617.Search in Google Scholar
Lundeberg, M. A., Fox, P. W., and Punćcohaŕ, J. 1994. “Highly Confident but Wrong: Gender Differences and Similarities in Confidence Judgments.” Journal of Educational Psychology 86 (1): 114–21, https://doi.org/10.1037/0022-0663.86.1.114.Search in Google Scholar
Mazzeo, J., Schmitt, A. P., and Bleistein, C. A. 1993. Sex-related Performance Differences on Constructed Response and Multiple Choice Sections of Advanced Placement Examinations. College Board Report 92-7. New York: College Entrance Examination Board.10.1002/j.2333-8504.1993.tb01516.xSearch in Google Scholar
Pekkarinen, T. 2015. “Gender Differences in Behaviour Under Competitive Pressure: Evidence on Omission Patterns in University Entrance Examinations.” Journal of Economic Behavior & Organization 115: 94–110, https://doi.org/10.1016/j.jebo.2014.08.007.Search in Google Scholar
Pratt, J.W., Gibbons, J.D., 1981. Concepts of Nonparametric Theory. Springer-Verlag, New York Heidelberg Berlin.10.1007/978-1-4612-5931-2Search in Google Scholar
Prieto, G., and Delgado, A. R. 1999. “The Role of Instructions in the Variability of Sex-Related Differences in Multiple-Choice Tests.” Personality and Individual Differences 27 (6): 1067–77, https://doi.org/10.1016/s0191-8869(99)00049-5.Search in Google Scholar
Reardon, S. F., Kalogrides, D., Fahle, E. M., Podolsky, A., and Zárate, R. C. 2018. “The Relationship Between Test Item Format and Gender Achievement Gaps on Math and Ela Tests in Fourth and Eighth Grades.” Educational Researcher 47 (5): 284–94, https://doi.org/10.3102/0013189x18762105.Search in Google Scholar
Ring, P., Neyse, L., David-Barett, T., and Schmidt, U. 2016. “Gender Differences in Performance Predictions: Evidence from the Cognitive Reflection Test.” Frontiers in Psychology 7: 1680, https://doi.org/10.3389/fpsyg.2016.01680.Search in Google Scholar
Romm, A. T., Schoer, V., and Kika, J. C. 2019. “A Test Taker's Gamble: The Effect of Average Grade to Date on Guessing Behaviour in a Multiple Choice Test with a Negative Marking Rule.” South African Journal of Economic and Management Sciences 22: 1–12, https://doi.org/10.4102/sajems.v22i1.2542.Search in Google Scholar
Scotchmer, S., 2008. Risk Taking and Gender in Hierarchies. Theoretical Economics 3 (4), 499–524. https://econtheory.org/ojs/index.php/te/article/viewFile/20080499/2132/115.10.3386/w14464Search in Google Scholar
Vanderoost, J., Janssen, R., Eggermont, J., Callens, R., and De Laet, T. 2018. “Elimination Testing with Adapted Scoring Reduces Guessing and Anxiety in Multiple-choice Assessments, but Does not Increase Grade Average in Comparison with Negative Marking.” PLoS One 13 (10): e0203931, https://doi.org/10.1371/journal.pone.0203931.Search in Google Scholar
Wu, Q., De Laet, T., and Janssen, R. 2018. “Elimination Scoring Versus Correction for Guessing: A Simulation Study.” In Quantitative Psychology. IMPS 2017., volume 233 of Springer Proceedings in Mathematics & Statistics, edited by M.Wiberg, S.Culpepper, R.Janssen, and D.MolenaarCham: Springer.10.1007/978-3-319-77249-3_16Search in Google Scholar
Zapechelnyuk, A. 2015. “An Axiomatization of Multiple-choice Test Scoring.” Economics Letters 132: 24–27, https://doi.org/10.1016/j.econlet.2015.03.042.Search in Google Scholar
© 2020 Walter de Gruyter GmbH, Berlin/Boston
Articles in the same Issue
- Research Articles
- The Gender-bias Effect of Test Scoring and Framing: A Concern for Personnel Selection and College Admission
- Gender and Educational Variations in the Earnings Premiums of Occupational Credentials
- Microfinance and Moneylenders: Long-run Effects of MFIs on Informal Credit Market in Bangladesh
- Movie Variety and the City
- Hi-tech Sexism? Evidence from Bangladesh
- Microfinance and Vulnerability to Seasonal Famine in a Rural Economy: Evidence from Monga in Bangladesh
- Immigration Relief and Insurance Coverage: Evidence from Deferred Action for Childhood Arrivals
- Complementarities between Labour Market Institutions and their Causal Impact on Youth Labour Market Outcomes
- Corporate Social Responsibility and Wage Inequality
- Letter
- The Optimal Earned Income Tax Credit (EITC) Schedule: A Trapezoid or a Triangle?
Articles in the same Issue
- Research Articles
- The Gender-bias Effect of Test Scoring and Framing: A Concern for Personnel Selection and College Admission
- Gender and Educational Variations in the Earnings Premiums of Occupational Credentials
- Microfinance and Moneylenders: Long-run Effects of MFIs on Informal Credit Market in Bangladesh
- Movie Variety and the City
- Hi-tech Sexism? Evidence from Bangladesh
- Microfinance and Vulnerability to Seasonal Famine in a Rural Economy: Evidence from Monga in Bangladesh
- Immigration Relief and Insurance Coverage: Evidence from Deferred Action for Childhood Arrivals
- Complementarities between Labour Market Institutions and their Causal Impact on Youth Labour Market Outcomes
- Corporate Social Responsibility and Wage Inequality
- Letter
- The Optimal Earned Income Tax Credit (EITC) Schedule: A Trapezoid or a Triangle?