2018

Troubling Trends in Machine Learning Scholarship

Jacob Steinhardt

citations

Cite Score

20

AI summary

This paper identifies and discusses four troubling trends in machine learning scholarship: failure to distinguish between explanation and speculation, failure to identify sources of empirical gains, mathiness, and misuse of language, providing examples and speculative suggestions for combating these trends.

Main Contributions

  • Identifies the failure to distinguish between explanation and speculation in ML papers.
  • Highlights the failure to identify sources of empirical gains, leading to misleading conclusions.
  • Introduces the concept of 'mathiness' as the use of mathematics that obfuscates rather than clarifies.
  • Points out the misuse of language, including suggestive definitions and overloaded terminology.
  • Offers suggestions for authors, publishers, and reviewers to improve the quality of ML scholarship.

Abstract

Collectively, machine learning (ML) researchers are engaged in the creation and dissemination of knowledge about data-driven algorithms. In a given paper, researchers might aspire to any subset of the following goals, among others: to theoretically characterize what is learnable, to obtain understanding through empirically rigorous experiments, or to build a working system that has high predictive accuracy. While determining which knowledge warrants inquiry may be subjective, once the topic is fixed, papers are most valuable to the community when they act in service of the reader, creating foundational knowledge and communicating as clearly as possible. What sort of papers best serve their readers? We can enumerate desirable characteristics: these papers should (i) provide intuition to aid the reader's understanding, but clearly distinguish it from stronger conclusions supported by evidence; (ii) describe empirical investigations that consider and rule out alternative hypotheses [62]; (iii) make clear the relationship between theoretical analysis and intuitive or empirical claims [64]; and (iv) use language to empower the reader, choosing terminology to avoid misleading or unproven connotations, collisions with other definitions, or conflation with other related but distinct concepts [56]. Recent progress in machine learning comes despite frequent departures from these ideals. In this paper, we focus on the following four patterns that appear to us to be trending in ML scholarship: 1. Failure to distinguish between explanation and speculation. 2. Failure to identify the sources of empirical gains, e.g. emphasizing unnecessary modifications to neural architectures when gains actually stem from hyper-parameter tuning. 3. Mathiness: the use of mathematics that obfuscates or impresses rather than clarifies, e.g. by confusing technical and non-technical concepts. 4. Misuse of language, e.g. by choosing terms of art with colloquial connotations or by overloading established technical terms. While the causes behind these patterns are uncertain, possibilities include the rapid expansion of the community, the consequent thinness of the reviewer pool, and the often-misaligned incentives between scholarship and short-term measures of success (e.g. bibliometrics, attention, and entrepreneurial opportunity). While each pattern offers a corresponding remedy (don't do it), we also discuss some speculative suggestions for how the community might combat these trends. As the impact of machine learning widens, and the audience for research papers increasingly includes students, journalists, and policy-makers, these considerations apply to this wider audience

Citation Graph

Loading graph...

References [83]

Sort:
Filter:

D. P. Kingma, Jimmy Lei Ba - 2014

49 papers in library cite

Alex Krizhevsky, Ilya Sutskever, Geoffrey E. Hinton - 2012

71 papers in library cite

Sepp Hochreiter, Jürgen Schmidhuber - 1997

94 papers in library cite

S. Ioffe, Christian Szegedy - 2015

18 papers in library cite

J. Long, E. Shelhamer, Trevor Darrell - 2015

7 papers in library cite

N. Srivastava, Geoffrey E. Hinton, Alex Krizhevsky, Ilya Sutskever, Ruslan R. Salakhutdinov - 2014

20 papers in library cite

V. Mnih - 2015

9 papers in library cite

Yoshua Bengio - 2010

20 papers in library cite

K. He, X. Zhang, S. Ren, Jian Sun - 2015

10 papers in library cite

Matthew D. Zeiler, Rob Fergus - 2014

15 papers in library cite

Rob Fergus - 2014

7 papers in library cite

James Bergstra, Yoshua Bengio - 2012

7 papers in library cite

John Duchi, Elad Hazan, Yoram Singer - 2011

19 papers in library cite

Y. Taigman, Michael Yang, Marc'aurelio Ranzato, Lior Wolf - 2014

5 papers in library cite

Zachary C. Lipton - 2016

1 paper in library cites

Chiyuan Zhang, Samy Bengio, Moritz Hardt, Benjamin Recht, Oriol Vinyals - 2016

2 papers in library cite

K. M. Hermann, T. Kocisky, Edward Grefenstette, L. Espeholt, W. Kay, M. Suleyman, Phil Blunsom - 2015

31 papers in library cite

Dumitru Erhan, Yoshua Bengio, Aaron Courville, Pierre Antoine Manzagol, Pascal Vincent, Samy Bengio - 2010

12 papers in library cite

R. Kiros, Yuxuan Zhu, Ruslan Salakhutdinov, Richard S. Zemel, R. Urtasun, Antonio Torralba, Sanja Fidler - 2015

23 papers in library cite

K. Jarrett, Koray Kavukcuoglu, Marc'aurelio Ranzato, Yann Lecun - 2009

20 papers in library cite

Matthew D. Zeiler, Dilip Krishnan, Graham W. Taylor, Rob Fergus - 2010

3 papers in library cite

Deli Chen, J. Bolton, Christopher D. Manning - 2016

9 papers in library cite

Alex Wiltschko - 2018

1 paper in library cites

K. Chatfield, K. Simonyan, A. Vedaldi, Andrew Zisserman - 2014

5 papers in library cite

Yoshua Bengio - 2012

3 papers in library cite

S. J. Reddi, S. Kale, S. Kumar - 2018

2 papers in library cite

P. Henderson, R. Islam, P. Bachman, J. Pineau, D. Precup, D. Meger - 2017

2 papers in library cite

S. Santurkar, D. Tsipras, A. Ilyas, A. Madry - 2018

1 paper in library cites

M. Lucic, K. Kurach, M. Michalski, Sylvain Gelly, O. Bousquet - 2017

2 papers in library cite

Leon Bottou, J. Peters, J. Q. Candela, D. X. Charles, D. M. Chickering, E. Portugaly, D. Ray, Patrice Simard, E. Snelson - 2013

2 papers in library cite

G. Melis, C. Dyer, Phil Blunsom - 2018

6 papers in library cite

Yann N. Dauphin, Razvan Pascanu, C. G. Gulcehre, Kyunghyun Cho, Surya Ganguli, Yoshua Bengio - 2014

4 papers in library cite

A. Choromanska, M. Henaff, M. Mathieu, G. B. Arous, Yann Lecun - 2015

4 papers in library cite

Y. Z. Zhang, Victor Zhong, Deli Chen, G. Angeli, Christopher D. Manning - 2017

3 papers in library cite

Rowan Zellers, M. Yatskar, S. Thomson, Yejin Choi - 2018

2 papers in library cite

N. Bostrom - 2017

2 papers in library cite

Leon Bottou, O. Bousquet - 2008

2 papers in library cite

Y. Freund, R. E. Schapire - 1997

1 paper in library cites

Missing year

Zoubin Ghahramani

1 paper in library cites

S. Zagoruyko, Adam Lerer, T. Y. Lin, P. O. Pinheiro, S. Gross, S. Chintala, Piotr Dollar - 2016

1 paper in library cites

Jürgen Schmidhuber - 1991

1 paper in library cites

D. Danks, A. J. London - 2017

1 paper in library cites

R. Cotterell, S. J. Mielke, J. Eisner, B. Roark - 2018

1 paper in library cites

D. Mcdermott - 1976

1 paper in library cites

Jacob Steinhardt, P. W. Koh, P. S. Liang - 2017

1 paper in library cites

Missing author listMissing year

1 paper in library cites

Zachary C. Lipton, Jianfeng Gao, Lei Li, Jixuan Chen, L. Deng - 2016

1 paper in library cites

A. Gretton, A. J. Smola, J. Huang, M. Schmittfull, K. M. Borgwardt, B. Scholkopf - 2009

1 paper in library cites

C. Hazirbas, L. L. Taixe, D. Cremers - 2017

1 paper in library cites

A. Esteva, B. Kuprel, R. A. Novoa, J. Ko, S. M. Swetter, H. M. Blau, Sebastian Thrun - 2017

1 paper in library cites

R. Korf - 1997

1 paper in library cites

S. S. Ziv, A. Bermano, I. Naeh, G. Chechik, Yoshua Bengio, Moritz Hardt, D. Reichman, Oriol Vinyals - 2017

1 paper in library cites

Zachary C. Lipton, A. Chouldechova, J. Mcauley - 2017

1 paper in library cites

Missing author list

2015

1 paper in library cites

P. Stock, M. Cisse - 2017

1 paper in library cites

Missing year

John Langford

1 paper in library cites

Zachary C. Lipton, S. Vikram, J. Mcauley - 2015

1 paper in library cites

P. R. Cohen, A. E. Howe - 1988

1 paper in library cites

B. M. Lake, Ruslan Salakhutdinov, Joshua B. Tenenbaum - 2015

1 paper in library cites

T. G. Armstrong, A. Moffat, W. Webber, J. Zobel - 2009

1 paper in library cites

H. Noh, S. Hong, B. Han - 2015

1 paper in library cites

Jacob Steinhardt, Percy Liang - 2015

1 paper in library cites

S. Mohamed, B. Lakshminarayanan - 2016

1 paper in library cites

D. E. Knuth, T. Larrabee, P. M. Roberts - 1987

1 paper in library cites

P. M. Romer - 2015

1 paper in library cites

M. J. Nye - 1980

1 paper in library cites

J. Yin, Xu Jiang, Z. L. Lu, L. Shang, H. Li, Xiang Lisa Li - 2015

1 paper in library cites

C. Liang, Jonathan Berant, Quoc Le, K. D. Forbus, N. Lao - 2017

1 paper in library cites

Missing year

Yann Lecun

1 paper in library cites

Ian J. Goodfellow, Oriol Vinyals, Andrew M. Saxe - 2015

1 paper in library cites

Jacob Steinhardt, Percy Liang - 2015

1 paper in library cites

J. Markoff - 2014

1 paper in library cites

T. Kwiatkowski, E. Choi, Y. Artzi, Luke Zettlemoyer - 2013

1 paper in library cites

A. J. Bray, D. S. Dean - 2007

1 paper in library cites

J. R. Platt - 1964

1 paper in library cites

Yoshua Bengio - 2017

1 paper in library cites

D. Gershgorn - 2017

1 paper in library cites

P. Langley, D. Kibler - 1991

1 paper in library cites

C. Metz - 2014

1 paper in library cites

Cited by

1

papers in your library

Cites

31

papers in your library

Read

on November 14, 2025

Your review

Tags

Paper Aliases

No aliases