Cite Score
91
AI summary
This paper introduces a CNN model for sentence classification using pre-trained word vectors, achieving state-of-the-art results on multiple benchmarks including sentiment analysis and question classification. The model uses static and task-specific vectors and a simple modification to use both types of vectors.
Main Contributions
Abstract
We report on a series of experiments with convolutional neural networks (CNN) trained on top of pre-trained word vectors for sentence-level classification tasks. We show that a simple CNN with little hyperparameter tuning and static vectors achieves excellent results on multiple benchmarks. Learning task-specific vectors through fine-tuning offers further gains in performance. We additionally propose a simple modification to the architecture to allow for the use of both task-specific and static vectors. The CNN models discussed herein improve upon the state of the art on 4 out of 7 tasks, which include sentiment analysis and question classification.
Citation Graph
References [31]
Alex Krizhevsky, Ilya Sutskever, Geoffrey E. Hinton - 2012
71 papers in library cite
Yann Lecun, Leon Bottou, Yoshua Bengio, Patrick Haffner - 1998
62 papers in library cite
Tomas Mikolov, Ilya Sutskever, K. Chen, G. S. Corrado, Jeffrey Dean - 2013
32 papers in library cite
John Duchi, Elad Hazan, Yoram Singer - 2011
19 papers in library cite
Quoc Le, Tomas Mikolov - 2014
13 papers in library cite
Geoffrey Hinton - 2013
13 papers in library cite
Yoshua Bengio, R. Ducharme, Pascal Vincent - 2001
62 papers in library cite
Geoffrey E. Hinton, N. Srivastava, Alex Krizhevsky, Ilya Sutskever, Ruslan R. Salakhutdinov - 2012
25 papers in library cite
Richard Socher, A. Perelygin, Jeffrey Wu, J. Chuang, C. Manning, A. Ng, Christopher Potts - 2013
24 papers in library cite
Ronan Collobert, Jason Weston, Leon Bottou, M. Karlen, Koray Kavukcuoglu, P. P. Kuksa - 2011
23 papers in library cite
Matthew D. Zeiler - 2012
13 papers in library cite
Phil Blunsom, Edward Grefenstette, N. Kalchbrenner - 2014
7 papers in library cite
Richard Socher, Jeffrey Pennington, Eric H. Huang, Andrew Y. Ng, Christopher D. Manning - 2011
10 papers in library cite
A. Razavian, H. Azizpour, J. Sullivan, S. Carlsson - 2014
6 papers in library cite
Bo Pang, L. Lee - 2005
13 papers in library cite
Bo Pang, L. A. Lee, L. Lillian - 2004
8 papers in library cite
J. Wiebe, T. Wilson, T. Theresa, C. A. Cardie, C. Claire - 2005
7 papers in library cite
Shijie Wang, Manning, C. Christopher - 2012
7 papers in library cite
Richard Socher, B. Huval, Christopher D. Manning, Andrew Y. Ng - 2012
7 papers in library cite
M. Hu, B. A. Liu, B. Bing - 2004
6 papers in library cite
Shijie Wang, C. Manning - 2013
4 papers in library cite
T. Nakagawa, K. Inui, S. Kurohashi - 2010
3 papers in library cite
J. P. C. G. D. Silva, L. Coheur, A. C. Mendes, A. Wichert - 2011
3 papers in library cite
Xiang Lisa Li, Dan Roth - 2002
3 papers in library cite
K. Hermann, Phil Blunsom - 2013
3 papers in library cite
W. Yih, X. He, C. Meek - 2014
2 papers in library cite
L. Dong, F. Wei, Shuming Liu, M. Zhou, K. Xu - 2014
1 paper in library cites
B. Yang, C. Cardie - 2014
1 paper in library cites
W. Yih, Kristina Toutanova, J. Platt, C. Meek - 2011
1 paper in library cites
Y. Shen, X. He, Jianfeng Gao, L. Deng, G. Mesnil - 2014
1 paper in library cites
M. Iyyer, P. Enns, J. B. Graber, P. Resnik - 2014
1 paper in library cites
Cited by
8
papers in your library
Cites
14
papers in your library
Read
on October 20, 2025
Your review
Tags
Paper Aliases
No aliases