2010

Word Representations: A Simple and General Method for Semi-Supervised Learning

J. Turian, L. Ratinov, Yoshua Bengio

citations

Cite Score

63

AI summary

This paper introduces a simple way to improve NLP accuracy by using unsupervised word representations (Brown clusters, Collobert & Weston embeddings, and HLBL embeddings) as extra word features, achieving improved results on NER and chunking tasks.

Main Contributions

  • Demonstrates that unsupervised word representations can improve the accuracy of existing supervised NLP systems.
  • Evaluates Brown clusters, Collobert and Weston embeddings, and HLBL embeddings on NER and chunking tasks.
  • Shows that combining different word representations can lead to further improvements in accuracy.
  • Provides a method for scaling word embeddings for off-the-shelf use as word features.
  • Offers a default method for setting the scaling parameter for word embeddings

Abstract

If we take an existing supervised NLP system, a simple and general way to improve accuracy is to use unsupervised word representations as extra word features. We evaluate Brown clusters, Collobert and Weston (2008) embeddings, and HLBL (Mnih & Hinton, 2009) embeddings of words on both NER and chunking. We use near state-of-the-art supervised baselines, and find that each of the three word representations improves the accuracy of these baselines. We find further improvements by combining different word representations. You can download our word features, for off-the-shelf use in existing NLP systems, as well as our code, here: http://metaoptimize.com/projects/wordreprs/

Citation Graph

Loading graph...

References [50]

Sort:
Filter:

Yoshua Bengio, R. Ducharme, Pascal Vincent - 2001

62 papers in library cite

Ronan Collobert, Jason Weston - 2008

32 papers in library cite

Yoshua Bengio, J. Louradour, Ronan Collobert, Jason Weston - 2009

6 papers in library cite

Jeffrey L. Elman - 1993

5 papers in library cite

F. Morin, Yoshua Bengio - 2005

19 papers in library cite

A. Mnih, Geoffrey E. Hinton - 2009

16 papers in library cite

A. Mnih, Geoffrey Hinton - 2007

12 papers in library cite

Yoshua Bengio, Jean Sebastien Senecal - 2003

11 papers in library cite

Holger Schwenk, Jean Luc Gauvain - 2002

14 papers in library cite

Reference title contains 'et al'

D. M. Blei, Andrew Y. Ng, Michael I. Jordan - 2003

10 papers in library cite

P. F. Brown, P. V. Desouza, R. L. Mercer, Vincent J. Della Pietra, J. C. Lai - 1992

12 papers in library cite

P. D. Turney, P. Pantel - 2010

6 papers in library cite

Rie Kubota Ando, Tong Zhang - 2005

4 papers in library cite

Fernando Pereira, N. Tishby, L. Lee - 1993

4 papers in library cite

J. Suzuki, H. Isozaki - 2008

4 papers in library cite

L. A. Ratinov, Dan Roth - 2009

3 papers in library cite

K. Lund, C. Burgess - 1996

3 papers in library cite

Tong Zhang, D. Johnson - 2003

2 papers in library cite

F. Huang, A. Yates - 2009

2 papers in library cite

E. F. T. K. Sang, S. Buchholz - 2000

2 papers in library cite

S. Miller, J. Guinness, A. Zamanian - 2004

2 papers in library cite

Yoshua Bengio - 2008

2 papers in library cite

D. Lin, Xiaobao Wu - 2009

2 papers in library cite

Wentao Li, Andrew Mccallum - 2005

2 papers in library cite

F. Sha, Fernando Pereira - 2003

2 papers in library cite

T. Koo, X. Carreras, Michael Collins - 2008

2 papers in library cite

J. Turian, L. Ratinov, Yoshua Bengio, Dan Roth - 2009

1 paper in library cites

S. Martin, J. Liermann, Hermann Ney - 1998

1 paper in library cites

V. Krishnan, Christopher D. Manning - 2006

1 paper in library cites

J. Suzuki, H. Isozaki, X. Carreras, Michael Collins - 2009

1 paper in library cites

T. K. Landauer, P. W. Foltz, D. Laham - 1998

1 paper in library cites

M. Sahlgren - 2005

1 paper in library cites

J. Vayrynen, T. Honkela - 2005

1 paper in library cites

T. Honkela, V. Pulkki, T. Kohonen - 1995

1 paper in library cites

Y. Goldberg, R. Tsarfaty, M. Adler, M. Elhadad - 2009

1 paper in library cites

V. Spitkovsky, H. Alshawi, Dan Jurafsky - 2010

1 paper in library cites

A. Ushioda - 1996

1 paper in library cites

M. Candito, B. Crabbe - 2009

1 paper in library cites

H. Zhao, Weizhu Chen, C. Kit, G. Zhou - 2009

1 paper in library cites

T. Honkela - 1997

1 paper in library cites

H. Ritter, T. Kohonen - 1989

1 paper in library cites

K. Lund, C. Burgess, R. A. Atchley - 1995

1 paper in library cites

K. Deschacht, M. F. Moens - 2009

1 paper in library cites

R. Rehurek, P. Sojka - 2010

1 paper in library cites

J. J. Vayrynen, T. Honkela, L. Lindqvist - 2007

1 paper in library cites

S. T. Dumais, G. W. Furnas, T. K. Landauer, S. T. Deerwester, R. Harshman - 1988

1 paper in library cites

M. Sahlgren - 2001

1 paper in library cites

J. J. Vayrynen, T. Honkela - 2004

1 paper in library cites

Cited by

17

papers in your library

Cites

10

papers in your library

Read

on March 20, 2025

Your review

Tags

Paper Aliases

No aliases