| Learning random walk models for inducing word dependency distributions |
| Full text |
Pdf
(177 KB)
|
| Source
|
ACM International Conference Proceeding Series; Vol. 69
archive
Proceedings of the twenty-first international conference on Machine learning
table of contents
Banff, Alberta, Canada
Page: 103
Year of Publication: 2004
ISBN:1-58113-828-5
|
|
Authors
|
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 5, Downloads (12 Months): 47, Citation Count: 12
|
|
|
ABSTRACT
Many NLP tasks rely on accurately estimating word dependency probabilities P(ω1|ω2), where the words w1 and w2 have a particular relationship (such as verb-object). Because of the sparseness of counts of such dependencies, smoothing and the ability to use multiple sources of knowledge are important challenges. For example, if the probability P(N|V) of noun N being the subject of verb V is high, and V takes similar objects to V', and V' is synonymous to V", then we want to conclude that P(N|V") should also be reasonably high---even when those words did not cooccur in the training data.To capture these higher order relationships, we propose a Markov chain model, whose stationary distribution is used to give word probability estimates. Unlike the manually defined random walks used in some link analysis algorithms, we show how to automatically learn a rich set of parameters for the Markov chain's transition probabilities. We apply this model to the task of prepositional phrase attachment, obtaining an accuracy of 87.54%.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Bikel, D. M. (2003). Intricacies of Collins' parsing model (Technical Report TR No. MS-CIS-03-11). University of Pennsylvania.
|
| |
2
|
Brémaud, P. (1999). Markov chains: Gibbs fields, monte carlo simulation, and queues. Springer-Verlag.
|
| |
3
|
|
| |
4
|
|
| |
5
|
Charniak, E. (1997). Statistical parsing with a context-free grammar and word statistics. Proc. 14th National Conference on Artificial Intelligence (pp. 598--603).
|
| |
6
|
|
| |
7
|
|
| |
8
|
Collins, M., & Brooks, J. (1995). Prepositional attachment through a backed-off model. Proceedings of the Third Workshop on Very Large Corpora (pp. 27--38).
|
| |
9
|
|
| |
10
|
Essen, U., & Steinbiss, V. (1992). Cooccurrence smoothing for stochastic language modeling. ICASSP (pp. 161--164).
|
| |
11
|
Goodman, J. T. (2001). A bit of progress in language modeling. MSR Technical Report MSR-TR-2001-72.
|
| |
12
|
|
| |
13
|
|
| |
14
|
|
| |
15
|
|
 |
16
|
John Lafferty , Chengxiang Zhai, Document language models, query models, and risk minimization for information retrieval, Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval, p.111-119, September 2001, New Orleans, Louisiana, United States
[doi> 10.1145/383952.383970]
|
| |
17
|
|
| |
18
|
Ng, A. Y., Zheng, A. X., & Jordan, M. (2001). Link analysis, eigenvectors, and stability. Proceedings of the Seventeenth International Joint Conference on Artificial Intelligence (IJCAI-01).
|
| |
19
|
|
| |
20
|
Rao, R. C. (1982). Diversity: Its measurement, decomposition, aportionment and analysis. The Indian Journal of Statistics, 44, 1--22.
|
| |
21
|
|
| |
22
|
Stetina, J., & Nagao, M. (1997). Corpus based PP attachment ambiguity resolution with a semantic dictionary. Proc. 5th Workshop on Very Large Corpora (pp. 66--80).
|
CITED BY 12
|
|
Dragomir R. Radev , Güneş Erkan , Anthony Fader , Patrick Jordan , Siwei Shen , James P. Sweeney, LexNet: a graphical environment for graph-based NLP, Proceedings of the COLING/ACL on Interactive presentation sessions, p.45-48, July 17-18, 2006, Sydney, Australia
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|