保存时间:2026/4/2 04:51:32
["the dog chased the cat", "the cat chased the mouse"] ,通过实际统计来计算 chased 分别作为目标词和上下文词时相关向量的值,在 GloVe 模型中,我们首先要明确向量并不是直接通过简单统计得到的,而是通过后续基于共现矩阵的优化过程来不断调整得到合适的向量表示。但我们可以先详细梳理一下共现矩阵等相关统计信息,为后续优化计算做准备:["the", "dog", "chased", "cat", "mouse"],词汇表大小 ∣V∣=5。[1] Tomas Mikolov, Kai Chen, Greg Corrado, and Jeffrey Dean. Efficient estimation of word representations in vector space. ICLR Workshop, 2013.
[2] Tomas Mikolov, Ilya Sutskever, Kai Chen, Greg Corrado, and Jeff Dean. Distributed representations of words and phrases and their compositionality. In NIPS, 2013.
[3] Jeffrey Pennington, Richard Socher, and Christopher D. Manning. GloVe: Global Vectors for Word Representation. In EMNLP, 2014.
[4] Scott Deerwester, Susan T. Dumais, George W. Furnas, Thomas K. Landauer, and Richard Harshman. Indexing by latent semantic analysis. Journal of the American Society for Information Science, 41(6):391–407, 1990.
[5] Steffen Ihler, Alexander Fisher, and William Stewart. Efficient methods for topic models. In Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pages 621–629. ACM, 2008.
[6] Tom M. Mitchell, Maria M. Lapata, and others. Predicting semantic relations from unsupervised text. In ACL, volume 4, page 29, 2008.
[7] Anna Rafferty, Stephen Clark, and Mark Steedman. Semantic compositionality and lexical substitution. In ACL, 2011.
[8] Yihong Gong, Li Deng, and Alex Acero. Exploiting global word co-occurrence statistics for statistical language learning. In ICASSP, 2011.
[9] Aria Haghighi and Dan Klein. Convolutional neural tensor networks for relation extraction with multi-way semantic compositionality. In EMNLP, 2014.
[10] Tomas Mikolov, Wen-tau Yih, and Geoffrey Zweig. Linguistic regularities in continuous space word representations. In HLT-NAACL, 2013.
[11] Antoine Bordes, Jason Weston, Ronan Collobert, and Yoshua Bengio. Learning structured embeddings of knowledge bases. In AAAI, 2013.
[12] Samy Bengio, Gabor Mesnard, and Yoshua Bengio. Learning word representations for machine translation. In EMNLP, 2009.
[13] Omer Levy and Yoav Goldberg. Neural word embedding as implicit matrix factorization. In NIPS, 2014.
[14] Gábor Recski. Matrices and vectors in combinatorial optimization. Akadémiai Kiadó, 1989.
[15] Christopher D. Manning, Prabhakar Raghavan, and Hinrich Schütze. Introduction to Information Retrieval. Cambridge University Press, 2008.
[16] Hinrich Schütze. Automatic word sense discrimination. Computational Linguistics, 24(1):97–124, 1998.
[17] John R. Quinlan. Induction of decision trees. Machine learning, 1(1):81–106, 1986.
[18] Thorsten Joachims. Text categorization with support vector machines: Learning with many relevant features. In ECML, volume 1398, pages 137–142. Springer, 1998.
[19] David M. Blei, Andrew Y. Ng, and Michael I. Jordan. Latent Dirichlet allocation. Journal of machine Learning research, 3(Jan):993–1022, 2003.
[20] Christopher D. Manning and Hinrich Schütze. Foundations of statistical natural language processing. MIT press, 1999.