Development of a semantic and syntactic model of natural language by means of non-negative matrix and tensor factorization

O.O. Marchenko

Abstract


A method of developing a structural model of natural language syntax and semantics is proposed. Syntactic and semantic relations between parts of a sentence are presented in a form of a recursive structure called a control space. Numerical characteristics of these data are stored in multidimensional arrays. After factorization, the arrays serve as the basis for the development of procedures for natural language semantic and syntactic analyses.

 Prombles in programming 2014; 2-3: 263-272

References


Deerwester S., Susan T. Dumais, George W. Furnas, Thomas K. Landauer, Richard Harshman. Indexing by Latent Semantic Analysis. // In Journal of the American Society for Information Science. – 1990. – P. 391–407.

Tim Van de Cruys. A Non-negative Tensor Factorization Model for Selectional Preference Induction // In Journal of Natural Language Engineering. – 2010. 16(4):417–437.

Tim Van de Cruys, Laura Rimell, Thierry Poibeau, and Anna Korhonen Multi-way Tensor Factorization for Unsupervised Lexical Acquisition // In Proceedings of COLING – 2012. – P. 2703–2720.

Cohen S.B., Michael Collins. Tensor Decomposition for Fast Parsing with Latent-Variable PCFGs // In NIPS. – 2012. – P. 2528–2536.

Peng Wei, Li Tao. On the equivalence between nonnegative tensor factorization and tensorial probabilistic latent semantic analysis // Applied Intelligence, Springer Journals. – 2011. October, Vol. 35, Issue 2, P. 285–295.

Anisimov A.V. Control space of syntactic structures of natural language // Cybernetics. – 1990. – N 3, P. 11–17.

Miller G.A., Beckwith R., Fellbaum C.D., Gross D., Miller K. WordNet: An online lexical database // Int. J. Lexicograph. – 1990. – 3, 4. –

P. 235–244.

Нариньяни А.С. Формальная модель: общая схема и выбор адекватных средств. Препр. № 400/ВЦ СО АН СССР. – Новосибирск, 1978. – 19 с.

Гладкий А.В. Синтаксические структуры естественного языка в автоматизированных системах общения. – М.: Наука, 1985. – 144 с.

Klein D. and Manning C.D. Accurate Unlexicalized Parsing // In Proceedings of ACL. – 2003. – P. 423–430.

Marie-Catherine de Marneffe, Bill MacCartney and Christopher D. Manning. Generating Typed Dependency Parses from Phrase Structure

Parses // In Proceedings of LREC. – 2006.

Lee D.D. and Seung H.S. Algorithms for Non-Negative Matrix Factorization // In Proceedings of NIPS. – 2000.– P. 556–562

Cichocki A., Zdunek R., Phan A.-H., Amari S.-I. Nonnegative Matrix and Tensor Factorizations: Applications to Exploratory Multi-way Data

Analysis and Blind Source Separation // J. Wiley & Sons, Chichester. – 2009.

Kasami T. An efficient recognition and syntax-analysis algorithm for context-free languages // Scientific report AFCRL-65-758, Air Force

Cambridge Research Lab, Bedford, MA. –1965.

Cocke J. and Jacob T. Schwartz Programming languages and their compilers: Preliminary notes // Technical report, Courant Institute of Mathematical Sciences, New York University, 1970

Younger D.H. Recognition and parsing of context-free languages in time n3 // In Information and Control – 1967. 10(2). – P. 189–208.

Марченко О.О. Алгоритм конвертації дерева залежностей у керуючий простір синтаксичної структури речення // Вісник Київського національного університету імені Тараса Шевченка. Серія: фізико-математичні науки. – 2013. – № 5.

Antikainen J., Havel J., Josth R., Herout A., Zemcík P., Hauta-Kasari M., Zemcík P. Nonnegative Tensor Factorization Accelerated Using

GPGPU // In TPDS. – 2011. – P. 1135–1141.

Kysenko V., Rupp K., Marchenko O., Selberherr S., Anisimov A. GPU-Accelerated Non-negative Matrix Factorization for Text Mining // In Lecture Notes in Computer Science. – 2012. – Vol. 7337. – P. 158–163.

Ponzetto S.P., Navigli R. Knowledge-rich Word Sense Disambiguation rivaling supervised systems // In Proceedings of ACL. – 2010. – P. 1522–1531.

Ponzetto S.P., Navigli R. Large-Scale Taxonomy Mapping for Restructuring and Integrating Wikipedia // In Proceedings of IJCAI. – 2009. – P. 2083–2088.

Ponzetto S.P., Navigli R. BabelNet: Building a Very Large Multilingual Semantic Network // In Proceedings of ACL. – 2010. – P. 216–225.

Ruiz-Casado M. Enrique Alfonseca and Pablo Castells // Automatic assignment of Wikipedia encyclopedic entries to WordNet synsets. In

Proceedings of AWIC. – 2005.


Refbacks

  • There are currently no refbacks.