A three-dimensional model of semantic search: queries, resources, and results

J.V. Rogushina

Abstract


We propose three-dimensional model of semantic search that analyzes search requests, information resources (IRs) and search results. This model is proposed as an additional tool for describing and comparing information retrieval systems (IRSs) that use various elements of artificial intelligence and knowledge management for more effective and relevant satisfaction of user information needs. In this work we analyze existing approaches to the semanticization of search queries and the use of external knowledge sources for retrieval process.
The values of parameters analyzed by this model are not mutually exclusive, that is, the same IRS can support several search options. More over, the representation means of queries and resources are not always comparable.
The model makes it possible to identify IRSs with intersected triads «request-IR-result» and to perform their comparison precisely on these subclasses of search problems. This approach allows to select search algorithms that are more pertinent for specific user tasks and to choose on base of this selection appropriate retrieval services that provide information for further processing. An important feature of the proposed model is that it uses only those IRS characteristics that can be directly evaluated by retrieval users.

Prombles in programming 2023; 4: 39-55


Keywords


semantic search; ontology; search query

References


Rogushina, J. (2015) The Web semantic ontology-based search: development of models, tools and methods - Melitopol, 291 p. (in Ukrainian)

Bast, H., Buchhold, B, Haussmann, E. (2016) Semantic search on text and knowledge bases. Foundations and Trends in Information Retrieval 10(2-3): 119-271. CrossRef

Mangold, C. (2007) A survey and classification of semantic search ap-proaches. Metadata Semantic Ontolo-gies 2(l):23-34. CrossRef

Manning, C. (2011) Part-of-speech tagging from 97% to 100%: Is it time for some linguistics? Gelbukh AF (Computational Linguistics and Intelli-gent Text Processing, Springer Berlin Heidelberg, 171-189. CrossRef

Hua, W., Wang, Z., Wang, H, Zheng, K, Zhou, X (2015) Short text under-standing through lexical-semantic analysis. In: 2015 IEEE 31st International Conference on Data Engineering, 495-506. CrossRef

Fellbaum, C. (2010). WordNet. In: Theory and applications of ontology: computer applications, 231-243. CrossRef

Pehcevski, J., Vercoustre, A., Thom, J. (2008) Exploiting locality of Wikipe-dia links in entity ranking. In: Advances in Information Retrieval, Springer Berlin Heidelberg, 258-269. CrossRef

Kaptein, R., Serdyukov, P., de Vries A., Kamps, J. (2010) Entity ranking using wikipedia as a pivot. In: Proc. of the 19th ACM Conference on Information and Knowledge Management, CIKM 2010, 69-78. CrossRef

Schuhmacher, M., Dietz, L., Ponzetto S (2015) Ranking entities for web queries through text and knowledge. In: Proc. of the 24th ACM International on Conference on Information and Knowledge Management, 1461-1470. CrossRef

Tran, T., Cimiano, P., Rudolph, S., Studer, R. (2007) Ontology-based in-terpretation of keywords for semantic search. In: Proc. of the 6th Interna-tional The Semantic Web and 2Nd Asian Conference on Asian Semantic Web Conference, ISWC'07/ASWC'07, 523-536. CrossRef

Schuhmacher, M., Ponzetto, S.P. (2013) Exploiting dbpedia for web search results clustering. In: Proc. of the 2013 Workshop on Automated Knowledge Base Construction, ACM, DOI 10.1145/2509558. 2509574. CrossRef

Mikolov, T., Chen, K., Corrado, G., Dean, J. (2013) Efficient estimation of word representations in vector space. arXiv preprint arXiv:1301.3781.

Zou, X. (2020). A survey on application of knowledge graph. In: Journal of Physics: Conference Series Vol. 1487, No. 1, 012-016. CrossRef

Lehmann, J., Isele, R., Jakob, M., Jentzsch, A., Kontokostas, D., Mendes, P. N., Bizer, C. (2015). Dbpedia-a large-scale, multilingual knowledge base extracted from wikipedia. Semantic web, 6(2), 167-195. CrossRef

Vrandečić, D., Krötzsch, M. (2014). Wikidata: a free collaborative knowledgebase. Communications of the ACM, 57(10), 78-85. CrossRef

Horrocks, I., Tessaris, S. (2002) Que-rying the semantic web: A formal ap-proach. In: Horrocks I., Hendler J. (eds) The Semantic Web, ISWC 2002, 177-191. CrossRef

Stojanovic, N., Studer, R., Stojanovic, L. (2003). An approach for the ranking of query results in the semantic web. In: The Semantic Web-ISWC 2003: Second International Semantic Web Conference, . Proc. 2, 500-516. CrossRef

Maedche, A., Motik, B., Stojanovic, L., Studer, R., Volz, R. (2003). An infrastructure for searching, reusing and evolving distributed ontologies. In: Proc. of the 12th international conference on World Wide Web, 439-448. CrossRef

Tonon, A., Demartini, G., Cudrd-Mauroux, P. (2012) Combining inverted indices and structured search for ad-hoc object retrieval. In: Proc. of the 35th International ACM SIGIR Con-ference on Research and Development in Information Retrieval, SIGIR '12,125-134. CrossRef

Pound, J., Mika, P., Zaragoza, H. (2010). Ad-hoc object retrieval in the web of data. In: Proc. of the 19th in-ternational conference on World Wide Web, 771-780. CrossRef

Rocha, C., Schwabe, D., Aragao, M. P. (2004). A hybrid approach for searching in the semantic web. In Proc. of the 13th international conference on World Wide Web, 374-383). CrossRef

Zhang, L., Yu, Y., Zhou, J., Lin, C., & Yang, Y. (2005). An enhanced model for searching in semantic portals. In Proc. of the 14th international conference on World Wide Web, 453-462). CrossRef

Wang, Q., Mao, Z., Wang, B., Guo, L. (2017) Knowledge graph embedding: A survey of approaches and applica-tions. In: IEEE Transactions on Knowledge and Data Engineering 29(12):2724-2743. CrossRef

Ding, L., Finin, T., Joshi, A., Pan, R., Cost, R. S., Peng, Y., Sachs, J. (2004). Swoogle: a search and metadata engine for the semantic web. In: Proc. of the thirteenth ACM international conference on Information and knowledge management, 652-659. CrossRef

Hogan, A., Harth, A., Umbrich, J., Kinsella, S., Polleres, A., Decker, S. (2011) Searching and browsing linked data with swse: The semantic web search engine. In: Web Semantics: Science, Services and Agents on the World Wide Web 9(4):365-401. CrossRef

Lei, Y., Uren, V.S., Motta, E. (2006) Sem-search: A search engine for the semantic web. In: Managing Knowledge in a World of Networks, 15th International Conference EKAW-2006, 238-245. CrossRef

Oren, E., Delbru, R., Catasta, M., Cy-ganiak, R., Stenzhom, H., Tumma-rello, G. (2008) Sindice.com: a docu-ment-oriented lookup index for open linked data. In: IJMSO 3(1):37-52. CrossRef

d'Aquin, M., Motta, E. (2011) Watson, more than a semantic web search engine. In: Semantic web 2(l):55-63. CrossRef

Cudré-Mauroux, P. (2019). Semantic Search. https://exascale.info/assets/pdf/cudre2018abigdata.pdf.

Raza, M. A., Mokhtar, R., Ahmad, N., Pasha, M., Pasha, U. (2019). A taxonomy and survey of semantic approaches for query expansion. In: IEEE Access, 7, 17823-17833. CrossRef

Gabrilovich, E., Broder, A., Fontoura, M., Joshi, A., Josifovski, V., Riedel, L., Zhang, T. (2009). Classifying search queries using the web as a source of knowledge. In: ACM Transactions on the Web (TWEB), 3(2), 1-28. CrossRef

Wu, J., Ilyas, I., Weddell, G. (2011). A study of ontology-based query expansion. In: Technical report CS-2011-04. https://cs.uwaterloo.ca/research/tr/2011/CS-2011-04.pdf.

Qiu, Y., & Frei, H. P. (1993). Concept based query expansion. In: Proc. of the 16th annual international ACM SIGIR conference on Research and development in information retrieval, 160-169. CrossRef

Duggan, G. B., Payne, S. J. (2008). Knowledge in the head and on the web: Using topic expertise to aid search. In: Proc. of the SIGCHI conference on Human factors in computing systems, 39-48. CrossRef

Wildemuth, B. M. (2004). The effects of domain knowledge on search tactic formulation. In: Journal of the american society for information science and technology, 55(3), 246-258. CrossRef

Loukachevitch, N. V., Dobrov, B. V. (2004). Development of Ontologies with Minimal Set of Conceptual Relations. In: LREC.

Navigli, R., Velardi, P. (2004). Learning domain ontologies from document warehouses and dedicated web sites. In: Computational Linguistics, 30(2), 151-179. CrossRef

Liu, S., Liu, F., Yu, C., Meng, W. (2004). An effective approach to doc-ument retrieval via utilizing WordNet and recognizing phrases. In: Proc. of the 27th annual international ACM SIGIR conference on Research and development in information retrieval, 266-272. CrossRef

Moreau, F., Claveau, V., Sébillot, P. (2007). Automatic morphological que-ry expansion using analogy-based ma-chine learning. In: Advances in Infor-mation Retrieval: 29th European Con-ference on IR Research, ECIR 2007, Proc. 29, 222-233). CrossRef

Best, B. J., Gerhart, N., Lebiere, C. (2010). Extracting the ontological structure of OpenCyc for reuse and portability of cognitive models. In: Proc. of the 17th Conference on Behavioral Representation in Modeling and Simulation.

Suchanek, F. M., Kasneci, G., Weikum, G. (2008). Yago: A large ontology from wikipedia and wordnet. In: Journal of Web Semantics, 6(3), 203-217. CrossRef

Lehmann, J., Isele, R., Jakob, M., Jentzsch, A., Kontokostas, D., Mendes, P. N., Bizer, C. (2015). Dbpedia - a large-scale, multilingual knowledge base extracted from wikipedia. In: Semantic web, 6(2), 167-195. CrossRef

Kalender, M., Dang, J., Uskudarli, S. (2010). Unipedia: A unified ontological knowledge platform for semantic content tagging and search. In: 2010 IEEE Fourth International Conference on Semantic Computing, 293-298. CrossRef

Aggarwal, N., Buitelaar, P. (2012,). Query Expansion Using Wikipedia and Dbpedia. In: CLEF (Online Working Notes/Labs/Workshop).

Zhou, D., Wu, X., Zhao, W., Lawless, S., Liu, J. (2017). Query expansion with enriched user profiles for personalized search utilizing folksonomy data. In: IEEE Transactions on Knowledge and Data Engineering, 29(7), 1536-1548. CrossRef

Ray, S. K., Singh, S., Joshi, B. P. (2009). Exploring multiple ontologies and WordNet framework to expand query for question answering system. In: Proc. of the First International Conference on Intelligent Human Computer Interaction: (IHCI 2009), 296-305). CrossRef

Deutch, D., Frost, N., & Gilad, A. (2017). Provenance for natural language queries. In: Proc. of the VLDB Endowment, 10(5), 577-588. CrossRef

Unni, M., Baskaran, K. (2011). Overview of approaches to semantic web search. In: International Journal of Computer Science and Communication (IJCSC), 2, 345-349.

Sudeepthi, G., Anuradha, G., Babu, M. S. P. (2012). A survey on semantic web search engine. In: International Journal of Computer Science Issues (IJCSI), 9(2), 241-245.

Cimiano, P., Haase, P., Heizmann, J., Mantel, M., Studer, R. (2008). Towards portable natural language interfaces to knowledge bases- The case of the ORAKEL system. In: Data & Knowledge Engineering, 65(2), 325-354. CrossRef

Croft, W. B., Turtle, H. R., Lewis, D. D. (1991). The use of phrases and structured queries in information retrieval. In: Proc. of the 14th annual international ACM SIGIR conference on Research and development in information retrieval, 32-45. CrossRef

Teletska, A. O., Zagnitko, A. P., Nadutenko, M. V. (2018). Classification of information search languages. History, philosophy, law, 120. (in Ukrainian)

Chowdhury G. G. (2010) Information Retrieval, 3rd edition. London: Facet Publishing, 488 р.

Serbin, O. (2008). Representation of information search languages in web-oriented systems. In: Scientific works of the V.I. Vernadskyi National Library of Ukraine, (20), 176-184. (in Ukrainian)

Rogushina, J. V. (2019). Means and methods of the unstructured data analysis. In: Problems in programming, (1), 57-77. CrossRef

Pérez, J., Arenas, M., Gutierrez, C. (2009). Semantics and complexity of SPARQL. In: ACM Transactions on Database Systems (TODS), 34(3), 1-45. CrossRef

Weiand, K., Hartl, A., Hausmann, S., Bry, F., Furche, T. (2012). Keyword-Based Search over Semantic Data. Semantic Search over the Web, 159-192. CrossRef

Bao, J., Ding, L., Hendler, J. (2008). Knowledge representation and query in semantic MediaWiki: a formal study. Tetherless World Constellation (RPI) Technical Report. DOI 10.1.1.187.4263.

Rogushina, J., Priyma, S., Strokan, O. (2017) Creating and Use of Semantic Wiki Resources: A Study Guide. - Melitopol, 169 p. (in Ukrainian)

Rogushina, J., Grishanova, I. (2022) Semantic Information Resources with a Complex Structure: Knowledge Representation, Scaling and Search Problems. In: UkrPROG, CEUR Vol-3501, 158-171. CrossRef

Pryima, S., Rogushina, J., Strokan, O. (2018). Use of semantic technologies in the process of recognizing the outcomes of non-formal and informal learning. In: CEUR Workshop Proceedings, 226-235. CrossRef

The FAIR Guiding Principles for scientific data management and steward-ship. Available from: https://www.nature.com/articles/sdata201618.

Rogushina, J., Grishanova, I. (2022). Study of principles, models and meth-ods of FAIR paradigm of scientific data management for analysis for BIG data metadata. In: Problems in programming, (4), 26-35. CrossRef

Rogushina, J. (2023). Development of intelligent information analytical webportals based on semantic Wiki technologies: problems and challenges. In: Problems in programming, (3), 66-80. CrossRef




DOI: https://doi.org/10.15407/pp2023.04.039

Refbacks

  • There are currently no refbacks.