DSSim
   HOME

TheInfoList



OR:

DSSim is an ontology mapping system, that has been conceived to achieve a certain level of the envisioned
machine intelligence Artificial intelligence (AI) is the capability of computer, computational systems to perform tasks typically associated with human intelligence, such as learning, reasoning, problem-solving, perception, and decision-making. It is a field of re ...
on the
Semantic Web The Semantic Web, sometimes known as Web 3.0, is an extension of the World Wide Web through standards set by the World Wide Web Consortium (W3C). The goal of the Semantic Web is to make Internet data machine-readable. To enable the encoding o ...
. The main driving factors behind its development was to provide an alternative to the existing heuristics or
machine learning Machine learning (ML) is a field of study in artificial intelligence concerned with the development and study of Computational statistics, statistical algorithms that can learn from data and generalise to unseen data, and thus perform Task ( ...
based approaches with a multi-agent approach that makes use of uncertain reasoning. The system provides a possible approach to establish machine understanding over Semantic Web data through multi-agent beliefs and conflict resolution.


Theoretical background

The DSSim framework for ontology mapping was introduced in 2005 by Miklos Nagy and Maria Vargas-Vera at the
Open University The Open University (OU) is a Public university, public research university and the largest university in the United Kingdom by List of universities in the United Kingdom by enrolment, number of students. The majority of the OU's undergraduate ...
(OU). DSSim addresses three challenges of the Semantic Web: * Uncertainty: Ontology mapping agents adopt the
Dempster–Shafer theory The theory of belief functions, also referred to as evidence theory or Dempster–Shafer theory (DST), is a general framework for reasoning with uncertainty, with understood connections to other frameworks such as probability, possibility and ...
for creating beliefs over mapping hypothesis. Based on evidences of similarity the mapping agents combine their beliefs in order to provide a coherent view on the mappings. The system is based on a theoretical mental model for software agents to represent beliefs over similarities of different terms in different ontologies. Through these beliefs that are derived using different similarity measure and background knowledge, each agent can establish certain understanding of the terms and their context. * Inconsistency: Conflicts in belief are resolved using fuzzy voting mechanism. Processing data on the Semantic Web produces scenarios where the different agents has conflicting beliefs over a particular solution. In these situations the agents need to resolve their conflicts in order to choose the best possible solution e.g. in our case mapping. Mapping agents use fuzzy voting to determine the best decision for agent society but in case voters make mistakes in their judgments, then the majority alternative (if it exists) is statistically most likely to be the best choice. The application of voting for mapping agents is a possible way to make systems more intelligent i.e. mimic the decision making how humans reach the decision on a problematic issue. * Vastness:
Genetic algorithm In computer science and operations research, a genetic algorithm (GA) is a metaheuristic inspired by the process of natural selection that belongs to the larger class of evolutionary algorithms (EA). Genetic algorithms are commonly used to g ...
s based optimisations techniques are used in order to provide a reasonable time frame for belief combination using large ontologies. One of the main disadvantages of using Dempster-Shafer theory for uncertain reasoning is the computational complexity of the belief combination. DSSim resolves the problem by using genetic algorithm for creating the graphical structure that is used to compute the belief combination efficiently in the ontology mapping context. DSSim uses novel 3D visualisation techniques of both mapping and reasoning results. The main purpose of the reasoning storage and visualisation is to retain the reasoning states, in order to visualise it later to the end users. The main objective is to show to the end users why the system has selected a mapping candidate from two different ontologies.


Evaluation of the system

The evaluation of the system was carried out in the Ontology Alignment Evaluation Initiative (OAEI). DSSim has participated in 2006, 2007, 2008 and 2009 achieving gradually improved results. The following sections present the result of two tracks out of 8 from the OAEI 2008.


Library track at OAEI 2008

According to the original task definition provided by the organizers of the OAEI 2008, the library track involved the alignment of two Dutch thesauri. These Dutch thesauri are used to index books from two collections held by the National Library of the Netherlands (KB). KB maintains two big collections: the Deposit Collection, containing all the Dutch printed publications (one million items), and the Scientific Collection, with about 1.4 million books mainly about the history, language and culture of the Netherlands. Each collection is described according to its own indexing system and conceptual vocabulary. On the one hand, the Scientific Collection was described using the GTT, a huge vocabulary containing 35,000 general concepts ranging from Wolkenkrabbers (Sky-scrapers) to Verzorging (Care). On the other hand, the books contained in the Deposit Collection are mainly indexed against the Brinkman thesaurus, containing a large set of headings (more than 5,000) that were expected to serve as global subjects of books. For each concept, the thesauri provided the usual lexical and semantic information: preferred labels, synonyms and notes, broader and related concepts, etc. The language of both thesauri was Dutch, but a quite substantial part of Brinkman concepts (around 60%) come with English labels. The library track was difficult partly because of its relative large size and because of its multilingual representation. Nevertheless in the library track DSSim has performed the best out of the 3 participating systems. However these ontologies contain related and broader terms therefore the mapping can be carried out without consulting multi-lingual background knowledge.


Directory track at OAEI 2008

As stated by the original task definition provided by the organizers of the OAEI 2008, this track is designed to evaluate mapping quality in a real world taxonomy integration scenario. The main objective is to measure whether
ontology alignment Ontology alignment, or ontology matching, is the process of determining correspondences between concepts in ontologies. A set of correspondences is also called an alignment. The phrase takes on a slightly different meaning, in computer science, c ...
tools can effectively be applied to integration of "shallow ontologies". The evaluation dataset was extracted from
Google Google LLC (, ) is an American multinational corporation and technology company focusing on online advertising, search engine technology, cloud computing, computer software, quantum computing, e-commerce, consumer electronics, and artificial ...
,
Yahoo! Yahoo (, styled yahoo''!'' in its logo) is an American web portal that provides the search engine Yahoo Search and related services including My Yahoo, Yahoo Mail, Yahoo News, Yahoo Finance, Yahoo Sports, y!entertainment, yahoo!life, and its a ...
and
Looksmart LookSmart is an American search advertising, content management, online media, and technology company. It provides search, machine learning and chatbot technologies as well as pay-per-click and contextual advertising services. LookSmart a ...
web directories A web directory or link directory is an online list or catalog of websites. That is, it is a directory on the World Wide Web of (all or part of) the World Wide Web. Historically, directories typically listed entries on people or businesses, and the ...
. The way these ontology pairs were created was to rely on a reference interpretation for nodes, constructed by looking at their use. The assumption was that the semantics of nodes could have been derived from their
pragmatics In linguistics and the philosophy of language, pragmatics is the study of how Context (linguistics), context contributes to meaning. The field of study evaluates how human language is utilized in social interactions, as well as the relationship ...
, namely from analysing, which documents were
classified Classified may refer to: General *Classified information, material that a government body deems to be sensitive *Classified advertising or "classifieds" Music *Classified (rapper) (born 1977), Canadian rapper * The Classified, a 1980s American ro ...
under which nodes. The basic idea was therefore to compute the relationship hypotheses based on the
co-occurrence In linguistics, co-occurrence or cooccurrence is an above-chance frequency of ordered occurrence of two adjacent terms in a text corpus. Co-occurrence in this linguistic sense can be interpreted as an indicator of semantic proximity or an idio ...
of documents. The specific characteristics of the dataset were: * More than 4500 of node matching tasks, where each node matching task is composed from the paths to root of the nodes in the web directories. * Expert mappings for all the matching tasks. * Simple relationships. Basically web directories contain only one type of relationship the so called "classification relation". * Vague terminology and modeling principles: The matching tasks incorporate the typical "real world" modeling and terminological errors. In the directory track only 6 systems have participated in 2008. In terms of F-value DSSim has performed the best however the difference was marginal compared to the CIDER or Lily systems.


References

{{Reflist, refs= {{cite journal , last1=Nagy , first1=Miklos , last2=Vargas-Vera , first2=Maria , title=Multiagent Ontology Mapping Framework for the Semantic Web , journal=IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans , publisher=Institute of Electrical and Electronics Engineers (IEEE) , volume=41 , issue=4 , year=2011 , issn=1083-4427 , doi=10.1109/tsmca.2011.2132704 , pages=693–704 , s2cid=206792175 Miklos Nagy and Maria Vargas-Vera. Towards an Automatic Semantic Data Integration: Multi-Agent Framework Approach. Semantic Web. Gang Wu (ed), Chapter 7, pp. 107-134; In-Tech Education and Publishing KG; 2010, {{ISBN, 978-953-7619-54-1. Miklos Nagy, Maria Vargas-Vera and Enrico Motta. Multi-agent Ontology Mapping Framework in the AQUA Question Answering System. the Fourth International Mexican Conference on Artificial Intelligence (MICAI-2005), ''Lecture Notes in Artificial Intelligence LNAI 3789'', Gelbukh, A de Albornoz and H. Terashima (Eds), pp. 70-79, Monterrey Mexico, November 14–18, 2005. Jérôme Euzenat, Malgorzata Mochol, Pavel Shvaiko, Heiner Stuckenschmidt, Ondrej Sváb, Vojtech Svátek, Willem Robert van Hage, Mikalai Yatskevich: Results of the Ontology Alignment Evaluation Initiative 2006. ''In Proceedings of the 1st International Workshop on Ontology Matching, ISWC 2006'' {{cite journal, author=Jérôme Euzenat, Antoine Isaac, Christian Meilicke, Pavel Shvaiko, Heiner Stuckenschmidt, Ondřej Šváb, Vojtěch Svátek, Willem Robert van Hage, Mikalai Yatskevich, title=Results of the Ontology Alignment Evaluation Initiative 2007, journal=Proceedings of the 2nd International Workshop on Ontology Matching, ISWC 2007, year=2007, url=http://disi.unitn.it/~p2p/OM-2007/0-o-oaei2007.pdf Caterina Caracciolo, Jérôme Euzenat, Laura Hollink, Ryutaro Ichise, Antoine Isaac, Véronique Malaisé, Christian Meilicke, Juan Pane, Pavel Shvaiko, Heiner Stuckenschmidt, Ondřej Šváb-Zamazal, and Vojtěch Svátek: Results of the Ontology Alignment Evaluation Initiative 2008. ''In Proceedings of the 3rd International Workshop on Ontology Matching, ISWC 2008'' Jérôme Euzenat, Alfio Ferrara, Laura Hollink, Antoine Isaac, Cliff Joslyn, Véronique Malaisé, Christian Meilicke, Andriy Nikolov, Juan Pane, Marta Sabou, François Scharffe, Pavel Shvaiko, Vassilis Spiliopoulos, Heiner Stuckenschmidt, Ondřej Šváb-Zamazal, Vojtěch Svátek, Cássia Trojahn dos Santos, George Vouros and Shenghui Wang: Results of the Ontology Alignment Evaluation Initiative 2007. ''In Proceedings of the 4th International Workshop on Ontology Matching, ISWC 2009'' Ontology Alignment Evaluation Initiative::Library Track
/ref> Ontology Alignment Evaluation Initiative::Track name
/ref>


External links


DSSim demo
Semantic Web