TOWARDS A COLLABORATIVE KNOWLEDGE DISCOVERY SYSTEM FOR ENRICHING SEMANTIC INFORMATION ABOUT RISKS OF GEOSPATIAL DATA
Keywords: Geospatial database design, Collaborative approach, Preventive risk analysis, Geospatial data misuse
Abstract. The aim of this research is to design and implement a knowledge discovery system that facilitates, using a web 2.0 collaborative approach, the identification of new risks of geospatial data misuse based on a contributed knowledge repository fed by application domain experts. [Context/Motivation] This research is motivated by the irregularity of risk analysis efforts and the poor semantic of the collected information about risks. In the context of risk analysis during geospatial database design, the knowledge about risks of geospatial data misuse is typically held by domain application experts. The collection and record of that knowledge are usually considered as optional activities. It is usually performed through face-to-face risk assessment meetings and reports. Such techniques end up by restricting the scope of risk analysis to a set of obvious risks usually already identified. Besides, little consideration is devoted to the storage of risk information in an appropriate format for automatic reasoning and new risk information discovery. As a consequence, many foreseeable risky aspects inherent to the data remain overlooked leading to ill-defined specification and faulty decisions. [Principal ideas/results] In this paper, we present a contributed knowledge discovery system that aims at enriching the semantic information about risks of geospatial data misuse in order to identify foreseeable risks. The proposed web-based system relies on a systematic and more active involvement of users in risk analysis. The approach consists of 1) providing an overview of the related work in the domains of risk analysis within the context of geospatial database design, 2) presenting an ontology-based knowledge discovery system that helps experts in risks identification based on an upper-level risk ontology and on a structured representation of the domain-specific knowledge and, 3) presenting the components of the proposed system architecture and how it may be implemented and used in practice, and finally 4) we conclude by discussing the approach. [Contribution] A major outcome is that the proposed platform can help discovering implicit domain knowledge, and facilitating the identification of foreseeable risks of geospatial data misuse in a way to preventively improve the resulting fitness-for-use.