GEOSPATIAL BIG DATA ANALYTICS FOR SUSTAINABLE SMART CITIES
Keywords: Geospatial, Big Data, Parallel Computing, Smart City, Smart Environment, Smart Infrastructure, Green Energy
Abstract. Growing urbanization cause environmental problems such as vast amount of carbon emissions and pollution all over the world.
Smart Infrastructure and Smart Environment are two significant components of the smart city paradigm that can create opportunities for ensuring energy conservation, preventing ecological degradation, and using renewable energy sources. Since a great portion of the data contains location information, geospatial intelligence is a key technology for sustainable smart cities. We need a holistic framework for the smart governance of cities by utilizing key technological drivers such as big data, Geographic Information Systems (GIS), cloud computing, Internet of Things (IoT). Geospatial Big Data applications offer predictive data science tools such as grid computing and parallel computing for efficient and fast processing to build a sustainable smart city ecosystem. Effective management of big data in storage, visualization, analytics, and analysis stages can foster green building, green energy, and net zero targets of countries. Parallel computing systems have the ability to scale up analysis on geospatial big data platforms which is key for ocean, atmosphere, land, and climate applications. In this study, it is aimed to create the necessary technical infrastructure for smart city applications with a holistic big data management approach. Thus, a smart city model framework is developed for Smart Environment and Smart Governance components and performance comparison of Dask-GeoPandas and Apache Sedona parallel processing systems are carried out. Apache Sedona performed better on the performance test during read, write, join and clustering operations.