RESEARCH ON GIS MASSIVE TRAFFIC DATA ANALYSIS PLATFORM BASED ON HADOOP
Keywords: Traffic data, GIS, Hadoop, MapReduce
Abstract. In view of the limitations of storage and calculation of mass traffic data in traditional GIS platform, this paper uses efficient and scientific technical means to analyze the data, and proposes a Hadoop-based GIS mass traffic data analysis platform. The platform uses MapReduce as a distributed computing programming model to analyze massive data for urban traffic decision-making, and uses HDFS distributed file storage framework to store and manage massive traffic data at TB level or even PB level. Finally, the results are displayed by using geographic information system spatial visualization technology, and the impact of the data volume and the number of nodes in the cluster on the calculation time-consuming is analyzed and compared. The experimental results show that the use of distributed multi-node cluster can effectively improve the storage and computing efficiency of massive traffic data, and greatly accelerate the total task scheduling time.