GEOMATICS APPROACH FOR ASSESSMENT OF RESPIRATORY DISEASE MAPPING

Air quality is an important subject of relevance in the context of present times because air is the prime resource for sustenance of life especially human health position. Then with the aid of vast sums of data about ambient air quality is generated to know the character of air environment by utilizing technological advancements to know how well or bad the air is. This report supplies a reliable method in assessing the Air Quality Index (AQI) by using fuzzy logic. The fuzzy logic model is designed to predict Air Quality Index (AQI) that report monthly air qualities. With the aid of air quality index we can evaluate the condition of the environment of that area suitability regarding human health position. For appraisal of human health status in industrial area, utilizing information from health survey questionnaire for obtaining a respiratory risk map by applying IDW and Gettis Statistical Techniques. Gettis Statistical Techniques identifies different spatial clustering patterns like hot spots, high risk and cold spots over the entire work area with statistical significance. *Corresponding Author.


INTRODUCTION
Air pollution has reached a great concern globally due to manifestation of technological and scientific innovations in various countries in addition to the diverse activities of human beings for their sophistication.In late years, the urban and non urban pollution problems have grown in a big way and have taken on a very serious dimension.The unplanned growth, rapid evolution of industrialization has deteriorated the ambient air quality.Anthropogenic air pollution continues to be seen as a environmental and public health problem.Its seriousness lies in the fact that elevated pollutant levels are brought up in environments where harm to human health and welfare is more probable.Due to deterioration of ambient air quality our communities, countryside and even remote location changes from hour to hour, day to day and over longer time scales.Air quality can be seen poor when cause a reduction in visibility, soil building surfaces and damage materials, damage crops and other plants, cause adverse health effects.Awareness of the air quality levels is important to those who suffer from sickness, caused by air pollution.For the general public information, the questioner usually will not be satisfied with raw data, time series plots, statistical analysis and other complex findings pertaining to air quality.So for analyzing and representing air quality status uniformly, air quality index (AQI) is an important tool that can assess the relative change in the concentrations of groups of pollutants.The relative change with respect to the concentrations of pollutants and respective standards.The index should be based on guidelines and take into account each and every aspect of air pollution.(Chelani et.al., 2002).Geomatics is the science and technology of gathering, storing, analyzing, interpreting, modelling, distributing and using georeferenced information so geomatics is multidisciplinary and it comprises a broad range of disciplines including surveying and mapping, remote sensing, geographical information system (GIS) & global positioning system(GPS).(Boulos et.al. 2001).One of the most important is GIS technology that allows the organization, manipulation, analysis and visualization of spatial data, often uncovering relationships, patterns and trends (Scott et.al., 2010).So Geographic information systems (GIS) can be described as automated systems for the capture, storage, retrieval, analysis and display of spatial data'.Geographic information system (GIS) is a computer database management and mapping program that organizes, store and display large amounts of multipurpose information.With the help of epidemiological data by using GIS, construct disease maps for mapping populations at risk and analyzing trends over space and time.The aim of this paper is to develop, implement and evaluate methods and tools for visualizing and analyzing geospatial health data and enhance public health research.Disease maps can be used to assess the need for geographical variation in health resource allocation or could be useful in research studies of the relation of incidence to exploratory variables.Disease mapping usually chooses certain spatial interpolation method (s) and then creates a continuous surface of disease distribution according to geographically distributed sampling data of disease.There are all kinds of spatial interpolation methods, which include Inverse Distance Weighted (IDW), global polynomial, local polynomial, and Kriging etc.

STUDY AREA AND DATA COLLECTION
Sonebhadra is the 2 nd largest district of Uttar Pradesh, India.The territory takes in an area of 6788 km 2 and a population of 1,463,468 with a population density of 216 persons per km 2 .In this report, there are two locations Anpara & Renusagar are viewed of Sonebhadra district of Uttar Pradesh, India.This location covering an expanse of nearly 6 km 2 with a population of 22,385 and extends from latitude 24.2060 N and 82.7650 E. It hosts a thermal power station with a total set up capacity of power generation of 2830 MW.It is built aside Govind Ballabh Pant Sagar Lake and the Rihand River.Renusagar is a town in Sonebhadra district in the province of Uttar Pradesh, India.Renusagar is the home of Renupower, a 700 MW captive power plant which provides electricity to Hindalco Industries' operations in Renukoot.This location covering an expanse of nearly 5 km 2 with a population of 20,000 approximately and extends from latitude 24.180 N and 82.790 E.

AIR POLLUTANT DATA COLLECTION
The present work was carried out during 2008-2011 and the concentrations of different air pollution viz.SO 2 , NO 2 , RSPM & SPM were monitored in the ambient air at two different locations where two monitoring stations, in Anpara Colony, Anpara and in Renusagar Colony, Renusagar are set up by UPPCB and these data are collected by UPPCB, Lucknow.

AGGREGATION OF HEALTH SURVEY DATA
This study utilizes data from epidemiological survey based on a standardized interviewer questionnaire referring to respiratory symptoms and diseases, life style and personal uses.This questionnaire gives information about 09 symptoms/diseases: Asthma, Bronchitis, Cough, Tuberculosis, Sinusitis, Rhinitis, Dermatitis, Eye Redness, Heart Problem (Presence/Absence of respiratory diseases/Symptoms).

AIR QUALITY INDEX
An air quality index is one of the important tools available for analyzing and representing air quality status uniformly so the air quality index (AQI) can be used as a measure to assess the relative change in the concentration of groups of pollutants.Air Quality Index (AQI) is specified as an overall scheme that transforms the weighted values of individual air pollution related parameters (e.g.SO 2 , NO 2 , SPM & RSPM etc.) into a single number or set of numbers.The result is a set of rules (for example, an equation) that translates parameter values into a more parsimonious form by means of numerical manipulation.(M.Sharma et.al. 2003).(Joshi P.C, et.al, 2011) describe the status of the air quality and its effects on human health, the ranges of index values have been categorized as: good, moderate, poor, very poor and severe.The AQI scale was divided into five categories describes the scope of air quality and its associated potential health issue.The indices use health based descriptions to provide meaningful data to the public.

A FUZZY INFERENCE SYSTEM BASED APPROACH FOR CALCULATING AQI
The fuzzy logic set was brought out in 1965 by Lotfi A. Zadeh that facilitates the mastery of a complicated system without knowledge of its mathematical description and represented as a precise problem-solving methodology that is capable to simultaneously handle numerical data and linguistic cognition.

GEOMATICS APPROACH FOR RESPIRATORY DISEASE RISK MAP
Geomatics offers a digital lens for exploring the dynamic links between people, their health and well-being and changing physical and societal environments.With the assistance of GIS we can break down and addressing public health problems.In this paper with the aid of GIS we used to map and analyze the geographical distributions of populations at risk, health consequences and risk components to explore connections between risk factors and health issues and to address health problems.GIS makes it easier to search and analyze large databases of health events at a high degree of spatial disaggregation and to link data from surveillance systems to other information about the environment including information on the distribution of risk factors.

USING STATISTICAL TECHNIQUES
GIS has a vital role in surveillance and control of the vector borne diseases as it is promising to scrutinize factors associated with the disease through the geocoding processes (Bhunia et.al, 2013).In the present study, we used several techniques under the umbrella of GIS.The application of GIS with spatial statistics including spatial autocorrelation and cluster analysis pertained to the other diseases, where it is often used to investigate and more clearly exhibit the spatial patterns of disease (Bhunia et.al, 2013).So the target of this investigation is the detection and enumeration of spatial heterogeneity in disease prevalence across a geographical area at highprevalence or high risk areas.With the help of spatial statistics tool.We can describe and analyze how various geographical events occur.

SPATIAL AUTOCORRELATION
Spatial autocorrelation analysis was performed on the incidence rates of respiratory disease to test whether the cases were distributed randomly over space and, if not, to evaluate any identified spatial disease clusters for statistical significance (Kulldorff et.al, 1997).Global autocorrelation tests measure the tendency, across all data points, for higher (or lower) values to correlate more closely together in space with other higher (or lower) values than would be expected if the data points were drawn from a random distribution.Several tests of global autocorrelation are available, with the Moran's I being the most common.(Jerrettet.Al, 2010).

CLUSTER-OUTLIER ANALYSIS
Moran's I can only detect the presence of the clustering of similar values.The cluster-outlier type field distinguishes between a statistically significant (p<0.01)cluster of high values (High-High), cluster of low values (Low-Low), outlier in which a high value is surround primarily by low values (High-Low), and outlier in which a low value is surrounded primarily by high values (Low-High).A positive value for 'I' indicates that the feature is surrounded by features with similar values, such type of feature is part of a cluster.A negative value for 'I' indicates that the feature is surrounded by features with dissimilar values.Such a feature is an outlier.

HOTSPOT DETECTION AND ANALYSIS
Hotspot is defined as a condition indicating some form of clustering in a spatial distribution that to use Gettis-Ord G i * (d), which can separate clusters of high values from cluster of low values and this statistics is useful for determining the spatial dependence of neighbouring observations.The result expresses the Z-score and p-value of the calculated G i * (d), represent the statistical significance of the spatial clustering of values, given the conceptualization of spatial relationships and the scale of   (Li et.al., 2014).A common issue in spatial interpolation is the incorporation of data measured at various scales and over different spatial supports.This situation is frequently encountered in health studies where data are typically available over a wide range of scales (Goovaerts P. 2012).Disease mapping usually chooses certain spatial interpolation method like Inverse Distance Weighted (IDW), global polynomial, local polynomial and Kriging etc. and then creates a continuous surface of disease distribution according to geographically distributed sampling data of disease (Zhong S et.al, 2005).Inverse Distance Weighted interpolation explicitly implements the assumption that things that are close to one another are more alike than those that are farther apart.To predict a value for any unmeasured location, IDW will use the measured values surrounding the prediction location.Those measured values closest to the prediction location will have more influence on the predicted value than those farther away.It weights the points closer to the prediction location greater than those farther away, hence the name inverse distance weighted (Li et.al, 2014) focuses on one of the deterministic models, called IDW (inverse distance weighting) interpolation.Some of the advantages for the IDW interpolation method include: In the figure 9 & 10 the spatial clusters of respiratory diseases that cover specific locations.Each hotspot analysis of respiratory incidence rate showed statistically significant hot spots (P<0.01) that show the larger Z-score is, the more intense the clustering of high values (hot spot); and the smaller the Zscore is, the more intense the clustering of low values (cold spot).In the above map, darker areas indicate statistically significant hotspots, while light areas represent significant cold spot areas.These maps show clear spatial patterns of respiratory disease.However the several authors suggest that the performance of kriging is better that IDW to interpolate and predict the pattern of distribution.But for this kind of mapping, the IDW interpolation is most appropriate because IDW interpolation is simple and intuitive and is fast to compute the interpolated values.In the above map IDW interpolation implements to predict a value for any unmeasured locations.IDW use the measured values surrounding the prediction location and these measured values closest to the prediction location that have more influence on the predicted values than those farther away thus it weights the points closer to the prediction location greater than those farther away.

CONCLUSION
In this study, air pollution in industrial area is currently an issue of great concern to the harmful impact on human health, agriculture productivity and forestry etc.The assessment of air quality of industrial area with four pollutants (SPM, RSPM, SO 2 , NO 2 ) parameters use fuzzy logic concepts that calculate Air Quality Index (AQI).With the help of AQI standards that is prescribed by the Central Pollution Control Board of India (CPCB) we can assess the status of area, the higher the AQI value, greater is the level of air pollution and greater the damage to health.So this index uses health-based descriptions to provide meaningful information to the public.We used epidemiological sampled data that is collected from socioeconomic questionnaire based survey to derive an environmental indicator of respiratory health.We applied spatial statistical tools to produce the distribution map of respiratory symptoms; these interpolated maps may be considered a good representation of the distribution of respiratory symptoms/diseases over area.From environmental point of view the study suggests the need to exploit a multilayer analysis to consider all the possible risk factors for population health status that focus on the presence of different kinds of pollution sources.

Figure: 1 Flow Chart Mapping of Health Risk
The five levels of AQI are depicted in given below

Figure 3 :
Figure 3: Graph of Air Quality Index of Anpara

Figure 4 :
Figure 4: Graph of Air Quality Index of Renusagar

Figure 5 :Figure 6 :
Figure 5: Spatial Autocorrelation of Anpara analysis.The output from G i * (d) statistic identifies spatial clusters of high (hot spots) and spatial clusters of low values (cold spots).

Figure 9 :
Figure 9: Health Risk map for Anpara area

Figure 10 :
Figure 10: Health Risk map for Renusagar of FIS that contains a fuzzification process of input variables by membership functions, design base ofIF - THEN rules (BRs)or automatic IF-THEN rules extraction from input data, operators (AND, OR, NOT) application in rules, implication and aggregation within these rules and process of defuzzification of gained values to crisp values.Fuzzy inference system is the real operation of mathematical function from a given input to an output using fuzzy logic.(KumaravelR.et.al, 2012)describe two types of fuzzy inference method are Mamdani and Sugeno fuzzy inference methods.They apply different types of fuzzy reasoning and expression of fuzzy if then rules.Takagi and Sugeno proposed sugeno fuzzy model, where as a surgeon and can built up a methodical approach to generating fuzzy rules from a given input-output data.These good examples are built with IF-THEN rules that have fuzzy antecedent and functional consequent whereas the Mamdani fuzzy model is founded on the collections of ten patterns with both fuzzy antecedent and consequent parameters.In this report we have used Mamdani fuzzy expert system with using four most significant input variables are SO 2 , NO 2 , RSPM & SPM to estimate AQI to design a fuzzy inference system.The methodology for the maturation of the fuzzy Inference System (FIS) based Air Quality Index (AQI) model involves the following steps that compute the end product of this FIS given the inputs: (a) Fuzzification of input and output variables.(b) Choice of membership functions for input and output variables.
Fuzzy logic is really useful for addressing real world problem and uses variables like low, medium and high in place of true/false or yes/no variables.Peter Hajek et.al, 2009 describe the general structure (c) Purpose of application rule base.(d)Defuzzification of IAQI.The process of transforming crisp values into grades of membership for linguistic terms of fuzzy sets.The membership function is used to associate a grade to each linguistic term.(UpadhayayG.et.al, 2011).According to the prediction of air quality index of the two locations in Sonebhadra district, U.P.(India) as per Indian Air Quality criteria is discussed above so below given table 2 and 3 represented simulation results :

Table :
Fuzzy Logic based model for determination of Air Quality Index