Ejercicio_3
Ejercicio_3
Introduction
Statistical cluster analysis can help you minimize the subjectivity in your maps by identifying meaningful clusters in your data. The Hot Spot Analysis
and Outlier Analysis tools use statistics to detect spatial patterns in your data, but each provides slightly different information about these patterns.
The Hot Spot Analysis tool uses the Getis-Ord Gi* statistic to identify statistically significant spatial clusters of high values (hot spots) and low values
(cold spots).
The Outlier Analysis tool uses the Anselin Local Moran's I statistic to identify statistically significant clusters of high and low values and to detect
spatial outliers, or features with values that are significantly dissimilar from their neighbors.
ArcGIS provides traditional and optimized statistical cluster analysis tools. The optimized tools interrogate your data to provide smart default values,
optimizing the analysis workflow. The traditional tools allow you more flexibility in defining the spatial relationships in your data, giving you more
control over your analysis. In this exercise, you will use the optimized statistical cluster analysis tools to explore spatial patterns in data.
Scenario
The Supplemental Nutrition Assistance Program (SNAP) is a federal program in the United States that helps families buy nutritional food to maintain
their health and well-being. In this exercise, you will complete a hot spot analysis and an outlier analysis to find meaningful patterns of high and low
SNAP participation. This information can help decision makers distribute resources more efficiently and equitably, ensuring that healthy food is
accessible to all SNAP recipients.
Note: The exercises in this course include View Result links. Click these links to confirm that your
results match what is expected.
In this step, you will open the ArcGIS Pro project that you downloaded.
Note: If you have configured ArcGIS Pro to start without a project template or with a default
project, you will not see the Start page. On the Project tab, click Open, and then click
Open Another Project.
d In the Open Project dialog box, browse to the PatternDetection_SpaceTime folder that you saved on your computer.
Your ArcGIS Pro project includes a map of the counties in the contiguous United States. Each county is symbolized by the rate of the
population that participated in SNAP during 2019.
The statistical cluster analysis tools analyze your data to detect patterns of high and low values. In this analysis, you will analyze the
distribution of a value that is named SNAPRate, which is the percentage of eligible people who participate in the SNAP program in each
county.
a In the Geoprocessing pane, under the search field, click the Toolboxes tab.
Note: If you closed the Geoprocessing pane, from the Analysis tab, in the Geoprocessing group,
click Tools.
The Optimized Hot Spot Analysis tool opens in the Geoprocessing pane.
The Optimized Hot Spot Analysis tool uses a fixed-distance band to define each feature's neighborhood. If you do not specify a distance, the
tool uses incremental spatial autocorrelation to determine whether there is a scale, or distance, at which clustering across the dataset is most
pronounced. If the Optimized Hot Spot Analysis tool cannot identify a distance using this method, it will compute the average distance that
would yield 30 neighbors for each feature. To learn more about the Incremental Spatial Autocorrelation tool, go to ArcGIS Pro
Help: Incremental Spatial Autocorrelation (Spatial Statistics).
The fixed-distance band method can be used to define feature neighborhoods. Other methods are available with the traditional Hot Spot
Analysis (Getis-Ord Gi*) tool. To learn more about this tool, go to ArcGIS Pro Help: Hot Spot Analysis (Getis-Ord Gi*) (Spatial Statistics).
f Click Run.
Step 3f***: Run a hot spot analysis.
The result of your analysis is a layer displaying hot spots in three shades of red and cold spots in three shades of blue. The varying shades
correspond to three confidence intervals, indicating how confident you can be that these patterns are not the result of random chance.
Next, you will review the analysis details to ensure that the parameters were appropriate for your analysis of SNAP participation.
The Optimized Hot Spot Analysis tool message window appears, listing the tool's geoprocessing steps in detail.
Tip: If you choose the default values for the Optimized Hot Spot Analysis tool, review the
geoprocessing details to identify the default parameter values. Ensure that these values
are appropriate for the scale of your analysis.
b Click Messages, and then review the geoprocessing details to answer the following question.
- Answer
151156 meters
The tool chose a default distance band of approximately 150 kilometers based on the average distance to 30 nearest neighbors. This default
value is a good place to start exploring your data, but it may not represent the scale at which you want to analyze patterns in your dataset. In
this example, a 150-kilometer distance band is too large because you want to analyze more local patterns in SNAP participation. You will
reduce the distance band to 75 kilometers to detect more local patterns in this county-level dataset.
f In the field to the right of 75, click the down arrow and choose Kilometers.
g Click Run.
Step 4g***: Review the analysis parameters.
Reducing the size of the distance band identified more detailed patterns. This scale is more appropriate for this particular analysis.
In this step, you will interpret the results of your statistical analysis.
? What statistically significant spatial patterns can you detect from this analysis?
- Answer
Generally, the southeastern areas of the contiguous United States have statistically
significantly high SNAP participation, and the north-central areas of the contiguous
United States have statistically significantly low SNAP participation.
The results of this statistical analysis provide a measure of confidence that can help you identify areas with clusters of high SNAP participation.
You can use this information to investigate these areas and their access to stores that accept SNAP and carry healthy foods.
Completing an outlier analysis will help you identify features that have values that are statistically significantly different from neighbors' values.
This analysis will provide additional insight into the spatial patterns of the data.
a In the top-left corner of the Geoprocessing pane, click the Back button , and then search for outlier.
Similar to the Optimized Hot Spot Analysis tool, the Optimized Outlier Analysis tool will interrogate your data to determine an appropriate
neighborhood distance for your analysis. The traditional outlier analysis tool, Cluster And Outlier Analysis (Anselin Local Moran's I), gives you
more control over the analysis parameters. To learn more about the Cluster And Outlier Analysis (Anselin Local Moran's I) tool, go to ArcGIS
Pro Help: Cluster and Outlier Analysis (Anselin Local Moran's I) (Spatial Statistics).
Tip: When you compare the results of a hot spot analysis and an outlier analysis, use the same
distance band in the analysis.
f Click Run.
Note: The permutations in the Optimized Outlier Analysis tool compare your data values to a set
of randomly generated values. Therefore, your results may vary slightly from the View
Result graphic.
The bright red and blue features represent spatial outliers. Features with high values surrounded by areas with low values are called High-Low
outliers and are displayed in red. Features with low values surrounded by areas with high values are called Low-High outliers and are
displayed in dark blue. The pink and light blue colors indicate clusters of features with statistically significantly high values (pink) and
statistically significantly low values (light blue). These clusters typically align with the hot spots and cold spots from the Optimized Hot Spot
Analysis tool.
You have completed two different methods of pattern detection—a hot spot analysis and an outlier analysis. You may be wondering which
method is more appropriate to use. The answer is often both. Each method answers a different question and is valuable in understanding the
patterns in your data. You can compare these analyses to gain additional insight into the spatial patterns of your data.
c From the Feature Layer tab, in the Compare group, click Swipe.
e Click the map, and then drag your pointer to the left, to the right, or up and down to compare the results from the Optimized Hot Spot
Analysis tool and the Optimized Outlier Analysis tool.
Step 7e***: Compare the results of analyses.
Using a hot spot analysis and an outlier analysis, you located statistically significant clusters of both high and low SNAP participation. This
information can help in the allocation of SNAP resources to areas of higher food insecurities and also help identify areas where eligible
residents may be underutilizing or not enrolling in the SNAP program. The results can help drive the decision to distribute resources more
efficiently and equitably.
f At the top of the map view, next to Pattern Detection, click the X to close the map.
Another way to run a hot spot analysis in ArcGIS Pro is by using R. The R-ArcGIS Bridge offers you the ability to tap directly into R from your
current ArcGIS Pro project. You can then use the R-ArcGIS Bridge to combine the power of R and ArcGIS to solve spatial problems and for
spatial data access and visualization. This stretch goal introduces you to the R-ArcGIS Bridge. You will use R to run a hot spot analysis and an
outlier analysis.
Note: To complete the stretch goal, you must install the following software:
b Use the Lesson Forum to post your questions and observations. Be sure to include the #stretch hashtag in the posting title.
c When you are finished, save the project and exit ArcGIS Pro.