BA_Project_Report_Format (1)
BA_Project_Report_Format (1)
Sahil Poonatar
2. The Data
2.1 Where did we get the data?
In conducting this analysis, a comprehensive dataset was utilized, incorporating various socioeconomic
indicators obtained from reputable sources. The dataset comprises the following variables: Forest Area, GDP
per Capita, GNI per Capita, Inflation, Population, Human Development Index, Unemployment, Infant
Mortality, Income Classification, and Development Status. The data sources for the majority of these variables
were the World Bank (worldbank.org), the United Nations Development Programme (undp.org), and
UNICEF (unicef.org). The Income Classification dataset was web scraped using the BeautifulSoup Python
library, but the remaining datasets were downloaded directly from their respective sites. Subsequently, the
collected data underwent thorough data wrangling procedures facilitated by the Pandas library. Specifically,
the pd.melt and pd.merge functions were employed to consolidate and organize the data, enabling a coherent
and integrated dataset for analysis. Furthermore, extensive data cleaning procedures were implemented to
ensure the dataset's integrity and usability. This involved eliminating extraneous whitespace, formatting
inconsistencies, and handling missing values. By meticulously addressing these data quality concerns, the
resulting dataset was prepared for in-depth analysis and interpretation.
3.1 Dashboard 1 - Choropleth World Map for every factor along with each trend.
Figure 1—Dashboard 1: Choropleth with current forest cover and trend lines
depicting the change in forest cover versus the selected socio -economic factors
The first dashboard presents a visually engaging representation of current forest cover across different
countries through a choropleth map. This map offers an overview of the distribution and extent of forested
areas, providing valuable insights into the global forest landscape. Based on the selected filters of continent or
country, the choropleth map dynamically adjusts to display only the specific region of interest. This
interactive feature allows users to focus their analysis on a particular continent or country, enabling a more
targeted exploration of forest cover within a specific geographic context.
Below the map, a line graph illustrates the historical trend of forest cover from 1989 to 2017. This graph allows
users to observe the changes and fluctuations in forest cover over time. Additionally, the dashboard
incorporates a dynamic feature where users can select specific socioeconomic factors of interest using filters.
When selected, these factors are overlaid on the line graph, displaying their corresponding trend lines
alongside the forest cover trend. This interactive functionality enables users to explore potential correlations
or patterns between forest cover and selected socioeconomic factors, aiding in the identification of potential
relationships and informing further analysis. Overall, the dashboard offers a comprehensive visual analysis of
forest cover, its historical trends, and its potential associations with socioeconomic factors, promoting a
deeper understanding of the complex interplay between forests and socioeconomic development.
3.2 Dashboard 2 - Choropleth Map with Predicted forest cover in 2025
The second dashboard features a choropleth map displaying the estimated forest cover in 2025. This
prediction was generated using Holt Winter's Forecasting, a time series forecasting technique. The choropleth
map's appearance is dynamic and adjusts according to selected filters such as continent, country, status, and
income. This interactive functionality enables users to focus their analysis on specific regions or categories,
providing a more tailored view of the projected forest cover.
Adjacent to the choropleth map, on the right side of the dashboard, there is a table presenting relevant forest
cover data. This table includes the current forest cover percentage for each country, the predicted percentage
for 2025, and the corresponding change between these two periods. Like the choropleth map, this table also
adjusts based on the selected filters. Users can explore the forest cover statistics for specific continents,
countries, development status, and income groups, allowing for a more detailed examination of forest cover
dynamics within desired subsets.
The integration of the choropleth map and the corresponding table provides a comprehensive visual analysis
of both the projected forest cover and the associated statistical information. Users can easily compare current
and predicted forest cover percentages, assess the extent of change, and identify regions or categories
experiencing significant shifts in forest cover over time. This dashboard empowers users to gain insights into
future forest cover trends while considering various contextual factors, facilitating informed decision-making
and further exploration of the relationship between forest cover and the selected filters.