0% found this document useful (0 votes)
14 views

BA_Project_Report_Format (1)

Ba project Report

Uploaded by

kjagtap798
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
14 views

BA_Project_Report_Format (1)

Ba project Report

Uploaded by

kjagtap798
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 4

Business Analytics Project Report

Sahil Poonatar

1. Link to the Dashboards


https://public.tableau.com/app/profile/subh6914/viz/GreenAnalytics_16375909366540/Dashboard1

2. The Data
2.1 Where did we get the data?
In conducting this analysis, a comprehensive dataset was utilized, incorporating various socioeconomic
indicators obtained from reputable sources. The dataset comprises the following variables: Forest Area, GDP
per Capita, GNI per Capita, Inflation, Population, Human Development Index, Unemployment, Infant
Mortality, Income Classification, and Development Status. The data sources for the majority of these variables
were the World Bank (worldbank.org), the United Nations Development Programme (undp.org), and
UNICEF (unicef.org). The Income Classification dataset was web scraped using the BeautifulSoup Python
library, but the remaining datasets were downloaded directly from their respective sites. Subsequently, the
collected data underwent thorough data wrangling procedures facilitated by the Pandas library. Specifically,
the pd.melt and pd.merge functions were employed to consolidate and organize the data, enabling a coherent
and integrated dataset for analysis. Furthermore, extensive data cleaning procedures were implemented to
ensure the dataset's integrity and usability. This involved eliminating extraneous whitespace, formatting
inconsistencies, and handling missing values. By meticulously addressing these data quality concerns, the
resulting dataset was prepared for in-depth analysis and interpretation.

2.2 What does the data consist of?


The final dataset encompasses socioeconomic data for the years 1980 to 2017, capturing a comprehensive and
extensive timeframe. It includes information on multiple indicators, covering various aspects of a country's
development and well-being. The dataset consists of key variables such as Forest Area, GDP per Capita, GNI
per Capita, Inflation, Population, Human Development Index, Unemployment, Infant Mortality, Income
Classification, and Development Status for all countries. With this extensive temporal and geographic
coverage, the dataset provides a rich and holistic representation of socioeconomic conditions across the world.

2.3 What makes this data so special?


This dataset holds a special place due to its unique characteristics, making it a valuable resource for
understanding global socioeconomic dynamics. Its standout feature lies in its comprehensive coverage, as it
incorporates a wide array of socioeconomic factors such as Forest Area, GDP per Capita, GNI per Capita,
Inflation, Population, Human Development Index, Unemployment, Infant Mortality, Income Classification,
and Development Status. By including these indicators, it offers a holistic perspective on multiple dimensions
of development, allowing for a more nuanced analysis. Another remarkable aspect of this dataset is its
longitudinal time frame, spanning from 1989 to 2017. This extensive temporal coverage enables researchers to
explore long-term trends, identify shifts and patterns, and evaluate the impact of policies and interventions
over time. Such insights are crucial for understanding the dynamics of socioeconomic development and
formulating effective strategies for the future. The global scope of the dataset is another distinguishing factor.
With data available for all countries worldwide, it facilitates cross-country comparisons and global analyses.
This broad geographical coverage enables researchers to identify disparities, similarities, and unique
characteristics across different regions and countries, contributing to a comprehensive understanding of the
global socioeconomic landscape. Moreover, the dataset's reliability is ensured by sourcing the data from
reputable organizations such as the World Bank, the United Nations Development Programme (UNDP), and
UNICEF. These organizations adhere to stringent data collection methodologies, ensuring the accuracy,
consistency, and credibility of the dataset. In summary, the dataset's comprehensive coverage, longitudinal
time frame, global scope, and reliability make it an exceptional resource for researchers and analysts seeking
to explore and understand the complexities of socioeconomic development on a global scale.
3. What did we do?
The visualization subsystem consists of a couple interactive Tableau dashboards.

3.1 Dashboard 1 - Choropleth World Map for every factor along with each trend.

Figure 1—Dashboard 1: Choropleth with current forest cover and trend lines
depicting the change in forest cover versus the selected socio -economic factors

The first dashboard presents a visually engaging representation of current forest cover across different
countries through a choropleth map. This map offers an overview of the distribution and extent of forested
areas, providing valuable insights into the global forest landscape. Based on the selected filters of continent or
country, the choropleth map dynamically adjusts to display only the specific region of interest. This
interactive feature allows users to focus their analysis on a particular continent or country, enabling a more
targeted exploration of forest cover within a specific geographic context.

Below the map, a line graph illustrates the historical trend of forest cover from 1989 to 2017. This graph allows
users to observe the changes and fluctuations in forest cover over time. Additionally, the dashboard
incorporates a dynamic feature where users can select specific socioeconomic factors of interest using filters.
When selected, these factors are overlaid on the line graph, displaying their corresponding trend lines
alongside the forest cover trend. This interactive functionality enables users to explore potential correlations
or patterns between forest cover and selected socioeconomic factors, aiding in the identification of potential
relationships and informing further analysis. Overall, the dashboard offers a comprehensive visual analysis of
forest cover, its historical trends, and its potential associations with socioeconomic factors, promoting a
deeper understanding of the complex interplay between forests and socioeconomic development.
3.2 Dashboard 2 - Choropleth Map with Predicted forest cover in 2025

Figure 2—Dashboard 2: Choropleth with Predicted forest cover in 2025.

The second dashboard features a choropleth map displaying the estimated forest cover in 2025. This
prediction was generated using Holt Winter's Forecasting, a time series forecasting technique. The choropleth
map's appearance is dynamic and adjusts according to selected filters such as continent, country, status, and
income. This interactive functionality enables users to focus their analysis on specific regions or categories,
providing a more tailored view of the projected forest cover.

Adjacent to the choropleth map, on the right side of the dashboard, there is a table presenting relevant forest
cover data. This table includes the current forest cover percentage for each country, the predicted percentage
for 2025, and the corresponding change between these two periods. Like the choropleth map, this table also
adjusts based on the selected filters. Users can explore the forest cover statistics for specific continents,
countries, development status, and income groups, allowing for a more detailed examination of forest cover
dynamics within desired subsets.

The integration of the choropleth map and the corresponding table provides a comprehensive visual analysis
of both the projected forest cover and the associated statistical information. Users can easily compare current
and predicted forest cover percentages, assess the extent of change, and identify regions or categories
experiencing significant shifts in forest cover over time. This dashboard empowers users to gain insights into
future forest cover trends while considering various contextual factors, facilitating informed decision-making
and further exploration of the relationship between forest cover and the selected filters.

3. What are the insights we gleaned?


While these visualizations alone cannot provide definitive conclusions, they offer valuable insights that
prompt further exploration. The described visualizations provide several potential insights. First, they reveal
geographical patterns in current and projected forest cover, allowing identification of deforestation hotspots,
successful reforestation efforts, and areas with high or low forest cover. Second, the line graph of historical
forest cover trends offers insights into temporal changes, enabling the assessment of the effectiveness of
policies and external factors affecting forests. Third, overlaying trend lines of selected socioeconomic factors
on the forest cover graph facilitates the exploration of correlations and associations, generating hypotheses for
deeper analysis. Fourth, the estimated forest cover for 2025 provides a glimpse into potential future scenarios,
aiding in policy planning and conservation efforts. Lastly, the ability to adjust the visualizations based on
filters allows for targeted analyses, such as comparing forest cover trends across continents or evaluating
conservation efforts based on development status. It is crucial to conduct further investigations and employ
additional analytical methods to validate these insights and gain a comprehensive understanding of the
intricate relationships between forests and socioeconomic factors.

4. What is unique about this project?


Analysis of forest cover is a complex topic, and the causes of forest cover degradation are widely debated.
Green Analytics is a unique, creative and interactive platform to analyze, visualize and identify the
socio-economic factors that are most relevant to the changes of Forest cover rate. It is a user-friendly visual
tool that determines correlations against Forest cover changes and socio-economic factors, over the past 30
years. Most current studies are based on a single social or economic factor. Green Analytics brings multiple
factors together under one umbrella. Users can perform meaningful analysis of the net effect of these factors
on forest cover. Analysis can be done based on time, geography and income classification.

5. Who can benefit from using these dashboards?


This can be utilized by non-profit organizations, government agencies and independent researchers to drive
policy decisions.

You might also like