Framing an Analytics Problem - Case 2
Framing an Analytics Problem - Case 2
com
AQFVB5DZMK
Framing an Analytics
Problem
cost.
• The intent is to identify the avenues where there
is a scope of reducing the cost of the project.
[email protected]
AQFVB5DZMK
Approximately, $7.3 billion have been invested in the Solar Electric project since
the beginning of the incentive program.
[email protected]
AQFVB5DZMK
[email protected]
AQFVB5DZMK
Even though Staten Island has the most number of projects, the mean project cost is highest
in the city of Schenectady. This file is meant for personal use by [email protected] only.
Sharing or publishing the contents in part or full is liable for legal action.
What is the current status of the projects? How many
projects are completed and up & running from each sector?
[email protected]
AQFVB5DZMK
Approximately 95% projects have been completed. Majority of the completed projects are from
Residential sector. This file is meant for personal use by [email protected] only.
Sharing or publishing the contents in part or full is liable for legal action.
More the number of photovoltaic modules installed, more would be the DC current
generating capacity subsequently more would be the Expected annual electricity
production in kilowatt-hours (kWh). Use an appropriate plot to verify this claim.
[email protected]
AQFVB5DZMK
Plot 1
[email protected]
AQFVB5DZMK
Using a statistical test (ANOVA), this can further be verified by formulating a hypothesis as follows:
H0 (Null Hypothesis): correlation for plot 1 = correlation for plot 2 = correlation for plot 3
Ha (Alternative Hypothesis): correlation for plot 1 != correlation for plot 2 != correlation for plot 3
[email protected]
AQFVB5DZMK
Looking at the plot, we can see there seems to be a positive correlation between project cost and
Expected annual electricity production. So, quite clearly it looks like higher costing project might produce
more electricity. But, its also important to keep
Thisin
filemind
is meantthat “Correlation
for personal does not imply Causation”.
use by [email protected] only.
Sharing or publishing the contents in part or full is liable for legal action.
Amongst the completed projects, which project contractors seems
to have exceeded the deadline of 5 years to complete the project
since the date of application?
[email protected]
AQFVB5DZMK
[email protected]
AQFVB5DZMK
Contractor named "Solar Liberty Energy Systems, Inc." has taken the highest time(more than
6 years) to install the project and get it up & running. It takes just about 1 year to implement
similar projects. Whereas this project took about 6 years to complete.
This file is meant for personal use by [email protected] only.
Sharing or publishing the contents in part or full is liable for legal action.
How much money was invested in similar projects on an
average?
[email protected]
AQFVB5DZMK
On an average, approximately 233 thousand dollars were invested in similar projects, but
about 320 thousand dollars were invested in this specific project which is more than the
average.
More details regarding the reasons for the delay in completion of the project as well as the
higher cost of the project should be discussed with the contractor.
[email protected]
AQFVB5DZMK
As we can see from the plots, the project cost is higher for "Commercial/
Industrial(MegaWatt Block)" program type which is quite obvious as it is expected to
generate the highest amount of annual electricity production.
This file is meant for personal use by [email protected] only.
Sharing or publishing the contents in part or full is liable for legal action.
Do incentives depend on the cost of the project?
[email protected]
AQFVB5DZMK
There seems to be a positive correlation between the project cost and the incentives. But
this might not actually be the only criteria for deciding the amount of incentive. Remember
the phrase, "Correlation does not imply causation". Its highly likely that there can be other
factors involved to decide the incentive amount.
This file is meant for personal use by [email protected] only.
Sharing or publishing the contents in part or full is liable for legal action.
Every city can have multiple electric utilities. How can we visualize
the contribution of different electric utilities to the cost of the
project for the top 10 cities we previously identified?
[email protected]
AQFVB5DZMK
From the above stacked bar plot, we can clearly see that most used electric utility is “Consolidated
Edison”. “Orange and Rockland Utilities” is the only electric utility available in Middletown. “National
Grid” is the only utility available in Schenectady and
This file Albany
is meant and ause
for personal majorly used utility in Buffalo.
by [email protected] only.
Sharing or publishing the contents in part or full is liable for legal action.
• This data can further be refined and used for prediction
of the project cost as well as the incentives.
• This dataset has a vast scope of drilling down to a specific
case/situation and analyzing the data revolving around a
certain situation, as we did in case of identifying the
project contractor who did not meet the 5 year
completion deadline.
[email protected]
AQFVB5DZMK
END!