Problemstatement 1
Problemstatement 1
Insignia Corporation are doing business of selling gifts online and they
have recently started their business.
Their Product,sales along with customer and employees and the geography at
which they have sales is sent to a Data processing team in a Table.
One day their CEO decided to hire a CDO to improve operational
efficiencies in the organization and modernize the capabilities within the
organization.
First recommendation which was given by CDO was to form a Team which can
load this csv data into a datawarehouse and create some tables out of the
single table.
Then CDO asked the Team lead to create a Data model out of the Table.
Dataset Details:
Note that you are not required to load the data in this step. You are just
required to create the data model.
Also note that you might not have the permission to create certain key
constraints such as foreign key constraints.
In that case, simply maintain the referential integrity without creating
the foreign key.
You, being the part of the BI /Data engineeering team has to identify all
the dimensions and create the dimensions with tracking columns by using
lineages .
The lineage table must have a load_id and this lineage load_id/lineage_id
would be used to identify each data load and number of rows in the source
and destination.
Lineage_Id bigint
Source_System Varchar(100)
Load_Stat_Datetime datetime
Load_EndDatetime datetime
Rows_at_Source int
Rows_at_destination_Fact int
Load_Status bit
The lineage_id columns must be used in all the tables of dimension and
fact to identify what data is loaded by the ETL,from which source, how
many rows were affected , at what time etl stated and ended and lastly
whether the etl succeeded or failed.
A date dimension table must also be implmented with the following fields:
DateKey
Date
Day_Number(Number of the month)
Month_Name
Short_Month(Short Month Name in three characters)
Calendar_Month_Number
Calendar_Year
Fiscal_Month_Number
Fiscal_Year
Week_Number
Datekey must be in integer format and must be used to map the single fact
table .
fiscal year of the company starts with July and completes in the month of
June so fiscal date fields must have right mapping to create the financial
reports .
Note: Dates in the Date dimension must be loaded from 2000 to 2023
Calendar Years.
You can name the Stored procedures and Tables as per your own accordance
but do keep the naming convention consistent and use the best practices
wherever required.
The Employee Dimension along with Customer Dimension must have SCD Type 2
implemented.
Late arriving dimension concept for all the dimensions must be implmented.
guidelines---sql server:
*/