The document outlines the assignment details for B.Tech VI Semester CSE(AI) students on Data Warehousing and Mining, with a submission deadline of February 28, 2025. It includes various tasks such as listing applications of data warehouses, discussing OLAP operations, and drawing schema diagrams for specific data warehouses. Additionally, it requires students to perform OLAP operations and write SQL queries based on given scenarios involving data warehouses for a hospital and an educational institution.
The document outlines the assignment details for B.Tech VI Semester CSE(AI) students on Data Warehousing and Mining, with a submission deadline of February 28, 2025. It includes various tasks such as listing applications of data warehouses, discussing OLAP operations, and drawing schema diagrams for specific data warehouses. Additionally, it requires students to perform OLAP operations and write SQL queries based on given scenarios involving data warehouses for a hospital and an educational institution.
2. What does the subject-oriented data warehouse signify? 3. What are the advantages of star schema? Explain. 4. What are the various star schema keys? Explain with the help of an example. 5. Discuss about the OLAP operations on multidimensional data with an example. 6. Which one is faster Multidimensional OLAP or Relational OLAP. Justify. 7. List and explain the activities of ETL process. 8. Explain in detail about the architecture and implementation of data warehouse. 9. Suppose that a data warehouse for Ramkrishna care hospital consists of three dimensions time, doctor, and patient, and the two measures count and charge, where charge is the fee that a doctor charges a patient for a visit. a. Draw a schema diagram for the above data warehouse using all of the schemas: [star, snowflake, fact constellation] b. Starting with the base cuboid [day, doctor, patient], what specific OLAP operations should be performed to list the total fee collected by each doctor in 2004? c. To obtain the same list, write an SQL query assuming the data are stored in a relational database with the schema fee (day, month, year, doctor, hospital, patient, count, charge) 10.Suppose that a data warehouse for SSIPMT consists of the following four dimensions: student, course, semester, and instructor, and two measures count and avg_grade. When at the lowest conceptual level (e.g., for a given student, course, semester, and instructor combination), the avg_grade measure stores the actual course grade of the student. At higher conceptual levels, avg_grade stores the average grade for the given combination. a. Draw a snowflake schema diagram for the data warehouse b. Starting with the base cuboid [student, course, semester, instructor], what specific OLAP operations (e.g., roll-up from semester to year) should one perform to list the average grade of CSE (AI)& AIML courses for each SSIPMT student.