CSE602 - Data Warehousing & Data Mining
CSE602 - Data Warehousing & Data Mining
Course Objectives:
To demonstrate new concepts of organizing data ware house & data mining technique to drive the useful information out of the piles
of data. With the growth of large amount of data today it has become necessity to explore and mine the data so that we can have
hidden useful Information. This course will expose students to the process of extracting patterns and useful information from large
data sets by combining methods from data mining, statistics and artificial intelligence with database management. It will also expose
students to have data analysis using data mining tools. This course is also covering some advance topics in data mining like, opinion
mining, web mining etc.
Pre-requisites:NIL
Course Contents/Syllabus:
Weightage (%)
Module I: Data Warehousing 20
• Data warehousing, characteristics and components of a data warehouse,
• ETL process,
• Data marts,
• Data warehouse logical design : star schemas, snowflake, fact tables, dimensions, other schemas,
• Materialized views,
• Data warehouse physical design: hardware and i/o considerations,
• Parallelism, indexes.
2. Demonstrate and analyze the result of following Data mining techniques using weka on the data sets provided
with
WEKA
a) Classification (e.g., BayesNet, KNN, C4.5 Decision Tree, Neural Networks, SVM),
b) Regression (e.g., Linear Regression, Isotonic Regression, SVM for Regression),
c) Clustering (e.g., Simple K-means, Expectation Maximization (EM)),
d) Association rules (e.g., Apriori Algorithm, Predictive Accuracy, Confirmation Guided),
e) Feature Selection (e.g., Cfs Subset Evaluation, Information Gain, Chi-squared Statistic), and
f) Visualization (e.g., View different two-dimensional plots of the data).
4. Write a program to implement BFS and DFS with respect to 2-D modeling.
60 40 100
Weightage (%) 5 10 8 7 70
Weightage (%) 5 15 10 10 60
Text & References:
Text:
1 “Mastering Data Mining: The Art and Science of Customer Relationship Management”, by Berry and Lin off, John Wiley and Sons,
2001.
2 “Data Ware housing: Concepts, Techniques, Products and Applications”, by C.S.R. Prabhu, Prentice Hall of India, 2001.
References:
1 “Data Mining: Concepts and Techniques”, J.Han, M.Kamber, Academic Press, Morgan Kanf man Publishers, 2001.
2 “Data Mining”, by Pieter Adrians, DolfZantinge, Addison Wesley, 2000.
3 “Data Mining with Microsoft SQL Server”, by Seidman, Prentice Hall of India, 2001