STAT W4240x Data Mining 3 pts. Prerequisites:
COMS W1003, W1004, W1005, W1007, or the equivalent. Corequisites: Either STAT W3105 or W4105, and either STAT W3107 or W4107. Data Mining is a dynamic and fast growing field at
the interface of Statistics and Computer Science. The emergence of massive
datasets containing millions or even billions of observations provides the
primary impetus for the field. Such datasets arise, for instance, in
large-scale retailing, telecommunications, astronomy, computational and
statistical challenges. This course will provide an overview of current
research in data mining and will be suitable for graduate students from many
disciplines. Specific topics covered with include databases and data
warehousing, exploratory data analysis and visualization, descriptive
modeling, predictive modeling, pattern and rule discovery, text mining,
Bayesian data mining, and causal inference.