STAT W3026x Applied Data Mining 3 pts. Data Mining
is a dynamic and fast growing field at the interface of Statistics and
Computer Science. The emergence of massive datasets containing millions or
even billions of observations provides the primary impetus for the field.
Such datasets arise, for instance, in large-scale retailing,
telecommunications, astronomy, computational and statistical challenges. This
course will provide an overview of current practice in data mining. Specific
topics covered with include databases and data warehousing, exploratory data
analysis and visualization, descriptive modeling, predictive modeling,
pattern and rule discovery, text mining, Bayesian data mining, and causal
inference. The use of statistical software will be emphasized.