Data Mining - Corso di Laurea Magistrale in Informatica


Academic Year 2016/2017
(first semester)

Lecturer: Michelangelo Ceci


Contents (preliminary) and more - Academic Year 2016/2017 (6CFU) 

Notices to the students:

Causa interruzione delle attivita' didattiche per i giorni 9-10 gennaio, le lezioni di Data Mining riprenderanno il giorno giovedi' 12 gennaio alle ore 14:30.
La data del secondo esonero e' fissata per il giorno 18 gennaio 2016 alle ore 9:00 (aula Goedel). Vertera' sui moduli 5, 6 e 7.

Lecture Notes:

Introduction to the course

Knowledge Discovery in Databases: the process and the CRISP-DM methodology

Rule-based classification (see also Chapter 2 of the text by T. Mitchell, Machine Learning, Morgan Kaufmann, 1997, 1 A survey on the separate-and-conquer approach to rule-based learning, a paper on Multiple concept learning) Solved Example of CE

Decision trees (see also: Chapter 3 of the text by T. Mitchell, Machine Learning, Morgan Kaufmann, 1997; 1,2 for a survey on decision tree learning; 3, 4, 5 for the simplification of decision trees)

Bayesian framework for classification (see also: Chapter 6 of the text by T. Mitchell, Machine Learning, Morgan Kaufmann, 1997; Generative and Discriminative Classifiers: Naive Bayes and Logistic Regression , Article on hierarchical text categorization )

Parametric and non parametric regression; Stepwise Model Tree Induction (see also Sections 1.2, 2.1, 2.3, 4.6 of the text by A. Azzalini & B. Scarpa, Analisi dei dati e Data Mining. Springer, 2004; this paper for model trees)

Variable Associations (see also Chapter 6 of the text by A. Azzalini & B. Scarpa, Analisi dei dati e Data Mining. Springer, 2004; 1 for a perspective on the relation between association measures and association rules; 2 for a seminal paper on mining association rules)


Laboratory:

Practice on WEKA Presentation demonstrating all graphical user interfaces (GUI) in Weka.
Presentation which explains how to use Weka for exploratory data mining.
Introduction, Selection, Preprocessing, Transformation
Classification (J48, Naive Bayes Classifier and K-NN)
Regression
Association Analysis





Exams:

Instructions (Modalita' di svolgimento dell'esame): here
Prima prova di esonero del 13-12-2016: traccia, risultati
Seconda prova di esonero del 18-01-2017: traccia, risultati

Possible case studies :
#1 Association Rule Mining from Spatial Data for Crime Analysis: link pwd: dm1617
#2 Extraction of Biomedical Entities related to miRNA from Biomedical Literature: link pwd: dm1617
#3 Analysis of the Evolution of Call Data Records Network: link pwd: dm1617
#4 Prediction of relationships between long non-coding RNAs and diseases : Small Dataset - Big Dataset
Description (#1, #2, #3)
Description (#4)









Links:

Previous year's courses, held by Prof. Donato Malerba



Top of this page