Data Mining - Corso di Laurea Magistrale in Informatica
Academic Year 2016/2017
Contents (preliminary) and more - Academic Year 2016/2017 (6CFU)
Notices to the students:
Causa interruzione delle attivita' didattiche per i giorni 9-10 gennaio, le lezioni di Data Mining riprenderanno il giorno giovedi' 12 gennaio alle ore 14:30.
La data del secondo esonero e' fissata per il giorno 18 gennaio 2016 alle ore 9:00 (aula Goedel). Vertera' sui moduli 5, 6 e 7.
Introduction to the course
Knowledge Discovery in Databases: the process and the CRISP-DM methodology
Rule-based classification (see also Chapter 2 of the text by T. Mitchell, Machine Learning, Morgan Kaufmann, 1997, 1 A survey on the separate-and-conquer approach to rule-based learning, a paper on Multiple concept learning) Solved Example of CE
Decision trees (see also: Chapter 3 of the text by T. Mitchell, Machine Learning, Morgan Kaufmann, 1997; 1,2 for a survey on decision tree learning; 3, 4, 5 for the simplification of decision trees)
Bayesian framework for classification (see also: Chapter 6 of the text by T. Mitchell, Machine Learning, Morgan Kaufmann, 1997; Generative and Discriminative Classifiers: Naive Bayes and Logistic Regression , Article on hierarchical text categorization )
Parametric and non parametric regression; Stepwise Model Tree Induction (see also Sections 1.2, 2.1, 2.3, 4.6 of the text by A. Azzalini & B. Scarpa, Analisi dei dati e Data Mining. Springer, 2004; this paper for model trees)
Variable Associations (see also Chapter 6 of the text by A. Azzalini & B. Scarpa, Analisi dei dati e Data Mining. Springer, 2004; 1 for a perspective on the relation between association measures and association rules; 2 for a seminal paper on mining association rules)
Practice on WEKA
Presentation demonstrating all graphical user interfaces (GUI) in Weka.
Presentation which explains how to use Weka for exploratory data mining.
Introduction, Selection, Preprocessing, Transformation
Classification (J48, Naive Bayes Classifier and K-NN)
Instructions (Modalita' di svolgimento dell'esame): here
Prima prova di esonero del 13-12-2016: traccia, risultati
Seconda prova di esonero del 18-01-2017: traccia, risultati
Possible case studies :
#1 Association Rule Mining from Spatial Data for Crime Analysis: link pwd: dm1617
#2 Extraction of Biomedical Entities related to miRNA from Biomedical Literature: link pwd: dm1617
#3 Analysis of the Evolution of Call Data Records Network: link pwd: dm1617
#4 Prediction of relationships between long non-coding RNAs and diseases :
Small Dataset -
Description (#1, #2, #3)
Previous year's courses, held by Prof. Donato Malerba