Secção Nome Descrição
Ficha da unidade UC info
Hiperligação Self-assessment #1
Ficheiro Introduction to Data Science (in Portuguese)
Assignment Ficheiro DATASET
Ficheiro Guideline for diagnostic (in Portuguese)
Ficheiro Table with BMI information for Portugal (in Portuguese)
Ficheiro Explanation about the PPA variable (in Portuguese)
Ficheiro Tables of blood pressure for children (in English)
Hiperligação Paper #1 with results on a similar dataset (169 subjects)
Hiperligação Paper #2 with statistical results on the same dataset (7199 subjects)
Ficheiro Data Analysis: some good practices
Homeworks Hiperligação To read #1: Apples-to-apples: the pitfals of cross-validation
Hiperligação To read #2: The relationship between ROC and Precision-Recall curves
Hiperligação To read #3: refutation of the second paper
Python Books Hiperligação Python Data Science Handbook
Hiperligação Suggestions from python.org
Hiperligação O'Reilly Python books
Hiperligação Python resources
Theoretical Classes Ficheiro Presentation
Ficheiro Introduction to Data Mining
Ficheiro Data understanding and manipulation
Ficheiro Distances and dimensionality reduction (till slide 41, inclusive)
Hiperligação Recorded class
Ficheiro Distances and dimensionality reduction (cont. from slide 42)
Ficheiro Distances and dimensionality reduction (cont. from slide 55)
Ficheiro Data imputation
Ficheiro Data visualization
Ficheiro Basic Concepts in Classification
Ficheiro Basic Concepts in Classification: Decision Trees (from slide 15)
Ficheiro Naive Bayes classifier
Ficheiro Naive Bayes classifier (from slide 9) and Belief Networks
Ficheiro Evaluating the Performance of a Classifier
Ficheiro Evaluation Metrics
Ficheiro Evaluating the Performance of a Classifier (from slide 10)
Ficheiro Evaluation Metrics (revisited)
Ficheiro Regression and KNN
Ficheiro Python code associated with the regression and KNN slides
Ficheiro Melbourne data associated with regression and KNN slides
Ficheiro Support Vector Machines (SVM)
Ficheiro A little more detail on SVMs

Section 2.6.1.4 of this dissertation has a detailed and nice explanation about SVMs.

Ficheiro Artificial Neural Networls
Ficheiro Clustering
Ficheiro Ensembles
Ficheiro Basic Association Analysis
Practical Classes Ficheiro Entropy revisited
Ficheiro Distances revisited
Ficheiro species.csv
Hiperligação Code for decision trees (iris, with pruning)
Hiperligação Code for naive Bayes (German credit dataset)
Hiperligação Decision boundaries
Hiperligação Performance Evaluation of Classifiers
Hiperligação Regression and Logistic Regression
Hiperligação SVM exercises
Hiperligação Hierarchical Clustering

Here it is some Python code applying hierarchical clustering to the iris dataset.

Explore the various options of clustering, including k-means, k-means++ and dbscan. Identify differences between these different clustering methods.

Apply these methods and evaluate the quality of the generated clusters using your favorite dataset.

Pasta de ficheiros PPT_to_PDF