Apprentissage automatique et Fouille de données - Ecole Superieure en Informatique 08-MAI-1945 SIDI BEL ABBES

Département

Second cycle

Année d étude

2éme Année IASD

Semestre

Crédit

Coefficient

Enseignants du module

DIF Nassima

Annuaire enseignants

Pré requis :

Analyse 1,2,3,4
Algèbre 1,2,3
Probabilité 1,2
Recherche opérationnelle

OBJECTIFS :

Machine learning refers to a broad set of algorithms and related concerns for discovering patterns

in data, making new inferences based on data, and generally improving the performance of a

software system without direct programming. These methods are critical for data science. Data

scientists should understand the algorithms they apply, be able to implement them, if necessary,

and make principled decisions about their use.

Scope:

Broad categories of machine learning approaches (e.g.,supervised and unsupervised)
Algorithms and tools (i.e.,implementations of those algorithms) in each of the broad learning categories.
Problems related to model expressivity as well as availability of data, and techniques
Express formally the representational power of models learned by an algorithm, and relate that to issues such as expressiveness and overfitting.
Exhibit knowledge of methods to mitigate the effects of overfitting and curse of dimensionality in the context of machine learning algorithms.
Provide an appropriate performance metric for evaluating machine learning algorithms/tools for a given problem.
Differences in interpretability of learned models.
Solve the problem of overfitting, and unbalanced datasets.

CONTENU DU MODULE :

Introduction to ML

What is Machine Learning
ML Block diagram
Examples of Machine Learning applications

Regression

Linear Regression
Multiple Linear Regression

Assessing performances

Training/Test Error, ect
Positive and Negative Class
Overfitting and regularization
Cross-validation

Classification

Logistic Regression
Naïve Bayes Classifier
K-Nearest Neighbors
Support Vector Machines
Decision Trees

Clustering

K-Means Clustering
Hierarchical clustering
DBSCAN & HAC Algorithm

Feature Reduction/Dimensionality reduction

Principal Component Analysis
Kernel Principal Component Analysis
Non-Negative Matrix Factorization
Singular Value Decomposition

Ensembles methods

Bagging & boosting and its impact on bias and variance
boosting
Random forest
Gradient Boosting Machines and XGBoost

Consultez les ressources disponibles concernant ce module sur le moteur de recherche de la bibliothèque, ou accédez directement au cours de vos enseignants via la plateforme de téléenseignement de l’école « e-learn ».

e-Bibliothéque e-Learning