Session programme

Session Programme

Week One

Day 1 – Introduction to Data Science

  • Overview.
  • Introduction to Data Science.
  • Applications of Data Science.
  • Types of Data.
  • The Five Steps of Data Science.
  • Descriptive, Predictive, and Prescriptive Analytics.
  • Big Data.
  • Installing Anaconda

Day 2 – Python Programming
  • Python for Data Science.
  • List comprehensions, Lambda expressions, etc.
  • Pandas, Numpy, Matplotlib, Seaborn, etc.
  • Data Handling.
  • Attendees will be assumed to have a background in programming.

Day 3  Statistics and Probability
  • Concepts of Statistics
  • Basis of Experimentation, normalization, and random sampling.
  • Hypothesis testing, confidence intervals, interpretation of p-values.
  • Introduction to Probability Theory.

Day 4 – Data Science Concepts
  • Correlation vs Causation.
  • Classification vs Regression.
  • Evaluation of Classifiers and Regressors.

Day 5 – Time Series, NLP, and Visualization
  • Time Series and Properties.
  • Natural Language Processing (NLP).
  • Visualization using Matplotlib, Seaborn, and Plotly.

Week Two

Day 6 – Machine Learning
  • What is Machine Learning?
  • Supervised vs Unsupervised Learning.
  • Classification vs Regression.
  • Bias and Variance.
  • Linear Regression.
  • Logistic Regression.
  • Neural Networks.

Day 7 – Clustering and Decision Trees
  • Agglomerative Clustering.
  • Divisive Clustering.
  • K-Means and DBSCAN.
  • EM Clustering.
  • Decision Trees and ID3.

Day 8 – Ensembles, Bagging, and Boosting
  • Ensemble Learning.
  • Bagging and Boosting.
  • Random Forests.
  • AdaBoost, XGBoost, LightGBM, and CATBoost.

Day 9 – Deep Learning
  • What is Deep Learning.
  • Building deep learning models with Keras.
  • Recurrent Neural Networks (including LSTMs).
  • Convolutional Neural Networks.

Day 10 – Miscellaneous Topics
  • Naïve Bayes and SVMs.
  • Data Privacy and Ethics.
  • Data Science on the Cloud.

https://www.um.edu.mt/event/datascience2022/programme