Programme

Part 1 - Introduction to Python for Data Science

Days 1-3- Introduction to Python Programming for Data Science

  • Installing and Working with Anaconda.
  • Variables, Lists, Arrays, Dictionaries, etc.
  • Control and Looping Structures.
  • Methods.
  • Reading and Saving Data.
  • Data handling.


Part 2 - Data Science with Python

Day 4 – Data Visualisation and Statistics in Python

  • Concepts of Statistics.
  • Basis of Experimentation, normalisation, and random sampling.
  • Hypothesis testing, confidence intervals, interpretation of p-values.
  • Data Visualisation.

Day 5 – Introduction to Data Science

  • Introduction to Data Science.
  • Applications of Data Science.
  • Types of Data.
  • The Five Steps of Data Science.
  • Descriptive, Predictive, and Prescriptive Analytics.
  • Linear Regression as a First Model.
  • Mean Squared Error (MSE) as a Validation Metric.

Day 6 – Model Training and Validation

  • Classification as a supervised Learning Example.
  • Logistic Regression.
  • Formalisation of Training and Model Validation Approaches. 

Day 7 – Clustering and Model Selection

  • Clustering and Data Annotation.
  • Data Quality Verification - Inter-rater and Intra-Rater Comparisons.
  • Measures for Model Selection (t-tests, ANOVA, Kohen Kappa, etc.)

Day 8 – Classification Methods

  • Further Methods for Classification of Tabular Data.
  • Decision Trees and Random Forests as Ensemble Models.

Day 9 - Modelling of Temporal Data

  • Continuous and Discrete State-Space Temporal Data.
  • Continuous State-Space Data: AR, MA, ARMA and ARIMA Models.
  • Discrete State-Space Data: Markov Chains Models.

Day 10 – Neural Network and Deep Learning Models

  • Feedforward Neural Networks and the Back Propagation Algorithm.
  • Various Deep Neural Network such as CNNs and RNNs with Examples.

 


https://www.um.edu.mt/events/datascience2024/programme/