Course catalogue doctoral education - VT24

    Startpage
  • Ansökan kan ske mellan 2023-10-16 och 2023-11-15
Application closed
Print
Title Multivariate Prediction Models, Machine Learning and AI with Applications in Precision Medicine
Course number 5694
Programme Epidemiologi
Language English
Credits 1.5
Date 2023-11-13 -- 2023-11-17
Responsible KI department Institutionen för medicinsk epidemiologi och biostatistik
Specific entry requirements Epidemiology I, Introduction to epidemiology; Epidemiology II, Design of epidemiological studies; Biostatistics I, Introduction for epidemiologists; Biostatistics II, Logistic regression for epidemiologists; and Biostatistics III: Survival analysis for epidemiologists, or equivalent courses
Purpose of the course This course aims to provide an introduction to both supervised and unsupervised methodologies for prediction modelling with a focus on biomedical applications, molecular epidemiology and personalised medicine. The main objective of the course is to provide basic theory and to facilitate for the course participants to acquire practical knowledge that will enable to apply covered methodologies in their own research.
Intended learning outcomes After successfully completing this course you as a student are expected to be able to:
- Perform and assess basic quality control and outlier detection
- Apply unsupervised and supervised statistical learning methods to detect patterns in data
- Devise cross-validation strategies for parameter estimation, model selection and prediction performance evaluation
- Make informed judgement of how to apply basic principles for variable selection
- Critically evaluate prediction models and artificial intelligence in real-world applications
- Conceptually design and devise applications of machine learning and deep learning in real-world applications
Contents of the course Personalised medicine is a cornerstone of tomorrow’s health care, and is based on the idea of stratifying patients into groups based on e.g. disease risk, prognosis or probability of treatment response and administrate the most suitable therapy for each individual. The capabilities to generate vast amounts of quantitative clinical, imaging, and molecular data from DNA- and RNA-sequencing and other molecular profiling methods are providing unprecedented opportunity for implementation of personalized precision medicine approaches in the health care system. Molecular profiling typically generates data with tens of thousands of variables of which only a subset is relevant for treatment decisions. Similarly, imaging data from e.g. radiology and digital pathology provides information rich data to inform patient management. The promise of personalised medicine relies on our ability to turn the vast molecular datasets into clinically actionable predictive models of individualised diagnostics, prognostication, and therapy response. Development and application of statistical learning methods, prediction modelling, artificial intelligence, and deep learning are central components in developing these models, and in developing the biomarker panels that can be used for subtyping, risk stratification and prediction of treatment response. This course provides an introduction to statistical learning methods, prediction models, and deep learning that are relevant for personalised medicine with a focus on real-world applications.

The course covers basic theory and introduction to modern statistical and machine learning methods for prediction modelling and deep learning in high-dimensional data, together with applied data analysis through computer-based exercises. Lectures and exercises will cover the full process going from the initial data set and through data normalisation, quality control, outlier detection, application of unsupervised learning methods, application of supervised learning methods, variable selection, cross-validation, model evaluation, and recently developed methods (e.g. deep learning, conformal prediction).
Teaching and learning activities The course is based on a combination of lectures, which cover methods and theory, together with computer-based exercises in R, where real-world data are analysed and interpreted. Previous experience from practical experience applying statistical models in a computer-based environment (e.g R, SAS, Stata, Matlab, Python) is strongly recommended.
Compulsory elements The individually written examination.
Examination The individual examination will be performed as a take-home examination. It consists of an individually written lab-report where results from an applied data analysis mini-project should be summarised and critically evaluated. Students who do not obtain a passing grade in the first examination will be offered a second examination within two months of the final day of the course.
Literature and other teaching material Suggested course literature:
Elements of Statistical Learning, Hastie, Tibshirani and Friedman (2009). Springer-Verlag,
Freely available at https://statweb.stanford.edu/~tibs/ElemStatLearn/
Number of students 8 - 25
Selection of students Eligible doctoral students will be prioritized according to 1) the relevance of the course syllabus for the applicant’s doctoral project (according to written information), 2) date for registration as a doctoral student (priority given to earlier registration date). To be considered, submit a completed application form. Give all information requested, including a short description of current research training and motivation for attending, as well as an account of previous courses taken.
More information It is recommended to have taken an introductory course in R or to have equivalent experience prior to taking this course.
Additional course leader Additional course leader: Martin Eklund, Department of Medical Epidemiology and Biostatistics, Karolinska Institutet.
Latest course evaluation Course evaluation report
Course responsible Mattias Rantalainen
Institutionen för medicinsk epidemiologi och biostatistik

mattias.rantalainen@ki.se
Contact person Gunilla Nilsson Roos
Institutionen för medicinsk epidemiologi och biostatistik
08-524 822 93
gunilla.nilsson.roos@ki.se