Federated Machine Learning
Vast amount of data is collected and stored every day. Nevertheless, when looking at the most pressing problems of
our time, how come that complex machine models, for instance for cancer treatment, are only trained on a few 1000
samples? The core problem of this issue is that the data is available but must not be shared due to legal
restrictions and bureaucratic hurdles, in particular when we are talking about sensitive data.
In this project, you will help to elevate this problem by utilizing an approach called federated machine learning.
Here, the data remains at the safe site of storage and the machine learning comes to the data. A small, local,
model is trained on each site, and then combined into a global model while only transmitting anonymous model
parameters. With that, we will be able to train high-quality models while not infringing on data security and
The project is part of a large consortium project FeatureCloud. You will implement federated solutions for
specific problems which seamlessly integrate into the FeatureCloud platform. Potential projects are:
- Federated Clustering Algorithms
- Federated Similarity Calculation
- Federated Cluster Evaluation
- Federated Network Enrichment
Impact of fibre, red and processed meat on risk of chronic inflammatory diseases: a prospective UK Biobank cohort study on prognostic factors and personalised medicine in the UK Biobank
UKBiobank is an incredible large datasoruce and has collected data from 500.000 British individuals enabling the
investigation of gene-lifestyle interactions in relation to development of diseases. Data is already available and
requires advanced data science skill to cope with the sheer amount of available data.
The aim of this project to programm statistical models to investigate gene-diet interactions in an efficient way.
You will learn to perform observational and advanced computer analyses such as interaction studies, case-control
and case-only studies. Further, you will learn how to do network and pathway analyses.
You will be supervised by Richard Röttger and Vibeke Andersen, professor, Research Leader, Research Unit for
Molecular Diagnostics and Clinical Research, University Hospital of Southern Denmark.
Practical Information. The study is designed as a Master Thesis project and can also be done in a team:
Two students working on their own project, but collaborating on methods, interpretation, writing manuscript,
etc., are preferred. You will be responsible for writing a manuscript under supervision. It is not crucial to
have in depth knowledge of Medicine or Biology. Could form a basis for a PhD study, if wanted.