Master Theses Topics – 2025/26

MLG proposes the following MA thesis topics for this academic year. NB: Number of topics is limited. If interested please contact the supervisor asap.

Cloud-Native MLOps for Real-Time Digital Twins

Supervisors: Gianluca Bontempi, Gian Marco Paldino

Digital Twins (DTs) are virtual replicas of physical systems that are becoming essential in Industry 4.0 for monitoring, simulation, and optimization. Machine Learning Operations (MLOps) bridges the gap between laboratory ML models and robust, scalable, and maintainable models required for live DT environments. To achieve the scalability and real-time responsiveness required by modern DTs, MLOps is best implemented on a cloud platform.

This thesis will address the practical challenges of operationalizing ML models for Digital Twins using a cloud-native approach on e.g. Amazon Web Services (AWS). The student will explore, design, and implement a complete, automated MLOps pipeline on AWS for a DT in either the renewable energy or traffic simulation domain. The core of this thesis is not just to build a model, but to build the industrial-grade, cloud-native infrastructure around it, covering data ingestion (e.g. Kinesis, S3), CI/CD (e.g. CodePipeline), model training and deployment (e.g. Amazon SageMaker), and monitoring (e.g. CloudWatch).

The ideal candidate will be passionate about bridging the gap between academic research and real-world application. Required skills include expert proficiency in Python, a strong foundation in Machine Learning, and a keen interest in software engineering, automation, and cloud technologies (experience with Docker or AWS is a major plus). Registration in the MA computational intelligence module is required.

Industry-Ready Tools & Resources:

Frugal Machine Learning for Sustainable Renewable Energy Systems

Supervisors: Gianluca Bontempi, Gian Marco Paldino

The increasing complexity of machine learning (ML) models has led to a significant rise in their computational and energy demands. This “Red AI” trend poses a challenge to the sustainable development of artificial intelligence. In response, the field of “Frugal Machine Learning” or “Green AI” has emerged, focusing on the creation of ML models that are not only accurate but also efficient in terms of computational resources, energy consumption, and carbon footprint. This is particularly relevant in the renewable energy sector, where ML is crucial for tasks like forecasting energy production and demand to ensure grid stability.

This thesis aims to explore the intersection of frugal machine learning and renewable energy. The core objective is to investigate and apply state-of-the-art techniques for measuring and reducing the carbon footprint of ML models used in the context of renewable energy forecasting. The student will conduct a comprehensive review of frugal ML methodologies and will practically apply and benchmark tools designed to estimate the CO2 emissions of computation, such as the Python package `CodeCarbon` and the `Green Algorithms calculator`. A key innovative aspect of this thesis will be to design and implement a scheduler that intelligently times the training of ML models to coincide with periods of peak renewable energy production, thereby minimizing the reliance on fossil fuel-based energy sources.

The student should have a strong background in Machine Learning and Python programming, an interest in interdisciplinary research, particularly in the application of AI to sustainability and energy systems, and be proactive and capable of working independently. Registration in the MA module on computational intelligence is preferred.

References:

Curriculum Learning in the Laser Learning Environment

Supervisors: Tom Lenaerts, Yannick Molinghen

The Laser Learning Environment (LLE) is a cooperative Multi-Agent environment that has shown to be challenging due to its unique properties. At MLG, we’ve shown that agents were unable to accurately estimate the value of key states due to State Space Bottlenecks (SSBs) and identified this as the cause of poor performance in state-of-the-art mixing networks.

The objective of this thesis is to investigate Curriculum Learning (CL) methods and assess their ability to learn better policies. A particular focus will be on Unsupervised Environment Design (UED), where a meta-agent generates increasingly difficult yet feasible tasks. You’ll develop ways for a meta-agent to incrementally design an LLE environment and perform experiments to draw conclusions on CL regarding generalization capabilities of agents in LLE.

References:

  • “Laser Learning Environment: a new cooperative environment for coordination-critical tasks”
  • Laser Learning Environment Github repository: https://github.com/yamoling/lle
  • “Unrewarded subgoals, a persisting problem in cooperative multi-agent Markov decision processes”
  • “Value-Decomposition Networks For Cooperative Multi-Agent Learning”
  • “QMIX: Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning”
  • “QPLEX: Duplex Dueling Multi-Agent Q-learning”
  • “Emergent Complexity and Zero-shot Transfer via Unsupervised Environment Design”
Explainable multi-agent reinforcement learning

Supervisors: Tom Lenaerts, Yannick Molinghen

For years, we’ve observed what looks like collaborative behavior in cooperative multi-agent reinforcement learning, but the reasons behind these behaviors remain unanswered. Previous work on single-agent RL distills policies in decision trees to provide clearer explanations on agent intentions. Other methods learn structural causal models of the environment or decompose reward signals into multiple signals corresponding to different game events.

This thesis aims to determine the best methods to analyze agent intentions when they exhibit collaborative behavior in multi-agent reinforcement learning. You’ll first evaluate these methods on single-agent scenarios before advancing to cooperative multi-agent ones. The suggested environment is the Laser Learning Environment (LLE).

References:

  • “Distilling Deep Reinforcement Learning Policies in Soft Decision Trees”
  • “Explainable Reinforcement Learning Through a Causal Lens”
  • “Explainable Reinforcement Learning: A Survey”
  • “Principle Component Analysis”
  • “Visualizing data using t-SNE”
  • “Explainable Reinforcement Learning via Reward Decomposition”
Exploration methods for model-based multi-agent reinforcement learning

Supervisors: Yannick Molinghen, Tom Lenaerts

Reinforcement Learning comes in two flavors: model-based and model-free. Some algorithms learn a model of the environment to make predictions without requiring actual steps, which might be costly. Meanwhile, some single-agent exploration methods based on intrinsic curiosity build internal world models and check their accuracy to compute intrinsic rewards.

This thesis investigates how model-based multi-agent reinforcement learning can leverage internal environment models to improve exploration, comparing that to model-free MARL algorithms. The suggested environment is the Laser Learning Environment (LLE).

References:

  • “Mastering Atari with Discrete World Models”
  • “A Possibility for Implementing Curiosity and Boredom in Model-Building Neural Controllers”
  • “Curiosity-driven Exploration by Self-supervised Prediction”
  • “Episodic Multi-agent Reinforcement Learning with Curiosity-driven Exploration”
Fast variable selection without shrinkage

Supervisor: Maarten Jansen

Selecting optimal models from broad non-nested model spectrums can be driven by criteria balancing good training set prediction and model complexity. Optimization over variable numbers is combinatorially complex and not feasible for high-dimensional data. This problem can be approximated by replacing the counting measure with a sum of magnitude estimators, changing a combinatorial problem into a convex, quadratic programming one.

This thesis applies variable selection in sparse inverse problems, or in deblurring and denoising images, using gradient projection or iterative thresholding.

Machine Learning for Causal Discovery

Supervisors: Gianluca Bontempi and Gianmarco Paldino

This thesis focuses on designing and implementing machine learning methods for probability distribution classification to discover causal directionality from data.

The student should be an expert in R and Python programming, registered in the MA module on computational intelligence, proficient in Machine Learning, and passionate about interdisciplinary applied research.

References:

  • https://link.springer.com/article/10.1007/s10115-021-01621-0
  • CauseMe platform
Methods for omics data clustering

Supervisor: Matthieu Defrance

Clustering analysis is routinely performed on omics data to explore or discover underlying cell identities. The high dimensionality and significant sparsity of these data (with false zero count observations) make clustering computationally challenging. This project studies state-of-the-art techniques for omics data clustering, emphasizing neural network approaches for initial data embedding.

Reference: https://bmcbioinformatics.biomedcentral.com/articles/10.1186/s12859-021-04210-8

Contact: Matthieu Defrance (matthieu.defrance@ulb.be)

Methods for classification of rare diseases using omics data

Supervisor: Matthieu Defrance

High-throughput sequencing and genome-wide analyses have profoundly impacted genetic diagnosis of rare diseases. Beyond classical genetic variants calling, new methods based on epigenetic or transcriptomic alterations have emerged. This project develops and evaluates supervised classification methods for rare diseases classification.

Reference: Erfan Aref-Eshghi et al. Evaluation of DNA Methylation Episignatures for Diagnosis and Phenotype Correlations in 42 Mendelian Neurodevelopmental Disorders.

Contact: Matthieu Defrance (matthieu.defrance@ulb.be)

Trade-offs in decision-making under uncertainty

Supervisors: Tom Lenaerts, Axel Abels

Real-world decision-making requires balancing multiple, possibly conflicting objectives. Concerns like interpretability, fairness, and execution speed often conflict with primary performance metrics. This project evaluates algorithms for decision-making under uncertainty (multi-armed bandits) regarding these secondary objectives, potentially extending into procedural fairness and interpretability in contextual bandits.

References:

  • Patil, Vishakha, et al. “Achieving fairness in the stochastic multi-armed bandit problem.”
  • Turgay, Eralp, Doruk Oner, and Cem Tekin. “Multi-objective contextual bandit problem with similarity information.”
  • Lattimore, Tor, and Csaba Szepesvári. Bandit algorithms.
Learning correlated equilibria

Supervisor: Tom Lenaerts

This thesis examines how learning and evolution may find correlated equilibria, an extension of Nash equilibria in games. You’ll analyze the literature, formulate the state-of-the-art, and implement and test suggested approaches on learning problems to evaluate their usefulness.

References:

  • Aumann, R.J. (1987). Correlated equilibrium as an expression of Bayesian rationality.
  • Milgrom, P., and Roberts, J. (1991). Adaptive and sophisticated learning in normal form games.
  • Foster, D.P., and Vohra, R.V. (1997). Calibrated learning and correlated equilibrium.
  • Hart, S., and Mas‐Colell, A. (2000). A simple adaptive procedure leading to correlated equilibrium.
  • Cripps, M. (1991). Correlated equilibria and evolutionary stability.
  • Metzger, L.P. (2018). Evolution and correlated equilibrium.
  • Arifovic, J., Boitnott, J.F., and Duffy, J. (2019). Learning correlated equilibria: An evolutionary approach.
Knowledge graphs and drug repurposing

Supervisors: Tom Lenaerts, Inas Bosch and Nassim Versbraegen

This thesis explores drug-disease association potential using knowledge graphs (KG) and KG embeddings. You’ll identify relevant contributions in this field, focus on one or two approaches to confirm published results, and examine their applicability for rare diseases and multi-mutant disease contexts.

References:

  • Himmelstein, D. S., et al. (2017). Systematic integration of biomedical knowledge prioritizes drugs for repurposing.
  • Roessler, H. I., et al. (2021). Drug repurposing for rare diseases.
  • Bang, D., et al. (2023). Biomedical knowledge graph learning for drug repurposing.
  • Johnson, R., et al. (2024). Graph Artificial Intelligence in Medicine.
  • Perdomo-Quinteiro, P., & Belmonte-Hernández, A. (2024). Knowledge Graphs for drug repurposing.
  • Wang, Q., et al. (2017). Knowledge graph embedding: A survey of approaches and applications.
Identification of epistasis using machine learning

Supervisors: Tom Lenaerts and Nassim Versbraegen

This thesis examines state-of-the-art machine learning methods for discovering and analyzing epistatic interactions between variants and genes. Epistasis occurs when different genetic loci contribute to a phenotype non-additively.

You’ll first document available methods for researching epistatic effects, creating a taxonomy based on technology or data usage. Then, you’ll select methods and datasets to implement and analyze, comparing their effectiveness in examining epistatic interactions.

References:

  • Cordell, H. J. (2009). Detecting gene–gene interactions that underlie human diseases.
  • Niel, C., et al. (2015). A survey about methods dedicated to epistasis detection.
  • Chicco, D., & Faultless, T. (2021). Brief survey on machine learning in epistasis.
  • Russ, D. (2023). Efficient strategies for epistasis detection in genome-wide data.
  • Chang, Y. C., et al. (2020). GenEpi: gene-based epistasis discovery using machine learning.
  • Abd El Hamid, M. M., et al. (2021). Machine learning for detecting epistasis interactions.
Evaluation of Alphafold structures for oligogenic diseases

Supervisors: Tom Lenaerts and Nassim Versbraegen

This thesis investigates structural protein knowledge relevance for the OLIDA oligogenic diseases database. With Alphafold, we can now predict structures for disease cases involving multiple genes, potentially improving disease understanding and treatment.

You’ll collect protein sequence and structural knowledge for OLIDA instances, considering Alphafold’s confidence in generated structures for mutated regions. A comparison with monogenetic variants will be made, followed by developing a prototype for next-generation predictive methods.

References:

  • Abramson, J., et al. (2024). Accurate structure prediction of biomolecular interactions with AlphaFold 3.
  • Desai, D., et al. (2024). Review of AlphaFold 3: transformative advances in drug design and therapeutics.
  • Lee, C. Y., et al. (2024). Systematic discovery of protein interaction interfaces using AlphaFold.
  • Sebastiano, M. R., et al. (2022). AI-based protein structure databases for rare diseases research.
  • Visibelli, A., et al. (2024). Molecular Origins of the Mendelian Rare Diseases.
  • Scafuri, B., et al. (2022). Computational methods for pharmacological chaperones for rare diseases.
  • Schmidt, A., et al. (2023). Predicting missense variant pathogenicity using AlphaFold2 features.
Variant pathogenicity prediction with gene and protein language models

Supervisors: Tom Lenaerts and Nassim Versbraegen

With language model success and clear associations between natural language and protein/genetic language, methods are emerging to improve variant pathogenicity prediction. This thesis investigates the state-of-the-art of such methods and explores their potential to enhance our team’s approaches.

References:

  • Lin, W., et al. (2024). Enhancing missense variant pathogenicity prediction with protein language models.
  • Brandes, N., et al. (2023). Genome-wide prediction of disease variant effects with a deep protein language model.
  • Molotkov, I., et al. (2024). Making sense of missense: challenges in variant pathogenicity prediction.
  • Fan, X., et al. (2023). SHINE: protein language model-based pathogenicity prediction.
  • Zhan, H., & Zhang, Z. (2024). DYNA: Disease-Specific Language Model for Variant Pathogenicity.
  • Sayeed, M. A., et al. (2024). Gene Pathogenicity Prediction using Genomic Foundation Models.