Journals and Conferences Publications

796 entries « ‹ 3 of 16 › »

2023

Tubella, Andrea Aler; Mollo, Dimitri Coelho; Lindström, Adam Dahlgren; Devinney, Hannah; Dignum, Virginia; Ericson, Petter; Jonsson, Ana; Kampik, Timotheus; Lenaerts, Tom; Mendez, Julian Alfredo; Nieves, Juan Carlos

ACROCPoLis: A Descriptive Framework for Making Sense of Fairness Proceedings Article

In: Proceedings of the 6th ACM Conference on Fairness, Accountability, and Transparency, FAccT 2023, pp. 1014-1025, Association for Computing Machinery, 2023, (Conference: 6th ACM Conference on Fairness, Accountability, and Transparency(6: 12/6/2023-15/06/2023: Chicago)).

Abstract | Links | BibTeX

Nachtegael, Charlotte; Stefani, Jacopo De; Lenaerts, Tom

ALAMBIC: Active Learning Automation with Methods to Battle Inefficient Curation Proceedings Article

In: Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics: System Demonstrations, pp. 117–127, Association for Computational Linguistics, 2023, (Conference: European Chapter of the Association for Computational Linguistics(17: 2 May 2023 to 4 May 2023: Dubrovnik, Croatia)).

Abstract | Links | BibTeX

Abels, Axel; Lenaerts, Tom; Trianni, Vito; Nowe, Ann

Expertise Trees Resolve Knowledge Limitations in Collective Decision-Making Proceedings Article

In: Proceedings of the 40th International Conference on Machine Learning: ICML’23, pp. 79-90, PMLR, 2023, (Conference: 40th International Conference on Machine Learning(Honolulu Hawaii USA)).

Abstract | Links | BibTeX

Claeskens, G.; Jansen, Maarten

Comments on: Statistical inference and large-scale multiple testing for high-dimensional regression models Journal Article

In: Test, vol. 32, no. 4, pp. 1177-1179, 2023, (DOI: 10.1007/s11749-023-00896-5).

Claeskens, G.; Jansen, Maarten; Zhou, Jing

Discussion on: “A scale-free approach for false discovery rate control in generalized linear models” by Dai, Lin, Zing, Liu. Journal Article

In: Journal of the American Statistical Association, vol. 118, no. 543, pp. 1573-1577, 2023, (Language of publication: fr).

Bhattacharya, Shreya; Lefèvre, Laure; Hayakawa, Hisashi; Jansen, Maarten; Clette, Frédéric L.

Scale Transfer in 1849: Heinrich Schwabe to Rudolf Wolf Journal Article

In: Solar physics, vol. 298, no. 1, pp. 1-12, 2023, (Language of publication: fr).

2022

Piron, Anthony; Szymczak, Florian; Alvelos, Maria De Oliveira; Defrance, Matthieu; Lenaerts, Tom; Eizirik, Decio L.; Cnop, Miriam

RedRibbon: A new rank-rank hypergeometric overlap pipeline to compare gene and transcript expression signatures Journal Article

In: BioRxiv, 2022, (DOI: https://doi.org/10.1101/2022.08.31.505818).

Abstract | Links | BibTeX

@article{info:hdl:2013/353212d,

title = {RedRibbon: A new rank-rank hypergeometric overlap pipeline to compare gene and transcript expression signatures},

author = {Anthony Piron and Florian Szymczak and Maria De Oliveira Alvelos and Matthieu Defrance and Tom Lenaerts and Decio L. Eizirik and Miriam Cnop},

url = {https://dipot.ulb.ac.be/dspace/bitstream/2013/353212/3/2022.08.31.505818v1.full.pdf},

year  = {2022},

date = {2022-01-01},

journal = {BioRxiv},

abstract = {Motivation. High throughput omics technologies have generated a wealth of large protein, gene and transcript datasets that have exacerbated the need for new methods to analyse and compare big datasets. Rank-rank hypergeometric overlap is an important threshold-free method to combine and visualize two ranked lists of P-values or fold-changes, usually from differential gene expression analyses. Here, we introduce a new rank-rank hypergeometric overlap-based method aimed at both gene level and alternative splicing analyses at transcript or exon level, hitherto unreachable as transcript numbers are an order of magnitude larger than gene numbers.Results. We tested the tool on synthetic and real datasets at gene and transcript levels to detect correlation and anti-correlation patterns and found it to be fast and accurate, even on very large datasets thanks to an evolutionary algorithm based minimal P-value search. The tool comes with a ready-to-use permutation scheme allowing the computation of adjusted P-values at low time cost. Additionally, the package is a drop-in replacement to previous packages as a compatibility mode is included, allowing to re-run older studies with close to no change to existing pipelines. RedRibbon holds the promise to accurately extricate detailed information from large analyses.Availability. RNA-sequencing datasets are available through the Gene Expression Omnibus (GEO) portal with accession numbers GSE159984, GSE133218, GSE137136, GSE98485, GSE148058 and GSE108413. The C libraries and R package code are open to the community with a permissive licence (GPL3) and available for download from GitHub https://github.com/antpiron/ale, https://github.com/antpiron/cRedRibbon and https://github.com/antpiron/RedRibbon.},

note = {DOI: https://doi.org/10.1101/2022.08.31.505818},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Grolaux, Robin; Hardy, Alexis; Olsen, Catharina; Dooren, Sonia Van; Smits, Guillaume; Defrance, Matthieu

Identification of differentially methylated regions in rare diseases from a single-patient perspective Journal Article

In: Clinical Epigenetics, vol. 14, no. 1, 2022, (DOI: 10.1186/s13148-022-01403-7).

Abstract | Links | BibTeX

@article{info:hdl:2013/353081,

title = {Identification of differentially methylated regions in rare diseases from a single-patient perspective},

author = {Robin Grolaux and Alexis Hardy and Catharina Olsen and Sonia Van Dooren and Guillaume Smits and Matthieu Defrance},

url = {https://dipot.ulb.ac.be/dspace/bitstream/2013/353081/1/doi_336725.pdf},

year  = {2022},

date = {2022-01-01},

journal = {Clinical Epigenetics},

volume = {14},

number = {1},

abstract = {Abstract Background DNA methylation (5-mC) is being widely recognized as an alternative in the detection of sequence variants in the diagnosis of some rare neurodevelopmental and imprinting disorders. Identification of alterations in DNA methylation plays an important role in the diagnosis and understanding of the etiology of those disorders. Canonical pipelines for the detection of differentially methylated regions (DMRs) usually rely on inter-group (e.g., case versus control) comparisons. However, these tools might perform suboptimally in the context of rare diseases and multilocus imprinting disturbances due to small cohort sizes and inter-patient heterogeneity. Therefore, there is a need to provide a simple but statistically robust pipeline for scientists and clinicians to perform differential methylation analyses at the single patient level as well as to evaluate how parameter fine-tuning may affect differentially methylated region detection. Result We implemented an improved statistical method to detect differentially methylated regions in correlated datasets based on the Z-score and empirical Brown aggregation methods from a single-patient perspective. To accurately assess the predictive power of our method, we generated semi-simulated data using a public control population of 521 samples and investigated how the size of the control population, methylation difference, and region size affect DMR detection. In addition, we validated the detection of methylation events in patients suffering from rare multi-locus imprinting disturbance and evaluated how this method could complement existing tools in the context of clinical diagnosis. Conclusion In this study, we present a robust statistical method to perform differential methylation analysis at the single patient level and describe its optimal parameters to increase DMRs identification performance. Finally, we show its diagnostic utility when applied to rare disorders.},

note = {DOI: 10.1186/s13148-022-01403-7},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Bizet, Martin; Defrance, Matthieu; Calonne, Emilie; Bontempi, Gianluca; Sotiriou, Christos; Fuks, Franccois; Jeschke, Jana

Improving Infinium MethylationEPIC data processing: re-annotation of enhancers and long noncoding RNA genes and benchmarking of normalization methods. Journal Article

In: Epigenetics, vol. 17, no. 13, pp. 2434-2454, 2022, (DOI: 10.1080/15592294.2022.2135201).

Abstract | Links | BibTeX

@article{info:hdl:2013/353467b,

title = {Improving Infinium MethylationEPIC data processing: re-annotation of enhancers and long noncoding RNA genes and benchmarking of normalization methods.},

author = {Martin Bizet and Matthieu Defrance and Emilie Calonne and Gianluca Bontempi and Christos Sotiriou and Franccois Fuks and Jana Jeschke},

url = {https://dipot.ulb.ac.be/dspace/bitstream/2013/353467/5/KEPI_17_2135201.pdf},

year  = {2022},

date = {2022-01-01},

journal = {Epigenetics},

volume = {17},

number = {13},

pages = {2434-2454},

abstract = {Illumina Infinium DNA Methylation (5mC) arrays are a popular technology for low-cost, high-throughput, genome-scale measurement of 5mC distribution, especially in cancer and other complex diseases. After the success of its HumanMethylation450 array (450k), Illumina released the MethylationEPIC array (850k) featuring increased coverage of enhancers. Despite the widespread use of 850k, analysis of the corresponding data remains suboptimal: it still relies mostly on Illumina's default annotation, which underestimates enhancerss and long noncoding RNAs. Results: We have thus developed an approach, based on the ENCODE and LNCipedia databases, which greatly improves upon Illumina's default annotation of enhancers and long noncoding transcripts. We compared the re-annotated 850k with both 450k and reduced-representation bisulphite sequencing (RRBS), another high-throughput 5mC profiling technology. We found 850k to cover at least three times as many enhancers and long noncoding RNAs as either 450k or RRBS. We further investigated the reproducibility of the three technologies, applying various normalization methods to the 850k data. Most of these methods reduced variability to a level below that of RRBS data. We then used 850k with our new annotation and normalization to profile 5mC changes in breast cancer biopsies. 850k highlighted aberrant enhancer methylation as the predominant feature, in agreement with previous reports. Our study provides an updated processing approach for 850k data, based on refined probe annotation and normalization, allowing for improved analysis of methylation at enhancers and long noncoding RNA genes. Our findings will help to further advance understanding of the DNA methylome in health and disease.},

note = {DOI: 10.1080/15592294.2022.2135201},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Rivière, Quentin; Corso, Massimiliano; Ciortan, Madalina; Noël, Grégoire; Verbruggen, Nathalie; Defrance, Matthieu

Exploiting Genomic Features to Improve the Prediction of Transcription Factor-Binding Sites in Plants. Journal Article

In: Plant and Cell Physiology, vol. 63, no. 10, pp. 1457-1473, 2022, (DOI: 10.1093/pcp/pcac095).

Abstract | Links | BibTeX

@article{info:hdl:2013/352290,

title = {Exploiting Genomic Features to Improve the Prediction of Transcription Factor-Binding Sites in Plants.},

author = {Quentin Rivière and Massimiliano Corso and Madalina Ciortan and Grégoire Noël and Nathalie Verbruggen and Matthieu Defrance},

url = {https://dipot.ulb.ac.be/dspace/bitstream/2013/352290/3/Riviere_et_al.pdf},

year  = {2022},

date = {2022-01-01},

journal = {Plant and Cell Physiology},

volume = {63},

number = {10},

pages = {1457-1473},

abstract = {The identification of transcription factor (TF) target genes is central in biology. A popular approach is based on the location by pattern matching of potential cis-regulatory elements (CREs). During the last few years, tools integrating next-generation sequencing data have been developed to improve the performance of pattern matching. However, such tools have not yet been comprehensively evaluated in plants. Hence, we developed a new streamlined method aiming at predicting CREs and target genes of plant TFs in specific organs or conditions. Our approach implements a supervised machine learning strategy, which allows decision rule models to be learnt using TF ChIP-chip/seq experimental data. Different layers of genomic features were integrated in predictive models: the position on the gene, the DNA sequence conservation, the chromatin state and various CRE footprints. Among the tested features, the chromatin features were crucial for improving the accuracy of the method. Furthermore, we evaluated the transferability of predictive models across TFs, organs and species. Finally, we validated our method by correctly inferring the target genes of key TFs controlling metabolite biosynthesis at the organ level in Arabidopsis. We developed a tool-Wimtrap-to reproduce our approach in plant species and conditions/organs for which ChIP-chip/seq data are available. Wimtrap is a user-friendly R package that supports an R Shiny web interface and is provided with pre-built models that can be used to quickly get predictions of CREs and TF gene targets in different organs or conditions in Arabidopsis thaliana, Solanum lycopersicum, Oryza sativa and Zea mays.},

note = {DOI: 10.1093/pcp/pcac095},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Ciortan, Madalina; Defrance, Matthieu

GNN-based embedding for clustering scRNA-seq data Journal Article

In: Bioinformatics, vol. 38, no. 4, pp. 1037-1044, 2022, (DOI: 10.1093/bioinformatics/btab787).

Abstract | Links | BibTeX

@article{info:hdl:2013/343811b,

title = {GNN-based embedding for clustering scRNA-seq data},

author = {Madalina Ciortan and Matthieu Defrance},

url = {https://dipot.ulb.ac.be/dspace/bitstream/2013/343811/3/btab787.pdf},

year  = {2022},

date = {2022-01-01},

journal = {Bioinformatics},

volume = {38},

number = {4},

pages = {1037-1044},

abstract = {Abstract Motivation Single-cell RNA sequencing (scRNA-seq) provides transcriptomic profiling for individual cells, allowing researchers to study the heterogeneity of tissues, recognize rare cell identities and discover new cellular subtypes. Clustering analysis is usually used to predict cell class assignments and infer cell identities. However, the high sparsity of scRNA-seq data, accentuated by dropout events generates challenges that have motivated the development of numerous dedicated clustering methods. Nevertheless, there is still no consensus on the best performing method. Results graph-sc is a new method leveraging a graph autoencoder network to create embeddings for scRNA-seq cell data. While this work analyzes the performance of clustering the embeddings with various clustering algorithms, other downstream tasks can also be performed. A broad experimental study has been performed on both simulated and scRNA-seq datasets. The results indicate that although there is no consistently best method across all the analyzed datasets, graph-sc compares favorably to competing techniques across all types of datasets. Furthermore, the proposed method is stable across consecutive runs, robust to input down-sampling, generally insensitive to changes in the network architecture or training parameters and more computationally efficient than other competing methods based on neural networks. Modeling the data as a graph provides increased flexibility to define custom features characterizing the genes, the cells and their interactions. Moreover, external data (e.g. gene network) can easily be integrated into the graph and used seamlessly under the same optimization task. Availability and implementation https://github.com/ciortanmadalina/graph-sc. Supplementary information Supplementary data are available at Bioinformatics online.},

note = {DOI: 10.1093/bioinformatics/btab787},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Renaux, Alexandre; Terwagne, Chloé CT; Cochez, Michael; Tiddi, Ilaria; Nowé, Ann; Lenaerts, Tom

A knowledge graph approach for interpretable prediction of pathogenic genetic interactions Miscellaneous

2022, (Conference: European Conference on Computational Biology (ECCB) 2022 (2022-07: Sitges, Spain)).

Abstract | Links | BibTeX

Abels, Axel; Lenaerts, Tom; Trianni, Vito; Nowé, Ann

A New Approach to Handle Non-Stationarity in Collective Decision-Making Miscellaneous

2022, (Conference: ACM Collective Intelligence conference (CI)(Virtual)).

Montero-Porras, Eladio; Gruji’c, Jelena; Domingos, Elias Fernandez; Lenaerts, Tom

Inferring Strategies from Observations in Long Iterated Prisoner’s Dilemma Experiments Miscellaneous

2022, (Conference: Complex Systems Conference 2022(17-21/10/2022: Palma de Mallorca, Spain)).

Versbraegen, Nassim; Gravel, Barbara; Nachtegael, Charlotte; Renaux, Alexandre; Verkinderen, Emma; Nowé, Ann; Lenaerts, Tom; Papadimitriou, Sofia

Taking the prediction of pathogenic variant-combinations to the next level with VarCoPP2.0 Miscellaneous

2022, (Conference: European Conference on Computational Biology (21: 12-21 September 2022: Sitges, Barcelona)).

Montero-Porras, Eladio; Grujić, Jelena; Domingos, Elias Fernandez; Lenaerts, Tom

Inferring Strategies from Observations in Long Iterated Prisoner’s Dilemma Experiments Miscellaneous

2022, (Conference: International Conference on Social Dilemmas(19-22/07/2022: Coppenhagen, Denmark)).

Grolaux, Robin; Hardy, Alexis; Olsen, Catharina; Dooren, Sonia Van; Smits, Guillaume; Defrance, Matthieu

Identification of differentially methylated regions in rare diseases from a single-patient perspective Journal Article

In: Clinical Epigenetics, vol. 14, no. 1, 2022, (DOI: 10.1186/s13148-022-01403-7).

Abstract | Links | BibTeX

@article{info:hdl:2013/353081b,

title = {Identification of differentially methylated regions in rare diseases from a single-patient perspective},

author = {Robin Grolaux and Alexis Hardy and Catharina Olsen and Sonia Van Dooren and Guillaume Smits and Matthieu Defrance},

url = {https://dipot.ulb.ac.be/dspace/bitstream/2013/353081/1/doi_336725.pdf},

year  = {2022},

date = {2022-01-01},

journal = {Clinical Epigenetics},

volume = {14},

number = {1},

abstract = {Abstract Background DNA methylation (5-mC) is being widely recognized as an alternative in the detection of sequence variants in the diagnosis of some rare neurodevelopmental and imprinting disorders. Identification of alterations in DNA methylation plays an important role in the diagnosis and understanding of the etiology of those disorders. Canonical pipelines for the detection of differentially methylated regions (DMRs) usually rely on inter-group (e.g., case versus control) comparisons. However, these tools might perform suboptimally in the context of rare diseases and multilocus imprinting disturbances due to small cohort sizes and inter-patient heterogeneity. Therefore, there is a need to provide a simple but statistically robust pipeline for scientists and clinicians to perform differential methylation analyses at the single patient level as well as to evaluate how parameter fine-tuning may affect differentially methylated region detection. Result We implemented an improved statistical method to detect differentially methylated regions in correlated datasets based on the Z-score and empirical Brown aggregation methods from a single-patient perspective. To accurately assess the predictive power of our method, we generated semi-simulated data using a public control population of 521 samples and investigated how the size of the control population, methylation difference, and region size affect DMR detection. In addition, we validated the detection of methylation events in patients suffering from rare multi-locus imprinting disturbance and evaluated how this method could complement existing tools in the context of clinical diagnosis. Conclusion In this study, we present a robust statistical method to perform differential methylation analysis at the single patient level and describe its optimal parameters to increase DMRs identification performance. Finally, we show its diagnostic utility when applied to rare disorders.},

note = {DOI: 10.1186/s13148-022-01403-7},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Bizet, Martin; Defrance, Matthieu; Calonne, Emilie; Bontempi, Gianluca; Sotiriou, Christos; Fuks, Franccois; Jeschke, Jana

Improving Infinium MethylationEPIC data processing: re-annotation of enhancers and long noncoding RNA genes and benchmarking of normalization methods. Journal Article

In: Epigenetics, vol. 17, no. 13, pp. 2434-2454, 2022, (DOI: 10.1080/15592294.2022.2135201).

Abstract | Links | BibTeX

@article{info:hdl:2013/353467d,

title = {Improving Infinium MethylationEPIC data processing: re-annotation of enhancers and long noncoding RNA genes and benchmarking of normalization methods.},

author = {Martin Bizet and Matthieu Defrance and Emilie Calonne and Gianluca Bontempi and Christos Sotiriou and Franccois Fuks and Jana Jeschke},

url = {https://dipot.ulb.ac.be/dspace/bitstream/2013/353467/5/KEPI_17_2135201.pdf},

year  = {2022},

date = {2022-01-01},

journal = {Epigenetics},

volume = {17},

number = {13},

pages = {2434-2454},

abstract = {Illumina Infinium DNA Methylation (5mC) arrays are a popular technology for low-cost, high-throughput, genome-scale measurement of 5mC distribution, especially in cancer and other complex diseases. After the success of its HumanMethylation450 array (450k), Illumina released the MethylationEPIC array (850k) featuring increased coverage of enhancers. Despite the widespread use of 850k, analysis of the corresponding data remains suboptimal: it still relies mostly on Illumina's default annotation, which underestimates enhancerss and long noncoding RNAs. Results: We have thus developed an approach, based on the ENCODE and LNCipedia databases, which greatly improves upon Illumina's default annotation of enhancers and long noncoding transcripts. We compared the re-annotated 850k with both 450k and reduced-representation bisulphite sequencing (RRBS), another high-throughput 5mC profiling technology. We found 850k to cover at least three times as many enhancers and long noncoding RNAs as either 450k or RRBS. We further investigated the reproducibility of the three technologies, applying various normalization methods to the 850k data. Most of these methods reduced variability to a level below that of RRBS data. We then used 850k with our new annotation and normalization to profile 5mC changes in breast cancer biopsies. 850k highlighted aberrant enhancer methylation as the predominant feature, in agreement with previous reports. Our study provides an updated processing approach for 850k data, based on refined probe annotation and normalization, allowing for improved analysis of methylation at enhancers and long noncoding RNA genes. Our findings will help to further advance understanding of the DNA methylome in health and disease.},

note = {DOI: 10.1080/15592294.2022.2135201},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Piron, Anthony; Szymczak, Florian; Alvelos, Maria De Oliveira; Defrance, Matthieu; Lenaerts, Tom; Eizirik, Decio L.; Cnop, Miriam

RedRibbon: A new rank-rank hypergeometric overlap pipeline to compare gene and transcript expression signatures Journal Article

In: BioRxiv, 2022, (DOI: https://doi.org/10.1101/2022.08.31.505818).

Abstract | Links | BibTeX

@article{info:hdl:2013/353212,

title = {RedRibbon: A new rank-rank hypergeometric overlap pipeline to compare gene and transcript expression signatures},

author = {Anthony Piron and Florian Szymczak and Maria De Oliveira Alvelos and Matthieu Defrance and Tom Lenaerts and Decio L. Eizirik and Miriam Cnop},

url = {https://dipot.ulb.ac.be/dspace/bitstream/2013/353212/3/2022.08.31.505818v1.full.pdf},

year  = {2022},

date = {2022-01-01},

journal = {BioRxiv},

abstract = {Motivation. High throughput omics technologies have generated a wealth of large protein, gene and transcript datasets that have exacerbated the need for new methods to analyse and compare big datasets. Rank-rank hypergeometric overlap is an important threshold-free method to combine and visualize two ranked lists of P-values or fold-changes, usually from differential gene expression analyses. Here, we introduce a new rank-rank hypergeometric overlap-based method aimed at both gene level and alternative splicing analyses at transcript or exon level, hitherto unreachable as transcript numbers are an order of magnitude larger than gene numbers.Results. We tested the tool on synthetic and real datasets at gene and transcript levels to detect correlation and anti-correlation patterns and found it to be fast and accurate, even on very large datasets thanks to an evolutionary algorithm based minimal P-value search. The tool comes with a ready-to-use permutation scheme allowing the computation of adjusted P-values at low time cost. Additionally, the package is a drop-in replacement to previous packages as a compatibility mode is included, allowing to re-run older studies with close to no change to existing pipelines. RedRibbon holds the promise to accurately extricate detailed information from large analyses.Availability. RNA-sequencing datasets are available through the Gene Expression Omnibus (GEO) portal with accession numbers GSE159984, GSE133218, GSE137136, GSE98485, GSE148058 and GSE108413. The C libraries and R package code are open to the community with a permissive licence (GPL3) and available for download from GitHub https://github.com/antpiron/ale, https://github.com/antpiron/cRedRibbon and https://github.com/antpiron/RedRibbon.},

note = {DOI: https://doi.org/10.1101/2022.08.31.505818},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Rivière, Quentin; Corso, Massimiliano; Ciortan, Madalina; Noël, Grégoire; Verbruggen, Nathalie; Defrance, Matthieu

Exploiting Genomic Features to Improve the Prediction of Transcription Factor-Binding Sites in Plants. Journal Article

In: Plant and Cell Physiology, vol. 63, no. 10, pp. 1457-1473, 2022, (DOI: 10.1093/pcp/pcac095).

Abstract | Links | BibTeX

@article{info:hdl:2013/352290b,

title = {Exploiting Genomic Features to Improve the Prediction of Transcription Factor-Binding Sites in Plants.},

author = {Quentin Rivière and Massimiliano Corso and Madalina Ciortan and Grégoire Noël and Nathalie Verbruggen and Matthieu Defrance},

url = {https://dipot.ulb.ac.be/dspace/bitstream/2013/352290/3/Riviere_et_al.pdf},

year  = {2022},

date = {2022-01-01},

journal = {Plant and Cell Physiology},

volume = {63},

number = {10},

pages = {1457-1473},

abstract = {The identification of transcription factor (TF) target genes is central in biology. A popular approach is based on the location by pattern matching of potential cis-regulatory elements (CREs). During the last few years, tools integrating next-generation sequencing data have been developed to improve the performance of pattern matching. However, such tools have not yet been comprehensively evaluated in plants. Hence, we developed a new streamlined method aiming at predicting CREs and target genes of plant TFs in specific organs or conditions. Our approach implements a supervised machine learning strategy, which allows decision rule models to be learnt using TF ChIP-chip/seq experimental data. Different layers of genomic features were integrated in predictive models: the position on the gene, the DNA sequence conservation, the chromatin state and various CRE footprints. Among the tested features, the chromatin features were crucial for improving the accuracy of the method. Furthermore, we evaluated the transferability of predictive models across TFs, organs and species. Finally, we validated our method by correctly inferring the target genes of key TFs controlling metabolite biosynthesis at the organ level in Arabidopsis. We developed a tool-Wimtrap-to reproduce our approach in plant species and conditions/organs for which ChIP-chip/seq data are available. Wimtrap is a user-friendly R package that supports an R Shiny web interface and is provided with pre-built models that can be used to quickly get predictions of CREs and TF gene targets in different organs or conditions in Arabidopsis thaliana, Solanum lycopersicum, Oryza sativa and Zea mays.},

note = {DOI: 10.1093/pcp/pcac095},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Papadimitriou, Sofia; Gravel, Barbara; Nachtegael, Charlotte; Baere, Elfride De; Loeys, Bart; Vikkula, Miikka; Smits, Guillaume; Lenaerts, Tom

The importance of good data quality and proper pathogenicity reporting in the medical genetics field: the case of oligogenic diseases Miscellaneous

2022, (Conference: Rare Med Symposium(8-12-2022: Gent)).

Abstract | Links | BibTeX

@misc{info:hdl:2013/366742,

title = {The importance of good data quality and proper pathogenicity reporting in the medical genetics field: the case of oligogenic diseases},

author = {Sofia Papadimitriou and Barbara Gravel and Charlotte Nachtegael and Elfride De Baere and Bart Loeys and Miikka Vikkula and Guillaume Smits and Tom Lenaerts},

url = {https://dipot.ulb.ac.be/dspace/bitstream/2013/366742/3/Abstract_GRD.pdf},

year = {2022},

date = {2022-01-01},

abstract = {Background/Aims:Reports of oligogenic cases (i.e. individuals whose disease phenotype can only be explained by the co-occurrence of multiple variants in several genes) have been rapidly increasing, in an effort to close the gap of missing genetic diagnoses. Nevertheless, the quality of this data had never been properly assessed, especially as standards and guidelines for such cases are currently missing. This work, aimed to collect all reported oligogenic cases in one database, OLIDA, assess the quality of the reported information and provide, for the first time, recommendations for their proper reporting. Methods:318 research articles reporting oligogenic cases were extracted from PubMed. Independent curators collected the relevant oligogenic information (i) from the articles and (ii) from public relevant databases. With this data, a transparent curation protocol was developed assigning a confidence score to each oligogenic case based on the amount of pathogenic evidence at the genetic and functional level. The collection and assessment of this data led to the creation of OLIDA, the Oligogenic Diseases Database. Results:OLIDA contains information on oligogenic cases linked to 177 different genetic diseases. Each instance is linked with a confidence score depicting the quality of the associated genetic and functional pathogenic evidence. The data revealed that the majority of papers do not provide proper genetic evidence excluding a monogenic model, while this evidence is rarely coupled with functional experiments for confirmation. Our recommendations stress the necessity of fulfilling both conditions. The use of multiple extended pedigrees showing a clear segregation of the reported variants, control cohorts of a suitable size, as well as functional experiments showing the synergistic effect of the involved variants are essential for this purpose. Conclusion:With our work we reveal the recurrent issues on the reporting of oligogenic cases and stress the need for the development of standards in the field. As the number of papers identifying oligogenic causes to disease is increasing rapidly, initiating this discussion is imperative.},

note = {Conference: Rare Med Symposium(8-12-2022: Gent)},

keywords = {},

pubstate = {published},

tppubtype = {misc}

}

Background/Aims:Reports of oligogenic cases (i.e. individuals whose disease phenotype can only be explained by the co-occurrence of multiple variants in several genes) have been rapidly increasing, in an effort to close the gap of missing genetic diagnoses. Nevertheless, the quality of this data had never been properly assessed, especially as standards and guidelines for such cases are currently missing. This work, aimed to collect all reported oligogenic cases in one database, OLIDA, assess the quality of the reported information and provide, for the first time, recommendations for their proper reporting. Methods:318 research articles reporting oligogenic cases were extracted from PubMed. Independent curators collected the relevant oligogenic information (i) from the articles and (ii) from public relevant databases. With this data, a transparent curation protocol was developed assigning a confidence score to each oligogenic case based on the amount of pathogenic evidence at the genetic and functional level. The collection and assessment of this data led to the creation of OLIDA, the Oligogenic Diseases Database. Results:OLIDA contains information on oligogenic cases linked to 177 different genetic diseases. Each instance is linked with a confidence score depicting the quality of the associated genetic and functional pathogenic evidence. The data revealed that the majority of papers do not provide proper genetic evidence excluding a monogenic model, while this evidence is rarely coupled with functional experiments for confirmation. Our recommendations stress the necessity of fulfilling both conditions. The use of multiple extended pedigrees showing a clear segregation of the reported variants, control cohorts of a suitable size, as well as functional experiments showing the synergistic effect of the involved variants are essential for this purpose. Conclusion:With our work we reveal the recurrent issues on the reporting of oligogenic cases and stress the need for the development of standards in the field. As the number of papers identifying oligogenic causes to disease is increasing rapidly, initiating this discussion is imperative.

Ciortan, Madalina; Defrance, Matthieu

GNN-based embedding for clustering scRNA-seq data Journal Article

In: Bioinformatics, vol. 38, no. 4, pp. 1037-1044, 2022, (DOI: 10.1093/bioinformatics/btab787).

Abstract | Links | BibTeX

@article{info:hdl:2013/343811c,

title = {GNN-based embedding for clustering scRNA-seq data},

author = {Madalina Ciortan and Matthieu Defrance},

url = {https://dipot.ulb.ac.be/dspace/bitstream/2013/343811/3/btab787.pdf},

year  = {2022},

date = {2022-01-01},

journal = {Bioinformatics},

volume = {38},

number = {4},

pages = {1037-1044},

abstract = {Abstract Motivation Single-cell RNA sequencing (scRNA-seq) provides transcriptomic profiling for individual cells, allowing researchers to study the heterogeneity of tissues, recognize rare cell identities and discover new cellular subtypes. Clustering analysis is usually used to predict cell class assignments and infer cell identities. However, the high sparsity of scRNA-seq data, accentuated by dropout events generates challenges that have motivated the development of numerous dedicated clustering methods. Nevertheless, there is still no consensus on the best performing method. Results graph-sc is a new method leveraging a graph autoencoder network to create embeddings for scRNA-seq cell data. While this work analyzes the performance of clustering the embeddings with various clustering algorithms, other downstream tasks can also be performed. A broad experimental study has been performed on both simulated and scRNA-seq datasets. The results indicate that although there is no consistently best method across all the analyzed datasets, graph-sc compares favorably to competing techniques across all types of datasets. Furthermore, the proposed method is stable across consecutive runs, robust to input down-sampling, generally insensitive to changes in the network architecture or training parameters and more computationally efficient than other competing methods based on neural networks. Modeling the data as a graph provides increased flexibility to define custom features characterizing the genes, the cells and their interactions. Moreover, external data (e.g. gene network) can easily be integrated into the graph and used seamlessly under the same optimization task. Availability and implementation https://github.com/ciortanmadalina/graph-sc. Supplementary information Supplementary data are available at Bioinformatics online.},

note = {DOI: 10.1093/bioinformatics/btab787},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Abels, Axel; Domingos, Elias Fernandez; Lenaerts, Tom; Trianni, Vito; Nowé, Ann

Bias Mitigation in Decision-Making with Expert Advice Miscellaneous

2022, (Conference: Benelux AI Conference (BNAIC) and Benelux machine learning conference (Benelearn)(7-9/11/2022: Antwerpen, Belgique)).

Abels, Axel; Lenaerts, Tom; Trianni, Vito; Nowé, Ann

A Novel Approach to Handle Non-stationarity in Collective Decision-Making with Experts Miscellaneous

2022, (Conference: ACM Collective Intelligence Conference 2022(20-21 Octobre 2022: Online)).

Piron, Anthony; Colli, Maikel Luis; Defrance, Matthieu; Eizirik, Decio L.; Mercader, Josep Maria; Cnop, Miriam

Identification of novel type 1 and type 2 diabetes genes by colocalisation of human islet eQTL and GWAS variants Miscellaneous

2022, (Conference: EASD Annual Meeting of the European Association for the Study of Diabetes(58th: 19 – 23 September 2022: Stockholm, Sweden)).

Montero-Porras, Eladio; Gruji’c, Jelena; Domingos, Elias Fernandez; Lenaerts, Tom

Inferring Strategies from Observations in Long Iterated Prisoner’s Dilemma Experiments Miscellaneous

2022, (Conference: Complex Systems Conference 2022(17-21/10/2022: Palma de Mallorca, Spain)).

Versbraegen, Nassim; Gravel, Barbara; Nachtegael, Charlotte; Renaux, Alexandre; Verkinderen, Emma; Nowé, Ann; Lenaerts, Tom; Papadimitriou, Sofia

Taking the prediction of pathogenic variant-combinations to the next level with VarCoPP2.0 Miscellaneous

2022, (Conference: European Conference on Computational Biology (21: 12-21 September 2022: Sitges, Barcelona)).

Montero-Porras, Eladio; Grujić, Jelena; Domingos, Elias Fernandez; Lenaerts, Tom

Inferring Strategies from Observations in Long Iterated Prisoner’s Dilemma Experiments Miscellaneous

2022, (Conference: International Conference on Social Dilemmas(19-22/07/2022: Coppenhagen, Denmark)).

Terrucha, Ines; Domingos, Elias Fernandez; Santos, Francisco C; Simoens, Pieter; Lenaerts, Tom

The art of compensation : how hybrid teams solve collective risk dilemmas Miscellaneous

2022, (Conference: Adaptive and Learning Agents (ALA) Workshop(9-10/5/2022: Auckland, NZ)).

Nachtegael, Charlotte; Gravel, Barbara; Dillen, Arnau; Smits, Guillaume; Nowe, Ann; Papadimitriou, Sofia; Lenaerts, Tom

Scaling up the oligogenic diseases research with OLIDA: the Oligogenic Diseases Database Miscellaneous

2022, (Conference: Genomics of Rare Disease 2022).

Abstract | Links | BibTeX

@misc{info:hdl:2013/352609b,

title = {Scaling up the oligogenic diseases research with OLIDA: the Oligogenic Diseases Database},

author = {Charlotte Nachtegael and Barbara Gravel and Arnau Dillen and Guillaume Smits and Ann Nowe and Sofia Papadimitriou and Tom Lenaerts},

url = {https://dipot.ulb.ac.be/dspace/bitstream/2013/352609/3/poster_GenomicsDiseases2022.pdf},

year  = {2022},

date = {2022-01-01},

abstract = {The study of genetic variation associated with disease has shown the inadequacy of the “one gene - one disease phenotype” paradigm for many cases, leading to the notion of a conceptual continuum starting from monogenic disorders to oligogenic and polygenic diseases. An important step towards understanding non-Mendelian disorders was the creation of the Digenic Diseases Database (DIDA), collecting curated scientific information on digenic variant combinations involved in digenic diseases. Different machine learning methods aiming to tackle the cause of digenic diseases have successfully used DIDA as a benchmark dataset and have been in turn used in scientific studies analysing novel oligogenic cases. While this marked a new age of predictive tools and underlined the importance of DIDA, these advances also demonstrated the need to expand further in the genetic disease continuum, beyond digenic diseases, in a continuous and more careful manner. Moreover, a structured re-evaluation of the inclusion of oligogenic combinations in such a database and their pathogenic link to diseases has become essential, in order to aid researchers in using high-quality and properly curated information when assessing their medical cases. We present OLIDA (https://olida.ibsquare.be/), the Oligogenic Diseases Database, which reinvents DIDA, containing newly and fully re-curated data and freely accessible information on oligogenic variant combinations, i.e. combinations of variants in multiple genes involved in an oligogenic disease, published in the scientific literature until February 2020. The database includes 916 oligogenic variant combinations, 192 of them involving more than two genes, linked to 159 genetic diseases. OLIDA provides, for the first time in the field, a structured protocol for the evaluation of the pathogenicity of each oligogenic combination, based on the genetic and functional evidence supporting it, paying special attention to their joint variant effect. The evidence is derived from a combination of the results presented in the scientific papers and information from knowledge databases, and is depicted with a confidence score. OLIDA further follows the FAIR principles on data management. To conclude, OLIDA is the first database containing oligogenic variant combinations and, for each, a confidence score of its pathogenic involvement in the associated disease. With this work, we are initiating the important discussion on how the evidence of pathogenicity related to oligogenic diseases should be reported and evaluated in the scientific literature, a concept that becomes increasingly important with the growing amount of data in the field.},

note = {Conference: Genomics of Rare Disease 2022},

keywords = {},

pubstate = {published},

tppubtype = {misc}

}

The study of genetic variation associated with disease has shown the inadequacy of the “one gene – one disease phenotype” paradigm for many cases, leading to the notion of a conceptual continuum starting from monogenic disorders to oligogenic and polygenic diseases. An important step towards understanding non-Mendelian disorders was the creation of the Digenic Diseases Database (DIDA), collecting curated scientific information on digenic variant combinations involved in digenic diseases. Different machine learning methods aiming to tackle the cause of digenic diseases have successfully used DIDA as a benchmark dataset and have been in turn used in scientific studies analysing novel oligogenic cases. While this marked a new age of predictive tools and underlined the importance of DIDA, these advances also demonstrated the need to expand further in the genetic disease continuum, beyond digenic diseases, in a continuous and more careful manner. Moreover, a structured re-evaluation of the inclusion of oligogenic combinations in such a database and their pathogenic link to diseases has become essential, in order to aid researchers in using high-quality and properly curated information when assessing their medical cases. We present OLIDA (https://olida.ibsquare.be/), the Oligogenic Diseases Database, which reinvents DIDA, containing newly and fully re-curated data and freely accessible information on oligogenic variant combinations, i.e. combinations of variants in multiple genes involved in an oligogenic disease, published in the scientific literature until February 2020. The database includes 916 oligogenic variant combinations, 192 of them involving more than two genes, linked to 159 genetic diseases. OLIDA provides, for the first time in the field, a structured protocol for the evaluation of the pathogenicity of each oligogenic combination, based on the genetic and functional evidence supporting it, paying special attention to their joint variant effect. The evidence is derived from a combination of the results presented in the scientific papers and information from knowledge databases, and is depicted with a confidence score. OLIDA further follows the FAIR principles on data management. To conclude, OLIDA is the first database containing oligogenic variant combinations and, for each, a confidence score of its pathogenic involvement in the associated disease. With this work, we are initiating the important discussion on how the evidence of pathogenicity related to oligogenic diseases should be reported and evaluated in the scientific literature, a concept that becomes increasingly important with the growing amount of data in the field.

Piron, Anthony; Szymczak, Florian; Alvelos, Maria De Oliveira; Defrance, Matthieu; Lenaerts, Tom; Eizirik, Decio L.; Cnop, Miriam

RedRibbon: A new rank-rank hypergeometric overlap pipeline to compare gene and transcript expression signatures Journal Article

In: BioRxiv, 2022, (DOI: https://doi.org/10.1101/2022.08.31.505818).

Abstract | Links | BibTeX

@article{info:hdl:2013/353212c,

title = {RedRibbon: A new rank-rank hypergeometric overlap pipeline to compare gene and transcript expression signatures},

author = {Anthony Piron and Florian Szymczak and Maria De Oliveira Alvelos and Matthieu Defrance and Tom Lenaerts and Decio L. Eizirik and Miriam Cnop},

url = {https://dipot.ulb.ac.be/dspace/bitstream/2013/353212/3/2022.08.31.505818v1.full.pdf},

year  = {2022},

date = {2022-01-01},

journal = {BioRxiv},

abstract = {Motivation. High throughput omics technologies have generated a wealth of large protein, gene and transcript datasets that have exacerbated the need for new methods to analyse and compare big datasets. Rank-rank hypergeometric overlap is an important threshold-free method to combine and visualize two ranked lists of P-values or fold-changes, usually from differential gene expression analyses. Here, we introduce a new rank-rank hypergeometric overlap-based method aimed at both gene level and alternative splicing analyses at transcript or exon level, hitherto unreachable as transcript numbers are an order of magnitude larger than gene numbers.Results. We tested the tool on synthetic and real datasets at gene and transcript levels to detect correlation and anti-correlation patterns and found it to be fast and accurate, even on very large datasets thanks to an evolutionary algorithm based minimal P-value search. The tool comes with a ready-to-use permutation scheme allowing the computation of adjusted P-values at low time cost. Additionally, the package is a drop-in replacement to previous packages as a compatibility mode is included, allowing to re-run older studies with close to no change to existing pipelines. RedRibbon holds the promise to accurately extricate detailed information from large analyses.Availability. RNA-sequencing datasets are available through the Gene Expression Omnibus (GEO) portal with accession numbers GSE159984, GSE133218, GSE137136, GSE98485, GSE148058 and GSE108413. The C libraries and R package code are open to the community with a permissive licence (GPL3) and available for download from GitHub https://github.com/antpiron/ale, https://github.com/antpiron/cRedRibbon and https://github.com/antpiron/RedRibbon.},

note = {DOI: https://doi.org/10.1101/2022.08.31.505818},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Montero-Porras, Eladio; Grujić, Jelena; Domingos, Elias Fernandez; Lenaerts, Tom

Inferring strategies from observations in long iterated Prisoner’s dilemma experiments Journal Article

In: Scientific reports, vol. 12, no. 1, 2022, (DOI: 10.1038/s41598-022-11654-2).

Abstract | Links | BibTeX

Piron, Anthony; Colli, Maikel Luis; Defrance, Matthieu; Eizirik, Decio L.; Mercader, Josep Maria; Cnop, Miriam

Identification of novel type 1 and type 2 diabetes genes by colocalisation of human islet eQTL and GWAS variants Miscellaneous

2022, (Conference: EASD Annual Meeting of the European Association for the Study of Diabetes(58th: 19 – 23 September 2022: Stockholm, Sweden)).

Montero-Porras, Eladio; Lenaerts, Tom; Gallotti, Riccardo; Gruji’c, Jelena

Fast deliberation is related to unconditional behaviour in iterated Prisoners’ Dilemma experiments Journal Article

In: Scientific Reports, vol. 12, no. 1, 2022, (DOI: 10.1038/s41598-022-24849-4).

Abstract | Links | BibTeX

Nachtegael, Charlotte; Gravel, Barbara; Dillen, Arnau; Smits, Guillaume; Nowe, Ann; Papadimitriou, Sofia; Lenaerts, Tom

Scaling up oligogenic diseases research with OLIDA: The Oligogenic Diseases Database Journal Article

In: Database, vol. 2022, 2022, (DOI: 10.1093/database/baac023).

Abstract | Links | BibTeX

@article{info:hdl:2013/342417b,

title = {Scaling up oligogenic diseases research with OLIDA: The Oligogenic Diseases Database},

author = {Charlotte Nachtegael and Barbara Gravel and Arnau Dillen and Guillaume Smits and Ann Nowe and Sofia Papadimitriou and Tom Lenaerts},

url = {https://dipot.ulb.ac.be/dspace/bitstream/2013/342417/3/baac023.pdf},

year  = {2022},

date = {2022-01-01},

journal = {Database},

volume = {2022},

abstract = {Improving the understanding of the oligogenic nature of diseases requires access to high-quality, well-curated Findable, Accessible, Interoperable, Reusable (FAIR) data. Although first steps were taken with the development of the Digenic Diseases Database, leading to novel computational advancements to assist the field, these were also linked with a number of limitations, for instance, the ad hoc curation protocol and the inclusion of only digenic cases. The OLIgogenic diseases DAtabase (OLIDA) presents a novel, transparent and rigorous curation protocol, introducing a confidence scoring mechanism for the published oligogenic literature. The application of this protocol on the oligogenic literature generated a new repository containing 916 oligogenic variant combinations linked to 159 distinct diseases. Information extracted from the scientific literature is supplemented with current knowledge support obtained from public databases. Each entry is an oligogenic combination linked to a disease, labelled with a confidence score based on the level of genetic and functional evidence that supports its involvement in this disease. These scores allow users to assess the relevance and proof of pathogenicity of each oligogenic combination in the database, constituting markers for reporting improvements on disease-causing oligogenic variant combinations. OLIDA follows the FAIR principles, providing detailed documentation, easy data access through its application programming interface and website, use of unique identifiers and links to existing ontologies. Database URL: https://olida.ibsquare.be},

note = {DOI: 10.1093/database/baac023},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Domingos, Elias Fernandez; Terrucha, Ines; Suchon, Remi; Grujić, Jelena; Burguillo, Juan J. C.; Santos, Francisco C.; Lenaerts, Tom

Delegation to artificial agents fosters prosocial behaviors in the collective risk dilemma Journal Article

In: Scientific reports, vol. 12, no. 1, 2022, (DOI: 10.1038/s41598-022-11518-9).

Abstract | Links | BibTeX

@article{info:hdl:2013/349554b,

title = {Delegation to artificial agents fosters prosocial behaviors in the collective risk dilemma},

author = {Elias Fernandez Domingos and Ines Terrucha and Remi Suchon and Jelena Grujić and Juan J. C. Burguillo and Francisco C. Santos and Tom Lenaerts},

url = {https://dipot.ulb.ac.be/dspace/bitstream/2013/349554/1/doi_333198.pdf},

year  = {2022},

date = {2022-01-01},

journal = {Scientific reports},

volume = {12},

number = {1},

abstract = {Home assistant chat-bots, self-driving cars, drones or automated negotiation systems are some of the several examples of autonomous (artificial) agents that have pervaded our society. These agents enable the automation of multiple tasks, saving time and (human) effort. However, their presence in social settings raises the need for a better understanding of their effect on social interactions and how they may be used to enhance cooperation towards the public good, instead of hindering it. To this end, we present an experimental study of human delegation to autonomous agents and hybrid human-agent interactions centered on a non-linear public goods dilemma with uncertain returns in which participants face a collective risk. Our aim is to understand experimentally whether the presence of autonomous agents has a positive or negative impact on social behaviour, equality and cooperation in such a dilemma. Our results show that cooperation and group success increases when participants delegate their actions to an artificial agent that plays on their behalf. Yet, this positive effect is less pronounced when humans interact in hybrid human-agent groups, where we mostly observe that humans in successful hybrid groups make higher contributions earlier in the game. Also, we show that participants wrongly believe that artificial agents will contribute less to the collective effort. In general, our results suggest that delegation to autonomous agents has the potential to work as commitment devices, which prevent both the temptation to deviate to an alternate (less collectively good) course of action, as well as limiting responses based on betrayal aversion.},

note = {DOI: 10.1038/s41598-022-11518-9},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Montero-Porras, Eladio; Grujić, Jelena; Domingos, Elias Fernandez; Lenaerts, Tom

Inferring strategies from observations in long iterated Prisoner’s dilemma experiments Journal Article

In: Scientific reports, vol. 12, no. 1, 2022, (DOI: 10.1038/s41598-022-11654-2).

Abstract | Links | BibTeX

Montero-Porras, Eladio; Lenaerts, Tom; Gallotti, Riccardo; Gruji’c, Jelena

Fast deliberation is related to unconditional behaviour in iterated Prisoners’ Dilemma experiments Journal Article

In: Scientific Reports, vol. 12, no. 1, 2022, (DOI: 10.1038/s41598-022-24849-4).

Abstract | Links | BibTeX

Han, The Anh T. A. H.; Lenaerts, Tom; Santos, Francisco C.; Pereira, Luís Moniz

Voluntary safety commitments provide an escape from over-regulation in AI development Journal Article

In: Technology in society, vol. 68, 2022, (DOI: 10.1016/j.techsoc.2021.101843).

Abstract | Links | BibTeX

@article{info:hdl:2013/339040,

title = {Voluntary safety commitments provide an escape from over-regulation in AI development},

author = {The Anh T. A. H. Han and Tom Lenaerts and Francisco C. Santos and Luís Moniz Pereira},

url = {https://dipot.ulb.ac.be/dspace/bitstream/2013/339040/3/AIES_agreement-2.pdf},

year  = {2022},

date = {2022-01-01},

journal = {Technology in society},

volume = {68},

abstract = {With the introduction of Artificial Intelligence (AI) and related technologies in our daily lives, fear and anxiety about their misuse as well as their inherent biases, incorporated during their creation, have led to a demand for governance and associated regulation. Yet regulating an innovation process that is not well understood may stifle this process and reduce benefits that society may gain from the generated technology, even under the best intentions. Instruments to shed light on such processes are thus needed as they can ensure that imposed policies achieve the ambitions for which they were designed. Starting from a game-theoretical model that captures the fundamental dynamics of a race for domain supremacy using AI technology, we show how socially unwanted outcomes may be produced when sanctioning is applied unconditionally to risk-taking, i.e. potentially unsafe, behaviours. We demonstrate here the potential of a regulatory approach that combines a voluntary commitment approach reminiscent of soft law, wherein technologists have the freedom of choice between independently pursuing their course of actions or establishing binding agreements to act safely, with either a peer or governmental sanctioning system of those that do not abide by what they pledged. As commitments are binding and sanctioned, they go beyond the classic view of soft law, akin more closely to actual law-enforced regulation. Overall, this work reveals how voluntary but sanctionable commitments generate socially beneficial outcomes in all scenarios envisageable in a short-term race towards domain supremacy through AI technology. These results provide an original dynamic systems perspective of the governance potential of enforceable soft law techniques or co-regulatory mechanisms, showing how they may impact the ambitions of developers in the context of the AI-based applications.},

note = {DOI: 10.1016/j.techsoc.2021.101843},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Bizet, Martin; Defrance, Matthieu; Calonne, Emilie; Bontempi, Gianluca; Sotiriou, Christos; Fuks, Franccois; Jeschke, Jana

Improving Infinium MethylationEPIC data processing: re-annotation of enhancers and long noncoding RNA genes and benchmarking of normalization methods. Journal Article

In: Epigenetics, vol. 17, no. 13, pp. 2434-2454, 2022, (DOI: 10.1080/15592294.2022.2135201).

Abstract | Links | BibTeX

@article{info:hdl:2013/353467,

title = {Improving Infinium MethylationEPIC data processing: re-annotation of enhancers and long noncoding RNA genes and benchmarking of normalization methods.},

author = {Martin Bizet and Matthieu Defrance and Emilie Calonne and Gianluca Bontempi and Christos Sotiriou and Franccois Fuks and Jana Jeschke},

url = {https://dipot.ulb.ac.be/dspace/bitstream/2013/353467/5/KEPI_17_2135201.pdf},

year  = {2022},

date = {2022-01-01},

journal = {Epigenetics},

volume = {17},

number = {13},

pages = {2434-2454},

abstract = {Illumina Infinium DNA Methylation (5mC) arrays are a popular technology for low-cost, high-throughput, genome-scale measurement of 5mC distribution, especially in cancer and other complex diseases. After the success of its HumanMethylation450 array (450k), Illumina released the MethylationEPIC array (850k) featuring increased coverage of enhancers. Despite the widespread use of 850k, analysis of the corresponding data remains suboptimal: it still relies mostly on Illumina's default annotation, which underestimates enhancerss and long noncoding RNAs. Results: We have thus developed an approach, based on the ENCODE and LNCipedia databases, which greatly improves upon Illumina's default annotation of enhancers and long noncoding transcripts. We compared the re-annotated 850k with both 450k and reduced-representation bisulphite sequencing (RRBS), another high-throughput 5mC profiling technology. We found 850k to cover at least three times as many enhancers and long noncoding RNAs as either 450k or RRBS. We further investigated the reproducibility of the three technologies, applying various normalization methods to the 850k data. Most of these methods reduced variability to a level below that of RRBS data. We then used 850k with our new annotation and normalization to profile 5mC changes in breast cancer biopsies. 850k highlighted aberrant enhancer methylation as the predominant feature, in agreement with previous reports. Our study provides an updated processing approach for 850k data, based on refined probe annotation and normalization, allowing for improved analysis of methylation at enhancers and long noncoding RNA genes. Our findings will help to further advance understanding of the DNA methylome in health and disease.},

note = {DOI: 10.1080/15592294.2022.2135201},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Paldino, Gian Marco; Caro, Fabrizio De; Stefani, Jacopo De; Vaccaro, Alfredo A.; Villacci, Domenico D.; Bontempi, Gianluca

A Digital Twin Approach for Improving Estimation Accuracy in Dynamic Thermal Rating of Transmission Lines Journal Article

In: Energies, vol. 15, no. 6, 2022, (DOI: 10.3390/en15062254).

Abstract | Links | BibTeX

Marquis, Bastien; Jansen, Maarten

Information criteria bias correction for group selection Journal Article

In: Statistical papers, 2022, (Language of publication: fr).

Cimpeanu, Theodor; Santos, Francisco C.; Pereira, Luís Marcelo; Lenaerts, Tom; Han, The Anh T. A. H.

Artificial intelligence development races in heterogeneous settings Journal Article

In: Scientific reports, vol. 12, no. 1, 2022, (DOI: 10.1038/s41598-022-05729-3).

Montero-Porras, Eladio; Grujić, Jelena; Domingos, Elias Fernández; Lenaerts, Tom

Inferring strategies from observations in long iterated Prisoner’s dilemma experiments Journal Article

In: Scientific reports, vol. 12, no. 1, 2022, (DOI: 10.1038/s41598-022-11654-2).

Marquis, Bastien; Jansen, Maarten

Information criteria bias correction for group selection Journal Article

In: Statistical papers, vol. 63, no. 5, pp. 1387-1414, 2022, (Language of publication: fr).

Ciortan, Madalina; Defrance, Matthieu

GNN-based embedding for clustering scRNA-seq data Journal Article

In: Bioinformatics, vol. 38, no. 4, pp. 1037-1044, 2022, (DOI: 10.1093/bioinformatics/btab787).

Bizet, Martin; Defrance, Matthieu; Calonne, Emilie; Bontempi, Gianluca; Sotiriou, Christos; Fuks, Franccois; Jeschke, Jana

Improving Infinium MethylationEPIC data processing: re-annotation of enhancers and long noncoding RNA genes and benchmarking of normalization methods. Journal Article

In: Epigenetics, vol. 17, no. 13, pp. 2434-2454, 2022, (DOI: 10.1080/15592294.2022.2135201).

Abstract | Links | BibTeX

@article{info:hdl:2013/353467c,

title = {Improving Infinium MethylationEPIC data processing: re-annotation of enhancers and long noncoding RNA genes and benchmarking of normalization methods.},

author = {Martin Bizet and Matthieu Defrance and Emilie Calonne and Gianluca Bontempi and Christos Sotiriou and Franccois Fuks and Jana Jeschke},

url = {https://dipot.ulb.ac.be/dspace/bitstream/2013/353467/5/KEPI_17_2135201.pdf},

year  = {2022},

date = {2022-01-01},

journal = {Epigenetics},

volume = {17},

number = {13},

pages = {2434-2454},

abstract = {Illumina Infinium DNA Methylation (5mC) arrays are a popular technology for low-cost, high-throughput, genome-scale measurement of 5mC distribution, especially in cancer and other complex diseases. After the success of its HumanMethylation450 array (450k), Illumina released the MethylationEPIC array (850k) featuring increased coverage of enhancers. Despite the widespread use of 850k, analysis of the corresponding data remains suboptimal: it still relies mostly on Illumina's default annotation, which underestimates enhancerss and long noncoding RNAs. Results: We have thus developed an approach, based on the ENCODE and LNCipedia databases, which greatly improves upon Illumina's default annotation of enhancers and long noncoding transcripts. We compared the re-annotated 850k with both 450k and reduced-representation bisulphite sequencing (RRBS), another high-throughput 5mC profiling technology. We found 850k to cover at least three times as many enhancers and long noncoding RNAs as either 450k or RRBS. We further investigated the reproducibility of the three technologies, applying various normalization methods to the 850k data. Most of these methods reduced variability to a level below that of RRBS data. We then used 850k with our new annotation and normalization to profile 5mC changes in breast cancer biopsies. 850k highlighted aberrant enhancer methylation as the predominant feature, in agreement with previous reports. Our study provides an updated processing approach for 850k data, based on refined probe annotation and normalization, allowing for improved analysis of methylation at enhancers and long noncoding RNA genes. Our findings will help to further advance understanding of the DNA methylome in health and disease.},

note = {DOI: 10.1080/15592294.2022.2135201},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Simar, Cédric; Petit, Robin; Bozga, Nichita; Leroy, Axelle; Alvarez, Ana Maria Cebolla; Petieau, Mathieu; Bontempi, Gianluca; Chéron, Guy

Riemannian classification of single-trial surface EEG and sources during checkerboard and navigational images in humans. Journal Article

In: PloS one, vol. 17, no. 1, pp. e0262417, 2022, (DOI: 10.1371/journal.pone.0262417).

Abstract | Links | BibTeX

Jansen, Maarten

Wavelets from a Statistical Perspective Book

CRC Press, 2022, (Language of publication: fr).

Marquis, Bastien; Jansen, Maarten

Information criteria bias correction for group selection Journal Article

In: Statistical papers, vol. 63, no. 5, pp. 1387-1414, 2022, (Language of publication: fr).

796 entries « ‹ 3 of 16 › »