Bioinformatics: HIRI researchers develop a new machine learning approach

The HIRI researchers involved in the study; from left: Sandra Gawlitt, Lars Barquist, Chase Beisel, Yanying Yu.
(c) HIRI/Nik Schölzel

Current study reveals how machine learning, data integration and AI contribute to better strategies in the fight against pathogens.

To combat viruses, bacteria and other pathogens, synthetic biology offers new technological approaches whose performance is being validated in experiments. Researchers from the Würzburg Helmholtz Institute for RNA-based Infection Research and the Helmholtz AI Cooperative applied data integration and artificial intelligence (AI) to develop a machine learning approach that can predict the efficacy of CRISPR technologies more accurately than before. The findings were published today in the journal Genome Biology.

The genome or DNA of an organism incorporates the blueprint for proteins and orchestrates the production of new cells. Aiming to combat pathogens, cure genetic diseases or achieve other positive effects, molecular biological CRISPR technologies are being used to specifically alter or silence genes and inhibit protein production.

One of these molecular biological tools is CRISPRi (from “CRISPR interference”). CRISPRi blocks genes and gene expression without modifying the DNA sequence. As with the CRISPR-Cas system also known as “gene scissors”, this tool involves a ribonucleic acid (RNA), which serves as a guide RNA to direct a nuclease (Cas). In contrast to gene scissors, however, the CRISPRi nuclease only binds to the DNA without cutting it. This binding results in the corresponding gene not being transcribed and thus remaining silent.

Until now, it has been challenging to predict the performance of this method for a specific gene. Researchers from the Würzburg Helmholtz Institute for RNA-based Infection Research (HIRI) in cooperation with the University of Würzburg and the Helmholtz Artificial Intelligence Cooperation Unit (Helmholtz AI) have now developed a machine learning approach using data integration and artificial intelligence (AI) to improve such predictions in the future.

The approach

CRISPRi screens are a highly sensitive tool that can be used to investigate the effects of reduced gene expression. In their study, published today in the journal Genome Biology, the scientists used data from multiple genome-wide CRISPRi essentiality screens to train a machine learning approach. Their goal: to better predict the efficacy of the engineered guide RNAs deployed in the CRISPRi system.

“Unfortunately, genome-wide screens only provide indirect information about guide efficiency. Hence, we have applied a new machine learning method that disentangles the efficacy of the guide RNA from the impact of the silenced gene,” explains Lars Barquist. The computational biologist initiated the study and heads a bioinformatics research group at the Würzburg Helmholtz Institute, a site of the Braunschweig Helmholtz Centre for Infection Research in cooperation with the Julius-Maximilians-Universität Würzburg.

Supported by additional AI tools (“Explainable AI”), the team established comprehensible design rules for future CRISPRi experiments. The study authors validated their approach by conducting an independent screen targeting essential bacterial genes, showing that their predictions were more accurate than previous methods.

“The results have shown that our model outperforms existing methods and provides more reliable predictions of CRISPRi performance when targeting specific genes,” says Yanying Yu, PhD student in Lars Barquist’s research group and first author of the study.

The scientists were particularly surprised to find that the guide RNA itself is not the primary factor in determining CRISPRi depletion in essentiality screens. “Certain gene-specific characteristics related to gene expression appear to have a greater impact than previously assumed,” explains Yu.

The study also reveals that integrating data from multiple data sets significantly improves the predictive accuracy and enables a more reliable assessment of the efficiency of guide RNAs. “Expanding our training data by pulling together multiple experiments is essential to create better prediction models. Prior to our study, lack of data was a major limiting factor for prediction accuracy,” summarizes junior professor Barquist. The approach now published will be very helpful in planning more effective CRISPRi experiments in the future and serve both biotechnology and basic research. “Our study provides a blueprint for developing more precise tools to manipulate bacterial gene expression and ultimately help to better understand and combat pathogens,” says Barquist.

The results at a glance:
– Gene features matter: The characteristics of targeted genes have a significant impact on guide RNA depletion in genome-wide screens.

– Data integration improves predictions: Combining data from multiple CRISPRi screens significantly improves the accuracy of prediction models and enables more reliable estimates of guide RNA efficiency.

– Designing better CRISPRi experiments: The study provides valuable insights for designing more effective CRISPRi experiments by predicting guide RNA efficiency, enabling precise gene-silencing strategies.

Funding:
The study was supported by funds from the Bavarian State Ministry of Science and Art through the bayresq.net research network.

This press release is also available on our homepage: https://www.helmholtz-hzi.de/en/news-events/news/view/article/complete/forschend….

Helmholtz Institute for RNA-based Infection Research:
The Helmholtz Institute for RNA-based Infection Research (HIRI) is the first institution of its kind worldwide to combine ribonucleic acid (RNA) research with infection biology. Based on novel findings from its strong basic research program, the institute’s long-term goal is to develop innovative therapeutic approaches to better diagnose and treat human infections. HIRI is a site of the Braunschweig Helmholtz Centre for Infection Research (HZI) in cooperation with the Julius-Maximilians-Universität Würzburg (JMU) and is located on the Würzburg Medical Campus. More information at http://www.helmholtz-hiri.de.

Helmholtz Centre for Infection Research:
Scientists at the Helmholtz Centre for Infection Research (HZI) in Braunschweig and its other sites in Germany are engaged in the study of bacterial and viral infections and the body’s defence mechanisms. They have a profound expertise in natural compound research and its exploitation as a valuable source for novel anti-infectives. As member of the Helmholtz Association and the German Center for Infection Research (DZIF) the HZI performs translational research laying the ground for the development of new treatments and vaccines against infectious diseases. http://www.helmholtz-hzi.de/en

Media Contact
Dr Britta Grigull
Head of Communications
Helmholtz Institute for RNA-based Infection Research (HIRI)
britta.grigull@helmholtz-hiri.de
+49 (0)931 31 81801

Originalpublikation:

Yu Y, Gawlitt S, Barros de Andrade e Sousa L, Medivan E, Piraud M, Beisel C, Barquist L:
Improved prediction of bacterial CRISPRi guide efficiency from depletion screens through mixed-effect machine learning and data integration. Genome Biology (2024)
DOI: 10.1186/s13059-023-03153-y
https://doi.org/10.1186/s13059-023-03153-y

Media Contact

Dr. Andreas Fischer Presse und Kommunikation
Helmholtz-Zentrum für Infektionsforschung

All latest news from the category: Life Sciences and Chemistry

Articles and reports from the Life Sciences and chemistry area deal with applied and basic research into modern biology, chemistry and human medicine.

Valuable information can be found on a range of life sciences fields including bacteriology, biochemistry, bionics, bioinformatics, biophysics, biotechnology, genetics, geobotany, human biology, marine biology, microbiology, molecular biology, cellular biology, zoology, bioinorganic chemistry, microchemistry and environmental chemistry.

Back to home

Comments (0)

Write a comment

Newest articles

NASA: Mystery of life’s handedness deepens

The mystery of why life uses molecules with specific orientations has deepened with a NASA-funded discovery that RNA — a key molecule thought to have potentially held the instructions for…

What are the effects of historic lithium mining on water quality?

Study reveals low levels of common contaminants but high levels of other elements in waters associated with an abandoned lithium mine. Lithium ore and mining waste from a historic lithium…

Quantum-inspired design boosts efficiency of heat-to-electricity conversion

Rice engineers take unconventional route to improving thermophotovoltaic systems. Researchers at Rice University have found a new way to improve a key element of thermophotovoltaic (TPV) systems, which convert heat…