Jun 4, 2025

Cloud Research Lab at the Donnelly Centre, University of Toronto accelerates health research at unprecedented rate

Computational Biology & AI, Faculty, Research

New lab uses AWS cloud computing and AI to shave years off data analysis for large-scale biomedical research

Composite image of (left) two researchers sitting in front of a computer with the one on the left holding and pointing to a 3D-printed model of a virus and (right) a man in a grey blazer and brown tie leaning against a railing
Two computational biologists looking at data and a 3D-printed model produced through the Cloud Research Lab (left) and Assistant Professor Artem Babaian (right)
By Anika Hazra

June 4, 2025Toronto – The Donnelly Centre for Cellular and Biomolecular Research at the University of Toronto (U of T), in collaboration with Amazon Web Services (AWS), has launched the Cloud Research Lab (CRL), an initiative to equip Canadian biomedical researchers with cutting-edge cloud computing resources. By providing advanced computational power, the CRL will accelerate both emerging and ongoing research projects to drive innovation in biomedical science. 

The new lab helps position Canada as a global leader in computational research by offering capacity and training to support data-intensive research. The goal of the lab is to enable researchers to scale up their analyses through access to the full potential of AWS cloud computing resources, thereby enhancing the breadth of data that can be analyzed and reducing processing time.

“Cloud computing is transforming research worldwide, and this latest investment by AWS ensures our students and faculty have the most advanced technology at their fingertips,” said Dr. Stéphane Angers, Director of the Donnelly Centre and Charles H. Best Chair in Medical Research at U of T. “The CRL will propel the Donnelly Centre forward as the nation’s leading research facility for computational biology and serve as a model for how strategic investment in cloud computing can accelerate discovery across the entire landscape of Canadian science. Our collaboration with Amazon Web Services enables us to push the boundaries of discovery while maximizing cost efficiency, ensuring Canadian-funded grants have an even greater impact.”

A legacy of innovation at U of T

U of T consistently ranks among the world’s top institutions, recognized by Times Higher Education’s 2025 World University Rankings as one of the top 10 public universities and its 2025 World University Rankings by Subject as one of the top 10 universities in the medical and health category.

The Donnelly Centre, established in 2005, has played a pivotal role in U of T’s leadership in biomedical research by fostering cross-disciplinary collaboration and technological innovation.

The CRL was founded by Dr. Artem Babaian, who joined the Donnelly Centre as an Assistant Professor of Molecular Genetics in 2022 following his development of Serratus, an open-science cloud computing platform that harnesses the power of AWS, which led to the discovery of more than 130,000 RNA viruses. The project analyzed 5.7 million publicly available biological samples—equivalent to 20 million gigabytes of data—in just 11 days. Serratus transformed researchers’ ability to identify genetic sequences of new RNA viruses, including coronaviruses.

The Serratus platform was also used in a study involving Babaian that identified a new class of sub-viral biological entities, known as obelisks. Initially, the research team spent months analyzing 100 human gut datasets to find two obelisk species. By comparison, using cloud-based technology, they re-analyzed 7.2 million datasets in just three weeks—uncovering more than 29,000 obelisk species.

Driving cutting-edge research with cloud technology

AWS is providing the CRL with powerful computing infrastructure to help researchers complete projects and report findings faster than ever before.

“Our mission at AWS is to democratize access to cloud and AI technology and ensure everyone can benefit from the wide breadth of services we offer,” said Coral Kennett, Head of Provincial Government & Education at AWS Canada. “We recognized an immediate need for access to computing services by Canadian researchers to be competitive on a global scale, and we will help fulfil this need.”

The CRL has already powered groundbreaking research, including the world’s largest database of assembled sequencing data, created by Dr. Babaian and Dr. Rayan Chikhi, Researcher and Group Leader at the Institut Pasteur and Researcher at Centre national de la recherche scientifique (CNRS).

With support from the CRL, the team transformed 100 petabytes of raw sequencing data from 27 million samples into a searchable 350-terabyte dataset. To accomplish this, the AWS batch process deployed up to 2.2 million CPUs over six days, enabling the simultaneous processing of millions of samples. The resulting database substantially improves access to sequencing data, creating new opportunities for enzyme discovery for medical applications.

Advancing early-stage research

The CRL is also supporting early-stage research with the potential to drive biomedical advancements. One of these projects, led by Dr. Philipp Maass, Assistant Professor of Molecular Genetics at U of T and Scientist at SickKids Research Institute, is leveraging cloud computing to analyze 3D spatial genome organization in human cells. His team optimized high-performance computing workflows using AWS cloud architecture, reducing analysis time from 13 years to just 10 minutes.

Another project in the works is to develop and train deep learning models on large mass spectrometry datasets using the AWS Cloud. Dr. Hannes Röst, Associate Professor of Molecular Genetics at the Donnelly Centre, is doing this work with the goal of improving methods for identifying small molecules to advance their potential applications in drug discovery, disease diagnosis and environmental science.

“Cloud computing is revolutionizing the way researchers are conducting science,” said Dr. Babaian. “Everyone working in the field of computational biology is aware that the volume of data we’re facing means we can’t continue performing data analysis the way we did five years ago. Cloud computing best practices have allowed the CRL to accelerate research at an unprecedented rate. I’m thrilled that my colleagues and I can now approach research in a way that saves us money and—more importantly—years in research time.”

The Cloud Research Lab is powered by Amazon Web Services and supported by the Canadian Institutes of Health Research.

About the Donnelly Centre

The Donnelly Centre for Cellular and Biomolecular Research is a research hub at the University of Toronto’s Temerty Faculty of Medicine, where scientists from diverse fields work together to advance medicine and health. Founded in 2005, the Donnelly Centre is a global leader in research on systems biology, regenerative medicine and disease modelling.

For more information about the Donnelly Centre, visit thedonnellycentre.utoronto.ca. Follow us on X, LinkedIn and Bluesky.

Media Contact

Anika Hazra
Research and Communications Officer, Donnelly Centre for Cellular and Biomolecular Research

anika.hazra@utoronto.ca
416-946-8253