Computational Tools Will Help Identify Microbes in Complex Environmental Samples

Credit: K. Konstantinidis and D. Tsementzi, Georgia Tech Georgia Tech graduate students Despina Tsementzi and Miguel Rodriguez prepare to sample water from Lake Seminole in Georgia, one of the lakes on the Chattahoochee River system. The water samples will be used to study microbial communities as part of research in the laboratory of Kostas Konstantinidis at Georgia Tech.

A new National Science Foundation-supported project will provide computational tools designed to help identify and characterize the gene diversity of the residents of these microbial communities. The project, being done by researchers at the Georgia Institute of Technology and Michigan State University, will allow clinicians and scientists to compare the genomic information of organisms they encounter against the growing volumes of data provided by the world’s scientific community.

The tools will be hosted on a web server designed to be used by researchers who may not have training in the latest bioinformatics techniques. A prototype system containing a limited number of computational tools is already available at http://enve-omics.ce.gatech.edu and is attracting more than 500 users each month.

“Across many areas of science, we are dealing with communities of microorganisms, and one challenge we’ve had is to identify them because we haven’t had good tools to tell apart individual microbes from the mixtures,” said Kostas Konstantinidis, an associate professor in the School of Civil and Environmental Engineering at Georgia Tech and the project’s principal investigator. “Our tools will be designed to deal with the genomes of whole communities of organisms.”

Current techniques identify individual microbes by examining their small subunit ribosomal RNA (SSU rRNA) genes, but the new tools will allow scientists to analyze entire genomes and meta-genomes.

“With the dawn of the genomic era, we can now get the whole genome of these organisms to see not only the ribosomal RNA, but also all the genes in the genome to get a better understanding of what the each organism’s potential might be,” said Konstantinidis. “There will be many advantages for looking at all the genes instead of just one, the SSU rRNA, such as to identify which organisms encode toxins or the enzymes for breaking down pollutants.”

Collaborators on the three-year project include scientists who operate the Ribosomal Database Project at Michigan State University: Jim Tiedje, director of Michigan State University’s Center for Microbial Ecology and James Cole, a Michigan State University research assistant professor and director of the Ribosomal Database Project.

The ability to identify and enumerate the organisms in complex communities using culture-independent, genomic technologies and associated bioinformatics algorithms is becoming more important as scientists study organisms that can’t be grown in the lab. The majority of the world’s organisms resist traditional lab culture, meaning they have to be studied in the field and identified through genetic information.

Konstantinidis and his research group are studying such communities in the water of lakes in Chattahoochee River system in Georgia and elsewhere. They are examining how these communities respond to perturbations, such as oil or pesticide spills, and the role that different members of the community play in breaking down pollutants.

“These tools actually come from our research practice,” said Konstantinidis. “We came to the point where we couldn’t process the data to answer the questions we wanted to ask. That led us to this new project to develop the tools we and others need to interrogate the data and get the information we are looking for.”

A single liter of lake water may contain as many as 500 different species, and together, their genomic information can total tens of billions of gene-coding letters. From Lake Lanier alone, the team has generated 200 gigabytes of genomic data.

“We want to figure out what organisms are there, and what genes they encode,” Konstantinidis explained. “The tools we are developing will allow us to do this.”

The tools developed in the project will be useful to both clinical microbiologists and environmental researchers. “This will not be specific to any one discipline,” he said. “As long as people are working with microbes, this will be helpful to them because some of the questions are universal.”

The system will also be built to provide user-friendly help to scientists who may not have training in the latest genomic and bioinformatics techniques. “There is a big need for big data analysis, and there are not many trained people right now,” Konstantinidis said. “These tools will make the lives of researchers easier.”

Among the challenges ahead is building an infrastructure able to handle the growing amounts of genomic information produced worldwide.

“We will have to develop some computational solutions for the problems of keeping up with all the new data becoming available,” said Konstantinidis. “We need to make tools that have high throughput to keep up with data volumes that are increasing geometrically.”

The system will initially operate on servers at Georgia Tech and Michigan State University, but if demand and data grow, additional resources may be sought, such as the National Science Foundation’s XSEDE supercomputer.

This research is supported by the National Science Foundation under award DBI-1356288. The opinions expressed in this article are those of the authors and do not necessarily reflect the official views of the National Science Foundation.

Research News
Georgia Institute of Technology
177 North Avenue
Atlanta, Georgia 30332-0181 USA

Media Relations Contacts: John Toon (404-894-6986) (jtoon@gatech.edu) or Brett Israel (404-385-1933) (brett.israel@comm.gatech.edu).

Writer: John Toon

Media Contact

John Toon newswise

More Information:

http://www.gatech.edu

All latest news from the category: Life Sciences and Chemistry

Articles and reports from the Life Sciences and chemistry area deal with applied and basic research into modern biology, chemistry and human medicine.

Valuable information can be found on a range of life sciences fields including bacteriology, biochemistry, bionics, bioinformatics, biophysics, biotechnology, genetics, geobotany, human biology, marine biology, microbiology, molecular biology, cellular biology, zoology, bioinorganic chemistry, microchemistry and environmental chemistry.

Back to home

Comments (0)

Write a comment

Newest articles

A new puzzle piece for string theory research

Dr. Ksenia Fedosova from the Cluster of Excellence Mathematics Münster, along with an international research team, has proven a conjecture in string theory that physicists had proposed regarding certain equations….

Climate change can cause stress in herring larvae

The occurrence of multiple stressors undermines the acclimatisation strategies of juvenile herring: If larvae are exposed to several stress factors at the same time, their ability to respond to these…

Making high-yielding rice affordable and sustainable

Plant biologists show how two genes work together to trigger embryo formation in rice. Rice is a staple food crop for more than half the world’s population, but most farmers…