Thesis Topic Details

Topic ID:
3151
Title:
Distributed data analysis for metagenomics
Supervisor:
Srikumar Venugopal
Research Area:
Data-Intensive Computing, Bioinformatics
Associated Staff
Assessor:
Fethi Rabhi
Topic Details
Status:
Active
Type:
R & D
Programs:
CS
Group Suitable:
Industrial:
No
Pre-requisites:
--
Description:
Microbes play a central role in nutrient cycling and the health of the planet and, in numbers of billions per litre of water, they represent the bulk of the ocean life. The problem with marine microbes is that they are characterised by a staggering diversity: in a drop of water coexist microbes tens of times more diverse than humans and trees. High-throughput genome sequencers allow analysis of thousands of genes and genomes of microbes from a bucket of water without growing these organisms in the lab. However, researchers are struggling to analyse the torrent of data produced by these machines with currently available genome analysis tools (such as BLAST).

This project will explore the use of distributed techniques for quickly analysing large volumes of genomic data. In particular, we will explore the use of graphics processor units (GPUs) for data analysis and develop novel algorithms and techniques for efficient processing of data.
Comments:
This project will be carried out using CUDA development environment from NVIDIA. Student should be capable of programming in C++/Java
Past Student Reports
 
No Reports Available. Contact the supervisor for more information.

Check out all available reports in the CSE Thesis Report Library.

NOTE: only current CSE students can login to view and select reports to download.