Department of Computer Science

Algorithms for DNA sequencing data

The genome of an oragnism can be investigated with DNA sequencing. DNA sequencing breaks the genome into small fragments and reports the nucleotide sequence of these fragments, i.e. substrings of the genome. We develop data structures and algorithms for analysing this kind of sequencing data. Possible topics for the summer internship include (i) lossy compression of sequencing data and (ii) indexing discriminating substrings of genomic data. The actual topic will be tailored according to the interests of the chosen applicant.

Programming skills and knowledge of algorithms and data structures is needed. Knowledge of biology or bioinformatics is beneficial but not necessary. The topics in this project are suitable for Master's thesis work.

More detailed descriptions of possible topics

Group: Algorithms for Biological Sequencing Data

Supervisor: Leena Salmela, leena.salmela@helsinki.fi