Friday 3:15 p.m.–4 p.m.
How to interpret your own genome using (mostly) Python.
- Audience level:
We’ve entered the era of the $1000 human genome, and soon it will be straightforward to get your own genome sequenced by a commercial company. But what does the data mean? What information can you get out of your genomic sequence? And what are the barriers to deeper analysis? What kinds of algorithms and databases are used in genomic analysis? All this, and more, will be revealed in this talk.
Biology is entering an exciting time, where human genomes are fairly inexpensive to sequence, computer algorithms for analyzing the data are becoming more sophisticated, and databases for understanding genomic function are growing in size. Suppose you get your genome sequenced — what’s in there, how would you interpret it, what software might you use, and what kind of limits are there to the information you can pull out of your genome sequence? All of the analysis will be done in an open github repo with the presentation built on top of an IPython Notebook, of course. Also, many of the basic mapping and querying algorithms involved in genomic analysis are simple enough that I will provide Python pseucodode of them, as well as provide references to further discussions of the algorithms and databases.