News
Last days to register for second exam!Written on 21.03.22 by Sven Rahmann Dear all, please note that the exam starts in one week from today. Thank you for your attention! |
Online Talk series for women in bioinformatics (women only!)Written on 14.03.22 by Sven Rahmann Are you interested in a scientific career as a woman in bioinformatics? You have questions and worries regarding how to get there? The online lecture series "Women in Bioinformatics" provides the answers. Successful female scientists present their research, their career path and answer your… Read more Are you interested in a scientific career as a woman in bioinformatics? You have questions and worries regarding how to get there? The online lecture series "Women in Bioinformatics" provides the answers. Successful female scientists present their research, their career path and answer your questions. Registration is now open for the talk by Prof. Dr. Alice McHardy on 10.05.22, 14:30-16:00 (CEST). Prof. Dr. McHardy holds a diploma in biochemistry and a doctoral degree in bioinformatics, both from Bielefeld University in Germany. From 2005 to 2007 she first was a postdoc and then a permanent staff member in the Bioinformatics and Pattern Discovery Group at the IBM T.J. Watson Research Center in Yorktown Heights, USA. She then became the head of the independent research group for Computational Genomics and Epidemiology at the Max Planck Institute for Computer Science in Saarbrücken. In 2010, she was appointed Chair of Algorithmic Bioinformatics at Heinrich Heine University in Düsseldorf. In 2014, she became head of the Department of Computational Biology of Infection Research at the Helmholtz Centre for Infection Research in Braunschweig and was appointed as a full professor at TU Braunschweig. Registration is also still possible for the next talk by Prof. Dr. Olga Kalinina on 22.03.22, 16:30-18:00 (CET). Prof. Dr. Kalinina is professor for Drug Bioinformatics at the Helmholtz Institute for Pharmaceutical Research Saarland (HIPS). She received a M.Sc. degree with distinction in mathematics from the Moscow State University in 2003 and a Ph.D. from the Engelhardt Institute for Molecular Biology of the Russian Academy of Sciences in 2007. She continued her research in bioinformatics as a postdoctoral fellow at the European Laboratory for Molecular Biology, for which she was awarded an EMBO Long Term Fellowship from the European Molecular Biology Organization, and later at the University of Heidelberg, working with Prof. Dr. Robert B. Russell (2007-2011). In 2012, she established her independent junior group at the Max Planck Institute for Informatics in the Department for Computational Biology and Applied Algorithmics lead by Prof. Dr. Dr. Thomas Lengauer, where she was researching resistance mechanisms in human viruses until 2018. Participation is free but registration is required for every talk. Details can be found at: https://www.bio.ifi.lmu.de/women-in-bioinformatics. Please note that the time of the lecture may vary. For further updates on upcoming talks please register for our Newsletter (https://lists.lrz.de/mailman/listinfo/wbioinformatics) or follow us on Twitter (https://twitter.com/WBioinformatics). Note: This lecture series is open to women only! |
Re-exam dates and time slot pollWritten on 05.03.22 (last change on 06.03.22) by Sven Rahmann Update on the SPAB re-exam The re-exam will be on March 28 + 29 (Mon/Tue). Update on the SPAB re-exam The re-exam will be on March 28 + 29 (Mon/Tue). Official registration in LSF will be opened next week. Please make sure you do both: get a time slot and register in LSF if you want to participate in the re-exam. Have a nice vacation if you can under the current circumstances Sven |
Q&A session today, 14:00Written on 14.02.22 by Sven Rahmann As announced in the last lecture, I will be answering your questions today from 14:00 - 15:00 or as long as there are questions. This is to answer questions that require discussion and cannot be easily asked on the forum. (In general, the forum is preferable, so everyone gets to read questions and… Read more As announced in the last lecture, I will be answering your questions today from 14:00 - 15:00 or as long as there are questions. This is to answer questions that require discussion and cannot be easily asked on the forum. (In general, the forum is preferable, so everyone gets to read questions and answers.) If you want to ask something today, please use the normal lecture link at 14:00. That's in 1 hour. |
Important: Exam RegistrationWritten on 07.02.22 by Sven Rahmann Dear all, many of you have registered for the exam in the LSF. Great. If you haven't, the exam number is 10159. Please don't forget to also ask me for a time slot on either Thursday (preferred) or on Friday (almost booked out) if you have not done so. Dear all, many of you have registered for the exam in the LSF. Great. If you haven't, the exam number is 10159. Please don't forget to also ask me for a time slot on either Thursday (preferred) or on Friday (almost booked out) if you have not done so. If at all possible, please take a spot on Thursday the 17th. The exam will be done on zoom by default.
|
Announcement - Women in Bioinformatics Talk SeriesWritten on 02.02.22 by Sven Rahmann Note: These talks are specially for women in bioinformatics. Male students cannot register! Are you interested in a scientific career as a woman in bioinformatics? You have questions and worries regarding how to get there? The online lecture series "Women in Bioinformatics" provides the answers.… Read more Note: These talks are specially for women in bioinformatics. Male students cannot register! Are you interested in a scientific career as a woman in bioinformatics? You have questions and worries regarding how to get there? The online lecture series "Women in Bioinformatics" provides the answers. Successful female scientists present their research, their career path and answer your questions. Registration is now open for the talk by Prof. Dr. Olga Kalinina on 22.03.22, 16:30-18:00. Prof. Dr. Kalinina is professor for Drug Bioinformatics at the Helmholtz Institute for Pharmaceutical Research Saarland (HIPS). She received a M.Sc. degree with distinction in mathematics from the Moscow State University in 2003 and a Ph.D. from the Engelhardt Institute for Molecular Biology of the Russian Academy of Sciences in 2007. She continued her research in bioinformatics as a postdoctoral fellow at the European Laboratory for Molecular Biology, for which she was awarded an EMBO Long Term Fellowship from the European Molecular Biology Organization, and later at the University of Heidelberg, working with Prof. Dr. Robert B. Russell (2007-2011). In 2012, she established her independent junior group at the Max Planck Institute for Informatics in the Department for Computational Biology and Applied Algorithmics lead by Prof. Dr. Dr. Thomas Lengauer, where she was researching resistance mechanisms in human viruses until 2018. Registration is also still possible for the next talk by Prof. Dr. Caroline Friedel on 15.02.22, 16:30-18:00. Prof. Friedel is professor for Bioinformatics at the Institute of Informatics at the Ludwig-Maximilians-Universität München. Her research focus is on development of analysis methods for large-scale sequencing data, integrated multi-omics data analysis and virus-host transcriptomics with a particular focus on DNA viruses. Participation is free but registration is required for every talk. Details can be found at: https://www.bio.ifi.lmu.de/women-in-bioinformatics. Please note that the time of the lecture may vary. For further updates on upcoming talks please register for our Newsletter (https://lists.lrz.de/mailman/listinfo/wbioinformatics) or follow us on Twitter (https://twitter.com/WBioinformatics). -- Prof. Dr. Caroline Friedel Lehr- und Forschungseinheit für Bioinformatik Institut für Informatik Ludwig-Maximilians-Universität München Amalienstrasse 17, 4. Stock, Büro 402 80333 München E-Mail: Caroline.Friedel@bio.ifi.lmu.de |
Course evaluation and number of exercise sheetsWritten on 18.01.22 by Sven Rahmann We have just added evaluation links to the Materials section. Please evaluate the course until 03-Feb-2022, and please do so separately for the lecture and the tutorial (there are two separate links!). Some of you also asked how many exercise sheets there will be, and the answer is 13. We have just added evaluation links to the Materials section. Please evaluate the course until 03-Feb-2022, and please do so separately for the lecture and the tutorial (there are two separate links!). Some of you also asked how many exercise sheets there will be, and the answer is 13. We hope that this information helps you with planning. |
Today's TutorialWritten on 18.01.22 by Jens Zentgraf Dear all, |
Pre-recorded lecture todayWritten on 11.01.22 by Sven Rahmann Unfortunately, today, Tuesday, January 11, I have to provide a pre-recorded lecture. Apologies, and I hope you enjoy it nevertheless, although it's a bit on the abstract side. |
Exam Dates (First Option): Feb 17 + 18Written on 06.01.22 by Sven Rahmann Dear all, the first set of exam dates (oral exam, ~30 minutes) has now been determined: Feb 17+18. Dear all, the first set of exam dates (oral exam, ~30 minutes) has now been determined: Feb 17+18. A second set of exam dates will be shortly before the start of the summer semester (dates to be determined; you can offer proposals). Also, PDFs and videos of recent lectures, as well as assignment 10 (due in one week) are now online.
|
Today's lecture canceledWritten on 04.01.22 by Sven Rahmann Due to illness, today's lecture must unfortunately be canceled. Enjoy the free morning, or take the time to review past material. See you on Thursday! |
Deadline for assignment 8 extendedWritten on 16.12.21 by Sven Rahmann Dear all, the deadline for assignment 8 has been extended till Sunday evening, in case you don't want to go on holidays any time soon. By all means, if you are ready, submit tonight. But if you need more time, feel free to extend your work over the weekend. If questions arise, please use the… Read more Dear all, the deadline for assignment 8 has been extended till Sunday evening, in case you don't want to go on holidays any time soon. By all means, if you are ready, submit tonight. But if you need more time, feel free to extend your work over the weekend. If questions arise, please use the forum instead of personal emails. Thank you and have a nice break! |
State of the CourseWritten on 02.12.21 by Sven Rahmann Dear all, we have so far covered discrete probability theory and important computational aspects. We still need to go on a bit with theory (until the holidays perhaps) and learn some basics of statistics, in particular foundations of test theory and estimation theory. To do this properly, we… Read more Dear all, we have so far covered discrete probability theory and important computational aspects. We still need to go on a bit with theory (until the holidays perhaps) and learn some basics of statistics, in particular foundations of test theory and estimation theory. To do this properly, we need to use continuous distributions (in addition to discrete ones). So the plan for next week is continuous distributions and introduction to testing (which I already somewhat did today). We will use differential gene expression analysis and differential methylation analysis as typical examples from "omics" to highlight the challenges. After the holidays, we will look into even more applications, and at the same time introduce more powerful models and procedures. I hope that this gives you some orientation. Slides for this week (Lectures 13 and 14, one common PDF) and videos are online, as well as the new assignment sheet from today, due in one week on December 9. This one is more programming-heavy, but not too difficult. Keep up the good work! |
Suggestion: Advent of CodeWritten on 02.12.21 by Sven Rahmann If you would like to expand your experience in coding, you can do the "advent of code" challenges. Every day in December (1 to 25), you receive a two-part problem embedded in a story. Given some (individualized) input file, you have to compute a solution of the problem. It starts out quite simple, and… Read more If you would like to expand your experience in coding, you can do the "advent of code" challenges. Every day in December (1 to 25), you receive a two-part problem embedded in a story. Given some (individualized) input file, you have to compute a solution of the problem. It starts out quite simple, and the first few problems can always be solved with a few lines of code. Later, it becomes more and more challenging. The first part of the problem on any day is usually simpler than the second part, which is only unlocked after you have solved the first part. Best of all: it's free (but you can make a donation to the maintainer/inventor if you want to, or buy merchandise). The website is her: https://adventofcode.com/2021 Note: The coding challenges are typically not related to this course (i.e., there are almost never any probability-related tasks), but it is good coding experience in general! Have fun! |
Thursday tutorial continues to be on-campusWritten on 01.12.21 by Jens Zentgraf Dear all, some participants of the Thursday tutorial have informed us that they would like to continue having an on-campus tutorial. Therefore we will stay with an on-campus tutorial.
|
Going Fully Online?Written on 30.11.21 by Sven Rahmann Dear all, unfortunately, due to illness, today's online tutorial could not take part. Jens will discuss the past solutions next time. Given the changes in university regulations, we are thinking about going fully online, i.e., also holding the Thursday tutorial online. On the other hand, the… Read more Dear all, unfortunately, due to illness, today's online tutorial could not take part. Jens will discuss the past solutions next time. Given the changes in university regulations, we are thinking about going fully online, i.e., also holding the Thursday tutorial online. On the other hand, the exchange works much better in an on-campus tutorial. Therefore: all of you who are currently visiting the on-campus tutorial, please inform myself or Jens Zentgraf (ideally both of us) if you would like to continue on-campus. If nobody wants to come to the on-campus tutorial, it will become an online tutorial from this week on! Thank you! |
Lecture 12 + PCR App; Assignment 06Written on 24.11.21 by Sven Rahmann As I am travelling this Thursday, there is only a (short) pre-recorded lecture. I re-explain càdlàg functions and show again how to to logarithmic addition, with an example. Then I showcase the PCR app (runs with streamlit locally in your browser), which turned out to still have some bugs. In the… Read more As I am travelling this Thursday, there is only a (short) pre-recorded lecture. I re-explain càdlàg functions and show again how to to logarithmic addition, with an example. Then I showcase the PCR app (runs with streamlit locally in your browser), which turned out to still have some bugs. In the downloadable code, these bugs are fixed, so you can now play with it and, if you like, work on the bonus task of Assignment 06 (due December 2), which is also now online. |
About Assignment 5 (Task 1)Written on 22.11.21 by Sven Rahmann Dear all, we had an error in assignment 5 task 1: The entries of the random matrix were given as +1 or -1, but they should be + or - 1 / sqrt(d). The sheet has been updated last Friday. But if you solved the problem with the original formulation (+-1), this is also fine. You can simply re-scale… Read more Dear all, we had an error in assignment 5 task 1: The entries of the random matrix were given as +1 or -1, but they should be + or - 1 / sqrt(d). The sheet has been updated last Friday. But if you solved the problem with the original formulation (+-1), this is also fine. You can simply re-scale the solution accordingly. To give you some background, this type of "random projection" is important in practice. By multiplying with a random matrix of a well-chosen type (such as in this task), you can project points from n-dimensional space into d-dimensional space (where d is much smaller than n) and approximately keep lengths and distances unchanged (in expectation). |
Lecture 10 (Thu Nov 18) not live (but pre-recorded)Written on 18.11.21 by Sven Rahmann Dear all, with sick kids, I will not be able to come to the university today (Thursday Nov 18) to give the lecture live. I have therefore recorded it now, so it is available for viewing in the cloud (see Materials, Lecture 10 Video). I will extend the lecture later by a presentation of a small app… Read more Dear all, with sick kids, I will not be able to come to the university today (Thursday Nov 18) to give the lecture live. I have therefore recorded it now, so it is available for viewing in the cloud (see Materials, Lecture 10 Video). I will extend the lecture later by a presentation of a small app to investigate the PCR distribution. Next Monday, we will start a new topic (on "numerical probability"). Next Thursday, there will be no lecture because Lecture 10 got quite long, and there will be the additional app video (which you could watch next Thursday during the lecture time). Some not so good news: With increasing numbers of infections, the 3G policy will be more heavily enforced. on campus with an increased number of random and non-random checks of vaccination status. You need your "3G proof" at all times on campus or you may get in trouble. Please remember that, especially if you come to an on-campus tutorial. Stay healthy by solving probability paradoxes! |
Week 3 SummaryWritten on 04.11.21 by Sven Rahmann This week, we finished discussing discrete probability distributions and introduced conditional probabilities and independence. Please check out the visualization app for discrete distributions (Materials for Lecture 05). Download the code, take a look, and install the requirements, as shown in… Read more This week, we finished discussing discrete probability distributions and introduced conditional probabilities and independence. Please check out the visualization app for discrete distributions (Materials for Lecture 05). Download the code, take a look, and install the requirements, as shown in Assignment 03. If you can, get some extra programming points by extending the app with the finite (n-element) Zipf distribution. Bayes' Theorem today is an extremely important piece of knowledge. Be sure to understand how to apply it, e.g., to the rare disease example. |
Assignment 02 published; lectures onlineWritten on 28.10.21 by Sven Rahmann Dear participants, the PDF of assignment 2 has been published in the Materials section (a submission for it will be opened later today). Also, all slides and videos up to lecture 4 have been published in the Materials section. If you notice any errors, please let us know by email or forum. Thanks! |
TutorialsWritten on 25.10.21 by Sven Rahmann Dear all, the system has made an automatic tutorial assignment. Please note: You can come to the other tutorial as well. The assignment is non-binding. In fact, please do the following: Come to the online tutorial if you can (Tue at 10, after the lecture, different zoom link!). Dear all, the system has made an automatic tutorial assignment. Please note: You can come to the other tutorial as well. The assignment is non-binding. In fact, please do the following: Come to the online tutorial if you can (Tue at 10, after the lecture, different zoom link!). If neither option works for you, please send an email to Jens Zentgraf with all your available time slots. |
FIrst Assignment PublishedWritten on 21.10.21 by Sven Rahmann Dear all, please find the first assignment sheet under "Materials". Have fun with the tasks. For the programming task, we recommend Python. If you use a different language, please remember that you need arbitrary precision integers; there are some huge numbers involved! |
Lecture starts today (Tue Oct 19!)Written on 19.10.21 by Sven Rahmann Dear all, the lecture starts today (Tue Oct 19) at 08:30. If you miss it, don't worry! A recording is available; the link in the Materials section. You need to be registered, though, to access it. There will be no tutorials this week, but you will receive a first problem set on Thursday. |
Registration now openWritten on 05.10.21 by Sven Rahmann You can now register for this course. When selecting a tutorial, please note that the on-campus tutorial is subject to regulations imposed by the university (3G). If in doubt, consider the online tutorial. Depending on the number of registrations, we may offer more than one online tutorial. |
Statistics, Probability and Applications in Bioinformatics
Specialized course, B.Sc. and M.Sc. Bioinformatics, Saarland University.
Elective course, M.Sc. CS, DS/AI, and related, Saarland University
Prerequisites | Mathematics (especially analysis and linear algebra); solid programming skills (required!) |
Credits | 9 ECTS credits |
Required time | 4V+2Ü (4 hours of lectures, 2 hours of tutorials per week) |
Language | English |
Registration | click on Registration in the menu header |
Details | available after registration in the Course Management system |
Times | Tuesday + Thursday, 08:30 - 10:00 (starts on Tue Oct 19!) |
Mode | online lecture; both online and offline tutorial options |
Link | Zoom link available under "Materials" after registration |
Target audience (IMPORTANT!)
This course is offered as a specialized lecture in the B.Sc. or M.Sc. Bioinformatics degrees, for 9 ECTS credits.
It can be taken by other students at Saarland Informatics campus (CS, DS/AI, etc.), but please discuss this with the instructor.
There is some overlap with StatsLab, so if absolutely required, you can do this course as a substitute for StatsLab (with a reduced exam), but note that this course is designed for a different audience.
You will need programming skills to qualify for the exam. Best would be Python, but you can use a language of your choice.
Please do not waste your time by attempting this course without a solid basis in programming.
Topics
The following topics will be covered in the course; additional topics may be included, depending on time and current events.
Probability
- randomness
- uniform distributions on finite sets (Laplace spaces)
- elementary and advanced combinatorics
- finite, discrete and continuous probability spaces
- random variables
- discrete probability distributions and where they come from
- probability distributions and OOP, scipy.stats
- conditional probabilities
- Bayes’ Theorem, simple version
- continuous probability distributions
- a glimpse at measure theory
- posterior distributions
Statistics
- descriptive statistics
- moments of random variables (expectation, variance, …)
- parametric models
- statistical testing (frequentist view)
- statistical testing (Bayesian view)
- parameter estimation: moments, maximum likelihood
- parameter estimation in mixture models: EM algorithm
- regression (simple linear, logistic, robust, multiple)
- regularization and Bayesian view on estimation
- robust regression
- multiple regression
- logistic regression
Stochastic Processes
- stochastic processes
- Poisson process
- models for random sequences
- Markov chains
- Markov processes: models of sequence evolution
- Hidden Markov Models and applications
- Probabilistic Arthimetic Automata and applications
- distribution of DNA Motif Occurrences: compound Poisson
- significance of pairwise sequence alignment
- the PCR process
Applications in Bioinformatics
- tests for differential gene expression
- Bayesian view on differential gene expression
- high-dimensionality low-sample problem
- multiple testing