Computational Biology (BIOCB)
BIOCB 2010 - Introduction to Computational Biology (3 Credits)
This class is designed to inspire students in the life sciences to see the power of computational biology in advancing the research frontier. In addition to providing students with foundational concepts in modern biology, the course will introduce core principles in computer science, mathematics, and statistics, including AI and machine learning. The course will also teach many fundamental skills in manipulating large data sets, including genome sequences, functional genomic data, protein structures, etc. The course will consist of two lectures and one practical session each week. In the practical's, students will learn the use of many of the latest software tools and will develop some basic programming skills. Students will be able to explore their own interests in greater depth in a term project.
Distribution Requirements: (BIO-AG, BSC-AG, DLG-AG, MQL-AG, OPHLS-AG), (BIO-AS, SDS-AS)
Last Four Terms Offered: Fall 2025, Fall 2024, Fall 2023
Learning Outcomes:
- Formulate succinct and focused research questions that make effective use of computation in diverse problems in genomics, population genetics, medical genetics, systems biology, structural biology, phylogenetics, and conservation biology.
- Explain key computational, mathematical, and statistical concepts that underlie modern data analysis, including machine learning/artificial intelligence.
- Write short computer programs in python and R to organize and analyze complex data collected from across the biological sciences.- Write short computer programs python and R that simulate biological data.
- Apply evolutionary principles to explain patterns of diversity both within and across species.
- Critically assess the validity of computational research from across the life sciences.
BIOCB 3620 - Dynamic Models and Data in Biology (4 Credits)
Life is dynamic and ever-changing. Because of this, a central tool used to study living systems are dynamic models. This course provides an introductory survey of the development, computer implementation, and applications of dynamic models in biology, as well as statistical and machine learning methods for building such models from biological data. Case-study format covering broad range of biological applications including gene regulation, neurobiology, physiology, behavior, epidemiology and ecology. Students learn mathematical methods for interpreting and building biological systems models, as well as computational methods for simulating models on the computer using a scripting and graphics environment.
Distribution Requirements: (BIO-AG, BSC-AG, DLS-AG)
Last Four Terms Offered: Spring 2026, Spring 2025
Learning Outcomes:
- Students will be able to read a dynamic model, interpreting its equations as statements about underlying biological processes and the assumptions made about the rates and consequences of those processes.
- Students will be able to adapt existing models for applications to related systems or alternative scenarios.
- Students will be able to write computer programs (using R) to numerically solve low-dimensional matrix equations (deterministic and stochastic), difference equation, differential equation, and agent-based models for biological systems.
- Students will be able to write computer programs (using R) to estimate the parameters and structure of dynamic models from real data.
- Students will be able to locate equilibria, compute Jacobians, evaluate local stability through eigenvalue calculations and other methods, and interpret these results in terms of asymptotic system dynamics and bifurcations.
- Students will be able to read and understand biological research papers that use modeling as a primary methodology.
- Students will be able to formulate meaningful research questions about biological systems that can be addressed using dynamic models and data, and apply the skills learned in the course to answer those questions.
BIOCB 4100 - Advanced Conservation Biology (4 Credits)
Crosslisted with ENTOM 4100, NTRES 4100
This course integrates technical approaches to biodiversity conservation, with a focus on biological analysis of species facing extinction risk. Students will learn quantitative tools for analyzing variation at genetic, population, and landscape levels. The curriculum covers stage-structured population dynamics, predation, gene flow, inbreeding, extinction processes, and harvesting effects. Analytical methods include population-projection models, perturbation analysis, metapopulation models, population viability analysis, and various genetic diversity assessments. Special attention is given to evaluating extinction risk in data-deficient species such as insects. By developing these quantitative skills, students will critically evaluate assumptions underlying conservation plans and assessments, including IUCN criteria and endangered species classification. The course is suitable for all biology and conservation-related majors.
Prerequisites: NTRES 3100 or BIOEE 3610 or ENTOM 4550; BIOMG2800 or NTRES 2830 or ENTOM 4700; or permission of instructor.
Distribution Requirements: (BIO-AS), (BSC-AG, DLG-AG, MQL-AG, OPHLS-AG)
Exploratory Studies:
(CU-SBY)
Last Four Terms Offered: Spring 2026, Fall 2024, Fall 2023, Fall 2022Learning Outcomes:
- Apply ecological and genetic principles to analyze patterns of biodiversity and assess extinction risks for threatened species.
- Conduct population viability analyses (PVA) to quantify extinction risk and evaluate conservation interventions.
- Analyze genetic diversity metrics including heterozygosity, inbreeding coefficients, and effective population size to inform conservation breeding programs.
- Evaluate challenges in assessing extinction risk for data-deficient species, with special emphasis on insects and other understudied taxa.
- Demonstrate proficiency in computational tools (R and other software) for conservation data analysis.
- Effectively communicate complex conservation analyses through scientific writing and data visualization.
BIOCB 4350 - From Regression to LLMs: Introduction to Machine Learning for Computational Biology (4 Credits)
This course will provide a rigorous treatment of computational statistics and machine learning methods used to analyze big biological data types. Analysis methods covered will include: generalized linear models, support vector machines, regularized linear models, kernel methods, random forests, neural networks, large language models. While the course will be focused on analysis methods and connections among methods, applications making use of specific big biological data types will be covered. An understanding of method limitations will be prioritized, as well as how to critically assess when a desired conclusion can be justified. Methods will be implemented in the computer lab in Python, where some previous exposure to programming will be assumed.
Prerequisites: exposure to R and Python.
Distribution Requirements: (DLS-AG, MQL-AG, OPHLS-AG, PSC-AG)
Last Four Terms Offered: Spring 2026
Learning Outcomes:
- Theory & Practice - Students will be able to explain the formal mathematical framework for each of the following methods, as well as implement a version of each in Python: generalized linear models, support vector machines, regularized linear models, kernel methods, random forests, neural networks, large language models.
- Data Analysis - Students will apply each of the following methods to both simulated and real biological data, including genomic, image, and health record data: generalized linear models, support vector machines, regularized linear models, kernel methods, random forests, neural networks, large language models.
- Communicating & Arguing with Data - Students will plot, present, and interpret in writing the outcomes from applying each of the following methods to real genomic, image, or health record data: generalized linear models, support vector machines, regularized linear models, kernel methods, random forests, neural networks, large language models.
- Data Interpretation and Critique - Students will be able to rigorously explain the limitations, and the implications of these limitations, when using the following methods to draw specific and alternative biological conclusions from real data: generalized linear models, support vector machines, regularized linear models, kernel methods, random forests, neural networks, large language models.
BIOCB 4381 - Biomedical Data Mining and Modeling (3 Credits)
A biomedical data science course using Python and available bioinformatics tools and techniques for the analysis of molecular biological data, including biosequences, microarrays, and networks. This course emphasizes practical skills rather than theory. Topics include advanced Python programming, R and Bioconductor, sequence alignment, MySQL database (DBI), web programming and services (CGI), genomics and proteomics data mining and analysis, machine learning, and methods for inferring and analyzing regulatory, protein-protein interaction, and metabolite networks.
Prerequisites: at least one introductory course in computer programming (any language) and one in statistical methods, or permission of the instructor.
Distribution Requirements: (BSC-AG, DLS-AG, MQL-AG, OPHLS-AG)
Last Four Terms Offered: Fall 2024, Fall 2023, Fall 2022, Fall 2021
Learning Outcomes:
- Demonstrate familiarity with the basics of applied statistical methodology.
- Demonstrate familiarity with statistical software and a programming language.
- Demonstrate ability to perform complex data mining of biological datasets using a programming language.
- Demonstrate ability to effectively communicate the results of a statistical analysis to biologists.
- Demonstrate familiarity with statistical and computational tools for high throughput genomic data.
- Demonstrate ability to build stand-alone softwares, web tools, and databases for analyzing biological data.
BIOCB 4810 - Principles of Population Genetics (4 Credits)
Crosslisted with BIOMG 4810
Population genetics studies the genetic composition of biological populations and how allele frequencies change over time and space. It serves as the theoretical foundation for understanding evolution. This course offers a comprehensive introduction to the core concepts and methods of population genetics, with a focus on linking observed patterns of genetic variation to the underlying evolutionary processes that generate them. Throughout the course, we emphasize the interplay between analytical theory, computer simulations, and the analysis of genetic data from both natural and experimental populations. We will also explore current efforts to connect genotype to phenotype and, ultimately, to fitness. Case studies will include the evolution of drug resistance, genetic ancestry inference, experimental evolution, and the genetic structure and demographic history of human populations.
Distribution Requirements: (BIO-AS), (BSC-AG, DLS-AG, MQL-AG, OPHLS-AG)
Last Four Terms Offered: Fall 2025, Fall 2023, Spring 2022, Fall 2020
Learning Outcomes:
- Explain and interpret the fundamental evolutionary processes that shape patterns of genetic variation within and among populations.
- Critically evaluate current research in population genetics.
- Design and implement simple computer simulations of evolutionary processes to test hypotheses and theoretical predictions.
- Apply appropriate statistical and computational methods to population genomic datasets to draw evolutionary inferences, and evaluate the statistical power, assumptions, and conceptual limitations of these methods.
- Describe and analyze applications of population genetics in conservation biology, agriculture, and medicine.
- Communicate clearly and thoughtfully about the ethical and societal implications of population genetics research.
BIOCB 4830 - Quantitative Genomics and Genetics (4 Credits)
A rigorous treatment of analysis techniques used to understand complex genetic systems. This course covers both the fundamentals and advances in statistical methodology used to analyze disease and agriculturally relevant and evolutionarily important phenotypes. Topics include mapping quantitative trait loci (QTLs), application of microarray and related genomic data to gene mapping, and evolutionary quantitative genetics. Analysis techniques include association mapping, interval mapping, and analysis of pedigrees for both single and multiple QTL models. Application of classical inference and Bayesian analysis approaches is covered and there is an emphasis on computational methods.
Prerequisites: STSCI 3080 and introductory statistics or equivalent.
Distribution Requirements: (OPHLS-AG)
Last Four Terms Offered: Spring 2026, Spring 2025, Spring 2024, Spring 2023
Learning Outcomes:
- Students will learn a statistical modeling strategy that is both basic and general, as well as how to apply this strategy to learn information about biological systems when analyzing genome-wide data. More specifically, students will learn the mathematics and interpretation of linear statistical models.
- Students will learn what these models can be used to infer when applied to genome-wide genetic and related data.
- Students will learn how to effectively and efficiently analyze large-scale genomic data and how to program in R for this purpose.
- Students will learn the limits of interpretation when applying these statistical models to genomic data when inferring information about a biological system.
BIOCB 4840 - Computational Genetics and Genomics (4 Credits)
Crosslisted with CS 4775
Computational methods for analyzing genetic and genomic data. Topics include sequence alignment, hidden Markov Models for discovering sequence features, motif finding using Gibbs sampling, phylogenetic tree reconstruction, inferring haplotypes, and local and global ancestry inference. Prior knowledge of biology is not necessary to complete this course.Grad students must do a final project that involves original research and that in most circumstances will involve programming and real data.Undergrads will not need to do research but will do a final project that involves a sizeable amount of programming, comprehensive literature review of a topic, or similar.
Prerequisites: STSCI 2200 and CS 2110; or equivalents.
Distribution Requirements: (BIO-AS, SDS-AS), (OPHLS-AG)
Last Four Terms Offered: Fall 2025, Fall 2024, Fall 2023, Fall 2022
Learning Outcomes:
- Understand computational algorithms used for the analysis of genetic and genomic data
- Formulate computational approaches for solving problems in computational genomics
- Understand challenges and limitations in inference methods used in computational genetics and genomics
BIOCB 4910 - Quantitative Approaches to Population Genetics (3 Credits)
This course covers the latest development and cutting-edge research topics in population genetics, aiming to enable students to perform research in population genetics. The first part will cover coalescent theory and inference involving complex demography. The second part will discuss natural selection and methods for inferring selection. We will allude to the complexity of demographic history and natural selection and their importance in explaining genomic patterns. The third part will introduce new data types and the challenges and opportunities with these data. We will dive into genotype likelihood and will emphasize the importance of simulation in population genetics. The course will be mostly delivered through lecturing, each interspersed with short conversations about reading assignments. Coursework involves reading literature, solving problem sets, and a course project.
Prerequisites: introduction to statistics or equivalent.
Distribution Requirements: (BIO-AG, BSC-AG, DLS-AG, MQL-AG)
Last Four Terms Offered: Spring 2026, Spring 2025, Spring 2024, Spring 2023
Learning Outcomes:
- Derive, compute, and interpret population genetic statistics grounded in coalescent and other evolutionary theories, and relate these quantities to observable patterns of genetic data.
- Apply population genetic models and inference frameworks to analyze genetic datasets, formulate and test evolutionary hypotheses, and draw evidence-based conclusions.
- Design and implement forward- and backward-in-time simulations (e.g., using tools such as SLiM and msprime) that incorporate key evolutionary processes, including mutation, recombination, migration, genetic drift, and natural selection.
- Conduct rigorous statistical analyses of simulated and empirical data, evaluate model fit, and interpret results in the context of evolutionary questions.
- Critically assess the assumptions, limitations, and appropriate use of analytical methods and published findings in population genetics.
- Communicate quantitative and evolutionary concepts effectively through clear written, oral, and visual scientific presentations.
BIOCB 5100 - Advanced Conservation Biology (4 Credits)
Crosslisted with ENTOM 5100, NTRES 5100
This course integrates technical approaches to biodiversity conservation, with a focus on biological analysis of species facing extinction risk. Students will learn quantitative tools for analyzing variation at genetic, population, and landscape levels. The curriculum covers stage-structured population dynamics, predation, gene flow, inbreeding, extinction processes, and harvesting effects. Analytical methods include population-projection models, perturbation analysis, metapopulation models, population viability analysis, and various genetic diversity assessments. Special attention is given to evaluating extinction risk in data-deficient species such as insects. By developing these quantitative skills, students will critically evaluate assumptions underlying conservation plans and assessments, including IUCN criteria and endangered species classification. The course is suitable for all biology and conservation-related majors.
Prerequisites: NTRES 3100 or BIOEE 3610 or ENTOM 4550; BIOMG2800 or NTRES 2830 or ENTOM 4700; or permission of instructor.
Last Four Terms Offered: Spring 2026
Learning Outcomes:
- Apply ecological and genetic principles to analyze patterns of biodiversity and assess extinction risks for threatened species.
- Conduct population viability analyses (PVA) to quantify extinction risk and evaluate conservation interventions.
- Analyze genetic diversity metrics including heterozygosity, inbreeding coefficients, and effective population size to inform conservation breeding programs.
- Evaluate challenges in assessing extinction risk for data-deficient species, with special emphasis on insects and other understudied taxa.
- Demonstrate proficiency in computational tools (R and other software) for conservation data analysis.
- Effectively communicate complex conservation analyses through scientific writing and data visualization.
BIOCB 6010 - Introduction to Computational Biology (3 Credits)
This class is designed to inspire students in the life sciences to see the power of computational biology in advancing the research frontier. In addition to providing students with foundational concepts in modern biology, the course will introduce core principles in computer science, mathematics, and statistics, including AI and machine learning. The course will also teach many fundamental skills in manipulating large data sets, including genome sequences, functional genomic data, protein structures, etc. The course will consist of two lectures and one practical session each week. In the practicals, students will learn the use of many of the latest software tools and will develop some basic programming skills. Students will be able to explore their own interests in greater depth in a term project.
Last Four Terms Offered: Fall 2025, Fall 2024, Fall 2023
Learning Outcomes:
- Formulate succinct and focused research questions that make effective use of computation in diverse problems in genomics, population genetics, medical genetics, systems biology, structural biology, phylogenetics, and conservation biology.
- Explain key computational, mathematical, and statistical concepts that underlie modern data analysis, including machine learning/artificial intelligence.
- Write short computer programs in python and R to organize and analyze complex data collected from across the biological sciences.
- Write short computer programs python and R that simulate biological data.
- Apply evolutionary principles to explain patterns of diversity both within and across species.
- Critically assess the validity of computational research from across the life sciences.
BIOCB 6350 - From Regression to LLMs: Introduction to Machine Learning for Computational Biology (4 Credits)
This course will provide a rigorous treatment of computational statistics and machine learning methods used to analyze big biological data types. Analysis methods covered will include: generalized linear models, support vector machines, regularized linear models, kernel methods, random forests, neural networks, large language models. While the course will be focused on analysis methods and connections among methods, applications making use of specific big biological data types will be covered. An understanding of method limitations will be prioritized, as well as how to critically assess when a desired conclusion can be justified. Methods will be implemented in the computer lab in Python, where some previous exposure to programming will be assumed.
Prerequisites: exposure to R and Python,
Last Four Terms Offered: Spring 2026
Learning Outcomes:
- Theory & Practice - Students will be able to explain the formal mathematical framework for each of the following methods, as well as implement a version of each in Python: generalized linear models, support vector machines, regularized linear models, kernel methods, random forests, neural networks, large language models.
- Data Analysis - Students will apply each of the following methods to both simulated and real biological data, including genomic, image, and health record data: generalized linear models, support vector machines, regularized linear models, kernel methods, random forests, neural networks, large language models.
- Communicating & Arguing with Data - Students will plot, present, and interpret in writing the outcomes from applying each of the following methods to real genomic, image, or health record data: generalized linear models, support vector machines, regularized linear models, kernel methods, random forests, neural networks, large language models.
- Data Interpretation and Critique - Students will be able to rigorously explain the limitations, and the implications of these limitations, when using the following methods to draw specific and alternative biological conclusions from real data: generalized linear models, support vector machines, regularized linear models, kernel methods, random forests, neural networks, large language models.
BIOCB 6381 - Biomedical Data Mining and Modeling (3 Credits)
A biomedical data science course using Python and available bioinformatics tools and techniques for the analysis of molecular biological data, including biosequences, microarrays, and networks. This course emphasizes practical skills rather than theory. Topics include advanced Python programming, R and Bioconductor, sequence alignment, MySQL database (DBI), web programming and services (CGI), genomics and proteomics data mining and analysis, machine learning, and methods for inferring and analyzing regulatory, protein-protein interaction, and metabolite networks.
Prerequisites: at least one introductory course in computer programming (any language) and one in statistical methods, or permission of the instructor.
Last Four Terms Offered: Fall 2024, Fall 2023, Fall 2022, Fall 2021
Learning Outcomes:
- Demonstrate familiarity with the basics of applied statistical methodology.
- Demonstrate familiarity with statistical software and a programming language.
- Demonstrate ability to perform complex data mining of biological datasets using a programming language.
- Demonstrate ability to effectively communicate the results of a statistical analysis to biologists.
- Demonstrate familiarity with statistical and computational tools for high throughput genomic data.
- Demonstrate ability to build stand-alone softwares, web tools, and databases for analyzing biological data.
BIOCB 6620 - Dynamic Models and Data in Biology (4 Credits)
Life is dynamic and ever-changing. Because of this, a central tool used to study living systems are dynamic models. This course provides an introductory survey of the development, computer implementation, and applications of dynamic models in biology, as well as statistical and machine learning methods for building such models from biological data. Case-study format covering broad range of biological applications including gene regulation, neurobiology, physiology, behavior, epidemiology and ecology. Students learn mathematical methods for interpreting and building biological systems models, as well as computational methods for simulating models on the computer using a scripting and graphics environment.
Last Four Terms Offered: Spring 2026, Spring 2025
Learning Outcomes:
- Students will be able to read a dynamic model, interpreting its equations as statements about underlying biological processes and the assumptions made about the rates and consequences of those processes.
- Students will be able to adapt existing models for applications to related systems or alternative scenarios.
- Students will be able to write computer programs (using R) to numerically solve low-dimensional matrix equations (deterministic and stochastic), difference equation, differential equation, and agent-based models for biological systems.
- Students will be able to write computer programs (using R) to estimate the parameters and structure of dynamic models from real data.
- Students will be able to locate equilibria, compute Jacobians, evaluate local stability through eigenvalue calculations and other methods, and interpret these results in terms of asymptotic system dynamics and bifurcations.
- Students will be able to read and understand biological research papers that use modeling as a primary methodology.
- Students will be able to formulate meaningful research questions about biological systems that can be addressed using dynamic models and data, and apply the skills learned in the course to answer those questions.
BIOCB 6810 - Principles of Population Genetics (4 Credits)
Crosslisted with BIOMG 6810
Population genetics studies the genetic composition of biological populations and how allele frequencies change over time and space. It serves as the theoretical foundation for understanding evolution. This course offers a comprehensive introduction to the core concepts and methods of population genetics, with a focus on linking observed patterns of genetic variation to the underlying evolutionary processes that generate them. Throughout the course, we emphasize the interplay between analytical theory, computer simulations, and the analysis of genetic data from both natural and experimental populations. We will also explore current efforts to connect genotype to phenotype and, ultimately, to fitness. Case studies will include the evolution of drug resistance, genetic ancestry inference, experimental evolution, and the genetic structure and demographic history of human populations.
Last Four Terms Offered: Fall 2025, Fall 2023
Learning Outcomes:
- Explain and interpret the fundamental evolutionary processes that shape patterns of genetic variation within and among populations.
- Critically evaluate current research in population genetics.
- Design and implement simple computer simulations of evolutionary processes to test hypotheses and theoretical predictions.
- Apply appropriate statistical and computational methods to population genomic datasets to draw evolutionary inferences, and evaluate the statistical power, assumptions, and conceptual limitations of these methods.
- Describe and analyze applications of population genetics in conservation biology, agriculture, and medicine.
- Communicate clearly and thoughtfully about the ethical and societal implications of population genetics research.
BIOCB 6830 - Quantitative Genomics and Genetics (4 Credits)
A rigorous treatment of analysis techniques used to understand complex genetic systems. This course covers both the fundamentals and advances in statistical methodology used to analyze disease and agriculturally relevant and evolutionarily important phenotypes. Topics include mapping quantitative trait loci (QTLs), application of microarray and related genomic data to gene mapping, and evolutionary quantitative genetics. Analysis techniques include association mapping, interval mapping, and analysis of pedigrees for both single and multiple QTL models. Application of classical inference and Bayesian analysis approaches is covered and there is an emphasis on computational methods.
Prerequisites: STSCI 3080 and introductory statistics or equivalent.
Last Four Terms Offered: Spring 2026, Spring 2025, Spring 2024, Spring 2023
Learning Outcomes:
- Students will learn a statistical modeling strategy that is both basic and general, as well as how to apply this strategy to learn information about biological systems when analyzing genome-wide data. More specifically, students will learn the mathematics and interpretation of linear statistical models.
- Students will learn what these models can be used to infer when applied to genome-wide genetic and related data.
- Students will learn how to effectively and efficiently analyze large-scale genomic data and how to program in R for this purpose.
- Students will learn the limits of interpretation when applying these statistical models to genomic data when inferring information about a biological system.
BIOCB 6840 - Computational Genetics and Genomics (4 Credits)
Computational methods for analyzing genetic and genomic data. Topics include sequence alignment, hidden Markov Models for discovering sequence features, motif finding using Gibbs sampling, phylogenetic tree reconstruction, inferring haplotypes, and local and global ancestry inference. Prior knowledge of biology is not necessary to complete this course.Grad students must do a final project that involves original research and that in most circumstances will involve programming and real data.
Prerequisites: STSCI 2200 and CS 2110; or equivalents.
Last Four Terms Offered: Fall 2025, Fall 2024, Fall 2023, Fall 2022
Learning Outcomes:
- Understand computational algorithms used for the analysis of genetic and genomic data
- Formulate computational approaches for solving problems in computational genomics
- Understand challenges and limitations in inference methods used in computational genetics and genomics
BIOCB 6890 - Current Topics in Population Genomics (1 Credit)
Graduate seminar on current topics in population genetics. Readings are chosen primarily from current scientific literature. Participation in discussion and presentation of at least one paper required for course credit.
Prerequisites: BIOMG 4810, BIOCB 4810 or permission of instructor.
Last Four Terms Offered: Spring 2026, Fall 2025, Spring 2025, Fall 2024
BIOCB 6910 - Quantitative Approaches to Population Genetics (3 Credits)
This course covers the latest development and cutting-edge research topics in population genetics, aiming to enable students to perform research in population genetics. The first part will cover coalescent theory and inference involving complex demography. The second part will discuss natural selection and methods for inferring selection. We will allude to the complexity of demographic history and natural selection and their importance in explaining genomic patterns. The third part will introduce new data types and the challenges and opportunities with these data. We will dive into genotype likelihood and will emphasize the importance of simulation in population genetics. The course will be mostly delivered through lecturing, each interspersed with short conversations about reading assignments. Coursework involves reading literature, solving problem sets, and a course project.
Prerequisites: BIOMG 2800 or BIOEE 1780; STSCI 3080 or STSCI 2200; or equivalents.
Last Four Terms Offered: Spring 2026, Spring 2025, Spring 2024, Spring 2023
Learning Outcomes:
- Derive, compute, and interpret population genetic statistics grounded in coalescent and other evolutionary theories, and relate these quantities to observable patterns of genetic data.
- Apply population genetic models and inference frameworks to analyze genetic datasets, formulate and test evolutionary hypotheses, and draw evidence-based conclusions.
- Design and implement forward- and backward-in-time simulations (e.g., using tools such as SLiM and msprime) that incorporate key evolutionary processes, including mutation, recombination, migration, genetic drift, and natural selection.
- Conduct rigorous statistical analyses of simulated and empirical data, evaluate model fit, and interpret results in the context of evolutionary questions.
- Critically assess the assumptions, limitations, and appropriate use of analytical methods and published findings in population genetics.
- Communicate quantitative and evolutionary concepts effectively through clear written, oral, and visual scientific presentations.
BIOCB 7200 - Statistical and Computational Genetics (1 Credit)
Weekly seminar series on recent advances in computational genomics. A selection of the latest papers in the field are read and discussed. Methods are stressed, but biological results and their significance are also addressed.
Prerequisites: BIOCB 4840 or BIOCB 6840 or CS 4775 or equivalent.
Last Four Terms Offered: Spring 2026, Spring 2025, Spring 2024, Spring 2023
BIOCB 7210 - Topics in Quantitative Genomics (1 Credit)
Weekly seminar series on recent advances in quantitative genomics. A selection of the latest papers in the field is read and discussed. Methods are stressed, but biological results and their significance are also addressed.
Prerequisites: BIOCB 4830 or BIOCB 6830 or permission of instructor.
Last Four Terms Offered: Spring 2026, Spring 2024, Spring 2023, Spring 2022
BIOCB 7600 - Data Driven Models in Biology (1 Credit)
Graduate seminar on methods for building models of biological systems using data, with an emphasis on recent methods including machine learning tools. Students will read and discuss recent literature in this area and, through group discussions, develop strategies for applying methods within their own research domains. Participation in discussion and presentation of at least one paper required for course credit.
Last Four Terms Offered: Fall 2025, Spring 2025, Spring 2024
Learning Outcomes:
- Describe and explain methods for developing mathematical and machine learning models using biological data.
- Describe recent developments in data-driven modeling and how these methods can be applied to study biological systems.
- Present results of recent scientific studies to their peers (4) discuss applications of data-driven modeling methods within their own research domains.
BIOCB 7700 - Topics in Statistical Genetics (1 Credit)
Weekly seminar series on recent advances in statistical genetics. A selection of the latest papers in the field are read and discussed. Methods are stressed, but biological results and their significance are also addressed.
Last Four Terms Offered: Fall 2025, Fall 2024
Learning Outcomes:
- After completing this course, students should be able to critically read a statistical genetics paper.
- After completing this course, assess the strengths and weaknesses of a scientific paper.
- After completing this course, determine whether a paper's conclusions follow from the presented analyses.
- After completing this course, describe the current state of statistical genetics research.
BIOCB 7800 - Genomic Methods in Evolution (1 Credit)
This class is aimed at students interested in designing experiments in evolutionary genomics. Evolutionary biology is at an exciting moment where “any organism…any genome” is possible. However, the pace of sequencing technology and computational methods development is rapid and before carrying out a research project, it is critical to use the appropriate methods for the focal research question. Specific topics will include laboratory methods, sample selection and collection, and computational approaches to data analysis. Students will read and discuss recent literature where genomics have been applied to fundamental questions in ecology, evolution, and conservation and, through discussions, develop strategies for applying these tools to their own research domains. Students are expected to present at least one paper and participate in course discussion for class credit.
Learning Outcomes:
- Explain different methods for generating and analyzing evolutionary genomic data.
- Explain how emerging genomic technology can be applied to fundamental questions in evolutionary biology.
- Present results of recent evolutionary genomic studies to their peers.
- Identify the appropriate methods for genomic analysis within their own research domains.
- Discuss the opportunity, limitations, and ethical considerations when applying genomics to questions across biological fields.
BIOCB 7900 - Communication Skills for Computational Biologists (1 Credit)
This class will help graduate students develop skills for disseminating their research in computational biology. This will include five topics. First, students will strengthen their writing skills, with a particular emphasis on writing compelling scientific proposals in the field of computational biology, which is a requirement for the A-exam. Second, students will learn how to design effective figures to communicate findings. Third, students will learn how to give scientific presentations, which Computational Biology Ph.D. students are required to do annually. Fourth, students will learn techniques for making and presenting posters. Fifth, students will develop strategies for communicating their research to non-specialists, including the public and stakeholders who might apply their research.
Learning Outcomes:
- Write a compelling scientific proposal in the field of computational biology.
- Use generative AI effectively in writing science as well as understand the limitations and ethical issues surrounding this.
- Design a figure that effectively communicates key results of a scientific study.
- Give a 15-minute presentation aimed at scientific colleagues that effectively conveys the rationale of the study, the scientific background, and how the new research being presented adds to our knowledge of an important scientific problem.
- Design and present a poster that is visually compelling.
- Concisely discuss both the rationale and the results of a research project in computational biology to a non-specialist audience.