Date of Award

2024

Degree Type

Thesis

Degree Name

Master of Science in Biological and Environmental Sciences (MSBES)

Department

Biological Sciences

First Advisor

Jon Puritz

Abstract

Restriction site-associated DNA sequencing (RADseq) has emerged as an effective tool to survey natural populations for genetic structure and diversity. When undergoing bioinformatic processing, markers generated from RADseq are aligned to a reference sequence prior to variant calling, which may introduce reference bias in downstream analysis. Here, the benefits of building a variant graph are evaluated based on RADseq markers to reduce reference bias, improve variant calling accuracy, and estimate fixation index (FST) and nucleotide diversity (π). Results indicate graph-based alignments improve SNP detection and provide more accurate estimates of per-site FST across a range of demographic conditions. No differences were found in estimates of nucleotide diversity based on simulated data with graph-based methods. On the contrary, in real data, graph-based methods returned greater amounts of genetic variation and higher estimates of average FST and π. This study concludes that graph-based reference construction and alignment improves the accuracy of biological inference drawn from RADseq data.

Creative Commons License

Creative Commons Attribution-Noncommercial 4.0 License
This work is licensed under a Creative Commons Attribution-Noncommercial-Share Alike 4.0 License.

Share

COinS
 
 

To view the content in your browser, please download Adobe Reader or, alternately,
you may Download the file to your hard drive.

NOTE: The latest versions of Adobe Reader do not support viewing PDF files within Firefox on Mac OS and if you are using a modern (Intel) Mac, there is no official plugin for viewing PDF files within the browser window.