(25-04-24) Successful master thesis defense by Erika Castaneda Sastre

Master thesis front page

Abstract

Chromosomes, carriers of an organism’s hereditary information, can be categorized into two main types: allosomes and autosomes. Allosomes, also known as sex chromosomes, play a crucial role in sex determination and regulating sex-related traits. Despite considerable diversity, they share standard features and exhibit variations in gene content and pairing systems. Understanding sex chromosomes is crucial for agricultural and disease control efforts, where genetic approaches utilizing sex-specific traits show promise. However, identifying allosomes, especially in non-model organisms, poses challenges. Here, we explore the use of supervised machine learning models, including logistic regression, random forests, support vector machines, and k-nearest neighbors, to classify contigs as autosomes or allosomes based on whole-genome sequencing data. The predictive capability of features like coverage, heterozygosity, and GC content is assessed. The results underscore the importance of feature combinations and model selection for accurate classification.

Kumar Saurabh Singh
Kumar Saurabh Singh
Assistant Professor