Back to Search
Start Over
Predicting Dog Phenotypes from Genotypes
- Source :
- Annual International Conference of the IEEE Engineering in Medicine and Biology Society. IEEE Engineering in Medicine and Biology Society. Annual International Conference. 2022
- Publication Year :
- 2022
-
Abstract
- We analyze dog genotypes (i.e., positions of dog DNA sequences that often vary between different dogs) in order to predict the corresponding phenotypes (i.e., unique observed characteristics). More specifically, given chromosome data from a dog, we aim to predict the breed, height, and weight. We explore a variety of linear and non-linear classification and regression techniques to accomplish these three tasks. We also investigate the use of a neural network (both in linear and non-linear modes) for breed classification and compare the performance to traditional statistical methods. We show that linear methods generally outperform or match the performance of non-linear methods for breed classification. However, we show that the reverse is true for height and weight regression. Finally, we evaluate the results of all of these methods based on the number of input features used in the analysis. We conduct experiments using different fractions of the full genomic sequences, resulting in input sequences ranging from 20 SNPs to ∼200k SNPs. In doing so, we explore the impact of using a very limited number of SNPs for prediction. Our experiments demonstrate that these phenotypes in dogs can be predicted with as few as 0.5% of randomly selected SNPs (i.e., 992 SNPs) and that dog breeds can be classified with 50% balanced accuracy with as few as 0.02% SNPs (i.e., 40 SNPs).
- Subjects :
- Enginyeria agroalimentària::Ciències de la terra i de la vida::Biologia [Àrees temàtiques de la UPC]
Neural Networks
Genotype
phenotype
Gossos
Performance
ADN
Classification technique
DNA sequences
Polymorphism, Single Nucleotide
Chromosomes
Non-linear regression
Computer
Dogs
single nucleotide polymorphism
Nonlinear classification
genomics
Animals
animal
Genomes
Polymorphism
Input features
Nonlinear mode
DNA
Single Nucleotide
Genomics
ComputingMethodologies_PATTERNRECOGNITION
Phenotype
dog
Neural-networks
Computer vision
Non-linear methods
Neural Networks, Computer
Linear methods
Forecasting
Regression techniques
Subjects
Details
- ISSN :
- 26940604
- Volume :
- 2022
- Database :
- OpenAIRE
- Journal :
- Annual International Conference of the IEEE Engineering in Medicine and Biology Society. IEEE Engineering in Medicine and Biology Society. Annual International Conference
- Accession number :
- edsair.doi.dedup.....4b070ec9b49c12eae76ed0ada2d327f6