Chemistry and Chemical Biology ETDs

Publication Date

Spring 7-15-2025

Abstract

Data-driven approaches to solving problems in biology and chemistry require utilization of reliable techniques and machine learning algorithms are the modern reliable approach. This work presents three problems that involve use of supervised learning techniques when classification is the goal and unsupervised learning techniques when global data representation is the goal.

In the first problem, we demonstrate the use of unsupervised clustering techniques, self-organizing maps and K-means, to ascertain analyte detection capabilities of carbon nitride dots. In the second problem, we add scalability features to a functional group classification model applied to infrared data and evaluate its ability to inform technologists of materials aging trends. In the last problem, we compare stacking and mixture of experts ensemble techniques’ aptitudes for base calling in new microarray resequencing technologies for evolving viral genomes.

Language

English

Keywords

Machine Learning, Chemistry, Data Science, Microarrays, SARS-CoV-2

Document Type

Dissertation

Degree Name

Chemistry and Chemical Biology

Level of Degree

Doctoral

Department Name

Department of Chemistry and Chemical Biology

First Committee Member (Chair)

Dr. Jeremy Edwards

Second Committee Member

Dr. Sherman Garver

Third Committee Member

Dr. Yi He

Fourth Committee Member

Dr. John Grey

Fifth Committee Member

Dr. Adam Halasz

Sixth Committee Member

Dr. Koushik Ghosh

Share

COinS