Document Type
Article
Publication Date
3-5-2018
Publication Title
Journal of Applied Statistics
Volume
45
Issue
12
Pages
2773-2787
Abstract
This study explores the performance of machine learning algorithms on the classification of fossil teeth in the Family Bovidae. Isolated bovid teeth are typically the most common fossils found in southern Africa and they often constitute the basis for paleoenvironmental reconstructions. Taxonomic identification of fossil bovid teeth, however, is often imprecise and subjective. Using modern teeth with known taxons, machine learning algorithms can be trained to classify fossils. Previous work by Brophy et al. [Quantitative morphological analysis of bovid teeth and implications for paleoenvironmental reconstruction of plovers lake, Gauteng Province, South Africa, J. Archaeol. Sci. 41 (2014), pp. 376–388] uses elliptical Fourier analysis of the form (size and shape) of the outline of the occlusal surface of each tooth as features in a linear discriminant analysis (LDA) framework. This manuscript expands on that previous work by exploring how different machine learning approaches classify the teeth and testing which technique is best for classification. In addition to LDA, four other machine learning techniques were considered (neural networks, nuclear penalized multinomial regression,random forests, and support vector machines) with support vector machines and random forests performing the best in terms of log loss and classification rate.
Recommended Citation
Matthews, Gregory J.; Brophy, Juliet K.; Luetkemeier, Maxwell; Gu, Hongie; and Thiruvathukal, George K.. A Comparison of Machine Learning Techniques for Taxonomic Classification of Teeth from the Family Bovidae. Journal of Applied Statistics, 45, 12: 2773-2787, 2018. Retrieved from Loyola eCommons, Mathematics and Statistics: Faculty Publications and Other Works, http://dx.doi.org/10.1080/02664763.2018.1441381
Creative Commons License
This work is licensed under a Creative Commons Attribution-Noncommercial-No Derivative Works 3.0 License.
Copyright Statement
© Informa UK Limited 2018
Comments
Author Posting. © Informa UK Limited 2018. This article is posted here by permission of Taylor & Francis for personal use, not for redistribution. The article was published in the Journal of Applied Statistics, 2018, https://doi.org/10.1080/02664763.2018.1441381.