Document Type
Article
Publication Date
11-18-2018
Abstract
This study explores the performance of machine learning algorithms on the classification of fossil teeth in the Family Bovidae. Isolated bovid teeth are typically the most common fossils found in southern Africa and they often constitute the basis for paleoenvironmental reconstructions. Taxonomic identification of fossil bovid teeth, however, is often imprecise and subjective. Using modern teeth with known taxons, machine learning algorithms can be trained to classify fossils. Previous work by Brophy et al. [Quantitative morphological analysis of bovid teeth and implications for paleoenvironmental reconstruction of plovers lake, Gauteng Province, South Africa, J. Archaeol. Sci. 41 (2014), pp. 376–388] uses elliptical Fourier analysis of the form (size and shape) of the outline of the occlusal surface of each tooth as features in a linear discriminant analysis (LDA) framework. This manuscript expands on that previous work by exploring how different machine learning approaches classify the teeth and testing which technique is best for classification. In addition to LDA, four other machine learning techniques were considered (neural networks, nuclear penalized multinomial regression,random forests, and support vector machines) with support vector machines and random forests performing the best in terms of log loss and classification rate.
Publication Source (Journal or Book title)
Journal of Applied Statistics
Number
461
First Page
2773
Last Page
2787
Recommended Citation
Matthews, G., Brophy, J., Luetkemeier, M., Gu, H., & Thiruvathukal, G. (2018). A comparison of machine learning techniques for taxonomic classification of teeth from the Family Bovidae. Journal of Applied Statistics, 45 (15), 2773-2787. https://doi.org/10.1080/02664763.2018.1441381