Efficiency of Learning in Experience-Limited Domains: Generalization Beyond the WUG Test
Document Type
Conference Proceeding
Publication Date
1-1-2019
Abstract
Learning to read English requires learning the complex statistical dependencies between orthography and phonology. Previous research has focused on how these statistics are learned in neural network models provided with as much training as needed. Children, however, are expected to acquire this knowledge in a few years of school with only limited instruction. We examined how these mappings can be learned efficiently, defined by tradeoffs between the number of words that are explicitly trained and the number that are correct by generalization. A million models were trained, varying the sizes of randomly-selected training sets. For a target corpus of about 3000 words, training sets of 200-300 words were most efficient, producing generalization to as many as 1800 untrained words. Composition of the 300 word training sets also greatly affected generalization. The results suggest directions for designing curricula that promote efficient learning of complex material.
Publication Source (Journal or Book title)
Proceedings of the 41st Annual Meeting of the Cognitive Science Society: Creativity + Cognition + Computation, CogSci 2019
First Page
1566
Last Page
1571
Recommended Citation
Cox, C., Borkenhagen, M., & Seidenberg, M. (2019). Efficiency of Learning in Experience-Limited Domains: Generalization Beyond the WUG Test. Proceedings of the 41st Annual Meeting of the Cognitive Science Society: Creativity + Cognition + Computation, CogSci 2019, 1566-1571. Retrieved from https://repository.lsu.edu/psychology_pubs/2336