Search
Skip to Search Results- 3speech recognition
- 2audio segmentation
- 2forced alignment
- 1acoustics
- 1cognitive science
- 1energy efficiency
-
2018-01-01
Matthew C. Kelley, Benjamin V. Tucker
The present paper investigates the effect of different inputs on the accuracy of a forced alignment tool built using deep neural networks. Both raw audio samples and Mel-frequency cepstral coefficients were compared as network inputs. A set of experiments were performed using the TIMIT speech...
-
2018-01-01
Matthew C. Kelley, Benjamin V. Tucker
Poster for the paper "A comparison of input types to a deep neural network-based forced aligner," presented at Interspeech 2018. Typo in alignment matrix (O[2,2] referenced O[1,2] instead of O[1,1]) updated on June 4, 2019. PAPER ABSTRACT: The present paper investigates the effect of different...
-
2022-01-01
Matthew C. Kelley, Benjamin V. Tucker
Using phonological neighborhood density has been a common method to quantify lexical competition. It is useful and convenient but has shortcomings that are worth reconsidering. The present study quantifies the effects of lexical competition during spoken word recognition using acoustic distance...