Date of Publication :17th April 2018
Abstract: This paper discusses an approach of making a confidence scoring for phone duration in speech recognition. A confidence scoring mechanism is derived out of correspondence between a Hidden Markov Model(HMM) based forced aligner and a Multi-Layer Perceptron(MLP) based frame classifier. Phone duration for noise is also factored into the approach which makes it more reliable.
Reference :