Bayesian Enhancement of Speech Signals

Bayesian Enhancement of Speech Signals Jeremy Reed

Outline • Speech Model • Bayes application • MCMC algorithm • Results

Speech Model • Predict current speech sample from p previous samples (AR process) • Justified by physics • Lossless acoustic tubes • Time for vocal tract to change shape • Use a window of T samples for short-time analysis

Speech Model • x1 are corrupted or “bad” samples • Prior for e~N(0, σe2) • Prior, p(a, σe2)=p(a, σe2)~IG(σe2; αe, βe) • αe, βe chosen to be broad enough to incorporate a (approach Jeffrey’s Prior) • AR coefficients are normal with ML mean and variance related to error and samples

Speech Model • vt is the channel noise • vt~ N(0, σv2) • Inverse Gamma for prior on σv2 • Can use different distribution if have prior knowledge on the channel’s characteristics

Bayesian Speech Enhancement • x is the clean speech sequence • y is x plus additive noise, v • θ is a vector containing the parameters of the speech and noise

Algorithm • Window audio segment of T samples, overlapping successive windows by p samples • Assign initial values to a, σv2, and σe2 by using values from last p samples of previous windows • For first window, inferences for these parameters drawn from p(x,θ|y)

Algorithm • Perform Gibbs sampling for unknown parameters:

Algorithm • Rv is the covariance matrix for the corrupted samples and assumed diag(σv2)

Results – 440 Hz Sine Wave

Results - Speech

Bayesian Enhancement of Speech Signals

Bayesian Enhancement of Speech Signals

Presentation Transcript

Wavelet-Based Speech Enhancement

Subspace Methods for Speech Enhancement

Speech Enhancement

Automotive Speech Enhancement of Today: Applications, Challenges and Solutions

Advanced Speech Enhancement in Noisy Environments

Speech Enhancement Using Spectral Subtraction

Chapter 6 Linear Predictive Coding (LPC) of Speech Signals

Bayesian integration of visual and auditory signals for spatial localization

Wavelet-Based Speech Enhancement

Speech Enhancement EE 516 Spring 2009

Speech Enhancement

A Tutorial on Bayesian Speech Feature Enhancement

„Bandwidth Extension of Speech Signals“

Speech Enhancement using Excitation Source Information

Bayesian Methods for Speech Enhancement

Speech Enhancement for ASR

Wearable Speech Enhancement

Speech Enhancement through Noise Reduction

Signal Subspace Speech Enhancement

Bayesian Learning for Models of Human Speech Perception

Wavelet-Based Speech Enhancement