Date of Award

1997

Degree Type

Dissertation

Degree Name

Doctor of Philosophy in Electrical Engineering

Department

Electrical Engineering

First Advisor

G. Faye Boudreaux-Bartels

Abstract

In this thesis, we analyze the complexity involved in the production of unvoiced speech signals with measures from nonlinear dynamics and chaos theory. Previous research successfully characterized some speech signals as chaotic. However, in this dissertation, we use multifractal measures to postulate the presence of various fractal regimes present in the attractors of unvoiced speech signals. We extend prior work which used only correlation dimension D₂ and Lyapunov Exponents to analyze some speech sounds. We capture the chaotic properties of unvoiced speech signals in the embedded vector space more succinctly by not only estimating the correlation dimension D₂, but also estimating the generalized dimension D_q. The (non-constant) generalized dimension were estimated from phase space reconstructed vectors of single scalar variable realization of unvoiced speech signals. The largest of those dimensions is an indicator of the minimum dimension required in the phase space of any realistic dynamic model of speech signals.

Results of the generalized dimension estimation support the hypothesis that unvoiced speech signals indeed have multifractal measures. The multifractal analysis also reveals that unvoiced speech signals exhibit low-dimensional chaos as well as "soft" turbulence. This is in contrast to the opinion that unvoiced speech signals are generated from what is technically known as "hard" turbulent flow, in which the dimension of a dynamical model is very high. Unvoiced speech signals may actually be generated from "soft" turbulent flow.

In this dissertation, we explore the relationship between the estimated generalized dimension D_q and the singularity spectrum ƒ(α). Existing algorithms for accurately estimated the resulting singularity spectrum ƒ(α) from the samples of generalized dimensions D_q of a multifractal chaotic time series use either (a) linear interpolation of the known, coarsely sampled, D_q values or (b) a finely sampled D_q curve obtained at great computational/experimental expense. Also, in conventional techniques the derivative in the expression for Legendre transform necessary to go from D_q to ƒ(α) is approximated using first order centered difference equation. Finely sampling the D_q is computationally intensive and the simple linear approximations to interpolation and differentiation give erroneous end points in the ƒ(α) curve. We propose using standard min-max filter design methods to more accurately interpolate between known samples of the D_q values and compute the differentiation needed to evaluate the Legendre transform. We use optimum (min-max) interolators and differentiators designed with the Parks-McClellan algorithm. We have computed the generalized dimensions and singularity spectrum of 20 unvoiced speech sounds from the ISOLET database. The results not only indicated multifractality of certain unvoiced speech sounds, but also may lead to nonlinear maps that may be useful in improving the nonlinear dynamical modeling of speech sounds.

This new approach to ƒ(α) singularity spectrum calculation exhibits computational reduction and improved accuracy. The proposed method also provides estimates of the generalized dimensions at D_∞ and D_-∞ which are almost impossible to obtain from real data with limited number of data samples. Also, the asymmetric spread of α values with the corresponding ƒ(α) around the maximum of ƒ(α) reveal the inhomogeneity in the attractors of unvoiced speech signals just like the variations in the D_q values. The asymmetric spread of α values may also be an indication that the turbulent energy fields generated during unvoiced speech production are made of non-homogeneous fractals.

Recommended Citation

Adeyemi, Olufemi A., "Multifractal Analysis of Unvoiced Speech Signals" (1997). Open Access Dissertations. Paper 468.
https://digitalcommons.uri.edu/oa_diss/468

Download

COinS

DOI

https://doi.org/10.23860/diss-adeyemi-olufemi-1997

Open Access Dissertations

Multifractal Analysis of Unvoiced Speech Signals

Date of Award

Degree Type

Degree Name

Department

First Advisor

Abstract

Recommended Citation

DOI

Terms of Use

Search

Browse

Author Corner

Open Access Dissertations

Multifractal Analysis of Unvoiced Speech Signals

Author

Date of Award

Degree Type

Degree Name

Department

First Advisor

Abstract

Recommended Citation

Share

DOI

Terms of Use

Search

Browse

Author Corner