MIMO Beamforming System For Speech Enhancement in Realistic
MIMO Beamforming System For Speech Enhancement in Realistic
https://doi.org/10.1007/s10772-018-9530-9
Abstract
Multiple noise sources in a realistic environment severely degrade the quality and intelligibility of the desired speech signal,
thus posing a severe problem for many speech applications. Several noise reduction algorithms have been proposed with
a main goal to solve this problem. However, the good performances of such algorithms are severely impaired in realistic
environment under multi-noise sources condition. In this paper, the author treats the noise cancellation system as a multiple-
input multiple-output (MIMO) beamformer system. The proposed approach consists of two steps. First, the noise signals are
generated by applying the white noise sources to a MIMO AR system. Then, the noisy microphone signals are sequentially
processed by employing multi-channel linear prediction error filters (MCLPEFs) and multi-channel adaptive noise estima-
tion filters (MCANEFs) in the lower path of the proposed beamformer. The MCLPEFs are used to whiten the input signals,
while the MCANEFs are used as a MIMO system identification to perform the modeling process of the noise signals. Finally,
the noise signals in the upper path are subtracted from the estimated noises in the lower path to recover an enhanced speech
signal. Moreover, the performance of the proposed MIMO approach was validated under a realistic environment with real
noise sources.
Keywords Adaptive beamforming · Multiple-input multiple-output · Microphone array · Noise estimation · Multi-noise
reduction
13
Vol.:(0123456789)
International Journal of Speech Technology
were including the additions of two adaptive line enhancers Instead of choosing the filters of the quiescent weight vec-
(ALEs) in the primary and reference channels of the stand- tor independently of the filters for the blocking matrix, they
ard system so that the narrowband noise can be efficiently are the same. The steering of the array to the position of the
eliminated by these ALEs in the first stage, while the stand- desired source is implicitly included in the quiescent weight
ard system concentrates on the wide band noise cancella- vector. The problem of adapting the blocking matrix and
tion in the second stage. The proposed system has shown its the multi-channel noise canceller is addressed by switching
capability for reducing both the narrowband and wideband between the adaptation of the blocking matrix and the adapta-
noise signals and its performance was shown to be better tion of the multi-channel noise canceller depending on pres-
than that of the standard system. ence of the desired signal and presence of noise, respectively.
In this paper, the author treats the microphone array sys- The decision is performed by a full-band energy-based voice
tem as a multiple-input multiple-output (MIMO) optimum activity detector.
filter. This gives new insights into the merits of the beam- The way to compute the filters for the blocking matrix
former system which take reverberation of the input signals and for the quiescent weight vectors is altered in Linhui et al.
into account by modeling the propagation by room impulse (2017). The determination of optimum filters for the block-
responses. Such that the reverberations are cancelled at the ing matrix can be seen from a system identification point of
output of the linear predictor parts in the proposed system. view. Using this interpretation, it is suggested in Linhui et al.
In addition, this paper deals with the concept of speech (2017) to estimate the blocking matrix by a system identi-
enhancement under conditions of multi and various noise fication technique which exploits the non-stationarity of
sources in realistic environment. The proposed MIMO sys- the desired signal. As in Van Comprenolle (1990), speech
tem consists of two steps. In the first step, the noisy micro- pauses are required to determine the blocking matrix filters.
phone signals are whitened by using multi-channel linear The tracking problem for the sidelobe canceling path is thus
prediction filters in the lower path of the proposed MIMO not resolved. Other researchers have examined the improved
system. In the second step, the background noise signals GSC for speech enhancement in a reverberating environment
can be reconstructed from the prediction error signals by (Hoshuyama et al. 1999; Herbordt and Kellermann 2003; Her-
estimating the transfer functions of noise generation sys- bordt et al. 2007). Hoshuyama et al. (1999) used a three-block
tems. These estimations are performed by the adaptive noise structure similar to the GSC. However, the blocking matrix
estimation filters in the multi-channel noise canceller. The has been modified to operate adaptively. In order to limit the
enhanced speech signal is obtained by subtracting the recon- leakage of the desired speech signal, which is responsible
structed background noise signals from the noise signals in for distortion in the output signal, a quadratic constraint is
the upper path. The rest of this paper is organized as follows. imposed on the norm of the noise canceller coefficients. Alter-
In Sect. 2, the research background is illustrated, while in natively, use of the leaky LMS algorithm has been suggested.
Sect. 3 the problem formulation and notation is introduced. In Herbordt and Kellermann (2003), the relation of the robust
Section 4 provides the principle of the proposed MIMO sys- GSC after Hoshuyama et al. (1999) with wideband linearly-
tem for multiple noise cancellation and dereverberations. In constrained least squares error (LCLSE) beamforming in the
Sect. 5, simulation results and discussions will be presented time-domain using spatio-temporal constraints was introduced.
to demonstrate the performance of the proposed approach for The frequency-domain implementation of the robust GSC
dereverberation and multi-noise reduction. This Section also beamformer was introduced to accelerate the convergence rate
discusses the performance of the proposed MIMO system (Herbordt et al. 2007).
for variety of actual noise sources in an actual room acoustic
environment. Conclusions are given in Sect. 6. 3 Problem formulation
13
International Journal of Speech Technology
y1 (k )
Room e out (k ) + s j (k ) j = 1,...J N y2 ( k ) Room
s1 (k )
∑ WFB (k ) s 2 (k )
Upper path H (k )
y1 (k ) h11 - y N (k )
h21 h
s1 ( k ) s J (k )
sˆ j (k )
y2 ( k ) h12
N1
eˆ (k )
h22 s 2 (k ) WANEF (k ) WLPEF (k )
N N
hN 2 Lower path
h1 J
y N (k ) h2 J Fig. 2 Schematic diagram of the proposed MIMO system
s J (k )
hNJ
𝐇 is a J(L + M − 1) × NM convolution matrix expressed as 𝐂s (k) = diag{𝐂(k), 𝐂(k − 1), ..., 𝐂(k − M + 1)} (12)
⎡ 𝐇1,1 𝐇1,2 ... 𝐇1,N ⎤ [ ]T
⎢ 𝐇 𝐇 ... 𝐇2,N ⎥ 𝐜s (k) = 𝐜T (k), 𝐜T (k − 1), ..., 𝐜T (k − M + 1) , (13)
𝐇 = ⎢ 2,1 2,2 (7)
⎢ . . ⎥⎥
⎣ 𝐇J,1 𝐇J,2 ... 𝐇J,N ⎦ respectively. The constraint Eq. (10) repeats the spatial con-
straints 𝐂T (k − m)𝐰(k − m) = 𝐜(k − m) for M successive
time instant m = 0, 1, ..., M − 1 . We demand that the desired
⎡ 𝐡j,n 0 ... 0 ⎤ speech signal processed by a fixed beamformer weight vec-
⎢ 0 𝐡j,n ⎥
=⎢ (8) tor 𝐖FB (k) is not distorted at the output of the proposed
0 ⎥⎥
𝐇j,n
⎢ approach over a data block of M successive samples. With
⎣ 0 ... 0 𝐡j,n ⎦
the vector 𝐱(k) of the desired signal at the microphones
[ ]T
𝐡j,n = [hj,n (0), hj,n (1), ..., hj,n (L − 1)]T 𝐱(k) = 𝐱1T (k) 𝐱2T (k), ..., 𝐱NT (k) (14)
(9)
j = 1, 2, ..., J and n = 1, 2, ..., N
13
International Journal of Speech Technology
[ ]T [ ]T
𝐱n (k) = 𝐱n (k) 𝐱n (k − 1), ..., 𝐱n (k − (M − 1)) , (15) 𝐰n (k) = wn (0) wn (1), ..., wn (MLPEF − 1) n = 1, 2, ..., N
(23)
such a constraint can be put into the form of (10) with C = 1 are the tap coefficients of the nth LPEF which are updated by
as follows: normalized least mean square (NLMS) adaptive algorithm
𝐂s (k) = diag{𝐱(k), 𝐱(k − 1), ..., 𝐱(k − M + 1)} (16) (Haykin and Kailath 2002). By minimizing the mean square
values of the prediction error signal set ̂
𝐞(k) , we obtain
𝐜s (k) = 𝐂Ts (k)𝐖FB (k) (17) [ ]+ { }
where 𝐖𝐋𝐏𝐄𝐅 = 𝐇T E{𝐬(k − 1)𝐬T (k − 1)}𝐇 𝐇T E 𝐬(k − 1)𝐬T (k) 𝐇
(24)
[ ]T where A+ is the Moore–Penrose generalized inverse of
𝐖𝐅𝐁 (k) = 𝐖𝐓𝐅𝐁 (k), 𝐖𝐓𝐅𝐁 (k − 1), ..., 𝐖𝐓𝐅𝐁 (k − M + 1)
(18) matrix A (Harville 1997), and E{.} is an expectation opera-
Note that the constraint vector 𝐜𝐬 (k) depends on the fixed tor. Assuming the source signals 𝐬j (k) can be generated by
beamformer weight vector 𝐖𝐅𝐁 (k) so that the design of 𝐜𝐬 (k) the MIMO AR process, we can write 𝐬j (k) as (Kailath et al.
is replaced by the appropriate choice of the weight vector 2000)
𝐖𝐅𝐁 (k) . This has the advantage that any beamformer design
𝐬j (k) = 𝐂Tj 𝐬j (k − 1) + 𝐞j (k) (25)
can be used to specify the constraints, or equivalently, to
specify the beamformer response with respect to the desired where 𝐞j (k) = [ej (k) 0, ..., 0]T , and 𝐂𝐣 is the B × B companion
signal. The fixed beamformer weight vector may be realized as matrix defined as
a delay and sum (DS) beamformer (as assumed in this paper),
if the position of the desired source only varies little within a ⎡ aj,1 1 0 . 0⎤
given interval of direction of arrival. When designing the fixed ⎢a 0 1 . 0⎥
𝐂𝐣 = ⎢ j,2 (26)
1 ⎥⎥
, j = 2, 3, ..., J
beamformer, one has to assure that the mainlobe width of the ⎢ .
fixed beamformer is sufficiently wide for not distorting the ⎣ aj,B 0 . . 0⎦
desired signal. Robustness against array perturbations can be
obtained by using appropriate designs of fixed beamformers. where aj is Z-transform of an AR polynomial of order B
given by
4.2 MCLPEF weights 𝐖LPEF (k) { }
aj (z) = 1 − aj,1 z−1 + .... + aj,B z−B , (27)
As can be seen from structure of the proposed system (Fig. 2),
we have two signal paths. The first is upper path, which con- and B is the order of the AR polynomials. We then have
tains a desired speech and residual noises after applying fixed { } { }
beamformer weights. The second path is the lower path, E 𝐬(k − 1)𝐬T (k) = E 𝐬(k − 1)𝐬T (k − 1) 𝐂𝐣 (28)
which consists of multi-channel linear prediction error filters
where 𝐬(k − 1) and 𝐞(k) are assumed to be orthogonal.
(MCLPEFs) and multi-channel adaptive noise estimation fil-
Assuming that E{𝐬(k − 1)𝐬T (k − 1)} is positive definite, we
ters (MCANEFs). The input signals of the MCLPEFs are the
can replace it with 𝐗T 𝐗 , where X is a matrix. Thus, the
noisy microphone signals yn (k) n = 1, 2, ..., N .
linear prediction error filters can be expressed as (Delcroix
In a matrix form, the output signal of the MCLPEFs can
et al. 2005)
be expressed as
( )−1
𝐞(k) = 𝐲(k) − 𝐲T (k − 1)𝐖𝐋𝐏𝐄𝐅
̂ 𝐖𝐋𝐏𝐄𝐅 = 𝐇T 𝐇𝐇T 𝐂𝐣 𝐡1 (29)
(19)
= 𝐬T (k)𝐇 − 𝐬T (k − 1)𝐇𝐖𝐋𝐏𝐄𝐅 where 𝐡1 = [𝐡T1,1 𝐡T2,1 , ..., 𝐡TJ,1 ]T is the first column of H.
Using (2), (19) and (29), the prediction error signal ̂ 𝐞(k)
where becomes
[ T ]T
̂ 𝐞T2 (k), ..., ̂
𝐞1 (k) ̂
𝐞(k) = ̂ 𝐞TN (k) (20) 𝐞(k) = 𝐬T (k)𝐇 − 𝐬T (k − 1)𝐇𝐖𝐋𝐏𝐄𝐅
̂
= 𝐬T (k)𝐇 − 𝐬T (k − 1)𝐇𝐇𝐓 (𝐇𝐇𝐓 )−𝟏 𝐂𝐣 𝐡𝟏
[ ]T
[ ] (30)
̂
𝐞n (k) = ̂
en (k) ̂
en (k − 1), ..., ̂
en (k − (MLPEF − 1)) n = 1, 2, ..., N = (𝐬T (k) − 𝐬T (k − 1)𝐂𝐣 𝐡𝟏
(21) = 𝐞T (k)𝐡𝟏
[ ]T
𝐖𝐋𝐏𝐄𝐅 (k) = 𝐰T1 (k) 𝐰T2 (k), ..., 𝐰TN (k) (22)
Equation (30) shows that the prediction error signals
𝐞(k) are proportional to the white noise sources 𝐞(k) . The
̂
13
International Journal of Speech Technology
s1 (k )
The system 𝐖ANEF (k) is driven by N input signals ̂ 𝐞(k)
according to (20). The output signal of the MIMO system
e out (k ) sj e(k ) 𝐞(k) j = 2, 3, ..., J are subtracted from the
̂𝐬j (k) = 𝐖TANEF (k)̂
∑ ∑ MIMO Noise
Generation speech reference signal 𝐬𝐣 (k) j = 1, 2, ..., J . This yields the
- J J-1 J-1
error signal 𝐞𝐨𝐮𝐭 (k) as follows 𝐞𝐨𝐮𝐭 (k) = 𝐬j (k) − ̂𝐬j (k) . The
error signal 𝐞𝐨𝐮𝐭 (k) is used for formulating cost function in
sˆ j (k ) order to determine 𝐖ANEF (k) according to LS optimization
WANEF (k )
N N criterion.
Using the singular value decomposition (SVD) for solv-
Fig. 4 MIMO system identification ing the LS problem, the solution can be computed as (Golub
and Loan 1989)
[ T ]+
MCLPEFs takes reverberation (multi-path propagation with 𝐖ANEF (k) = ̂
𝐞 (k) 𝐬j (k) j = 1, 2, ...J (32)
respect to the input source signals) into account by mod-
eling the propagation by room impulse response, such that The important observation from Fig. 4 is that the
the reflected signal paths are cancelled at the output of the
𝐖ANEF (k) performs very well even when the number of input
MCLPEFs. However, the prediction error signals becomes signals J (or equivalently the number of noise sources) are
white (Haykin and Kailath 2002). In the MCANEFs, we may equal to the number of microphones N. This is the unique
estimates the MIMO AR polynomials of the noise generation feature of the proposed system, while the good performance
systems in order to reconstruct the noise signals precisely. of the existing techniques is severely impaired in case J ≥ N .
The influences of these parameters (J, N and filter length
4.3 MCANEF weights 𝐖ANEF (k) MANEF of the adaptive noise estimation filters) on the per-
formance of the proposed MIMO system are discussed in
In this part, we consider the reconstruction of the noises the following section.
from whitened signals (output of LPEFs). Assume that
the noise sources are generated by exciting a MIMO finite
order AR system with a white noise sources as shown in 5 Simulation results
Fig. 3. The principle of MIMO system identification model
is shown in Fig. 4. The MCANEFs is placed in paral- 5.1 Simulation environments
lel to MIMO noise generation system. The output signals
𝐬𝐣 (k) j = 1, 2, ..., J are subtracted from the output ̂𝐬𝐣 (k) of The simulation room has dimensions 6 m × 3 m × 2.5 m.
the modeling system, which yields the error signals 𝐞𝐨𝐮𝐭 (k) . It consists of a microphone array, a desired speech source
Minimization of the 𝐞𝐨𝐮𝐭 (k) according to LSE cost function, s1 (k) and multi white Gaussian noise sources. The speech
allows to identify the MIMO noise generation system by source is located at 1 m from the center of the microphone
the modeling system 𝐖ANEF (k) . Note that the noise signals array at broadside direction, and the noise sources are
𝐬𝐣 (k) j = 2, ..., J and speech s1 (k) are assumed to be desired located at a distance of 2 m from the array center at direc-
signals and disturbance, respectively. tions 𝜃n = 0, ± 30, ± 45, 90 and 180◦ . Unless otherwise
The MCANEFs cannot estimate the desired speech sig- indicated, we will only use the noise source at 𝜃n = 45◦ (in
nal s1 (k) . This is due to the fact that the residual speech Sect. 5.3 and 5.4, multi noise sources will be used). The used
components are whitened by the MCLPEFs in the lower signals are sampled at 16 kHz with 16 bit resolution. The
path. Then the whitened residual speech components acoustic room impulse responses are calculated using the
13
International Journal of Speech Technology
13
International Journal of Speech Technology
30
28 16
26
Proposed (N=6)
24 14
22 Proposed (N=4)
12
Noise Reduction [dB]
Fig. 7 Noise reduction performance for T60 = 64 ms (a) and T60 = 200 ms (b) over the filter length MANEF and as a function of the number of
microphones N
20
10 GSC (N=2)
GSC (N=4) 18
GSC (N=6) 16
8 Proposed (N=2)
Proposed (N=4) 14
Speech Distortion [dB]
Speech Distortion [dB]
Proposed (N=6)
6 12
10
GSC (N=2)
4 8 GSC (N=4)
GSC (N=6)
6 Proposed (N=2)
2
4 Proposed (N=4)
Proposed (N=6)
0 2
0
0 500 1000 1500 2000 0 500 1000 1500 2000
Filter Length Filter Length
Fig. 8 Speech distortion performance for T60 = 64 ms (a) and T60 = 200 ms (b) over the filter length MANEF and as a function of the number of
microphones N
5.3 Noise reduction performance for multi‑noise microphones. For example, when number of noises = 4 and
sources N = 4, obviously the noise reduction by GSC (5.541 dB) is
equal to noise reduction by DS (5.451 dB). This means that
In this part, we consider all the noise sources as they are the noise reduction ability of the GSC is performed by DS
given in Sect. 5.1. The T60 of the simulated room is 200 ms. in its reference path only and there are no degrees of free-
Table 1 summarizes the simulation results. Many observa- dom for noise reduction in multi-channel noise canceller.
tions and interdependencies which have been described ear- Second, the proposed system can achieve almost perfect
lier in Sect. 4 can clearly be noticed from this table. First noise reduction even when the number of noise sources is
of all, as the number of noises increases, the noise reduc- equal to N. For example, when number of noises = 4 and
tion ability of the GSC decreases, particularly when the N = 4 the noise reduction performance of the proposed sys-
number of noises is equal or greater than the number of tem is (20.211 dB). The noise reduction improvement with
13
International Journal of Speech Technology
Table 1 Performance of noise reduction using different algorithms, T60 = 200 ms , MLPEF = 32 taps , and MANEF = 256 taps
Noise source Two microphones (N = 2) Four microphones (N = 4) Six microphones (N = 6)
DS (dB) GSC (dB) Proposed (dB) DS (dB) GSC (dB) Proposed (dB) DS (dB) GSC (dB) Proposed (dB)
respect to GSC is about (14.67 dB). This improvement is which can be used to remove the undesired low-frequency
also depicted in Fig. 9. The reason behind this has been components. Figure 10b, c illustrates the spectrograms
explained in Sect. 4. Finally, the performance of the pro- of the enhanced speech for both GSC and the proposed
posed system decreases with increasing the number of noise MIMO system, respectively. It can be seen that the spec-
sources and more than the number of adaptive noise estima- trograms contain some of such low frequency compo-
tion filters 𝐖ANEF (k). nents. Comparing these three results, it reveals that the
proposed system have achieved almost perfect multi-noise
5.4 Performance validation with actual noise cancellation. However, the output of GSC still consists of
signals a significant amount of noises. Apparently, GSC is less
effective than the proposed system in terms of multi-noise
In this experimental four microphones, i.e. N = 4 arranged suppression. Since GSC employ only the channel informa-
in a linear array with spacing 5 cm are used. The micro- tion from the desired speech to the microphones while
phones are omni-directional electrets condenser of type the proposed system use all channel information includ-
ATP-20M. One desired speech s1 (k) and four noise ing those from noise sources. Figure 11 shows another
sources, i.e., s2 (k) , s3 (k) , s4 (k) and s5 (k) are considered. results for moving car, where the actual car noise signals
These noises are white Gaussian noises played by loud- are recorded inside the moving car then played back by
speakers. The Labview (2005) with appropriate data the loudspeakers s2 and s3 . It can be seen that the proposed
acquisition card is used to implement the proposed MIMO system has the potential for reducing the actual car noises.
system. For comparison, the GSC system is also imple- Also, note that the amount of the undesirable cancellation
mented and tested. All the parameters are chosen to be in the desired speech signal is high when using the GSC.
same as those in the simulated acoustic environment, if not This cancellation in the desired signal becomes less with
mentioned otherwise. To show the overlapping in the time the use of proposed system. In order to compare the multi-
and frequency domains, we used spectrogram to display noise reduction ability of the DS, GSC and the proposed
the results. Figure 10a shows the spectrogram of the total system for a variety of actual noise sources. The SNR
signals received by first microphone. Note that to show improvement are calculated and summarized in Table 2.
the realistic result, we have not applied a high-pass-filter, From Table 2, it may be noted that the superiority of the
proposed system with respect to GSC is obvious.
13
International Journal of Speech Technology
Fig. 10 Spectrograms of the noisy signal, enhanced speech using Fig. 11 Practical results for actual car noises
GSC, and proposed system
13
International Journal of Speech Technology
that the proposed MIMO system has a better multi-noise- Herbordt, W., Buchner, H., Nakamura, S., & Kellermann, W. (2007).
reduction and lower speech-distortion performances than Multichannel bin-wise robust frequency-domain adaptive filtering
and its application to adaptive beamforming. IEEE Transactions
standard fixed and adaptive beamforming techniques for all on Audio, Speech, and Language Processing, 15(4), 1340–1351.
reverberation times. This due to the channel information for Herbordt, W., & Kellermann, W. (2003). Adaptive beamforming
all input signals are considered in the proposed approach. for audio signal acquisition. In J. Benesty & Y. Huang (Eds.),
Because the proposed approach loses no degrees of freedom Adaptive signal processing: Applications to real-world problems
(pp. 155–194). Berlin: Springer.
for multi-noise cancellation, it can be implemented with a Hoshuyama, O., Sugiyama, A., & Hirano, A. (1999). A robust adaptive
small number of microphones. The performance of the pro- beamformer for microphone arrays with a blocking matrix using
posed MIMO system was validated under a realistic environ- constrained adaptive filters. IEEE Transactions on Signal Process-
ment with real noise sources. ing, 47(10), 2677–2684.
Kailath, T., Sayed, A. H., & Hassidi, B. (2000). Linear estimation.
Upper Saddle River, NJ: Prentice-Hall.
Linhui, Sun, Min, & Yang, SuZ. (2017). An adaptive speech endpoint
detection method in low SNR environments. International Jour-
References nal of Speech Technology, September, 20(3), 651–658.
Mohammed, H. S., Rihan, A. M., NassarAdel, M. A., El-Fishawy, S.,
& Abd El-Samie, E. (2017). Efficient compression and reconstruc-
Allen, J., & Berkley, D. A. (1979). Image method for efficiently simu- tion of speech signals using compressed sensing. International
lating small room acoustics. Journal of the Acoustical Society of Journal of Speech Technology, 20(4), 851–857.
America, 66, 943–950. Mohammed, J. R. (2009a). An efficient method for combining adap-
Bitzer, J., Simmer, K. U., & Kammeyer, K. D. (1999). Theoretical noise tive echo and noise canceller in hands-free systems. International
reduction limits of the generalized sidelobe canceller (GSC) for Journal of Adaptive Control and Signal Processing, 23, 278–292
speech enhancement. Proceeding of IEEE International Confer- Mohammed, J. R. (2009b). Adaptive noise reduction and acoustic echo
ence on Acoustics, Speech, and Signal Processing (ICASSP), 5, cancellation using adaptive filters in hands-free communication
2965–2968) systems, PhD Thesis, Punjab Engineering College, Panjab Uni-
Delcroix, M., Hikichi, T., & Miyoshi, M. (2005). Blind dereverbera- versity, Chandigarh, India, 20 November 2009.
tion algorithm for speech signals based on multi-channel linear Mohammed, J. R. (2017). Development of two-input adaptive noise
prediction. Acoustical Science and Technology, 26(5), 432–439. canceller with ability to cancel wideband and narrowband noise
Djendi, M. (2018). An efficient wavelet-based adaptive filtering algo- signals. International Journal of Speech Technology, 20(3),
rithm for automatic blind speech enhancement. International 741–751.
Journal of Speech Technology, 21(2), 355–367. National Instruments, LabVIEW Fundamentals, Version 8.0, User
Faucon, G., Mezalek, S. T., & Le Bouquin, R. (1989). Study and com- Manual, Part Number 324029A-01, August 2005.
parison of three structure for enhancement of noisy speech. Pro- Neubauer, R. O. (2001). Existing reverberation time formulae—a
ceeding of IEEE International Conference on Acoustics, Speech, comparison with computer simulated reverberation times. In 8th
and Signal Processing, 1, 385–388. International Congress on Signal and Vibration, Hong Kong, July,
Golub, G. H., & Van Loan, C. F. (1989). Matrix computations 2001 (pp. 805–812).
(2nd edn.). Baltimore: John Hopkins University Press. Rashmirekha, R., & Mohanty, M. N. (2018). Performance analysis of
Griffiths, L. J., & Jim, C. W. (1982). An alternative approach to linearly adaptive variational mode decomposition approach for speech
constrained adaptive beamforming. IEEE Transactions on Anten- enhancement. International Journal of Speech Technology, 21(2),
nas and Propagation, 30(1), 27–34. 369–381.
Harville, D. A. (1997). Matrix algebra from a statistician’s perspective. Van Comprenolle, D. (1990). Switching adaptive filters for enhanc-
New York: Springer. ing noisy reverberant speech from microphone array recordings.
Haykin, S., & Kailath, T. (2002). Adaptive filter theory (4th edn.). In Proceeding of IEEE ICASSP, Albuquerque NM, April 1990
Upper Saddle River, NJ: Prentice-Hall, Pearson Education, Inc. (Vol. 2, pp. 833–836).
13