Logo PTI
Polish Information Processing Society
Logo RICE

Annals of Computer Science and Information Systems, Volume 10

Proceedings of the Second International Conference on Research in Intelligent and Computing in Engineering

Application of Variational Mode Decomposition on Speech Enhancement

, ,

DOI: http://dx.doi.org/10.15439/2017R27

Citation: Proceedings of the Second International Conference on Research in Intelligent and Computing in Engineering, Vijender Kumar Solanki, Vijay Bhasker Semwal, Rubén González Crespo, Vishwanath Bijalwan (eds). ACSIS, Vol. 10, pages 293296 ()

Full text

Abstract. Enhancement of speech signal and reduction of noise from speech is still a challenging task for researchers. Out of many methods signal decomposition method attracts a lot in recent years. Empirical Mode Decomposition (EMD) has been applied in many problems of decomposition. Recently Variational Mode Decomposition (VMD) is introduced as an alternative to it that can easily separate the signals of similar frequencies. This paper proposes the signal decomposition algorithm as VMD for denoising and enhancement of speech signal. VMD decomposes the recorded speech signal into several modes. Speech contaminated with different types of noise is adaptively decomposed into various components is said to be Intrinsic Mode Functions (IMFs) by shifting process as in Empirical Mode decomposition (EMD) method. Next to it the denoising technique is applied using VMD. Each of the decomposed modes is compact. The simulation result shows that the proposed method is well suited for the speech enhancement and removal of noise by restoring the original signal.

References

  1. N. E. Huang, Z. Shen, S. R. Long, M. C. Wu, H. H. Shih, Q. Zheng, N.-C. Yen, C. C. Tung, and H. H. Liu, “The empirical mode decomposition and the Hilbert spectrum for nonlinear and non-stationary time series analysis,” Proc. Royal Soc. A: Math., Phys. Eng. Sci., vol. 454, no. 1971, pp. 903–995, Mar. 1998.
  2. Mihir Narayan Mohanty, Aurobinda Routray, Ashok Kumar Pradhan, P.Kabisatpathy, “Power Quality Disturbances Classification using Support Vector Machines with optimized time-frequency kernels”, International Journal of Power Electronics,Vol.4, No 2, pp181-196, 2012.
  3. P. Loizou. Speech Enhancement: Theory and Practice. CRC Press, 2007.
  4. S. Boll. Suppression of acoustic noise in speech using spectral subtraction. IEEE Transactions on Acoustics, Speech and Signal Processing, 27(2):113 – 120, Apr. 1979.
  5. Yang Lu and Philipos C. Loizou, “A geometric approach to spectral subtraction. Speech Communication”, 50(6), pp 453–466, 2008.
  6. Y. Ephraim and H.L. Van Trees, “A signal subspace approach for speech enhancement. Speech and Audio Processing”, IEEE Transactions on, 3(4), pp 251 –266, Jul. 1995.
  7. Rashmirekha Ram, Mihir Narayan Mohanty, “Performance Analysis of Adaptive Algorithms for Speech Enhancement Applications”, Indian Journal of Science & Technology, 2016. (In press)
  8. V. Anoop, P.V.Rao and M. Nidhin, “Performance Analysis of Speech Enhancement Methods Using Adaptive Algorithms and Optimization Techniques”, IEEE, ICCSP,2015.
  9. S.Kamath, P.Loizou, “A Multi-band spectral subtraction method for enhancing speech corrupted by colored noise”, International Conference on Acoustics, Speech and signal processing, Vol.4,pp 4160-4164, 2002.
  10. Y. Ephraim, D. Malah, “Speech Enhancement using a Minimum Mean Square Error Log Spectral Amplitude Estimator”, IEEE Transactions on Acoustics, Speech and Signal Processing, Vol-33, pp 443-445, 1985.
  11. H.Sameti, “HMM-based strategies for Enhancement of Speech Signals Embedded in Nonstationary noise”, IEEE Transactions on Acoustics, Speech and Signal Processing, Vol-6, pp445-455, 1998.
  12. Junfeng Li , Shuichi Sakamoto, Satoshi Hongo, Masato Akagi, Yoiti Suzuki, “Adaptive β-order generalized spectral subtraction for speech enhancement”, Signal Processing, Elsevier Journal, No-88, pp 2764– 2776, 2008.
  13. Sayed. A. Hadei, M. Lotfizad, “A Family of Adaptive Filter Algorithms in Noise Cancellation for Speech Enhancement”, International Journal of Computer and Electrical Engineering, Vol. 2, No. 2, 2010.
  14. Quanshen Mai, Dongzhi He, Yibin Hou, Zhangqin Huang, “A Fast Adaptive Kalman Filtering Algorithm for Speech Enhancement”, IEEE International Conference on Automation Science and Engineering Trieste, Italy, August 24-27, 2011.
  15. Wu Caiyun, “Study of Speech Enhancement System Using Combinational Adaptive Filtering”, International Conference on Computer Distributed Control and Intelligent Enviromental Monitoring, 2012.
  16. Semwal, Vijay Bhaskar, Kaushik Mondal, and G. C. Nandi. "Robust and Accurate Feature Selection for Humanoid Push Recovery and Classification: Deep Learning Approach." Neural Computing and Applications, pp 1-10, 2015.
  17. Kumari, Pinki, and Abhishek Vaish. "Feature-Level Fusion of Mental Task’s Brain Signal for an Efficient Identification System." Neural Computing and Applications, 27.3, pp 659-669, 2016.
  18. Kumari, Pinki, and Abhishek Vaish. "Information-Theoretic Measures on Intrinsic Mode Function for the Individual Identification using EEG sensors." IEEE Sensors Journal 15.9 pp 4950-4960, 2015.
  19. Semwal, V.B., Singha, J., Sharma, P. et al., "An Optimized Feature Selection Technique based on Incremental Feature Analysis for Bio-Metric Gait Data Classification" Multimed Tools Appl., http://dx.doi.org/10.1007/s11042-016-4110, 2016.