Isolating sections of human voice from noisy audio

조회 수: 7 (최근 30일)
Amit
Amit 2015년 9월 16일
댓글: Amit 2015년 9월 16일
Hello there all the DSP experts. As a novice with a problem, I have a humble request for guidance.
I am working with several 15-hour audio samples captured in an industrial environment of the following kind: Noisy background, constant whirring of machines, as if driving in a noisy vehicle, no major repetitive noise, some sudden sharp 'ping' (alarm signals from certain machines), and on top of that infrequent/sporadic human voice.
My main interest lies in hearing out the human communication. Currently, I am spending listening to the entire 15 hours!
I do not intend a perfect solution, just want to save as much time as possible, by trimming the portions where no human voice exists.
Is there any way I can achieve this through Signal Processing?
Thank you very much all.
  댓글 수: 1
Amit
Amit 2015년 9월 16일
Any starting points, if not perfect solutions will be highly appreciated. Thanks.

댓글을 달려면 로그인하십시오.

채택된 답변

Image Analyst
Image Analyst 2015년 9월 16일
I think you need to use Independent Components Analysis (ICA), also known as Blind Source Separation (BSS) to solve the "Cocktail party problem". I'm not an expert in that so all I can do is to refer you to these links:
  댓글 수: 1
Amit
Amit 2015년 9월 16일
Thanks Image Analyst. While I try this, is there any 'makeshift' approach? My 15 hour samples have typically 10-30 minutes of human voice. Can anything coarser may be done. Thanks again.

댓글을 달려면 로그인하십시오.

추가 답변 (0개)

카테고리

Help CenterFile Exchange에서 Audio I/O and Waveform Generation에 대해 자세히 알아보기

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by