VGGishPreprocess not accepting input

Question

Joshua Gardner 2022년 4월 8일

0
링크

이 질문에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/1691785-vggishpreprocess-not-accepting-input

댓글: Joshua Gardner 2022년 4월 11일

Hello,

I'm trying to do some transfer learning on the VGGish network with my own data but can't seem to get VGGishPreprocess to accept it.

The code is:

net=vggish;
downloadFolder = fullfile(tempdir,'Data_Test1');
datasetLocation = tempdir;
ads = audioDatastore(downloadFolder,'IncludeSubfolders',true,'LabelSource','foldernames');
labelTable = countEachLabel(ads)
numClasses = size(labelTable,1);
[adsTrain, adsValidation] = splitEachLabel(ads,0.8);
countEachLabel(adsTrain)
countEachLabel(adsValidation)
overlapPercentage = 75;
fs = 16000;
[trainFeatures,trainLabels] = vggishPreprocessNEW(adsTrain,overlapPercentage); 

I should note that

vggishPreprocessNEW

is the variation of vggishPreprocess that is supplied in this article, rather than the one offered in the Audio Toolbox.

The error i end up with is:

I have run the same code on the ESC-10 dataset that is provided, which works fine.

I have copied ESC-10's file structure and format, reducing all my input files to the same size and length.

I am now unsure what else is left to test, I feel I am running my test in exactly the same way but with a different output.

System is MacBook Pro M1, 2020, Monterey 12.1, 8GB

Many thanks in advance for any help, it is greatly appreciated.

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글을 달려면 로그인하십시오.

이 질문에 답변하려면 로그인하십시오.

Answer 1

jibrahim 2022년 4월 8일

1
링크

이 답변에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/1691785-vggishpreprocess-not-accepting-input#answer_938240

MATLAB Online에서 열기

Hi Joshua,

First, sorry we had a local funtion name with the same name as a function we ship. We are fixing this.

The function you copied and you're working off does not do well when the audio has multiple channels (for example, stereo). I suspect this is why you are seeing this error. If you only care about one channel, you can use indexing to just use the first channel of the audio. If you care about all channels, use this code instead (it uses the documented vggishPreprocess function. Make sure you do not have local function with that name around):

function [predictor,response,segmentsPerFile] = vggishPreprocessDatastore(ads,overlap)
% Preallocate cell arrays for the predictors and responses
numFiles = numel(ads.Files);
predictor = cell(numFiles,1);
response = predictor;
segmentsPerFile = zeros(numFiles,1);
% Extract predictors and responses for each file
for ii = 1:numFiles
    [audioIn,info] = read(ads);
    duration = size(audioIn,1)/info.SampleRate;
    if duration>=0.975
        % Use the documented vggishPreprocess function
        predictor{ii} = vggishPreprocess(audioIn,info.SampleRate,"OverlapPercentage",overlap);
        numHops = size(predictor{ii},4);
        response{ii} = repelem(info.Label,numHops);
        segmentsPerFile(ii) = numHops;
    else % signal too short - no results
        predictor{ii} = [];
        response{ii} = [];
        segmentsPerFile(ii)=0;
    end
end
% Concatenate predictors and responses into arrays
predictor = cat(4,predictor{:});
response = cat(2,response{:});
end

댓글 수: 1
이전 댓글 -1개 표시이전 댓글 -1개 숨기기

Joshua Gardner 2022년 4월 11일

Hi Jibrahim,

both these solutions worked, I forgot my dataset was in stereo!

thanks so much for the response, I really appreciate it

댓글을 달려면 로그인하십시오.

VGGishPreprocess not accepting input

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

채택된 답변

댓글 수: 1
이전 댓글 -1개 표시이전 댓글 -1개 숨기기

추가 답변 (0개)

참고 항목

카테고리

태그

제품

릴리스

Community Treasure Hunt

VGGishPreprocess not accepting input

댓글 수: 0 이전 댓글 -2개 표시이전 댓글 -2개 숨기기

채택된 답변

댓글 수: 1 이전 댓글 -1개 표시이전 댓글 -1개 숨기기

추가 답변 (0개)

참고 항목

카테고리

태그

제품

릴리스

Community Treasure Hunt

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글 수: 1
이전 댓글 -1개 표시이전 댓글 -1개 숨기기