Shilpa Sonawane

Last seen: 대략 1개월 전 | 2021년부터 활동

Followers: 0 Following: 0

통계

순위
35,145
of 297,672

평판
1

참여
24 질문
2 답변

답변 채택
70.83%

획득한 표
1

배지 보기

Feeds

질문

How do I add generated audio to a silent video?
I have written following code to add generated audio into a silent video. It executes smoothly. But the audio doesn't get added...

대략 2개월 전 | 답변 수: 1 | 0

1

답변

질문

How to use speechClient("wav2vec2.0") command
I need to convert speech into text.I used speechClient("wav2vec2.0") command but got the following error Error using speechClie...

10개월 전 | 답변 수: 1 | 0

1

답변

질문

How to see matlab code for any inbuilt function
I used bilstm layer. I want to see the code for modification. please guide.

대략 1년 전 | 답변 수: 1 | 0

1

답변

질문

How to find image region if coordinates are provided?
I have coordinates of lip. I have to find lip region from coordinates. Please guide.

대략 1년 전 | 답변 수: 1 | 0

1

답변

질문

How to plot video frames along with entire audio and corresponding spectrogram for comparison in one graph?
I have three signals namely video, audio and its spectrogram. Need to display result in one graph for analysis

1년 초과 전 | 답변 수: 1 | 0

1

답변

질문

How to find word error rate of spoken sentence for regression based model?
I am working on visual speech synthesis. I have used GRID dataset which consists of short sentences. The developed model is regr...

1년 초과 전 | 답변 수: 1 | 0

1

답변

답변 있음
how to plot features of resnet-50 when input given is image
Thank you Sir. I will definately go through it.

1년 초과 전 | 0

답변 있음
how to select relative time stamp from target vector using deep learning
Thank you so much. I will try it definately.

1년 초과 전 | 0

| 수락됨

질문

how to plot features of resnet-50 when input given is image
I have images of mouth ROI applied as an input to resnet-50. i want to plot visual features. What are the steps to plot the fea...

1년 초과 전 | 답변 수: 2 | 0

2

답변

질문

How to find audio samples from MFCC?
I had calculated MFCC of audio frames. I have to reconctruct audio samples from MFCC. Please provide guidance.

거의 2년 전 | 답변 수: 1 | 0

1

답변

질문

How to generate audio from colored MFCC in the form of image?
In above fig, 41 images are tiled. Each image is generated mfcc. Its size is 28x28x3. It is rgb image. I have to find inverse ...

거의 2년 전 | 답변 수: 2 | 0

2

답변

질문

how to select relative time stamp from target vector using deep learning
i have 4 input signals namely, 4-D array to store frames of video , 4-D array to store frames of mfccs, 1-d array time stamps ...

거의 2년 전 | 답변 수: 2 | 0

2

답변

질문

How to find time step of video signal?
I have to find out time step of video signal and need to arrnge input data as (Timestep,96,96,3). please guide how to find o...

거의 2년 전 | 답변 수: 1 | 0

1

답변

질문

How to generate speech signal from spectral envelope, aperiodicity, fundamental frequency, V/UV signal
I have implemented the network as shown in fig which takes 2 inputs namely, video input and mfcc(audio) input. Video in...

거의 2년 전 | 답변 수: 0 | 0

0

답변

질문

How to provide input without datastore to multiple input deep neural network?
I have used the network shown in fig which takes 2 inputs namely video input(no. of images) & second is mfcc of audio signal o...

거의 2년 전 | 답변 수: 1 | 0

1

답변

질문

how to connect new network to existing model
I have combined output of 2 networks. I need to connect combined output with 3rd network. I am unable to do it. I have to con...

거의 2년 전 | 답변 수: 1 | 0

1

답변

질문

Input data must be a formatted dlarray.
I have used the code of VAE to generate image. My aim is to find probaility distribution of mfcc signal. Input is MFCC matrix of...

대략 2년 전 | 답변 수: 2 | 0

2

답변

질문

Predictors must be a numeric array, a datastore, or a table.
clear all; close all; clc; load mfcc_S6_8_01.mat; load mfcc_S6_8_02.mat; ...

대략 2년 전 | 답변 수: 1 | 0

1

답변

질문

How to save mfcc of audio frames in image format to provide input to cnn?
I have used spectrogram command to view mfcc. I used command "savsas" to sava it in image format. But when i oped it, fig shows...

대략 2년 전 | 답변 수: 1 | 0

1

답변

질문

How to use pretrain networks like vgg19,resnet18?
I would like to use pretrained network like VGG19, RESNET18. The output of this network should be connected to LSTM. Which laye...

2년 초과 전 | 답변 수: 1 | 0

1

답변

질문

How to concatenate video frames into single matrix for cnn?
I have multiple videos. Each video has different number of frames. I am trying to combine all the frames of all videos for train...

2년 초과 전 | 답변 수: 1 | 0

1

답변

질문

How to use trainNetwork function for input as a video?
I have written code to recognize characters 'A' & 'B'. But during training , i got following error. Error using trainNetwork ...

2년 초과 전 | 답변 수: 1 | 0

1

답변

질문

How to display vid variable in the form of image from mat file?
I downloaded AV Letters dataset for AVSR. It consists of mat file. I read mat file which consists of two variable. load A1_Anya...

3년 초과 전 | 답변 수: 1 | 0

1

답변

질문

How to resolve this error-Invalid training data. For image, sequence-to-label, and feature classification tasks, responses must be categorical.
%% Totla RGB images=55, img4training size=35x60x3x55, YL=training Labels- 55x1 load traindata1.mat; layers = [ imageInp...

3년 초과 전 | 답변 수: 1 | 1

1

답변

질문

How visual speech recognition is done using CNN and LSTM in Matlab?
In ViSUAL ASR, both audio and video inputs are there to recognize isolated words.I have seperated audio and video frames. How to...

3년 초과 전 | 답변 수: 2 | 0

2

답변

질문

How to read audio and video frame of avi file?
Input to program is avi file. i am using videoFReader function to read audio and respective video frame from avi file. But i go...

거의 4년 전 | 답변 수: 1 | 0

1

답변