How to use the dataset in Visual question Answering
조회 수: 2 (최근 30일)
이전 댓글 표시
I am working in visual question answering problem that accept image and question about it. Then it generates an answer of the question. I built a network that has two parts: the first one is CNN model that handels image as input. the second one is enseble model LSTM+BiLSTM that handles the text. I have the dataset has column for image path, question, and answer. I made all preprocessing steps for the dataset. My problem now how to tell the model to take image and text and perform them seprately and then makes fusion between them.
above is the network I built. in layer has to accept text which is a question. im_in has to accept image. I don't know how to handle the dataset.
Can you suggest a specific method for building model for visual quesrion_answering problem in matlab.
regards,
댓글 수: 0
답변 (1개)
Prince Kumar
2021년 11월 19일
Hi Suheer Al-Hadhrami,
You can make use of 'Multiple-Input Networks".
Please refer to the documentation for the same : https://www.mathworks.com/help/deeplearning/ug/multiple-input-and-multiple-output-networks.html
The following link might be useful too
댓글 수: 0
참고 항목
카테고리
Help Center 및 File Exchange에서 Image Data Workflows에 대해 자세히 알아보기
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!