How can I make my neural network support any size of image input?

Question

zzm oliver 2020년 4월 2일

0
링크

이 질문에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/514798-how-can-i-make-my-neural-network-support-any-size-of-image-input

댓글: Jacques Boutet de Monvel 2022년 5월 31일

There are three levels of code writing to do a vision-related deep learning task.

Highest Level: complete layerGraph and train with trainNetwork function.

Middle level: build a layerGraph without loss. Instead, calculate loss and gradient in an eval function. One can also specify customed learning rate schedule. This level allows some customization, and still exploits the easy-to-use highest level features.

Lowest level: this level has no concept of layer. Coders have to take care of the parameters themself. It's really messy and time-consuming to build and train a network in this way.

My question is: Highest level and middle level all requires a certain size of input, i.e, an imageInputLayer. But imageInputLayer only supports for fixed image size. I do not want to trouble myself with lowest level coding. So how could I make my NN take inputs of any size?

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글을 달려면 로그인하십시오.

이 질문에 답변하려면 로그인하십시오.

Answer 1

Ryan Comeau 2020년 5월 10일

0
링크

이 답변에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/514798-how-can-i-make-my-neural-network-support-any-size-of-image-input#answer_431576

MATLAB Online에서 열기

Hello,

I wish it was possible to just dump images of multiple sizes as well. Unfortunately though each image would yield a differen size of convultion maps and a different number. How then would it make sense to pass these into a full connection layer and curve fit these convolutions maps? It would be like sorting oranges by size, but half of your input oranges are apples, it would be a strange task.

There is however a solution to this problem. Your input images need to be scaled to the size of your network input size. This is one of the preprocessing steps that is important. Here is some code that could resize all of your images:

image=imread('path/to/image');
number_rows=200; %depending on the input size of your network
number_cols=300; %depending on input size. 
rescaled_image=imresize(image,[number_rows number_cols])

It may seem unintuitive, but computers don't see the same way we do and the scale of things doesn't always matter.

Hope this helps,

RC

댓글 수: 1
이전 댓글 -1개 표시이전 댓글 -1개 숨기기

Jacques Boutet de Monvel 2022년 5월 31일

If it is true that there is no way to feed an image of unprescribed size to a fully convolutional network, this is too bad! It misses one of the most attractive and elegant features of FCNs: the ability to process an image of any size in a seamless - translation invariant way at prediction time, while the network has been trained on much smaller image patches. This is very useful, and even crucial for segmentation applications.

Why not implement this feature at least to give the choice to users? This is one thing that could (still) make matconvnet more attractive than the matlab DL toolbox, despite all its impressive features.

댓글을 달려면 로그인하십시오.

How can I make my neural network support any size of image input?

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

채택된 답변

댓글 수: 1
이전 댓글 -1개 표시이전 댓글 -1개 숨기기

추가 답변 (0개)

참고 항목

카테고리

태그

제품

릴리스

Community Treasure Hunt

How can I make my neural network support any size of image input?

댓글 수: 0 이전 댓글 -2개 표시이전 댓글 -2개 숨기기

채택된 답변

댓글 수: 1 이전 댓글 -1개 표시이전 댓글 -1개 숨기기

추가 답변 (0개)

참고 항목

카테고리

태그

제품

릴리스

Community Treasure Hunt

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글 수: 1
이전 댓글 -1개 표시이전 댓글 -1개 숨기기