Classification problem with separate files

조회 수: 10 (최근 30일)
Alexander Koch
Alexander Koch 2022년 7월 4일
댓글: Alexander Koch 2022년 7월 9일
Hello everybody,
I am trying to solve a classification problem with multiple input files which are different examples for the problem. Every xlsx file contains different information about pores (size, shape, place) in a specimen made of different materials. I have about 100 files with minimum 4000 pores in each file. One of the pores is the critical pore which leads to failure when the specimen is loaded. That means that only one pore is classified as critical, the others are classified as uncritical. The task is now, to find the critical pore in the data with the help of machine learning. I cannot combine the seperate files since for each specimen the pore distribution is a little bit different.
I hope the problem got clear.
Thanks for your answer.
  댓글 수: 2
Hiro Yoshino
Hiro Yoshino 2022년 7월 5일
It seems that this is not a classification problem but an anomaly/outlier detection problem.
I bet the number of the critical pores is much smaller than that of other, i.e., the numbers are not well-balanced.
For this, there are some approaches. Some are based on probability distribution estimation and some are on boundary construction.
You should take a look at this site to see what would suit your case:
I would reccomend using one-class SVM as a starter since this is not based on probability distribution.
Alexander Koch
Alexander Koch 2022년 7월 9일
Thank you! That was really helpful.

댓글을 달려면 로그인하십시오.

답변 (0개)

카테고리

Help CenterFile Exchange에서 Statistics and Machine Learning Toolbox에 대해 자세히 알아보기

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by