How can I find and remove the nonzero duplicates in each column of a matrix

Question

Matt Talebi 2016년 7월 4일

0
링크

이 질문에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/293598-how-can-i-find-and-remove-the-nonzero-duplicates-in-each-column-of-a-matrix

댓글: Matt Talebi 2016년 7월 7일

X is a n-by-n matrix of integers ranging from 0 to n. I want to find nonzero duplicate entries in each column and remove them.

댓글 수: 2
없음 표시없음 숨기기

Image Analyst 2016년 7월 4일

편집: Image Analyst 2016년 7월 4일

Unless there is exactly the same number of elements to remove in each column, you can't. For example, you can't "remove" 3 elements from column 1 and 8 elements from column 2. Can you give an example of input and output and how you used unique() to try to solve it?

Matt Talebi 2016년 7월 6일

편집: per isakson 2016년 7월 6일

MATLAB Online에서 열기

The number of duplicates in each column is either 1 or none. Also the duplicate, if exists, is always the same integer as the column number. Example:

X = [ 1 2 3 4 5
9 5 3 8
5 4 0 1
7 3 2 0
1 6 7 9 ];

As can be seen in the 3rd column, 3 is a duplicate. Also in my real data set, the first row is always the column number (similar to this example). If it's not too much to ask I want Y to be:

Y = [ 1 2 5 4 5
9 4 3 8
5 6 0 1
7 0 2 0
1 0 7 9 ];

where duplicates removed by shifting one element up and adding two zeros at the end to balance the matrix (the order of numbers should be preserved). Otherwise, I think it should be still fine for me to be able to just identify columns with duplicate and then remove them manually. Thank you for your time!

댓글을 달려면 로그인하십시오.

이 질문에 답변하려면 로그인하십시오.

Answer 1

per isakson 2016년 7월 6일

0
링크

이 답변에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/293598-how-can-i-find-and-remove-the-nonzero-duplicates-in-each-column-of-a-matrix#answer_227797

편집: per isakson 2016년 7월 6일

MATLAB Online에서 열기

Given

"matrix of integers"
"the first row is always the column number"
"the duplicate, if exists, is always the same integer as the column number"

Try this

X = [ 1 2 3 4 5
      2 9 5 3 8
      7 5 4 0 1
      6 7 3 2 0
      3 1 6 7 9 ];
Y = nan( size(X) );
for jj = 1 : size( X, 2)
    isdub = X( :, jj ) == jj;
    if  any( isdub(2:end) ) 
        col = X(:,jj);
        col( isdub ) = [];
        Y(:,jj) = cat( 1, col, zeros(sum(isdub),1) );
    else
        Y(:,jj) = X(:,jj);
    end
end

result

>> Y
Y =
     1     2     5     4     5
     2     9     4     3     8
     7     5     6     0     1
     6     7     0     2     0
     3     1     0     7     9
>>

This code trades performance for readability.

&nbsp

Requirement of comment: "modify the codes ... keep the one in the first row and only remove the other one"

Y = nan( size(X) );
for jj = 1 : size( X, 2)
    col = X(2:end,jj);
    isdub = col == jj;
    if  any( isdub ) 
        col( isdub ) = [];
        Y(:,jj) = cat( 1, jj, col, zeros(sum(isdub),1) );
    else
        Y(:,jj) = X(:,jj);
    end
end

result

>> Y
Y =
   2     3     4     5
   9     5     3     8
   5     4     0     1
   7     6     2     0
   1     0     7     9

댓글 수: 2
없음 표시없음 숨기기

Matt Talebi 2016년 7월 6일

Thanks Per, it works flawlessly! Just as a minor modification can you possibly modify the codes such that, once a duplicate detected in a column, keep the one in the first row and only remove the other one (to preserve the column numbers).

Matt Talebi 2016년 7월 7일

All good now, thanks a lot!

댓글을 달려면 로그인하십시오.

How can I find and remove the nonzero duplicates in each column of a matrix

댓글 수: 2
없음 표시없음 숨기기

채택된 답변

댓글 수: 2
없음 표시없음 숨기기

추가 답변 (0개)

참고 항목

카테고리

태그

Community Treasure Hunt

How can I find and remove the nonzero duplicates in each column of a matrix

댓글 수: 2 없음 표시없음 숨기기

채택된 답변

댓글 수: 2 없음 표시없음 숨기기

추가 답변 (0개)

참고 항목

카테고리

태그

Community Treasure Hunt

댓글 수: 2
없음 표시없음 숨기기

댓글 수: 2
없음 표시없음 숨기기