Finding the index of dupplicate rows in a matrix

조회 수: 30 (최근 30일)
Danielle Leblance
Danielle Leblance 2017년 2월 6일
편집: Stephen23 2017년 2월 6일
I have matrix M 275935x2 . I want to remove duplicate rows. I tried two methods: Method 1)
M2=unique(M,'rows') % and it gave M2 179109x2 double
Method 2)
x0=find(hist(M,unique(M))>1); % it gave only 8301 duplicate values.
Which method is correct? I want to find th eindices of duplicate rows and not simply remove them. Any help is appreciated

답변 (1개)

Stephen23
Stephen23 2017년 2월 6일
편집: Stephen23 2017년 2월 6일
>> A = randi(1e4,275935,2);
>> [B,~,Y] = unique(A,'rows','stable');
>> [C,X] = hist(Y,unique(Y));
>> Z = ismember(Y,X(C>1)); % indices of repeated rows of A
For example this random data set had
>> nnz(Z)
ans =
816
row that occur most than once. To get the indices of the duplicate rows, try this:
[U,W] = unique(A,'rows','stable');
D = setdiff(1:size(A,1),W); %indices of duplicate rows.
  댓글 수: 2
Danielle Leblance
Danielle Leblance 2017년 2월 6일
I am sure there is something wrong. I am attaching the data.the unique function gives a matrix B which is different than the one that I obtain if I remove the duplicates Z
Stephen23
Stephen23 2017년 2월 6일
편집: Stephen23 2017년 2월 6일
This works for me:
>> load matlab.mat
>> [B,W] = unique(M,'rows','stable');
>> D = setdiff(1:size(M,1),W); % indices of duplicate rows.
And now compare:
>> M(D,:) = [];
>> isequal(M,B)
ans =
1

댓글을 달려면 로그인하십시오.

카테고리

Help CenterFile Exchange에서 Creating and Concatenating Matrices에 대해 자세히 알아보기

태그

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by