Deleting duplicates based on conditions of multiple columns

조회 수: 31 (최근 30일)
Nick
Nick 2020년 12월 28일
답변: Akash kumar 2022년 7월 31일
Hi,
I have a large dataset (100m rows x 40 columns ) and I would like to delete any row that has duplicates on a few specific columns. See example below:
A = [1 10 4; 1 10 4; 1 11 5; 1 11 5; 1 12 6; 1 12 7; 1 13 8; 2 4 25; 2 10 28; 2 10 28; 3 5 33; 4 25 23; 4 23 24];
I would like to delete all rows where the three columns have duplicate within each specific column. So in this example, row 2, 4 and 9 would be deleted because e.g.
row 1 and 2 have duplicates in each of the three columns and so I'd want to delete one of the two (doesn't matter which one).
I suspect the answer is somewhere along the use of unique and logical indexing but haven't managed to figure it out. Any help would be much appreciated. (I'm using Matlab 2018b)
Thanks
  댓글 수: 3
Nick
Nick 2020년 12월 28일
Thanks for this but unfortunately, this would work for this sample only I think. The actual dataset has 40 columns and i'd like to remove the rows based on the dupicates of 3 columns only, rather than all.
Nick
Nick 2020년 12월 28일
Just found the answer. This way you can find the unique rows amongst a number of columns (in this case, columns 1, 2 and 3) and then produce the original table without the duplicate values.
[C,ia] = unique(A(:,1:3),'rows')
A_new = A(ia,:)

댓글을 달려면 로그인하십시오.

채택된 답변

Nick
Nick 2020년 12월 28일
[C,ia] = unique(A(:,1:3),'rows')
A_new = A(ia,:)

추가 답변 (1개)

Akash kumar
Akash kumar 2022년 7월 31일
% With Index Number:- Shows the which index or Row value is extract from
% the A Matrix. I thinks, It can help you.
A = [1 10 4; 1 10 4; 1 11 5; 1 11 5; 1 12 6; 1 12 7; 1 13 8; 2 4 25; 2 10 28; 2 10 28; 3 5 33; 4 25 23; 4 23 24]';
[B index]=unique(AA(1:3,:).','rows', 'stable')
B = 10×3
1 10 4 1 11 5 1 12 6 1 12 7 1 13 8 2 4 25 2 10 28 3 5 33 4 25 23 4 23 24
index = 10×1
1 3 5 6 7 8 9 11 12 13

카테고리

Help CenterFile Exchange에서 Cell Arrays에 대해 자세히 알아보기

제품

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by