Finding the indices of duplicate values in one array
    조회 수: 468 (최근 30일)
  
       이전 댓글 표시
    
Given one array A=[ 1 1 2 3 5 6 7].
I need help to known the indices where there are duplicate values.
Thanks
댓글 수: 0
답변 (9개)
  Stephan Koehler
      
 2019년 7월 16일
        A = [1 2 3 2 5 3]
[v, w] = unique( A, 'stable' );
duplicate_indices = setdiff( 1:numel(A), w )
this should work too, and is elegant
댓글 수: 2
  Image Analyst
      
      
 2019년 11월 11일
				Use histcounts and look for bins with more than 2 counts.  
A = [1 2 3 2 5 3]
[counts, edges] = histcounts(A)
A =
     1     2     3     2     5     3
counts =
     1     2     2     0     1
edges =
  Columns 1 through 5
       0.5       1.5       2.5       3.5       4.5
  Column 6
       5.5
You can see that the bins for 2 and 3 both have 2 counts so there are multiples of 2 and 3 in A.
Note: This will find any repeats, and they don't have to be consecutive.  If you want to look for consecutive repeats, call the diff() function and look for zeros.
  Image Analyst
      
      
 2018년 5월 11일
        
      편집: Image Analyst
      
      
 2018년 5월 12일
  
      Here's one way:
A = [-2 0 1 1 2 3 5 6 6 6 7 11 40]
% Elements 3, 4, 8, 9, and 10 are repeats.
% Assume A is integers and get edges
edges = min(A) : max(A)
[counts, values] = histcounts(A, edges)
repeatedElements = values(counts >= 2)
% Assume they're integers
% Print them out and collect indexes of repeated elements into an array.
indexes = [];
for k = 1 : length(repeatedElements)
  indexes = [indexes, find(A == repeatedElements(k))];
end
indexes % Report to the command window.
You get [3,4,8,9,10] as you should.
댓글 수: 8
  Steven Lord
    
      
 2025년 3월 10일
				The last bin includes both the left and right edges, while the earlier bins include only the left edges. This is stated in the description of the edges input on the histcounts documentation page: "Bin edges, specified as a vector. The first vector element specifies the leading edge of the first bin. The last element specifies the trailing edge of the last bin. The trailing edge is only included for the last bin."
So add on a number that's greater than the maximum element in your data. Inf is a good choice.
A = [-2 0 1 1 2 3 5 6 6 6 7 11 40 40]
edges = [min(A):max(A) Inf];
[counts1, edges1] = histcounts(A, edges);
repeatedElements = edges1(counts1 >= 2)
Or you could use non-equally spaced bins containing the unique elements from your data.
[counts2, edges2] = histcounts(A, [unique(A) Inf])
repeatedElements = edges2(counts2 >= 2)
If your data spans a wide range this can reduce the number of bins histcounts uses.
whos counts1 edges1 counts2 edges2
The unique approach uses 10 bins, the non-unique approach uses 43. This is a fairly small difference for your sample A, but the impact is much larger if you have a distant outlier.
B = [A 5000]; % 5000 is far from the rest of the elements in A
edges = [min(B):max(B) Inf];
[counts1, edges1] = histcounts(B, edges);
[counts2, edges2] = histcounts(B, [unique(B) Inf]);
whos counts1 edges1 counts2 edges2
  Walter Roberson
      
      
 2025년 3월 10일
				Using edges = [min(B):max(B) Inf]; assumes that the input data is integer.
  Adam
      
      
 2017년 4월 21일
        
      편집: Adam
      
      
 2017년 4월 21일
  
      [~, uniqueIdx] = unique( A );
duplicateLocations = ismember( A, find( A( setdiff( 1:numel(A), uniqueIdx ) ) ) );
then
find( duplicateLocations )
will give you the indices if you want them rather than a logical vector.
There are probably neater methods though.
If you want only the duplicates after the first then simply
setdiff( 1:numel(A), uniqueIdx )
should do the job.
댓글 수: 9
  CompViscount
 2022년 9월 20일
				
      편집: CompViscount
 2022년 9월 20일
  
			Commenting here as it's led me to overall the best answer here, it just has a mistake. The "find" in the 2nd line changes the values into indices before passing to ismember, which just makes the output nonsense. I removed that. Using the same numbers as image analyst above:
A=[ 1 1 2 3 5 6 6 7]
[~, uniqueIdx] = unique(A);
dupeIdx = ismember( A, A( setdiff( 1:numel(A), uniqueIdx ) ) );
dupes = A(dupeIdx)
dupeLoc = find(dupeIdx)
  Jan
      
      
 2018년 5월 12일
        
      편집: Jan
      
      
 2021년 7월 2일
  
      function Ind = IndexOfMultiples(A)
T   = true(size(A));
off = false;
A   = A(:);
for iA = 1:numel(A)
  if T(iA)          % if not switched already
    d = (A(iA) == A);
    if sum(d) > 1   % More than 1 occurrence found
       T(d) = off;  % switch all occurrences
    end
  end
end
Ind = find(~T);
end
If the input has more than 45 elements, this is faster:
function T = isMultiple(A)
% T = isMultiple(A)
% INPUT:  A: Numerical or CHAR array of any dimensions.
% OUTPUT: T: TRUE if element occurs multiple times anywhere in the array.
%
% Tested: Matlab 2009a, 2015b(32/64), 2016b, 2018b, Win7/10
% Author: Jan, Heidelberg, (C) 2021
% License: CC BY-SA 3.0, see: creativecommons.org/licenses/by-sa/3.0/
T        = false(size(A));
[S, idx] = sort(A(:).');
m        = [false, diff(S) == 0];
if any(m)        % Any equal elements found:
   m(strfind(m, [false, true])) = true;
   T(idx) = m;   % Resort to original order
end
end
댓글 수: 2
  MRINAL BHAUMIK
 2021년 6월 28일
        
      편집: Walter Roberson
      
      
 2025년 4월 3일
  
      A=[ 1 1 2 3 5 6 7 6]
B = A'./A
B = B-diag(diag(B))
[pos1 pos2]=find(B==1)
댓글 수: 0
  Anamika
      
 2023년 7월 17일
        In MATLAB, you can find the indices of duplicate values in an array using the `find` function along with the `unique` function. Here's how you can do it:
A = [1 1 2 3 5 6 7];
% Finding the unique elements in the array
unique_elements = unique(A);
% Initializing an empty array to store the indices of duplicate values
duplicate_indices = [];
% Iterating through each unique element
for i = 1:numel(unique_elements)
    % Finding the indices of occurrences of the current unique element
    indices = find(A == unique_elements(i));
    % If there are more than one occurrence, add the indices to the duplicate_indices array
    if numel(indices) > 1
        duplicate_indices = [duplicate_indices indices];
    end
end
% Displaying the indices of duplicate values
disp(duplicate_indices);
Running this code will give you the indices of the duplicate values in the array A. In this case, the output will be: 1 2
This means that the duplicate values are located at indices 1 and 2 in the array A.
댓글 수: 0
  Eduardo Gonzalez Rodriguez
 2023년 7월 13일
        Here is my solution to find repeated values and their counts
function [dup, counts] = duplicates(A)
    [dup,~,n] = unique(A, 'rows', 'stable');
    counts = accumarray(n, 1, [], @sum);
    dup(counts==1) = [];
    counts(counts==1) = [];
댓글 수: 0
  Piotr
      
 2023년 5월 11일
        Hello,
here is my attempt to solve it. I faced similar problem but in my case I wanted to have the result in two column representation. Each row contains indices of repeated values.
A = [ 1 1 2 3 5 6 7 6];
nk = nchoosek(1:length(A),2);
nk(diff(A(nk),[],2)~=0,:) = [];
disp(nk)
Cheers, Piotr
댓글 수: 0
  Tim
 2025년 4월 3일
        
      편집: Tim
 2025년 4월 3일
  
      Posting @CompViscount's  comment as an answer because it gives a logical array identifying which elements have more than one entry in the array. This is the answer I need. 
A=[ 1 1 2 3 5 6 6 7]
[~, uniqueIdx] = unique(A);
dupeIdx = ismember( A, A( setdiff( 1:numel(A), uniqueIdx ) ) )
A = [-2 0 1 1 2 3 6 5 6 6 6 7 11 40 40]
[~, uniqueIdx] = unique(A);
dupeIdx = ismember( A, A( setdiff( 1:numel(A), uniqueIdx ) ) )
댓글 수: 0
참고 항목
카테고리
				Help Center 및 File Exchange에서 Matrix Indexing에 대해 자세히 알아보기
			
	Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!



















