Using the MD5 Hash for Duplicate File Deletion

버전 1.0.0.0 (1.91 KB) 작성자: Michael Kleder
This function uses an MD5 hash to rapidly detect and delete duplicate files in a directory.
다운로드 수: 4.4K
업데이트 날짜: 2004/12/15

라이선스 없음

This function rapidly compares large numbers of files for identical content by computing the MD5 hash of each file and detecting duplicates. The probablility of two non-identical files having the same MD5 hash, even in a hypothetical directory containing as many as a million files, is exceedingly remote. Thus, since hashes rather than file contents are compared, the process of detecting duplicates is greatly accelerated.

You must have the file md5DLL.dll on your MATLAB path to use this function. The function is stored in the MATLAB Central File Exchange, file #3784, and was written by Hans-Peter Suter. The URL for the download site is:
http://www.mathworks.com/matlabcentral/fileexchange/loadFile.do?objectId=3784&objectType=file

This function is intended for MS Windows operating systems. This is because MATLAB requires on the order of 0.1 seconds to execute an operating system command to delete each file, but can rapidly create and run an operating system batch file to perform the file deletions much faster. Since I use MATLAB on a Windows PC, this function creates a batch file for that platform. Futhermore, the md5DLL.dll file is specific to Windows.

인용 양식

Michael Kleder (2024). Using the MD5 Hash for Duplicate File Deletion (https://www.mathworks.com/matlabcentral/fileexchange/5393-using-the-md5-hash-for-duplicate-file-deletion), MATLAB Central File Exchange. 검색됨 .

MATLAB 릴리스 호환 정보
개발 환경: R11.1
모든 릴리스와 호환
플랫폼 호환성
Windows macOS Linux
카테고리
Help CenterMATLAB Answers에서 File Compression에 대해 자세히 알아보기

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!
버전 게시됨 릴리스 정보
1.0.0.0

Bug, caught by Peter Blumen. Thanks Peter!