Fast 2D GPU-based convolution

버전 1.0.0.0 (49 KB) 작성자: Alexander Huth

Graphics chip assisted fast 2d convolution

팔로우

4.2

(5)

다운로드 수: 4K

업데이트 날짜: 2008/7/16

라이선스 보기

cudaconv - Performs 2d convolution using an NVIDIA graphics chipset.

For large datasets (~1 million elements) and especially for large kernels (performance does not scale much with kernel size) cudaconv can outperform conv2 by as much as 5000%.

I did not create this algorithm.. it is adapted from an example included in the CUDA SDK and wrapped in MATLAB-compatible C code.

With very large data matrices, it can *completely* crash your computer(/graphics driver?), so beware. In testing, I found an upper limit on convolution size (limited either by the size the CUDA FFT function can accept or the size of a 2D texture) of roughly 2^20 elements, so above that the code breaks the convolution into smaller pieces. If you are feeling adventurous, feel free to raise that limit, but be aware that at those sizes cudaconv is already roughly 50-100x faster than conv2.

인용 양식

Alexander Huth (2024). Fast 2D GPU-based convolution (https://www.mathworks.com/matlabcentral/fileexchange/20220-fast-2d-gpu-based-convolution), MATLAB Central File Exchange. 검색됨 2024/10/6.

MATLAB 릴리스 호환 정보

개발 환경: R2007a

모든 릴리스와 호환

플랫폼 호환성

Windows macOS Linux

태그 태그 추가

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

cudaconv/

버전	게시됨	릴리스 정보
1.0.0.0	2008/7/16	Updated help, included testing script and image of benchmarks.	다운로드