Feeds
질문
Sum of squares profiling on GPU
I was profiling some code that runs on my GPU and came across something rather puzzling that I haven't been able to sort out... ...
거의 11년 전 | 답변 수: 1 | 2
1
답변질문
3D gpuArray vs cells of 2D gpuArrays major speed difference!
Can anybody explain why these codes have drastically different runtimes? I have a shared setup routine clear all y = ...
대략 11년 전 | 답변 수: 1 | 0
1
답변질문
Using multiple GPUs in a parfor type of loop
I am working on a machine learning problem where I am training my classifier using a GPU and the parallel computing toolbox. I ...
11년 초과 전 | 답변 수: 1 | 0
1
답변질문
Breaking up a computation vs "..." to continue line - Huge Speed Difference !?
I was profiling an ode solver today and found something very strange. The system has 9 states and the computation of the ode RH...
11년 초과 전 | 답변 수: 1 | 0
1
답변질문
Matrix multiply slices of 3d Matricies
Given two 3d matricies, A and B with size(A) = (n, m, k) and size(B) = (m, p, k) perform matrix multipl...
11년 초과 전 | 답변 수: 3 | 0
3
답변질문
Linear combination of cell arrays
Is there a compact way (without loops) to take linear combinations of cell arrays that contain the same type of data (matrices o...
11년 초과 전 | 답변 수: 1 | 0
1
답변질문
How do you make a callable object?
I want to define a class (much like griddedInterpolant) that has callable objects. Is there a particular method name that is in...
대략 12년 전 | 답변 수: 1 | 2
1
답변질문
Fastest way to dot product all columns of two matricies of same size
I have come up against this problem over and over, and I have a nice solution, but it seems non-optimal from a speed sense. Does...
12년 초과 전 | 답변 수: 2 | 0