Please help, I'm trying to get this code to run faster on my GPU. Currently this is the main bottleneck of my function. I was wondering whether this code is vectorizable in order to remove the for-loop? vector_1 is a gpuArray
for i=1:n A = vector_1((i-1)+(1:10)); B = vector_1((i-1)+16+(1:10)); AA(i+10) = (A'*B)/10; BB(i+10) = (A'*A)/10; end
Best Answer