In the latest release, GPU calls execute asynchronously to the CPU. But do also GPU calls themselves run asynchronously? I.e. does multiple calls using e.g. feval(CUDA_kernel, …) execute at the same time, or does the GPU wait for subsequent calls to finish? Does one need to impose wait() between feval to guarantee that the calls execute in order on the GPU? Experimental tests indicate that wait() is not needed, but it would be nice with a proper guarantee.
MATLAB: Asynchronous GPU calculations
cudagpu
Best Answer