MATLAB: Is it possible to speed up this loop or avoid from using

for loopperformancespeed

I give an example code below including 4 loops. How can I speed up this procedure? thanks for any help and ideas.

clc;clear all
%%inputs
dx=1; dy=1; xo=5;
f1=5;
f2=10;
f3=15;
kx=1:50;
ky=1:50;
k=rand(50,50);
z=rand(50,50);
%%%
%%%% how can I improve this example below for running faster ? 
tic
for j=1:50
    for i=1:50
        total=0;
        for m=1:50
            beta= (m-1)*dy;   
            for n=1:50
                alpha= (n-1)*dx;
                f4 =(1-exp(-(xo+ 2.*pi.*k(j,i)).*z(m,n)));
                f5 = exp(-2.*pi.*(kx(i).*alpha + ky(j).*beta));
                total = total+ f1.*f2.*f3.*f4.*f5;
            end
        end
        L(j,i)=total;
    end
end
toc

Best Answer

dx=1; dy=1; xo=5;
f1=5;
f2=10;
f3=15;
kx=1:50;
ky=1:50;
k=rand(50,50);
z=rand(50,50);
%%%
ky=ky(:);
z=reshape(z,[1,1,size(z)]);
alpha=reshape((0:49)*dx,1,1,1,[]);
beta=reshape((0:49)*dy,1,1,[]);
f4 = (1-exp(-(xo+ 2.*pi.*k).*z));
f5 = exp(-2.*pi.*(kx.*alpha + ky.*beta));
L=(f1.*f2.*f3).*sum(  sum( f4.*f5, 3)  ,4);

Related Solutions

MATLAB: Speed up for loop in this code for calculating mutual information (maybe using GPU computing)

For a 1000x1000 matrix, this is 6 times faster already:

% Version 1:
n  = size(X, 1);
X = X.';
Y = Y.';
dx = zeros(n, n);
dy = zeros(n, n);
for j = 1:n
   Xj = X(:, j);
   Yj = Y(:, j);
   for i = j+1:n
      dx(i,j) = sqrt(sum(bsxfun(@minus, X(:, i), Xj) .^ 2));
      dy(i,j) = sqrt(sum(bsxfun(@minus, Y(:, i), Yj) .^ 2));
      dx(j,i) = dx(i,j);
      dy(j,i) = dy(i,j);
   end
end
dz = max(dx, dy);

The original function took 29.5 sec (R2016b, Core2Duo, Win7/64), and the cleaned version 5.2 sec.

Here the data are processed columnwise, which is much faster because neighboring elements are accessed much faster in the memory. Then the comparison my max() is done outside the loop. And finally the resulting matrix is symmetric and you can omit the computation of X(:,i) and X(:,j) if you have the results for X(:,j) and X(:,i) already.

I tried to vectorized the inner loop:

% Version 2:
n  = size(X, 1);
X = X.';
Y = Y.';
dx = zeros(n, n);
dy = zeros(n, n);
for j = 1:n
   dx(j+1:n,j) = sqrt(sum(bsxfun(@minus, X(:, j+1:n), X(:, j)) .^ 2, 1));
   dy(j+1:n,j) = sqrt(sum(bsxfun(@minus, Y(:, j+1:n), Y(:, j)) .^ 2, 1));
   dx(j,j+1:n) = dx(j+1:n,j);
   dy(j,j+1:n) = dy(j+1:n,j);
end
dz = max(dx, dy);

But this takes 21 sec for 1000x1000 arrays. But for smaller 100x100 inputs it is faster: 1.2 sec instead of 2.2 sec (100 iterations).

Now you have an efficient function to start a parallelization or computation on the GPU. Maybe this is useful (I cannot test it):

% Version 3:
parfor v = 1:2
  if v == 1
    for j = 1:n
      dx(j+1:n, j) = sqrt(sum((X(:, j+1:n) - X(:, j)) .^ 2, 1));
      dx(j, j+1:n) = dx(j+1:n, j);
    end
  else
    for j = 1:n
      dy(j+1:n, j) = sqrt(sum((Y(:, j+1:n) - Y(:, j)) .^ 2, 1));
      dy(j, j+1:n) = dy(j+1:n, j);
    end
  end
end

But parfor for the inner loop will use more cores.

MATLAB: Transform a D-4 Array to generate a real valued ifftn

You should just do

ifftn(kernel,'symmetric')

Best Answer

Related Solutions

MATLAB: Speed up for loop in this code for calculating mutual information (maybe using GPU computing)

MATLAB: Transform a D-4 Array to generate a real valued ifftn

Related Question