MATLAB: Is the MEX file I generated from the MATLAB code slower than the MATLAB code

coderMATLABmatlab codermexspeed

I expect that creating a MEX file from my MATLAB code should always provide a speed improvement. Why is this not the case?

Best Answer

It is important to remember that under the hood, a MEX file is simply a function that calls a C/C++ (or sometimes Fortran) subroutine. So when we are comparing the execution time of a MATLAB script to that of a MEX file, we are comparing the amount of time it takes the MATLAB script to be interpreted to the amount of time it takes the generated C/C++ code to execute.

Now it is reasonable to think that compiled C code should execute faster than M code is interpreted. However, it has been many years since MATLAB has been a true interpreted language. Today it is a just-in-time (JIT) compiled language with a large library of pre-compiled routines that are optimized for the target that MATLAB is running on. With MEX code generation, we are generating portable C code. Sometimes the MATLAB libraries are even multi-threaded while the generated code is not. Generally speaking, when an application mostly exercises pre-compiled binaries in MATLAB, or things that the JIT compiler handles well, the more realistic expectation is for MATLAB to be faster than the generated C code. The times when we see large speed-ups tend to correspond to those functions which are still implemented as complicated MATLAB functions in MATLAB. One might expect, therefore, to see speedups with QUADGK or QUAD2D, for example, but not with FFT.

With a simple script, we are really just comparing the C compiler to the MATLAB JIT compiler. For instance, the "mod" function is executing the exact same binary code in MATLAB as it is in a MEX file generated with MATLAB Coder. There just is not any opportunity for speedup here.

Related Solutions

MATLAB: Is the MATLAB Coder 2.1 (R2011b) generated MEX file slower than the MATLAB function

MATLAB Coder does not generally speed up MATLAB built-in functions like EIG, SVD, FFT, FFT2, FFTN, QR, LU, etc. In fact, the correct expectation is that MATLAB Coder generated MEX functions that are dominated by calculations with MATLAB built-in functions like these will be slightly slower than equivalent code running in MATLAB. This is because the MATLAB functions are compiled already, and sometimes they are even multi-threaded.

Specifically the FFT function in MATLAB (and thus the FFT2 function) is highly optimized for the PC. The generated C code on the other hand is more of a readable & portable C code and is not optimized for a particular platform. So the slowdown in this case is expected regardless of MEX or standalone compilation. As a general note, compiled code is not always faster than MATLAB and FFT is one of those cases.

However, starting in R2016a and R2017b, MATLAB Coder added the ability for the generated code to call LAPACK and FFTW respectively:

https://www.mathworks.com/help/coder/ug/generate-code-that-calls-lapack-functions.html

https://www.mathworks.com/help/coder/ug/speed-up-fast-fourier-transforms-in-generated-standalone-code-by-using-fftw-library-calls.html

Using these will give you optimized linear algebra and FFT routines respectively that are near the performance of MATLAB.

For releases prior to R2016a, here are some options you can try for improving performance:

1) Turn off debugging for the compiler to enable optimizations

2) Turn off array bounds checking and Ctrl+C checking

Make sure you have turned off the Ensure Memory Integrity option, and probably also the Enable Responsiveness checks. This usually makes a huge difference. The integrity checks prevent the C compiler from generating efficient code for computationally-intensive loops.

The link below illustrates how to control the runtime checks in MATLAB coder:

<http://www.mathworks.com/help/releases/R2012a/toolbox/coder/ug/br81dy0-1.html>

3) Before R2017b, MATLAB Coder generated code for a simple radix-2 FFT algorithm. MATLAB uses FFTW. For large data (even moderately sized, actually), FFTW performs better than radix-2 FFT. To use FFTW before R2017b, you will need to make your FFT function extrinsic by adding the following line to your MATLAB code:

coder.extrinsic('fft2');

When running generated code in the MATLAB environment, calls to extrinsic functions transfer control from the generated code to MATLAB. MATLAB Coder does not compile or generate code for extrinsic functions. The overhead associated with data copying may be a bottleneck, but since you are running this code as a MEX file, 'coder.extrinsic' for the FFT2 function may help.

There is a section in our Help Documentation on accelerating MATLAB algorithms with MATLAB Coder. This link can be a useful reference for such situations:

http://www.mathworks.com/help/releases/R2012a/toolbox/coder/ug/bswncul.html

MATLAB: Creating a MATLAB Executable in MATLAB verses Creating a C/C++ Executable in MATLAB

http://www.mathworks.com/matlabcentral/answers/223937-should-i-use-matlab-compiler-sdk-or-matlab-coder-to-integrate-my-matlab-applications-with-c-c

And of course there is the factor that MATLAB Coder costs a lot (MATLAB Compiler does too but not as much as MATLAB Coder)

Best Answer

Related Solutions

MATLAB: Is the MATLAB Coder 2.1 (R2011b) generated MEX file slower than the MATLAB function

MATLAB: Creating a MATLAB Executable in MATLAB verses Creating a C/C++ Executable in MATLAB

Related Question