MATLAB: Is half-precision slower than double-precision

datatypefp16fp8MATLABnativeNetworkneural

While executing same calculations in MATLAB, speed of double-precision variables is faster than half-precision.

Is this normal? Is there any way to speed up?

Best Answer

This is true.

n = 100;
t1= zeros(1,n);
t2= zeros(1,n);
for i = 1:n
a = ones(10,10,100,100);
b = zeros(10,10,100,100);
a2 = half(a);
b2 = half(b);
tic
temp = plus(a,b);
c = sum(temp(:));
t1(i) = toc;
tic
temp2 = plus(a2,b2);
c2 = sum(temp2(:));
t2(i) = toc;
end
sum(t1)
sum(t2)
ans = 
0.1456
ans=
0.2737

Computations with half-precision data types is slower than those with double precision data types. This is because, unlike double, half-precision is not a native data type in MATLAB and hence, requires additional tweak to do the computations.

Related Solutions

MATLAB: Does MATLAB support quadruple precision – 128-bit floating point arithmetics

The floating point arithmetic format that occupies 128 bits of storage is known as binary128 or quadruple precision.

The following blog post by MathWorks' Chief Mathematician Cleve Moler describes an implementation of quadruple precision programmed entirely in the MATLAB language:

https://blogs.mathworks.com/cleve/2017/05/22/quadruple-precision-128-bit-floating-point-arithmetic/

Please note that quadruple precision is not available as of MATLAB R2018b. As a workaround, it is possible to obtain more precision using Variable Precision Arithmetic from the Symbolic Math Toolbox. Another possibility is to use the Fixed-Point Toolbox to obtain more precision. Note that in either case, the numbers will not be of the floating point format.

MATLAB: Using FP16 data in MATLAB

If you have R2018b or later, you can fread as uint16 and then typecast into half type. E.g.,

% Generate some sample data
>> d = [-inf -pi 0 pi inf nan]
d =
      -Inf   -3.1416         0    3.1416       Inf       NaN
>> h = half(d)
h = 
  1×6 half row vector
      -Inf   -3.1406         0    3.1406       Inf       NaN
>> u = storedInteger(h)
u =
  1×6 uint16 row vector
   64512   49736       0   16968   31744   65024
% Convert the uint16 values to half precision
>> H = half.typecast(u)
H = 
  1×6 half row vector
      -Inf   -3.1406         0    3.1406       Inf       NaN

If you don't have R2018b or later, then the half type is not availalbe and you will be stuck with converting the values to single or double precision if you want to work with them in MATLAB. In that case, use fread as uint16 and then use the following FEX submission to turn the values into single or double:

https://www.mathworks.com/matlabcentral/fileexchange/23173-ieee-754r-half-precision-floating-point-converter

E.g.,

>> S = halfprecision(u,'single')
S =
  1×6 single row vector
      -Inf   -3.1406         0    3.1406       Inf       NaN

Best Answer

Related Solutions

MATLAB: Does MATLAB support quadruple precision – 128-bit floating point arithmetics

MATLAB: Using FP16 data in MATLAB

Related Question