Solved – What does removing the first principal component from data signify

covariance-matrixeigenvaluespca

The first principal component is the axis along which the data varies the most. So, what happens if I remove that while retaining all the remaining components? I am guessing that the data kind of coalesces together, but I am not sure.

Best Answer

Removing a dimension from a data cloud, such as removing 1st PC of it, amounts to projecting data points onto the (hyper)plane perpendicular to the axis of that dimension. Imagine as example that your data is spheroid in 3D space. The PC1 is the spheroids main axis. Removing it is the projecting onto the plane which that axis pierces at 90 degree angle. Then, you are left with spherical data cloud lying in that plane.

Best Answer

Related Solutions

Solved – First principal component of 2D data forming a rectangle

Bonus: Dzhanibekov effect

Solved – What to do in PCA when one variable has similar values in several principal component eigenvectors

Related Question