[Math] distributed incremental SVD

it.information-theorylinear algebrana.numerical-analysis

Hello all,

I need some theoretical pointers (formulas, articles, online links) on how to merge Singular Value Decompositions (SVD) of two matrices (two different sets of observations over the same set of features).

That is, I have two SVDs: $A=U_A*S_A*V^T_A$ and $B=U_B*S_B*V^T_B$ and want to know SVD $A|B=U_{A|B}*S_{A|B}*V_{A|B}$. The original matrices $A$ and $B$ are unavailable, the solution must make use of the $U_A, S_A, V_A, U_B, S_B, V_B$ matrices only.

I need this because I want to implement a distributed version of incremental SVD: have several computation nodes work on different sets of observations independently, and then merge their results into one.

Cheers!

Best Answer

The people aspiring for the Netflix prize like incremental SVDs. See

  1. https://issues.apache.org/jira/browse/MAHOUT-371
  2. B.M. Sarwar, G.Karypis, J.A. Konstan, and J. Reidl. Incremental singular value deocmposition algorithms for highly scalable recommender systems. In Proceedings of the Fifth International Conference on Computer and Information Technology (ICCIT), 2002. (ONLINE)
  3. M. Brand. Fast online svd revisions for lightweight recommender systems. In Proceedings of the 3rd SIAM International Conference on Data Mining, 2003. and his tech report

Have you tried searching the ACM digital library for parallel SVD or singular value decomposition?

EDIT1 based on new input : See the following two papers by Hall, Marshall and Martin

  1. Merging and Splitting Eigenspace Models (Section 3)
  2. On Adding and Subtracting Eigenspaces with EVD and SVD
Related Question