re_vision: Tutorial page 9

Previous: Tutorial page 8 - Geometry
Next: TODO

Table of contents

Sparse matrix representations

In many applications (e.g., finite element methods) it is common to deal with very large matrices where only a few coefficients are different than zero. Both in term of memory consumption and performance, it is fundamental to use an adequate representation storing only nonzero coefficients. Such a matrix is called a sparse matrix.

Declaring sparse matrices and vectors
The SparseMatrix class is the main sparse matrix representation of the Eigen's sparse module which offers high performance, low memory usage, and compatibility with most of sparse linear algebra packages. Because of its limited flexibility, we also provide a DynamicSparseMatrix variante taillored for low-level sparse matrix assembly. Both of them can be either row major or column major:

#include <Eigen/Sparse>
SparseMatrix<std::complex<float> > m1(1000,2000);         // declare a 1000x2000 col-major compressed sparse matrix of complex<float>
SparseMatrix<double,RowMajor> m2(1000,2000);              // declare a 1000x2000 row-major compressed sparse matrix of double
DynamicSparseMatrix<std::complex<float> > m1(1000,2000);  // declare a 1000x2000 col-major dynamic sparse matrix of complex<float>
DynamicSparseMatrix<double,RowMajor> m2(1000,2000);       // declare a 1000x2000 row-major dynamic sparse matrix of double

Although a sparse matrix could also be used to represent a sparse vector, for that purpose it is better to use the specialized SparseVector class:

SparseVector<std::complex<float> > v1(1000); // declare a column sparse vector of complex<float> of size 1000
SparseVector<double,RowMajor> v2(1000);      // declare a row sparse vector of double of size 1000

Note that here the size of a vector denotes its dimension and not the number of nonzero coefficients which is initially zero (like sparse matrices).

Overview of the internal sparse storage
In order to get the best of the Eigen's sparse objects, it is important to have a rough idea of the way they are internally stored. The SparseMatrix class implements the common and generic Compressed Column/Row Storage scheme. It consists of three compact arrays storing the values with their respective inner coordinates, and pointer indices to the begining of each outer vector. For instance, let m be a column-major sparse matrix. Then its nonzero coefficients are sequentially stored in memory in a column-major order (values). A second array of integer stores the respective row index of each coefficient (inner indices). Finally, a third array of integer, having the same length than the number of columns, stores the index in the previous arrays of the first element of each column (outer indices).

Here is an example, with the matrix:

0	3	0	0	0
22	0	0	0	17
7	5	0	1	0
0	0	0	0	0
0	0	14	0	8

and its internal representation using the Compressed Column Storage format:

Values:	22	7	3	5	14	1	17	8
Inner indices:	1	2	0	2	4	2	1	4

Outer indices:

As you can guess, here the storage order is even more important than with dense matrix. We will therefore often make a clear difference between the inner and outer dimensions. For instance, it is easy to loop over the coefficients of an inner vector (e.g., a column of a column-major matrix), but completely inefficient to do the same for an outer vector (e.g., a row of a col-major matrix).

The SparseVector class implements the same compressed storage scheme but, of course, without any outer index buffer.

Since all nonzero coefficients of such a matrix are sequentially stored in memory, random insertion of new nonzeros can be extremely costly. To overcome this limitation, Eigen's sparse module provides a DynamicSparseMatrix class which is basically implemented as an array of SparseVector. In other words, a DynamicSparseMatrix is a SparseMatrix where the values and inner-indices arrays have been splitted into multiple small and resizable arrays. Assuming the number of nonzeros per inner vector is relatively low, this slight modification allow for very fast random insertion at the cost of a slight memory overhead and a lost of compatibility with other sparse libraries used by some of our highlevel solvers. Note that the major memory overhead comes from the extra memory preallocated by each inner vector to avoid an expensive memory reallocation at every insertion.

To summarize, it is recommanded to use a SparseMatrix whenever this is possible, and reserve the use of DynamicSparseMatrix for matrix assembly purpose when a SparseMatrix is not flexible enough. The respective pro/cons of both representations are summarized in the following table:

	SparseMatrix	DynamicSparseMatrix
memory usage	***	**
sorted insertion	***	***
random insertion in sorted inner vector	**	**
sorted insertion in random inner vector	-	***
random insertion	-	**
coeff wise unary operators	***	***
coeff wise binary operators	***	***
matrix products	***	*()
transpose	**	***
redux	***	**
*= scalar	***	**
Compatibility with highlevel solvers (TAUCS, Cholmod, SuperLU, UmfPack)	***	-

Matrix and vector properties

Here mat and vec represents any sparse-matrix and sparse-vector types respectively.

Standard dimensions	mat.rows() mat.cols()	vec.size()
Sizes along the inner/outer dimensions	mat.innerSize() mat.outerSize()
Number of non zero coefficiens	mat.nonZeros()	vec.nonZeros()

Iterating over the nonzero coefficients

Iterating over the coefficients of a sparse matrix can be done only in the same order than the storage order. Here is an example:

SparseMatrixType mat(rows,cols);
for (int k=0; k\<m1.outerSize(); ++k)
  for (SparseMatrixType::InnerIterator it(mat,k); it; ++it)
  {
    it.value();
    it.row();   // row index
    it.col();   // col index (here it is equal to k)
    it.index(); // inner index, here it is equal to it.row()
  }

SparseVector<double> vec(size);
for (SparseVector<double>::InnerIterator it(vec); it; ++it)
{
  it.value(); // == vec[ it.index() ]
  it.index();
}

Filling a sparse matrix

Owing to the special storage scheme of a SparseMatrix, it is obvious that for performance reasons a sparse matrix cannot be filled as easily as a dense matrix. For instance the cost of a purely random insertion into a SparseMatrix is in O(nnz) where nnz is the current number of non zeros. In order to cover all uses cases with best efficiency, Eigen provides various mechanisms, from the easiest but slowest, to the fastest but restrictive one.

If you don't have any prior knowledge about the order your matrix will be filled, then the best choice is to use a DynamicSparseMatrix. With a DynamicSparseMatrix, you can add or modify any coefficients at any time using the coeffRef(row,col) method. Here is an example:

DynamicSparseMatrix<float> aux(1000,1000);
aux.reserve(estimated_number_of_non_zero); // optional
for (...)
  for each j                          // the j can be random
    for each i interacting with j     // the i can be random
      aux.coeffRef(i,j) += foo(i,j);

Then the DynamicSparseMatrix object can be converted to a compact SparseMatrix to be used, e.g., by one of our supported solver:

SparseMatrix<float> mat(aux);

In order to optimize this process, instead of the generic coeffRef(i,j) method one can also use:

```
 m.insert(i,j) = value; 
```
which assumes the coefficient of coordinate (row,col) does not already exist (otherwise this is a programming error and your program will stop).
```
 m.insertBack(i,j) = value; 
```
which, in addition to the requirements of insert(), also assumes that the coefficient of coordinate (row,col) will be inserted at the end of the target inner-vector. More precisely, if the matrix m is column major, then the row index of the last non zero coefficient of the j-th column must be smaller than i.

Actually, the SparseMatrix class also supports random insertion via the insert() method. However, its uses should be reserved in cases where the inserted non zero is nearly the last one of the compact storage array. In practice, this means it should be used only to perform random (or sorted) insertion into the current inner-vector while filling the inner-vectors in an increasing order. Moreover, with a SparseMatrix an insertion session must be closed by a call to finalize() before any use of the matrix. Here is an example for a column major matrix:

SparseMatrix<float> mat(1000,1000);
mat.reserve(estimated_number_of_non_zero);  // optional
for each j                                  // should be in increasing order for performance reasons
  for each i interacting with j             // the i can be random
    mat.insert(i,j) = foo(i,j);             // optional for a DynamicSparseMatrix
mat.finalize();

Finally, the fastest way to fill a SparseMatrix object is to insert the elements in a purely coherence order (increasing inner index per increasing outer index). To this end, Eigen provides a very low but optimal API and illustrated below:

SparseMatrix<float> mat(1000,1000);
mat.reserve(estimated_number_of_non_zero);  // optional
for(int j=0; j<1000; ++j)
{
  mat.startVec(j);                          // optional for a DynamicSparseMatrix
  for each i interacting with j             // with increasing i
      mat.insertBack(i,j) = foo(i,j);
}
mat.finalize();                             // optional for a DynamicSparseMatrix

Note that there also exist the insertBackByOuterInner(Index outer, Index, inner) function which allows to write code agnostic to the storage order.

Supported operators and functions

In the following sm denote a sparse matrix, sv a sparse vector, dm a dense matrix, and dv a dense vector. In Eigen's sparse module we chose to expose only the subset of the dense matrix API which can be efficiently implemented. Moreover, all combinations are not always possible. For instance, it is not possible to add two sparse matrices having two different storage order. On the other hand it is perfectly fine to evaluate a sparse matrix/expression to a matrix having a different storage order:

SparseMatrixType sm1, sm2, sm3;
sm3 = sm1.transpose() + sm2;                    // invalid
sm3 = SparseMatrixType(sm1.transpose()) + sm2;  // correct

Here are some examples of the supported operations:

s_1 *= 0.5;
sm4 = sm1 + sm2 + sm3;          // only if s_1, s_2 and s_3 have the same storage order
sm3 = sm1 * sm2;
dv3 = sm1 * dv2;
dm3 = sm1 * dm2;
dm3 = dm2 * sm1;
sm3 = sm1.cwiseProduct(sm2);    // only if s_1 and s_2 have the same storage order
dv2 = sm1.triangularView<Upper>().solve(dv2);

The product of a sparse matrix A by a dense matrix/vector dv with A symmetric can be optimized by telling that to Eigen:

res = A.selfadjointView<>() * dv;        // if all coefficients of A are stored
res = A.selfadjointView<Upper>() * dv;   // if only the upper part of A is stored
res = A.selfadjointView<Lower>() * dv;   // if only the lower part of A is stored

Using the direct solvers

To solve a sparse problem you currently have to use one or multiple of the following "unsupported" module:

SparseExtra_Module
- solvers: SparseLLT<SparseMatrixType>, SparseLDLT<SparseMatrixType> (#include <Eigen/SparseExtra>)
- notes: built-in basic LLT and LDLT solvers
CholmodSupport_Module
- solver: SparseLLT<SparseMatrixType, Cholmod> (#include <Eigen/CholmodSupport>)
- notes: LLT solving using Cholmod, requires a SparseMatrix object. (recommended for symmetric/selfadjoint problems)
UmfPackSupport_Module
- solver: SparseLU<SparseMatrixType, UmfPack> (#include <Eigen/UmfPackSupport>)
- notes: LU solving using UmfPack, requires a SparseMatrix object (recommended for squared matrices)
SuperLUSupport_Module
- solver: SparseLU<SparseMatrixType, SuperLU> (#include <Eigen/SuperLUSupport>)
- notes: (LU solving using SuperLU, requires a SparseMatrix object, recommended for squared matrices)
TaucsSupport_Module
- solver: SparseLLT<SparseMatrixType, Taucs> (#include <Eigen/TaucsSupport>)
- notes: LLT solving using Taucs, requires a SparseMatrix object (not recommended)

Warning:: Those modules are currently considered to be "unsupported" because 1) they are not documented, and 2) their API is likely to change in the future.

Here is a typical example:

#include <Eigen/UmfPackSupport>
// ...
SparseMatrix<double> A;
// fill A
VectorXd b, x;
// fill b
// solve Ax = b using UmfPack:
SparseLU<SparseMatrix<double>,UmfPack> lu_of_A(A);
if(!lu_of_A.succeeded()) {
  // decomposiiton failed
  return;
}
if(!lu_of_A.solve(b,&x)) {
  // solving failed
  return;
}

See also the class SparseLLT, class SparseLU, and class SparseLDLT.

Next: TODO