Solved – How to use equalities as constrains with constrOptim in R

constraintlinear programmingoptimizationr

I want to solve a matrix system which have several solutions (infinite since it is overdetermined – 6 equations with 8 unknowns). However, the way I want to do it is to a criterion for the variables, which should be minimized, to find a specific solution. I am working in R

My first through was to use constrOptim in R However, this requires for me to set constraints that are equalities, and not "larger than or equal to", which is the the default in constrOptim.

The problem is essentially that I have a matrix, where I know none of the elements, and I want the the rowSums to have specific values, and the colSums to have specific values. This however, should be done while minimizing criterias that the i,j element divided by the sum of the i'th row, should be as close as possible to the sum of the j'th column divided by the sum of the whole matrix.

Does anyone know how I could solve this problem using constrOptim, or by using another function?

Best Answer

So I found a solution that works. This might be relevant for others who find the question in the future.

Here cat1 and cat2 are the sums which are requried from the matrices.

The code that works:

require(Rsolnp)


cat1 <- c(10000,3000,2000)
cat2 <- c(5000, 2500,7500)

sum(cat11) == sum(cat2)
elements <- length(cat2)*length(cat1)


cat1.share <- c(cat1/sum(cat1))
cat1.counter <- rep(1:length(cat1), 3)

#specify your function
min.func <- function(x) {
sum.holder <- cat1[cat1.counter[1]]/sum(cat1)*cat2[ceiling(1/length(cat1)]-x[1]
    for (i in 2:elements)  {
    sum.holder <- c(sum.holder,cat1[cat1.counter[i]]/sum(cat1)*cat2[ceiling(i/length(cat1))]-x[i])
}
  sum(abs(sum.holder))
}

#specify the equality function. The number 15 (to which the function is equal)
#is specified as an additional argument
equal <- function(x) {

  y <- x[1:elements]

  matrix.hold <- matrix(y, nrow = length(cat2), ncol = length(cat1), byrow = TRUE) 

  hold <- c(rowSums(matrix.hold),
            colSums(matrix.hold)
  ) 
  hold <- hold[-length(hold)]
  hold

}

x.start<- rep(min(cat2)/9, elements)
x.low <- rep(0, elements)
x.high <- rep(max(c(cat1,cat2)), elements)


#the optimiser
opt <- solnp(x.start, #starting values 
             min.func, #function to optimise
             eqfun=equal, #equality function 
             eqB=c(cat2[1:length(cat2)], cat1[1:(length(cat1)-1)]),       #equality constraint
             LB=x.low, #lower bound for parameters i.e. greater than zero
             UB=x.high,
             control = c(delta = 1.0e-8, tol = 1e-11)) #upper bound for parameters 

opt.matrix <- matrix(opt$pars, nrow = length(cat2), ncol = length(cat1), byrow=TRUE)

    opt.matrix

Related Solutions

Solved – Constrained Optimization library for equality and inequality constraints

Both packages, alabama and Rsolnp, contain "[i]mplementations of the augmented lagrange multiplier method for general nonlinear optimization" --- as the optimization task view says --- and are quite reliable and robust. The can handle equality and inequality constraints defined as (nonlinear) functions again.

I have worked with both packages. Sometimes, constraints are a bit easier to formulate with Rsolnp, whereas alabama appears to be a bit faster at times.

There is also the package Rdonlp2 that relies on an external and in the optimization community well-known software library. Unfortunately, its license status is a bit uncertain at the moment.

Solved – Least squares with non-linear constraints

As I've said in the comment, this is not a constrained optimisation problem, i.e. there are no constraints.

Define

$$\psi(x)=\begin{cases}T^2,|x|>T\\x^2,|x|\le T\end{cases}$$

Then the optimisation problem can be rewritten as

$$w^*=\text{argmin}_w\sum_{i=1}^n\psi\left(y_i-\sum_{j=1}^nx_{ij}w_j\right)$$

Such types of minimisation problems were first considered by P. J. Huber in 1964. This particular problem then is a robust statistics problem. More complicated example is least trimmed squares, where the portion of largest in absolute value errors are discarded. The latter method is implemented in R package robustbase with function lmrob.

For your particular problem you can use optim. Here is the example (in R). First setup dummy data and functions:

 psi <- function(x,T) { 
   x[abs(x)>T]<-T
   x^2
 }

 optfun <- function(w,T) {
     sum(psi(y-X%*%w,T=T))
 }

 ##Create sample data set    
 set.seed(13)
 X <- cbind(1,rnorm(100))
 y <- 1+ 0.5*X[,2] + rnorm(100)/2

 ##Contaminate data 
 smpl <- sample(1:100,10)
 X[smpl,2] <- 10

Now run the optimisation:

> optim(c(0,0),optfun,T=2)
$par
[1] 0.95419635 0.03152925

$value
[1] 48.39998

$counts
function gradient 
      77       NA 

$convergence
[1] 0

$message
NULL

See the slope is far from the true value. Reduce the value of $T$:

> optim(c(0,0),optfun,T=1)
$par
[1] 1.0072015 0.5557612

$value
[1] 31.67965

$counts
function gradient 
      63       NA 

$convergence
[1] 0

$message
NULL

Now we get the true values. In this simple example I ignored all the possible optimisation issues, i.e. the choice of starting values and choice of optimisation method. It is not hard to extend this code to produce standard errors of the coefficient estimates, see package numDeriv.

Best Answer

Related Solutions

Solved – Constrained Optimization library for equality and inequality constraints

Solved – Least squares with non-linear constraints

Related Question