Kriging with gstat package in R produces empty interpolation points

gstatkrigingrsf

I have kriged a dataset with the gstat package in R, but it has produced an empty variable as a result. I have 1,167 measurement points within a field, and I am trying to interpolate them across 3,464 interpolation points within the same field. How can I achieve a Kriged result with estimated values at each point?

Note: The input_data and interpolation points can be found within a text file at this link; the contents within the file just need to be copied and pasted into an R window and run to generate the data frames. In addition, if desired, the field boundary shape file referred to later in this post can be found here.

These are the points that will be interpolated:

#Import libraries
library(pacman)
p_load(raster, 
       sf, 
       dplyr, 
       ggplot2, 
       scales, 
       magrittr, 
       gstat, 
       gridExtra, 
       raster,
       sp,
       automap,
       mapview,
       leaflet,
       rgdal)

#Graph points for interpolation
input_data %>% 
  as.data.frame %>% 
  ggplot(aes(latitude, longitude)) +
  geom_point(aes(size = OM), color = 'red', alpha = 3/4) +
  ggtitle('Organic Matter Concentration') +
  coord_equal() +
  theme_bw()

Convert data frame into a spatial object

input_data_sf <- st_as_sf(input_data, coords = c('longitude', 'latitude'), crs = 4326)
crs(input_data_sf)

glimpse(input_data_sf)

Coordinate Reference System:
Deprecated Proj.4 representation: +proj=longlat +datum=WGS84 +no_defs 
WKT2 2019 representation:
GEOGCRS["WGS 84",
    DATUM["World Geodetic System 1984",
        ELLIPSOID["WGS 84",6378137,298.257223563,
            LENGTHUNIT["metre",1]]],
    PRIMEM["Greenwich",0,
        ANGLEUNIT["degree",0.0174532925199433]],
    CS[ellipsoidal,2],
        AXIS["geodetic latitude (Lat)",north,
            ORDER[1],
            ANGLEUNIT["degree",0.0174532925199433]],
        AXIS["geodetic longitude (Lon)",east,
            ORDER[2],
            ANGLEUNIT["degree",0.0174532925199433]],
    USAGE[
        SCOPE["Horizontal component of 3D system."],
        AREA["World."],
        BBOX[-90,-180,90,180]],
    ID["EPSG",4326]]

The projection we are working with will be WGS 1984.

Making sure that the data was converted properly:

plot(st_geometry(input_data_sf))

That looks good.

Fitting the variogram using gstat and a Matern model:

lzn.vgm <- variogram(log(OM)~1, input_data_sf)

lzn.fit <- fit.variogram(lzn.vgm, vgm('Mat'), fit.kappa = TRUE)

The following warning message is produced after runnign the fit.variogram() command:

Warning messages:
1: In fit.variogram(o, m, fit.kappa = FALSE, fit.method = fit.method,  :
  No convergence after 200 iterations: try different initial values?
2: In fit.variogram(o, m, fit.kappa = FALSE, fit.method = fit.method,  :
  No convergence after 200 iterations: try different initial values?
3: In fit.variogram(o, m, fit.kappa = FALSE, fit.method = fit.method,  :
  No convergence after 200 iterations: try different initial values?

Plotting the variogram:

plot(lzn.vgm, lzn.fit)

After generating the semivariogram, we now must convert the input points (the points for interpolation) to a spatial object:

int_pnts <- st_as_sf(int_pnts, coords = c('POINT_X', 'POINT_Y'), crs = 4326)
crs(int_pnts)

glimpse(int_pnts)

Visualizing the measured points and interpolated points:

#Read field boundary shapefile
shp <- st_read("field_boundary.shp")

plot1 <- ggplot() +
  geom_sf(data = shp) +
  geom_sf(data = velv) +
  ggtitle("Sample points in field")

#Run this code if you don't download the field boundary
#plot1 <- ggplot() +
  #geom_sf(data = velv) +
  #ggtitle("Sample points in field")

plot1

plot2 <- ggplot() +
  geom_sf(data = shp) +
  geom_sf(data = int+pnts) +
  ggtitle("Interpolation points")

#Run this code if you don't download the field boundary:
#plot2 <- ggplot() +
  #geom_sf(data = int+pnts) +
  #ggtitle("Interpolation points")

plot2

Finally, perform the kriging:

lzn.kriged <- krige(log(OM) ~ 1, input_data_sf, int_pnts, model = lzn.fit)

The problem is here: When converting the lzn.kriged output to a dataframe to investigate how it turned out, both of the first two columns (var1 predict and actual) have NA values beside them. How can I fix this so I can graph my interpolation?

#Convert kriged output to data frame
krig <- lzn.kriged %>% 
  as.data.frame

Best Answer

I don't really recommend doing kriging in lat-long degree coordinates. Project your data to a cartesian coordinate system and try again. For example, if I convert to 3857 (web mercator) then it works.

> int_pnts=st_transform(int_pnts, 3857)
> input_data_sf = st_transform(input_data_sf, 3857)
> lzn.vgm <- variogram(log(OM)~1, input_data_sf)
> lzn.fit <- fit.variogram(lzn.vgm, vgm('Mat'), fit.kappa = TRUE)

Some warnings we'll ignore, and then kriging produces meaningful results:

> lzn.kriged <- krige(log(OM) ~ 1, input_data_sf, int_pnts, model = lzn.fit)
[using ordinary kriging]
> head(lzn.kriged)
Simple feature collection with 6 features and 2 fields
Geometry type: POINT
Dimension:     XY
Bounding box:  xmin: -10757430 ymin: 4738785 xmax: -10757310 ymax: 4738918
Projected CRS: WGS 84 / Pseudo-Mercator
  var1.pred   var1.var                  geometry
1 0.6545492 0.04198732 POINT (-10757334 4738803)
2 0.3821123 0.04326622 POINT (-10757376 4738785)
3 0.8089032 0.04263483 POINT (-10757315 4738882)
4 0.6558402 0.04288256 POINT (-10757384 4738896)

CRS 3857 is not the best choice, its just something that is likely to "work" in terms of "producing a result" in most places on the globe. You should maybe find a local coordinate system for your data, or use a UTM zone appropriate for your longitude.

Related Solutions

[GIS] Spatio-temporal block kriging with R package gstat

In sp, SpatialPoints*, SpatialPixels* and SpatialGrid* (with * omitted or replaced by DataFrame) do support more than 2 spatial dimensions, as OP has done, but SpatialPolygons* and SpatialLines* do not. With gstat you can do 3-D block kriging with 3-D blocks (using block = c(10,10,10)), but you cannot do this for non-rectangular blocks, as OP wants. It is perfectly OK to substitute time for the third dimension, but you are constrained to the metric ST variogram.

library(gstat)
vignette("st")

gives you more options for variogram models, but not for predicting block mean values (this is FYI, not an answer to the question).

The only answer to the question would be to do 3D conditional simulations, and aggregate point values over your arbitrary 3D (2D polygon + time extent) blocks. Tedious, but possible; also only along the 3D path, not along the path described in the ST vignette (krigeST does not do simulation - yet!).

[GIS] Preparing dataset to perform co-kriging in R gstat

You will find all you need in the excellent (and didactic) technical note from Rossiter (2012)*:

Technical Note: Co-kriging with the gstat package of the R environment for statistical computing.

Co-kriging will use different functions from those with univariate kriging (for example, ordinary kriging).

The datasets (target and co-variables) should remain in separate data frames, but within the same object of class gstat. And predictions (interpolation) are carried out with predict.gstat.

In Rossiter (2012), chapters 6 and 7 explain in details how to do it:

(6) Modelling a bivariate co-regionalisation.
(7) Co-kriging with one co-variable.

Below is the main code from Rossiter (2012) which addresses the question. It uses the meuse dataset as example:

g <- gstat(NULL, id = "ltpb", form = ltpb ~ 1, data=meuse.pb) #target variable lead (pb).  
g <- gstat(g, id = "ltom", form = ltom ~ 1, data=meuse.co) #co-variable organic matter (om).  
v.cross <- variogram(g) #generate direct variograms and the cross-variogram.  
g <- gstat(g, id = "ltpb", model = m.ltpb.f, fill.all=T) #add variogram models to gstat object. In this case, it has been used the variogram model.   fitted to the target variable in previous chapter, for both target variable and the co-variable as starting points.  
g <- fit.lmc(v.cross, g) #fit theoretical variograms to experimental ones (uses linear model of co-regionalisation).  
k.c <- predict.gstat(g, meuse.grid) #predicts values for target variable in the prediction grid.

_{*Rossiter, D.G. 2012. Technical Note: Co-kriging with the gstat package of the R environment for statistical computing. University of Twente, Faculty of Geo-Information Science & Earth. Observation (ITC). Enschede (NL). Revision 2.3. 84p.}

Best Answer

Related Solutions

[GIS] Spatio-temporal block kriging with R package gstat

[GIS] Preparing dataset to perform co-kriging in R gstat

Related Question