[GIS] Function (sample code) to extract raster value per polygon in R

extractpolygonrraster

I have a raster image (Sentinel-2 band 4) and a shapefile covering the same area than the image (same projection and extension). The shapefile has different polygons which are agricultural fields.

I want to create a new file where, for each polygon, the mean value of the pixels that are inside it is calculated.

Could you please help me to find an appropriate R code for that?

Best Answer

The raster::extract function, when applied to polygons or with the buffer argument, returns a list object where each element in the list contains a vector of the raster values intersecting the polygon. If the input raster object was a stack or brick, containing multiple rasters, the list elements are a matrix rather than a vector.

Providing the fun argument to extract is just a way of aggregating or summarizing the values within the function, without having to the deal with the list object. This really only works with simple functions that operate on vectors, not matrices.

Here we can work through what extract returns and manipulate the results. First add the required packages and create some example data. We will create a raster (r) and some polygons (poly).

library(raster)
library(sp)   
poly <- raster(nrow=10, ncol=10)
  poly[] <- runif(ncell(poly)) * 10
    poly <- rasterToPolygons(poly, fun=function(x){x > 9})
      r <- raster(nrow=100, ncol=100)
        r[] <- runif(ncell(r)) 
plot(r)
  plot(poly, add=TRUE, lwd=4)

Now we can look at what extract returns when the fun argument is not provided.

( v <- extract(r, poly) )

You can see that it is a list with vectors of raster values corresponding to each polygon. Please note that the list elements are ordered, that is to say that the firs list element corresponds to the first polygon. Because of this, any summary of the list will stay ordered with the polygon data.

For illustration purposes, let's write our own function that calculates the proportion of values above a given threshold.

pct <- function(x, p=0.30) {
  if ( length(x[x >= p]) < 1 )  return(0) 
    if ( length(x[x >= p]) == length(x) ) return(1) 
     else return( length(x[x >= p]) / length(x) ) 
}

Now, we can apply this function to the list object using the lapply function. Since the object is a list it will return a list so, we simply wrap the call in unlist to coerce it into a vector.

unlist(lapply(v, pct))

Since this data is ordered with our polygons we can just assign it into a new column "pcts".

poly@data$pcts <- unlist(lapply(v, pct))
  poly@data
  spplot(poly, "pcts")

Here is a quick example of what happens when the list contains matrices resulting from passing extract a multiband object. In this case, any function passed to lapply would have to account for the different data structure eg., calling a specific column or operating on the entire matrix.

r.stack <- stack(r,r,r)
v <- extract(r.stack, poly) 
lapply(v, head) #display first 6 lines of each matrix

This is relevant because something like Sentinel data is multiband. To return the mean for each band and add it to the polygons data.frame you could apply something like the colMeans function and, using do.call, coerce to a matrix/data.frame.

as.data.frame(do.call(rbind, lapply(v, colMeans))) 
( poly@data <- data.frame(poly@data, do.call(rbind, lapply(v, colMeans))) )

You can also allow the extract function to to the recycling of the raster layer using the df=TRUE argument to achieve the same results.

extract(r.stack, poly, fun=mean, df=TRUE)

Related Solutions

[GIS] Extract median value for polygons from multiresolution raster data, using R (velox, raster -package)

velox maintainer here.

The development version of velox (available on github) now implements a small option for the VeloxRaster_extract method. If small = TRUE, velox will return raster values for all polygons that intersect with the raster extent. Specifically, for all small or oddly shaped polygons that do not intersect with any cell center, velox will look for intersections with entire cell boxes.

Using the development version of velox and setting the small option to TRUE, your example now yields no NA values:

# Install latest version of velox
library(devtools)
install_github("hunzikp/velox")

# Reproducible example:   
library(velox)
library(raster)

## Make VeloxRaster
mat <- matrix(1:100, 10, 10)
extent <- c(0,1,0,1)
vx <- velox(mat, extent=extent, res=c(0.1,0.1), crs="+proj=longlat     +datum=WGS84 +no_defs")

## Make SpatialPolygonsDataFrame
library(sp)
library(rgeos)
set.seed(0)
coords <- cbind(runif(10, extent[1], extent[2]), runif(10, extent[3], extent[4]))
sp <- SpatialPoints(coords)

# Default example
# from https://cran.r-project.org/web/packages/velox/README.html
spol_norm <- gBuffer(sp, width=0.2, byid=TRUE)
spdf_norm <- SpatialPolygonsDataFrame(spol_norm,     data.frame(id=1:length(spol_norm)), FALSE)

# Smaller buffer
spol_small<- gBuffer(sp, width=0.05, byid=TRUE)
spdf_small <- SpatialPolygonsDataFrame(spol_small, data.frame(id=1:length(spol_small)), FALSE)

plot(spdf_norm); par(new=F)
plot(spdf_small)

## Extract values and calculate mean, see results
(ex.mat.norm <- vx$extract(spdf_norm, fun=median))
(ex.mat.small <- vx$extract(spdf_small, fun=median, small = TRUE)) # -> No NA values anymore.

Extracting pixels from a raster based on a given tolerance value as in Photoshop

There is the Magic Wand plugin in QGIS that works similar to the magic wand tool known from graphic software. Install the plugin, then simply click on the map canvas and the plugin creates a polygon layer with the selection.

You could use this polygon to rasterize and use the Raster Calculator to mask the colors in the initial raster - or simply use the polygon layer for that.

You can change the Accuracy (Fast to Precise) and Color Threshold (Ambiguous to Strict).

One single click on the Ottoman Empire was enough to create Polygon styled in red hachures: $C:\Users\DU\Desktop\temp_new\320.tif$

Best Answer

Related Solutions

[GIS] Extract median value for polygons from multiresolution raster data, using R (velox, raster -package)

Extracting pixels from a raster based on a given tolerance value as in Photoshop

Related Question