R Programming – Plotting and Analyzing Extracted Elevation Data in R

elevationrrasterzonal statistics

I have a shapefile of county boundaries for New York state and elevation data downloaded through the elevatr package.

library(elevatr)
library(raster)
library(sf)
library(USAboundaries)

counties <- us_counties(map_date = "1930-01-01", resolution = "high", states = c("NY"))

counties_sf <- as(counties, "Spatial")
elevation_data <-get_elev_raster(counties_sf, z=9, src = "aws")
map <- extract(elevation_data, counties)
plot(elevation_data, axes=TRUE)
plot(st_geometry(counties), add=TRUE)

This produces an image like this:

That's all well and good, but how do I restrict the elevation data to just the area of the county boundaries, e.g. for computing zonal statistics or for creating a map of only the state?

The extract function from the raster package returns a sequential list of lists of elevation points, but as far as I can tell, there isn't a way to link those points back to the unique IDs of the actual counties that come from the shapefile, even using the extent function.

Ideally I'd like to work with everything using the sf (Simple Features) package, as in this question, because that's what I'm most familiar with. It's a little confusing for me to keep track of which packages return raster objects, which return sf objects, which return spatial polygon data frames, etc.

Best Answer

You can summarize values within each polygon by passing a custom function to the extract function in the raster package, or the exact_extract function in the exactextractr package (much faster, handles pixels partially covered by polygons.)

The raster::extract function expects a summarizing function with the signature function(x, na.rm), e.g.:

counties$second_lowest_point <- extract(
 elevation_data,
 counties, 
 fun=function(x, na.rm) {
   if (na.rm) { 
     sort(na.omit(x))[2]
   } else {
     sort(x)[2]
   }
 })

The exactextractr::exact_extract function expects a summarizing function with the signature function(x, w), where w is the fraction of the pixel that is covered by the polygon. Here we're taking the area-weighted mean of elevations > 400m:

counties$mean_elevation_over_400 <- exact_extract(
  elevation_data,
  counties,
  fun=function(x, w) { weighted.mean(x[x > 400]) })

Related Solutions

ArcGIS Desktop – Counting Raster Cells Within a Polygon

You can do this in two steps. First, use Con (Spatial Analyst) to convert cells > 50 to 1 and all other cells to 0. Then use Zonal Statistics as Table (Spatial Analyst) to count the number of "1" cells within your polygon.

enter image description here

[GIS] Extract median value for polygons from multiresolution raster data, using R (velox, raster -package)

velox maintainer here.

The development version of velox (available on github) now implements a small option for the VeloxRaster_extract method. If small = TRUE, velox will return raster values for all polygons that intersect with the raster extent. Specifically, for all small or oddly shaped polygons that do not intersect with any cell center, velox will look for intersections with entire cell boxes.

Using the development version of velox and setting the small option to TRUE, your example now yields no NA values:

# Install latest version of velox
library(devtools)
install_github("hunzikp/velox")

# Reproducible example:   
library(velox)
library(raster)

## Make VeloxRaster
mat <- matrix(1:100, 10, 10)
extent <- c(0,1,0,1)
vx <- velox(mat, extent=extent, res=c(0.1,0.1), crs="+proj=longlat     +datum=WGS84 +no_defs")

## Make SpatialPolygonsDataFrame
library(sp)
library(rgeos)
set.seed(0)
coords <- cbind(runif(10, extent[1], extent[2]), runif(10, extent[3], extent[4]))
sp <- SpatialPoints(coords)

# Default example
# from https://cran.r-project.org/web/packages/velox/README.html
spol_norm <- gBuffer(sp, width=0.2, byid=TRUE)
spdf_norm <- SpatialPolygonsDataFrame(spol_norm,     data.frame(id=1:length(spol_norm)), FALSE)

# Smaller buffer
spol_small<- gBuffer(sp, width=0.05, byid=TRUE)
spdf_small <- SpatialPolygonsDataFrame(spol_small, data.frame(id=1:length(spol_small)), FALSE)

plot(spdf_norm); par(new=F)
plot(spdf_small)

## Extract values and calculate mean, see results
(ex.mat.norm <- vx$extract(spdf_norm, fun=median))
(ex.mat.small <- vx$extract(spdf_small, fun=median, small = TRUE)) # -> No NA values anymore.

Best Answer

Related Solutions

ArcGIS Desktop – Counting Raster Cells Within a Polygon

[GIS] Extract median value for polygons from multiresolution raster data, using R (velox, raster -package)

Related Question