[GIS] Using rasterio to crop image using pixel coordinates instead of geographic coordinates

imagerypythonrasterio

I'm trying to extract a series of random patches of image from a larger satellite image (from worldview3). I want to extract patches of uniform size (say 512×512 for example). If it were a normal image, any number of libraries could do this easily. But I want to use rasterio to retain the geographic information of the image patch.

So far the code I've written identifies the upper left pixel coordinate using

image.bounds

and using the resulting bounding box to get the height and the width. Then I use numpy to randomly generate a new random coordinate within the image extent:

new_col = np.random.randint(min(bounds.left, bounds.right), max(bounds.left, bounds.right)-(size+1))
new_row = np.random.randint(min(bounds.top, bounds.bottom), max(bounds.top, bounds.bottom)-(size+1))

And then I subtract the patch size (512) from the new row and column and use those new coordinates as the minx, miny, maxx, maxy and crop from there using

mask(image, shapes=coords, crop=True)

With a non-georeferenced image where the upper left is (0,0) and the lower right is (M,N) this works flawlessly. Similarly for a NAIP image this seems to work. But with the worldview3 image the size is not uniform. I'll get an image size like 700×2000 for example, where I want it to be 512×512. My thought was that the NAIP image I tested with was in UTM, but that the worldview3 image was in lat/long so that subtracting the patch size from the random row and column didn't translate to a uniform pixel size. I still think the error has to do with lat/long but after reprojecting the image to UTM the problem persists.

So, is there any way I can crop the worldview3 image using rasterio but using pixel coordinates instead of geographic coordinates so that I can crop a uniform image size from the larger image but still retain the geographic information of the cropped image patch?

Best Answer

You can use a window -rasterio.windows.Window to read by pixel offsets. The georeferencing can be easily calculated from the window using the source dataset window_transform method.

import random
import rasterio
from rasterio.windows import Window

with rasterio.open('input.tif') as src:

    # The size in pixels of your desired window
    xsize, ysize = 512, 512

    # Generate a random window origin (upper left) that ensures the window 
    # doesn't go outside the image. i.e. origin can only be between 
    # 0 and image width or height less the window width or height
    xmin, xmax = 0, src.width - xsize
    ymin, ymax = 0, src.height - ysize
    xoff, yoff = random.randint(xmin, xmax), random.randint(ymin, ymax)

    # Create a Window and calculate the transform from the source dataset    
    window = Window(xoff, yoff, xsize, ysize)
    transform = src.window_transform(window)

    # Create a new cropped raster to write to
    profile = src.profile
    profile.update({
        'height': xsize,
        'width': ysize,
        'transform': transform})

    with rasterio.open('output.tif', 'w', **profile) as dst:
        # Read the data from the window and write it to the output raster
        dst.write(src.read(window=window))

Related Solutions

[GIS] How to calculate the image size knowing its coordinates and pixel size

For me it's basic mathematics.

According to your data:

minx = 286185.598266
maxx = 286223.863098
miny = 5180909.674438
maxy = 5180967.071686
pixel_size = 0.0463053

so you can calculate height and width of your image in meters:

width = maxx-minx
height = maxy-miny

and the numbers of pixels in each direction by dividing by the pixel_size

cols = width/pixel_size
rows = height/pixel_size

with your data it gives a raster of approx 826 x 1240

[GIS] Pixel coordinate to world coordinates using Python

Although the length of the arc of the upper parallel and the length of the arc of the lower parallel of the limits of the original tile, measured on a sphere of 6371 km radius, measure 480 m with a variation less than 10 centimeters that a pixel measures, In general we should consider it a mistake to establish a spatial resolution in linear dimensions and think of everything else as angular.

Let's say better that the spatial resolution is 0.10m x 0.10m the pixel, and that the coordinate system is flat. In 480 m around, I am a terraplanista. I measure with precision instruments to 480 m around and draw a plane of the terrain, not only without considering the terrestrial curvature, but also without considering the reduction to the ellipsoid (not to mention the geoid undulation or the deviation of the vertical).

Let's say the system is flat and tangent to the sphere in the center of the image. Coordinates from the center can be considered as eastings and northings in meters of a topocentric CRS. Or let's say the system is a cylinder, which axis rests in the plane of the equator and is tangent to the spheric surface at the center of the image. Both options are valid in 480 m around, calculable with pyproj, and as easy to use with the sphere as with the WGS84 ellipsoid. How to do it depends of your pyproj version, which can differ if your are using a conda environment. In any case, how to do it would be another answer.

About your code, and trying to leaving aside the wrong assumption that each pixel can have the same spatial resolution if defined by geographic coordinates:

Here: y_new = y_center + (side/size) * dist_px_y * (dlat/side), when you have an even pixel matrix, the center is in the corner of four of them. Then, the distance to the first pixel to any side is the size of that pixel / 2, the distance to other pixels can be generalized as : (coordinate of the pixel - 0.5) * size of pixels.

And here: x_new = x_min + (side/size) * ((size/2) + dist_px_x) * 360 * (1/(2 * np.pi * r * np.cos(np.deg2rad(y_new)))), I don't know why you go to x origin with the length of the center parallel arc but then to the right with the length of the current parallel arc. Each pixel up will have a difference of longitude.

I think that nothing of that may produce a 16 m error, wich is like the distance from the center to the edge of a submatrix. And the problem may be in the distance_vector calculation.

About all other formulas, I have not checked them but understand your intention and agree with the underlying math used to accomplish it.

Best Answer

Related Solutions

[GIS] How to calculate the image size knowing its coordinates and pixel size

[GIS] Pixel coordinate to world coordinates using Python

Related Question