[GIS] How to speed up a query for nearest linestring to point data

linestringnearest neighborpointpostgispostgresql

I have processed a Strahler stream order layer with a lot of river linestrings (type: Multilinestring) and their corresponding strahler number (see information here). Besides I have a point layer.

I want to know which linestring is the nearest to the distinct points from the point layer to extract the strahler number for every point.

enter image description here

My query here is very slow. For around 500 points and 2000 linestrings it takes over 6 minutes:

SELECT DISTINCT ON
    (lname)
    lnumber,
    lname,
    strahler,
    min(ST_Length(geom)) AS distance
FROM (
    SELECT 
        ST_MakeLine(ST_ClosestPoint(strahler_streams.geom, landslide.geom), landslide.geom) AS geom,
        strahler,
        lname,
        lnumber
FROM landslide, strahler_streams) AS foo
    GROUP BY strahler, lname, lnumber, geom 
    ORDER BY lname, distance;

I know there is the Indexed Nearest Neighbour Search in PostGIS.

But when I try this…

SELECT
    landslide.lnumber,
    strahler_streams.strahler
FROM landslide a, strahler_streams b
    WHERE lnumber=2114
        ORDER BY b.geom <#> a.geom LIMIT 1

… it only works when I select one single point (WHERE lnumber=2114)

How can I speed up the query for the nearest neighbor search (linestring, point)?

Best Answer

With the advice of Mapperz I've edited my query:

SELECT DISTINCT ON
    (lnumber)
    lnumber,
    strahler,
    min(ST_Distance(ST_ClosestPoint(strahler_streams.geom, landslide.geom), landslide.geom)) AS distance
FROM landslide, strahler_streams
    WHERE ST_DWithin(strahler_streams.geom, landslide.geom, 2000.0)
        GROUP BY lnumber, strahler
        ORDER BY lnumber, distance

I've approximated the distance value until all point features are included. Before I count them with SELECT count(geom) FROM landslide.

I've run the query with EXPLAIN ANALYZE. About 500 point features and 9000 linestrings features were are involved in this query and it takes about, depending on the distance value, 250ms (for a distance of 2000m (EPSG:31468)).

Related Solutions

PostGIS – How to Get the Nearest Point on a Linestring to a Given Point

ad 1) Looking at the documentation for your used functions, I'd say: "Yes, all concerned linestrings will be found."

expand(geometry, float)

This function returns a bounding box expanded in all directions from the bounding box of the input geometry, by an amount specified in the second argument. Very useful for distance() queries, to add an index filter to the query.

A && B

The "&&" operator is the "overlaps" operator. If A's bounding box overlaps B's bounding box the operator returns true.

ad 2) You should be able to achieve what you want via:

st_line_interpolate_point(linestring, st_line_locate_point(LineString, Point))

st_line_interpolate_point(linestring, location)

Interpolates a point along a line. First argument must be a LINESTRING. Second argument is a float8 between 0 and 1 representing fraction of total 2d length the point has to be located.

st_line_locate_point(LineString, Point)

Returns a float between 0 and 1 representing the location of the closest point on LineString to the given Point, as a fraction of total 2d line length. You can use the returned location to extract a Point (line_interpolate_point)

PostGIS – How to Perform Nearest Neighbor Calculation in PostGIS?

a and b are alias table names to the same table. This is effectively a T1 CROSS JOIN T2 in DB-speak. This allows a self-join to say "how close one part is to another" in a single table.

SELECT 
  a.hgt AS a_hgt,
  b.hgt AS b_hgt,
  ST_Distance(a.the_geom, b.the_geom) AS distance_between_a_and_b
FROM 
  public."TestArea" AS a, public."TestArea" AS b
WHERE
  a.gid < b.gid AND a.area > 100 AND b.area > 100

You might want to add another WHERE clause to limit the number of rows, e.g., add AND ST_Distance(a.the_geom, b.the_geom) < 1000.0 so that all distances are less than a kilometer (if you have projected UTM).

Best Answer

Related Solutions

PostGIS – How to Get the Nearest Point on a Linestring to a Given Point

PostGIS – How to Perform Nearest Neighbor Calculation in PostGIS?

Related Question