R – Handling ‘sf::st_cast(“LINESTRING”) – Keeping First Linestring Only’ Warning

linestringrsf

I have a dataset with geometry column in which there are linestrings and multilinestrings. While keeping the linestrings I want to convert the multilinestrings to linestrings (which should potentially increase the number of rows of the sf dataframe). Unfortunately when I use sf::st_cast("LINESTRING") there is a warning telling me that it is getting rid of all except the first linestring when transforming. Is there a way to keep all linestrings from the multilinestring when using sf::st_cast. Reproducible example with warning below:

library(sf)
library(dplyr)

# sample dataframe - creating linestrings
df1 <- data.frame(lon = 1:10, lat = 1:10, var = c(1,1,1,2,2,2,3,3,4,4)) %>%
  st_as_sf(coords = c("lon", "lat"), dim = "XY") %>% group_by(var) %>%
  summarise(geometry = st_union(geometry), do_union = F) %>% 
st_cast("LINESTRING")

# creating a multilinestring
df2 <- df1[1:2,] %>% mutate(var = c(1,1)) %>% group_by(var) %>% 
  summarise(geometry = st_union(geometry), do_union = F) %>% 
  st_cast("MULTILINESTRING")

# combining the two
df <- rbind(df1, df2)

# trying to convert only the multilinestring to two linestrings not changing 
the already existing linestrings
df <- df %>% st_cast("LINESTRING")

# Warning message:
# In st_cast.MULTILINESTRING(X[[i]], ...) : keeping first linestring only

I can do it manually first converting everything to multilinestring, and after everything to linestring like in the following:

df <- df %>% st_cast("MULTILINESTRING") %>% st_cast("LINESTRING")

but is there maybe a better way of doing this?

Best Answer

One alternative, to apply an st_cast to "LINESTRING" over each row:

> do.call(rbind,lapply(1:nrow(df),function(i){st_cast(df[i,],"LINESTRING")}))
Simple feature collection with 6 features and 1 field
geometry type:  LINESTRING
dimension:      XY
bbox:           xmin: 1 ymin: 1 xmax: 10 ymax: 10
epsg (SRID):    NA
proj4string:    NA
  var                   geometry
1   1 LINESTRING (1 1, 2 2, 3 3)
2   2 LINESTRING (4 4, 5 5, 6 6)
3   3      LINESTRING (7 7, 8 8)
4   4    LINESTRING (9 9, 10 10)
5   1 LINESTRING (1 1, 2 2, 3 3)
6   1 LINESTRING (4 4, 5 5, 6 6)

cant really be much better than:

> st_cast(st_cast(df, "MULTILINESTRING"),"LINESTRING")
Simple feature collection with 6 features and 1 field
geometry type:  LINESTRING
dimension:      XY
bbox:           xmin: 1 ymin: 1 xmax: 10 ymax: 10
epsg (SRID):    NA
proj4string:    NA
  var                   geometry
1   1 LINESTRING (1 1, 2 2, 3 3)
2   2 LINESTRING (4 4, 5 5, 6 6)
3   3      LINESTRING (7 7, 8 8)
4   4    LINESTRING (9 9, 10 10)
5   1 LINESTRING (1 1, 2 2, 3 3)
6   1 LINESTRING (4 4, 5 5, 6 6)

I assume that's what you mean in your last line, you don't give code. This is probably pretty close to optimal. library(microbenchmark) reckons the two-casts is about 10 times faster on your little example:

Unit: milliseconds
  expr      min       lq      mean    median       uq       max neval
 apply 9.087103 9.411445 10.056437 10.061594 10.50437 12.969576   100
 casts 1.737474 1.819215  2.000212  1.866471  1.92306  4.406047   100

Related Solutions

[GIS] R: sf package points to multiple lines with st_cast

I think that the sf package need to know first how you want to create the lines from your points. I mean which pair of POINT make every LINESTRING. In my example that was defined inside the lapply function. Follow the reproducible and commented code below, hope that helps:

# Load library
library(sf)

# Create points data
multipoints <- st_multipoint(matrix(c(10, 10, 15, 20, 30, 30), nrow = 3, byrow = TRUE), dim = "XY")
points <- st_cast(st_geometry(multipoints), "POINT") 

# Number of total linestrings to be created
n <- length(points) - 1

# Build linestrings
linestrings <- lapply(X = 1:n, FUN = function(x) {

  pair <- st_combine(c(points[x], points[x + 1]))
  line <- st_cast(pair, "LINESTRING")
  return(line)

})

# One MULTILINESTRING object with all the LINESTRINGS
multilinetring <- st_multilinestring(do.call("rbind", linestrings))

# Plot
plot(multipoints, pch = 19, cex = 2)
plot(multilinetring[[1]], col = "orange", lwd = 2, add = TRUE)
plot(multilinetring[[2]], col = "green", lwd = 2, add = TRUE)

Creating Lines Between Point Pairs in R – Spatial Analysis

Here is a tidyverse method of doing it, starting with your table from before converting to sf. The approach is to create a long-form table where each row is a start or end point, but include a lineid so that you can group_by on it and summarise to union the right points together, and then st_cast to LINESTRING.

library(tidyverse)
library(sf)
#> Linking to GEOS 3.6.1, GDAL 2.2.3, proj.4 4.9.3
table <- structure(list(NOMBRE = c("AL011900", "AL011900", "AL011900", "AL011900", "AL021900", "AL021900", "AL021900", "AL041905", "AL041905", "AL041905", "AL041905"), LAT = c(15, 15.2, 15.3, 15.4, 19, 19.5, 20, 36.3, 37.9, 39.6, 41), LONG = c(-42.1, -43.4, -44.7, -45.6, -59.3, -60, -60.6, -48.6, -47.9, -47.1, -46), INT = c(18.0054, 18.0054, 18.0054, 18.0054, 33.4386, 36.0108, 38.583, 46.2996, 43.7274, 41.1552, 41.1552)), row.names = c(NA, -11L), class = c("tbl_df", "tbl", "data.frame"), spec = structure(list(cols = list(NOMBRE = structure(list(), class = c("collector_character", "collector")), LAT = structure(list(), class = c("collector_double", "collector")), LONG = structure(list(), class = c("collector_double", "collector")), FECHA = structure(list(format = ""), class = c("collector_datetime", "collector")), INT = structure(list(), class = c("collector_double", "collector"))), default = structure(list(), class = c("collector_guess", "collector"))), class = "col_spec"))

table_sf <- table %>%
  group_by(NOMBRE) %>%
  mutate(
    lineid = row_number(), # create a lineid
    LONG_end = lead(LONG), # create the end point coords for each start point
    LAT_end = lead(LAT)
  ) %>% 
  unite(start, LONG, LAT) %>% # collect coords into one column for reshaping
  unite(end, LONG_end, LAT_end) %>%
  filter(end != "NA_NA") %>% # remove nas (last points in a NOMBRE group don't start lines)
  gather(start_end, coords, start, end) %>% # reshape to long
  separate(coords, c("LONG", "LAT"), sep = "_") %>% # convert our text coordinates back to individual numeric columns
  mutate_at(vars(LONG, LAT), as.numeric) %>%
  st_as_sf(coords = c("LONG", "LAT")) %>% # create points
  group_by(NOMBRE, INT, lineid) %>%
  summarise() %>% # union points into lines using our created lineid
  st_cast("LINESTRING")

plot(table_sf[, 1:2])

You can see in the plot that each line between two points has its own INT as requested.

Example

Best Answer

Related Solutions

[GIS] R: sf package points to multiple lines with st_cast

Creating Lines Between Point Pairs in R – Spatial Analysis

Related Question