GeoPandas Colorizing – Colorizing Polygons Based on Color Values in DataFrame Column

geopandasmatplotlib

I preparing some choropleth election result maps in Geopandas. The assigned color for each electoral district must correspond with the winning party (a color that I pre-determine), and the intensity of the color (opacity, saturation, etc) must be adjusted based some variable such as the margin of victory. Basically your traditional post-election map job.

My GeoDataFrame is structured as such (note there are three or more parties):

district_no   winning_party  color    margin  geom  
0             Conservative   #C62828  .56     POLYGON(599240.6488817427....)
1             Conservative   #C62828  .78     POLYGON(589240.6488427823....)
2             Liberal        #283593  .34     POLYGON(563405.6788424563....)
3             Conservative   #C62828  .08     POLYGON(563405.6488424563....)
4             Labour         #FF9800  .22     POLYGON(583405.6918424563....)
5             Labour         #FF9800  .37     POLYGON(633405.6128424563....)
6             Liberal        #283593  .48     POLYGON(533405.6278424563....)
etc...        etc...         etc...   etc...   etc...

I have not been able to figure out how to achieve this with the .plot() method. The argument 'color' simply applies the inputted color to all rows, and I am unsure how using cmap would help me achieve this, especially given that I must also adapt the color intensity based the margin column.

The only solution I can conceive is to loop through each row in the GeoDataFrame, and one by one plot out each polygon by assigning the color column value to the color argument, and then use the margin value to adjust the alpha level. Then somehow, I would have to merge/append all the plots back together to rebuild a final map (I'm not even sure this is possible).

Surely there is a better way than this. I have scoured the Internet, but surprisingly, I cannot find any solutions for this multi-party choropleth maps. Any ideas?.

Best Answer

The column= keyword can be used if you have values in a column which need to be mapped to a color (with a certain color map). But if you already have actual color names that you want to use directly, you can use the color keyword.

You can pass a list/array of colors (with the same number of values as the number of rows) to this color keyword. For example when you have 5 rows:

gdf.plot(color=['r', 'g', 'b', y', 'k'])

So in your case, you can pass the the column (but the actual values, not the column name):

gdf.plot(color=gdf['color'])

Small example to illustrate:

>>> gdf = geopandas.read_file(geopandas.datasets.get_path('nybb'))
# adding a column with color names (gdf has 5 rows)
>>> gdf['color'] = ['#C62828', '#C62828', '#283593', '#FF9800', '#283593']
>>> gdf.plot(color=gdf['color'])

Also specifying the alpha based on column values is a bit more complicated, as the alpha keyword does not yet accept an array-like. One possible work-around is to combine the color and alpha in a RGBA tuple (4 floats).
Continuing the same example from above, the following seems to work:

# add a margin column
gdf['margin'] = [.56, .78, .34, .08, .48]

from matplotlib.colors import to_rgba
gdf['color_rgba'] = gdf.apply(
    lambda row: to_rgba(row['color'], alpha=row['margin']), axis=1)
gdf.plot(color=gdf['color_rgba'])

Related Solutions

GeoPandas – Merging a GeoDataFrame and Pandas DataFrame by Column

You should specify a common key using on parameter. I removed merged_ prefix for legibility.

df  = spatial_df.merge(tab_df, on='mukey', how='left')
# df  = tab_df.merge(spatial_df, on='mukey', how='right')
gdf = gpd.GeoDataFrame(df)

Sample spatial_df:

    col1 col2  mukey geometry
0   A    1.76  1     ...
1   B    0.40  2     ...
2   C    0.97  3     ...
3   D    2.24  4     ...
4   X    1.86  5     ...
5   Y    0.97  6     ...

Sample tab_df:

    col3 col4  mukey
0   100  0.95  2
1   200  0.15  3
2   300  0.10  4

Result:

    col1  col2  mukey geometry  col3   col4
0   A     1.76  1     ...       NaN    NaN
1   B     0.40  2     ...       100.0  0.95
2   C     0.97  3     ...       200.0  0.15
3   D     2.24  4     ...       300.0  0.10
4   X     1.86  5     ...       NaN    NaN
5   Y     0.97  6     ...       NaN    NaN

Folium – Plot GeoJSON Polygon with Custom Fill Color

this is the solution that I came up with:

import matplotlib
import geopandas as gpd
import matplotlib
import matplotlib.pyplot as plt

# gdf: One's own geopandas geoDataFrame
# colname: name of the gdf to be plotted into the Folium Map

xmin, ymin, xmax, ymax = gdf.total_bounds

centroidx = np.mean([xmin, xmax])
centroidy = np.mean([ymin, ymax])

map1 = folium.Map(
    location=[centroidy, centroidx],
    tiles='cartodbpositron',
    zoom_start=6,
)

cmap = matplotlib.cm.get_cmap('viridis')

vmin = gdf[colname].min()
vmax = gdf[colname].max()


norm = matplotlib.colors.SymLogNorm(vmin=vmin, vmax=vmax, linthresh=0.1)

def fetchHexFromValue(value):
  NormedValue = norm(value)
  RGBAValue = cmap(NormedValue)
  HEXValue = matplotlib.colors.to_hex(RGBAValue)
  return HEXValue



for idx, r in gdf.iterrows():

    lat = r["geometry"].centroid.y
    lon = r["geometry"].centroid.x
    folium.Marker(location=[lat, lon],
                  popup='idx:{0} <br> {1}: {2}'.format(idx,
                                                       colname, 
                                                       r[colname])
    ).add_to(map1)

gdf.explore(colname, cmap="viridis", m=map1)

map1

Best Answer

Related Solutions

GeoPandas – Merging a GeoDataFrame and Pandas DataFrame by Column

Folium – Plot GeoJSON Polygon with Custom Fill Color

Related Question