ArcGIS – Extracting Substring from Field using Python Parser of Field Calculator in ArcMap

arcgis-desktopfield-calculatorpython-parser

I have some parcel data where I need to extract a subdivision name from a long string. The format is always "Subdivision: ____ ______ _____" etc. BUT, there is no uniformity to what comes before or after "Subdivision: " or the actual name of the subdivision. In my example below it shows that "Block: " follows "Subdivision: " but that's not always the case.

I'd like to learn how to solve this issue using python, but VB can also be used. I was reading about re (edit: Regex) in python, but without some further explanation I'm a little lost. Here is a screen shot showing what the data looks like.

Any tips on where I should try to go with this?

Best Answer

For a more general approach, you could use a regex like r'\s*(\w+):\s*' in the re.split() function to build a dict of parcel "keys" and "values" (not sure of your parcel terminology).

This regex looks for:

\s* - zero or more whitespace characters
(\w+) - one or more alphanumeric (a-Z, 0-9, but not other characters), note that the () brackets indicate a capture group
: - followed by a colon
\s* - followed by zero or more whitespace characters

The re.split function returns a list of each section of text between the matches, but because because we've used brackets to specify a capture group, those captured groups are returned as well.

For example:

import re

parcel_text = 'Section: 3 Township: 8 Range: 88 Subdivision: Blah blah blah Block: G Lot: 9A'

print(re.split(r'\s*(\w+):\s*', parcel_text))
['', 'Section', '3', 'Township', '8', 'Range', '88', 'Subdivision', 'Blah blah blah', 'Block', 'G', 'Lot', '9A']

parcel_list = re.split(r'\s*(\w+):\s*', parcel_text)[1:]  # Strip the first element as it's an empty string for some reason
parcel_dict = dict(zip(parcel_list[0::2], parcel_list[1::2]))  
# [0::2] = makes a list of every 2nd element starting from 0, [1::] is the same except starting from 1
# zip "zips" those 2 lists together into a list of 2 element lists, i.e [['Section', '3'], ['Township', '8'], etc...]

print(parcel_dict)

{'Section': '3',
 'Township': '8',
 'Range': '88',
 'Subdivision': 'Blah blah blah',
 'Block': 'G',
 'Lot': '9A'}

You can turn that into a field calculator expression, something like:

Code block / Pre-logic Script Code

import re

def parse_parcel(parcel_text):
    parcel_list = re.split(r'\s*(\w+):\s*', parcel_text)[1:]
    parcel_dict = dict(zip(parcel_list[0::2], parcel_list[1::2]))
    return parcel_dict

Expression

parse_parcel (!your_parcel_field!).get('Subdivision')  #.get avoids a KeyError if there's no "Subdivision"

Related Solutions

[GIS] Auto-incrementing field in feature class using ArcGIS Desktop

you need to use this code within ArcMap and the field calculator. Add your feature class in the table of content, right click on it to open the table, right click on the name of the field and launch the field calculator.

Then you check for codeblock and copy the code you mentioned.

enter image description here

now for your code snippets, here is what I would do

rec=0 
def autoIncrement(a): 
 global rec 
 pStart = 1  
 pInterval = 1 
 if (rec == 0):  
  rec = pStart  
 else:  
  rec += pInterval  
 return "water" + str(a) + "-" +  format(rec, '04d')

you call this code using

autoIncrement(!name_of_field!)

where name_of_field contains the type of feature

EDIT : If you want to use the OBJECTID field directly, then a simple concatenation is enough

"WATER-" + str(!typrfield!) + "-" +  format(!OBJECTID!, '04d')

if your number has to depend on the type, it then makes sense to use the Python code block

rec1=0 
rec2=0
def autoIncrement(a): 
 global rec1
 global rec2 
 pStart = 1  
 pInterval = 1 
 if (a == 1):
  if (rec1 == 0):  
   rec1 = pStart  
  else:  
   rec1 += pInterval
  out = "water-1-" +  format(rec1, '04d')  
 else:
  if (rec2 == 0):  
   rec2 = pStart  
  else:  
   rec1 += pInterval
  out = "water-2-" +  format(rec2, '04d')  
 return out

ArcMap – Extract Numbers from String Field Using Python Parser

Try this:

def makestr(test):  # Add colon
     numlist = []   # Don't use name "list"
     for s in test:
         if s.isdigit():
             numlist.append(s)
     return ''.join(numlist)  # Return a value

Best Answer

Related Solutions

[GIS] Auto-incrementing field in feature class using ArcGIS Desktop

ArcMap – Extract Numbers from String Field Using Python Parser

Related Question