[GIS] Programmatically converting arbitrary XML data to shapefile

convertgeotoolsjavashapefilexml

Based on the answers below I decided to do some programming. I'll be using GEOtools for this, a java lib: http://www.geotools.org/

I have the following XML:

<shapes xmlns="http://www.meteogroup-maritime.com/spos/GISLayer" 
    name="Load line zones" transparency="50" onland="false">

 <shape id="0" description="SUMMER ZONE" color="#FFFFC90E">
   <polygon>
    <location lat="35" lon="-180"/>
       ....
    <location lat="-33" lon="-170"/>
    <location lat="-47" lon="-180"/>
   </polygon>

   <label lat="29" lon="-45" text="SUMMER ZONE" />
   <label lat="-30" lon="-153" text="SUMMER ZONE" />
 </shape>

  ...

</shapes>

I want to turn this data into a shapefile, preferably by using existing software, although I don't mind scripting.

How can I do this?

Best Answer

If you are comfortable with Python, you could use ElementTree to parse the XML and pyshp to create the shapefile.

Here is something you can start with:

from xml.etree import ElementTree
import shapefile
import os

xml_file = 'input.xml'
shape_file = 'output.shp'
projection = 'GEOGCS["GCS_WGS_1984",DATUM["D_WGS_1984",SPHEROID["WGS_1984",6378137.0,298.257223563]],PRIMEM["Greenwich",0.0],UNIT["Degree",0.0174532925199433]]'

tree = ElementTree.parse(xml_file)
w = shapefile.Writer(shapefile.POLYGON)

# create fields
w.field('ID', 'N', 6)
w.field('DESCRIP')

root = tree.getroot()
shapes = root.getchildren()

for shape in shapes:
    # assumes single-part, single-ring polygons
    part = []
    locations = shape[0].getchildren()
    for location in locations:
        # specify coordinates in X,Y order (longitude, latitude)
        part.append([float(location.get('lon')), float(location.get('lat'))])
    w.poly(parts=[part])

    # copy attributes
    w.record(int(shape.get('id')), shape.get('description'))
w.save(shape_file)

# create the PRJ file
with open(os.path.splitext(shape_file)[0] + os.extsep + 'prj', 'w') as prj:
    prj.write(projection)

Related Solutions

[GIS] Converting Shapefile data to GeoJSON

There are java bindings for GDAL/ogr - see http://gdal.org/java/ . No idea if they work on Android, though. Apparently ( https://www.google.de/search?q=gdal+on+android ) building gdal on Android is not really easy.

http://sourceforge.net/projects/javashapefilere/ seems to be another option.

Also https://stackoverflow.com/questions/2044876/does-anyone-know-of-a-library-in-java-that-can-parse-esri-shapefiles .

[GIS] Shapefile to Network xml

As I continued to explore yesterday, I discovered the networkx Python library, in particular its read_shp() and write_shp() functions.

import networkx
G = networkx.read_shp('linesfile.shp')
networkx.write_shp(G, './')

Got me a lines file with the original attributes and a points file with the nodes. I'm actually thrilled at the result, though there isn't a field for the node ID. Hopefully I can do this with just a spatial join.

Nodes and Links

The Solution

Well, I did it. Here's a reduced form of the python code I wrote. The full code with detailed comments is available in this gist

import networkx as nx
import lxml.etree as ET
G = nx.read_shp("fafnetworkLCC.shp")
G = nx.convert_node_labels_to_integers(G, first_label=0, 
        label_attribute = "coord")
# create element tree structure
network = ET.Element("network", 
    attrib={'name':"MATSim network exported from FAF shapefile."})
nodes = ET.SubElement(network, "nodes")

for i in range(len(G)):
    ET.SubElement(nodes, "node", 
            attrib={'id': str(G.nodes()[i]), 
                    'x':str(G.node[i]['coord'][0]), 
                    'y':str(G.node[i]['coord'][1]),
                    'type':"2"})

links = ET.SubElement(network, "links", 
        attrib={'capperiod': "01:00:00",
                'effectivecellsize': "7.5",
                'effectivelanewidth': "3.75"})
length = nx.get_edge_attributes(G, "MILES")
idvar  = nx.get_edge_attributes(G, "ID")

for i in range(len(G.edges())):
    startnode = G.edges()[i][0]
    endnode = G.edges()[i][1]
    ET.SubElement(links, "link", attrib={ 
        'id': str(idvar[(startnode, endnode)]),
        'from': str(startnode), 
        'to':   str(endnode), 
        'capacity': str(6000),
        'modes': "car",
        'oneway': str(1),
        'type': str(10),
        'length': str(length[(startnode, endnode)] * 1609.34)}) # convert to meters

tree = ET.ElementTree(network)

with open('network.xml', 'w') as f:
    f.write("""<?xml version="1.0" encoding="UTF-8" ?>
<!DOCTYPE network SYSTEM "http://www.matsim.org/files/dtd/network_v1.dtd">
""")
    tree.write(f, pretty_print = True)

Best Answer

Related Solutions

[GIS] Converting Shapefile data to GeoJSON

[GIS] Shapefile to Network xml

The Solution

Related Question