[GIS] How to load geographic data into Hadoop (Hive) and then to be able to call spatial functions on them

hadoophiveshapefile

I need to load some geographic data into Hadoop (Hive), and then, to be able to call some spatial functions on them using Hive.

My input data format is ESRI shapefiles (but could be another format, I could transform if necessary).

I know that ESRI has released a very basic framework for working with spatial data within Hadoop via Hive, GIS Tools for Hadoop by ESRI, but it seems that I need to have my data in ESRI JSON and I don't have ESRI tools to convert my data into that format.

Is there any alternative way to load geodata into Hadoop or is there a way to convert my ESRI shapefiles into ESRI JSON?

Best Answer

ST_Geometry for Hive in the GIS Tools for Hadoop, can load data from several different formats: Well-Known Text (WKT), Well-Known Binary (WKB), GeoJSON, Esri JSON, and coordinates as a list of numbers. Please see the documentation for the constructor functions.

As for shapefile, the java library to read shapefiles by Mansour Raad may be of use.

Related Question