Your AI can now contextualize physical world data using Wherobots Spatial AI Coding Tools Learn More

WherobotsDB

The spatial compute engine built for physical-world data

WherobotsDB is the distributed spatial engine with full geometric correctness at any scale. 300+ spatial functions for vector and raster data, with native Spark SQL for tabular operations. Built by the original creators of Apache Sedona.

300+ spatial functions
Vector, raster, topology, and spatial indexing.
3x faster, 46% lower cost
Spatial joins that run in minutes, not days.
From city block to the entire globe
Horizontally distributed on Apache Spark. Scales with your data.

The benchmark leader in spatial compute

WherobotsDB delivers the fastest and most capable engine for cost-efficient performance across general-purpose spatial SQL workloads. The following table was run across multiple compute engines using the Apache Sedona SpatialBench at Scale Factor 1000.

In a separate test, WherobotsDB completed a Texas-statewide raster spatial join in 3 minutes 28 seconds. BigQuery timed out.

Scale Factor 1000 Query Capability Matrix
SpatialBench Query # WherobotsDB Sedona on Serverless Spark Proprietary SQL Serverless Proprietary Spark Clusters with Spatial SQL Sedona on Managed Spark Clusters
1: Spatial filter, aggregation, sorting
2: Spatial filter, aggregation, sorting
3: Spatial filter, aggregation, sorting
4: Lightweight spatial join and aggregation
5: Spatial aggregation
6: Heavy spatial join
7: Geometry construction and access
8: Heavy distance join
9: Heavy polygon self spatial join
10: Heavy spatial left join
11: Multi-way spatial join
12: KNN join

Key capabilities

Unlimited Scale

Scale workloads seamlessly from small to a planetary scale.

Apache Spark & Sedona Compatible

Migrate existing Apache Spark and Sedona workloads without changing a single line of code. Fully compatible out-of-the-box.

Nothing to Manage But Your Code

Serverless by default, or deploy securely in your own VPC.

300+ Spatial Functions for Vector and Raster Data

High performance spatial joins with 300+ ST_ and RS_ functions built to accelerate productivity.

Low Cost + High Performance Spatial Operations

WherobotsDB delivers fast, cost-efficient performance across spatial SQL and python workloads*. WherobotsDB benchmarks at 3x faster with up to 45% better price-performance than comparable engines, and at lower cost than PostGIS at scale.

*This represents actual cost benchmarking results for SpatialBench queries 7-12 with a scale factor of 1000 across three of the most comparable deployment options.

WherobotsDB
Lowest
Proprietary SQL Serverless Engine
x2.92 higher
Apache Sedona on Serverless Spark Engine
x2.03 higher
Proprietary Cloud Managed Spark
x2.52 higher
Relative cost across operations that completed

Workload Flexibility

From Small to Planetary Scale Workloads

Perform city to global scale spatial data operations in a single solution.

City
Region
Country
Global
Productivity

Accelerate Innovation Economically

SQL and Python
300+ ST_ and RS_ Functions
Raster
Vector
Tabular
SPATIAL JOIN
Data Products

“Spark is incredibly powerful — but it’s also a huge learning curve, Wherobots shortened the painful part of Spark and gave us production-grade scalability without having to babysit clusters.”

Ben Hudson
Co-founder and Head of Applied Science, aarden.ai

Deployment & Compatibility

Zero Migration. Total Flexibility.

Keep your existing Apache Spark or Sedona code, run it serverless or in your VPC

Serverless by Default

  • Zero infrastructure to manage
  • Auto-scaling, cost-efficient
  • Works out-of-the-box

Deploy in Your Cloud (BYOC)

  • Full control in your VPC
  • Enterprise-ready security
  • Works behind your firewall

Go deeper

Get Started with WherobotsDB

Explore the platform, build powerful solutions, or automate your geospatial workflows.

Discover

Seamless access to the right datasets, AI models and solution accelerators for your use case.

Build

Start using WherobotsDB with familiar Apache Spark/Sedona syntax.

Automate

Orchestrate spatial data flows at scale with Apache AirFlow or our REST API.

FAQ

How does WherobotsDB compare to other geospatial analytics engines?

WherobotsDB is a distributed geospatial analytics engine with full geometric correctness at planetary scale. On the Apache Sedona SpatialBench at Scale Factor 1000, WherobotsDB completed all 12 spatial queries, including heavy spatial joins that the next best Spark-based engine with Spatial SQL could not finish within a 10-hour timeout. WherobotsDB benchmarks at 46% lower cost than that engine and delivers up to 3x faster performance on analytical queries. For raster-vector workloads, WherobotsDB completed a Texas-statewide raster spatial join in about 3.5 minutes. BigQuery timed out on the same test.

Which spatial functions are supported in Wherobots?

WherobotsDB includes 300+ spatial functions for vector and raster data, covering geometry processing, spatial predicates, coordinate reference systems, and large-scale spatial joins. Advanced functions include kNN joins for nearest neighbor queries, zonal statistics on raster layers, travel isochrones for accessibility analysis, GPS map matching for mobility data, and PMTiles generation for map visualization. Native Spark SQL handles tabular operations alongside the spatial functions.

What raster data processing functions does WherobotsDB support?

WherobotsDB includes 100+ raster RS_ functions for processing satellite imagery, elevation models, and sensor data at scale. These cover in-database and out-of-database raster loading, affine transformations (translation, scaling, rotation, shearing), union aggregation to combine multiple rasters into multi-band outputs, zonal statistics, map algebra, raster tiling, and output writers for GeoTIFF, ArcGrid, and PNG formats. WherobotsDB’s out-of-database raster architecture processes terabyte-scale imagery without moving data into the database, enabling planetary-scale inference and zonal statistics.

Why is WherobotsDB faster than general-purpose engines for spatial data analysis?

WherobotsDB is built from the ground up for spatial data workloads. The engine uses a Rust-native, Arrow-columnar execution layer with zero-copy geometry handling via GeoArrow, optimized at the compute, query planner, and function levels. Production teams report processing times dropping from 39 days to under one day (GeoPostcodes) and from 7 days to 30 minutes (Aarden.ai). WherobotsDB separates compute from storage, scales horizontally on Apache Spark, and maintains 100% code compatibility with open-source Apache Sedona.
Sphere

Get started

WherobotsDB is part of Wherobots, the AI Context Engine for the Physical World.