2X price-performance and 10X throughput: Isima’s ecommerce experiment with Z3 VMs
Aisha Wang
Product Manager, Z3
Storage-dense workloads like horizontal, scale-out databases and log analytics need a high density of SSD and consistent performance. And to preserve data in the event of an outage, they also need predictable maintenance. At Google Cloud Next ‘24, we announced the general availability of the Z3 virtual machine series, our first storage-optimized VM family. Z3 features an industry-leading 6M 100% random-read and 6M write IOPs, and offers incredibly dense storage configurations of up to 409 SSD (GiB):vCPU on next-generation local SSD hardware.
Isima, a Silicon Valley startup and ecommerce analytics cloud, was among the first to try it. Their platform bi(OS) provides serverless infrastructure for real-time retail and ecommerce data and AI applications. It features a scale-out SQL-friendly database and zero-code capabilities to onboard, process, and operate data for real-time data integration, feature stores, data science, cataloging, observability, DataOps, and business intelligence.
In this blog, we summarize Isima’s tests and findings when comparing Z3 to general-purpose N2 VMs. Spoiler alert: We're talking 2X better price-performance, 10X more throughput, and a whole lot more.
The test
As a Google Cloud partner, Isima got early access to Z3, and tested it on a series of demanding, real-world ecommerce workloads, including microservice calls, ad hoc analytics, visualization queries and more, all firing simultaneously:
Fig.1 - bi(OS) on Z3 ecommerce use cases
To push Z3 to its limits and mirror real-world, high-availability deployments, Isima deployed bi(OS) on three z3-highmem-88 instances, each split into five Docker containers across multiple zones. Each Docker container was allocated 16 vCPUs, 128GB RAM, and two 3TB SSDs. This setup allowed Isima to better compare Z3 with previous tests they performed using n2-highmem-16 instances.
Fig.2 - bi(OS) on Z3 Test Setup
To emulate excessive load and a variety of worst case scenarios, Isima tested:
-
Demand spikes: They hit the system with a brief peak load that saturated system resources to ~70%, then maintained a relentless 75% of that peak for a full 72 hours — all while demanding (and achieving) 99.999% reliability.
-
Select queries: They tested select queries to avoid unintentional caching effects by the OS or bi(OS). That is, when Isima read data (via select queries), they intentionally went back in time (e.g., querying data inserted 30 minutes ago) to ensure they were reading data from the Local SSD and not from RAM — something crucial for Z3. This way, they were confident that performance testing results could survive real-world workloads.
-
Multiple deployment scenarios: Both single-tenant and multi-tenant configurations were tested, validating Z3's ability to handle diverse real-world deployments.
-
Simulated maintenance events: Isima even factored in planned maintenance using Docker restarts, showing Z3's ability to handle disruptions without breaking a sweat.
The verdict
-
Throughput: bi(OS) on Z3 handled ~2X+ more throughput than the tests performed last year using n2-highmem-16, with 2X better price-performance.
- NVMe drive latencies: Write latencies were ~6X better, while read latencies were unchanged.
- Drive variance: Over the course of 72 hours, each drive on each z3-highmem-88 VM reported +/- 0.02ms variance in both read and write latencies.
We are thrilled to see these results for the new Z3 instances, and are confident that it will unlock the power of many more workloads. This isn't just about benchmarks, it's about real-world workloads. You can learn more about Z3 in the our detailed documentation. And if you’re looking for an agile, performant ecommerce platform, bi(OS) is available on the Google Marketplace with a free 30-day trial.