SpicepodSet
SpicepodSet CRD reference for deploying and managing Spicepod replicas.
A SpicepodSet (spice.ai/v1) deploys and manages one or more Spicepod replicas. The operator handles the full lifecycle: creating the workload, rolling updates, volume management, health monitoring, and crashloop protection.
The operator chooses the underlying workload type adaptively:
A single
Deploymentfor the simple case (novolume, nocluster,replicas <= 1).Per-replica
StatefulSets whenvolume,cluster, orreplicas > 1is configured, providing stable pod identities and ordered startup.
Minimal Example
apiVersion: spice.ai/v1
kind: SpicepodSet
metadata:
name: my-spicepod
namespace: default
spec:
replicas: 1
spicepod:
name: my-spicepod
kind: Spicepod
version: v1The spicepod field accepts the inline Spicepod definition (datasets, catalogs, models, views, etc.) and is preserved as-is by the operator.
Container Image
tag
latest-models
Image tag.
name
spiceai/spiceai
Image name.
registry
Docker Hub
Image registry.
pullPolicy
Always for :latest, else IfNotPresent
Image pull policy.
pullSecret
—
Name of a Kubernetes Secret holding credentials for a private registry.
Ports
httpPort
8090
HTTP API port.
flightPort
50051
Apache Arrow Flight port.
metricsPort
9090
Prometheus metrics port.
The Service exposes fixed ports 8080 (HTTP), 50051 (Flight), and 9090 (metrics) and maps targetPort to the configured Spiced ports above.
Resources
Environment Variables and Secrets
Persistent Storage
Enable persistent volumes with automatic resizing:
Volume shrinking is not supported. Decreasing storageRequests has no effect on existing PVCs.
When volume is configured, the operator deploys per-replica StatefulSets with PersistentVolumeClaims. Increasing storageRequests triggers automatic PVC resizing.
Service Account
enabled
false
Whether to use a ServiceAccount.
create
false
Whether to create a new ServiceAccount. If false, set name to reference an existing one.
name
SpicepodSet name
ServiceAccount name; required when create: false.
annotations
—
Annotations applied only to the ServiceAccount (ideal for IRSA eks.amazonaws.com/role-arn).
Update Strategies
RollingOrdered (default)
Updates pods one at a time in ordinal order, waiting for each to become Ready before proceeding.
RollingParallel
Updates pods in parallel. Set maxUnavailable to bound concurrent unavailable pods.
Parallel
Updates all pods simultaneously with no availability constraints.
BlueGreen
Brings up a parallel StatefulSet for the new generation, then atomically switches Service traffic once all new-generation pods are Ready. PVCs are ephemeral across generations.
minReadyForCutover
minReadyForCutoverSwitches the Service to the new generation as soon as minReadyForCutover new-generation pods are Ready, instead of waiting for all of them.
For
BlueGreenand instant-rollback: switches the version-pinned Service selector and promotesstatus.activeVersionearly.For
RollingOrdered/RollingParallel: causes the Service object to be created earlier in the rollout (selector is not version-pinned, so traffic is already routed to all matching Ready pods once the Service exists).Ignored for
Parallel. A value of0is treated as unset. Values larger thanreplicascollapse to "wait for all".
Instant Rollback
Retain the previous-generation pods after a rollout so traffic can be cut back to them instantly without rolling pods.
After a successful rollout, set the rollback annotation on the SpicepodSet to swap traffic back to the standby:
The operator switches the Service to the standby pods, clears the annotation, and tracks status.activeVersion / status.standbyVersion / status.standbyExpiresAt. Standby pods are torn down after retentionPeriodSeconds if no rollback occurs.
Combine instantRollback with BlueGreen for the canonical zero-downtime production rollout pattern.
Scaling and Pausing
Set replicas: 0 to pause the workload while retaining supporting resources (Service, ConfigMap, NetworkPolicy, ServiceAccount):
Network and DNS
Egress, ingress, DNS policy, and DNS config are nested under network:
Disable the Service entirely:
Annotations and Labels
Updating annotations or labels triggers a full pod rollout, even when no other configuration has changed:
Operator-reserved keys (e.g. spice.ai/app, spice.ai/spicepod, spice.ai/version, spice.ai/cluster, spice.ai/cluster-role, spice.ai/cluster-mtls, spice.ai/component, spice.ai/observed-generation, spice.ai/rollback, spice.ai/sidecar-injected, spice.ai/validation-level) are stripped from user-supplied annotations/labels.
Health Probes
Probes are only created when the Spiced HTTP server is enabled (i.e. for non-executor nodes). Cluster executors have no probes.
Pod Scheduling
Crashloop Protection
The operator monitors pods for repeated failures. When a SpicepodSet accumulates more dead pod observations than the threshold (default: 10), the operator pauses the workload (replicas → 0) and records a pauseReason of CrashLooping in status. Configure via the operator CLI flag --pause-crashlooping-pods-threshold (0 disables).
Status
Useful fields on status:
replicas
Formatted replica state for kubectl display, e.g. 2/5.
readyReplicas / totalReplicas
Numeric replica counts.
role
<none>, scheduler, or executor.
pauseReason
Set when paused (e.g. CrashLooping).
activeVersion
Spec SHA of the version currently receiving traffic (instant rollback).
standbyVersion
Spec SHA of the standby retained for instant rollback.
standbyExpiresAt
Epoch seconds when the standby pods will be reclaimed.
conditions
Standard Kubernetes Conditions describing reconciliation state.
Monitoring
Telemetry properties passed to the Spice runtime are configured at the operator level (Helm values), not on the SpicepodSet:
Last updated
Was this helpful?