Spark

Apache Spark Connector Documentation

import Tabs from '@theme/Tabs'; import TabItem from '@theme/TabItem';

Apache Spark as a connector for federated SQL query against a Spark Cluster using Spark Connect

datasets:
  - from: spark:spiceai.datasets.my_awesome_table
    name: my_table
    params:
      spark_remote: sc://my-spark-endpoint

Configuration

Limitations

  • Correlated scalar subqueries are only supported in filters, aggregations, projections, and UPDATE/MERGE/DELETE commands. Spark Docs

  • The Spark connector does not yet support streaming query results from Spark.

Last updated

Was this helpful?