Apache Spark SQL API

Spark module for structured data processing with DataFrame and Dataset APIs. Provides a SQL interface and supports various data sources including Parquet, ORC, JSON, CSV, JDBC, Hive, and Delta Lake. The Spark SQL API supports Scala, Python, Java, and R bindings.

API entry from apis.yml

apis.yml Raw ↑
aid: apache-spark:apache-spark-sql-api
name: Apache Spark SQL API
description: Spark module for structured data processing with DataFrame and Dataset APIs. Provides a SQL
  interface and supports various data sources including Parquet, ORC, JSON, CSV, JDBC, Hive, and Delta
  Lake. The Spark SQL API supports Scala, Python, Java, and R bindings.
humanURL: https://spark.apache.org/docs/latest/sql-programming-guide.html
tags:
- DataFrames
- SQL
- Structured Data
properties:
- type: Documentation
  url: https://spark.apache.org/docs/latest/sql-programming-guide.html
- type: SDK
  url: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/index.html
  title: Scala API Reference
- type: SDK
  url: https://spark.apache.org/docs/latest/api/python/reference/pyspark.sql/index.html
  title: Python API Reference
- type: SDK
  url: https://spark.apache.org/docs/latest/api/java/index.html?org/apache/spark/sql/package-summary.html
  title: Java API Reference