Apache SystemDS Python API

The SystemDS Python API (systemds) provides a Python interface for building end-to-end ML pipelines. It includes Matrix and Frame types for distributed data manipulation, built-in algorithms for preprocessing, feature engineering, classification, regression, and clustering, and federated learning capabilities. The Python API communicates with a SystemDS runtime backend via gRPC for distributed execution on Apache Spark.

API entry from apis.yml

apis.yml Raw ↑
aid: apache-systemds:apache-systemds-python-api
name: Apache SystemDS Python API
description: The SystemDS Python API (systemds) provides a Python interface for building end-to-end ML
  pipelines. It includes Matrix and Frame types for distributed data manipulation, built-in algorithms
  for preprocessing, feature engineering, classification, regression, and clustering, and federated learning
  capabilities. The Python API communicates with a SystemDS runtime backend via gRPC for distributed execution
  on Apache Spark.
humanURL: https://apache.github.io/systemds/api/python/
tags:
- Python
- Machine Learning
- Data Science
- Distributed Computing
properties:
- type: Documentation
  url: https://apache.github.io/systemds/api/python/
- type: SDK
  url: https://pypi.org/project/systemds/
  title: Python Package (PyPI)
- type: SDK
  url: https://search.maven.org/search?q=org.apache.systemds
  title: Java/Scala Maven Package