Skip to content

Python Delta Local Pipeline

This article provides a guide on how to execute a simple Delta Table copy locally without Spark using the RTDIP SDK. This pipeline was tested on an M2 Macbook Pro using VS Code in a Python (3.10) environment.

Prerequisites

This pipeline job requires the packages:

Components

Name Description
PythonDeltaSource Reads data from a Delta Table.
PythonDeltaDestination Writes to a Delta table.

Example

Below is an example of how to read from and write to Delta Tables locally without the need for Spark

from rtdip_sdk.pipelines.sources.python.delta import PythonDeltaSource
from rtdip_sdk.pipelines.destinations.python.delta import PythonDeltaDestination

source = PythonDeltaSource("{/path/to/source/table}").read_batch()

destination = PythonDeltaDestination(source, "{/path/to/destination/table}", mode="append").write_batch()