PySpark¶
Installation¶
Connection Parameters¶
Connect using the Database.from_pyspark()
method with parameters:
PARAMETER | DESCRIPTION |
---|---|
session
|
A SparkSession instance.
TYPE:
|
mode
|
Can be either "batch" or "streaming". If "batch", every source, sink, and query executed within this connection will be interpreted as a batch workload. If "streaming", every source, sink, and query executed within this connection will be interpreted as a streaming workload. Default is 'batch'.
TYPE:
|
match_schema
|
Regex pattern to filter schemas. Mutually exclusive with match_tables.
TYPE:
|
match_tables
|
Regex pattern to filter tables. Mutually exclusive with match_schema.
TYPE:
|
**kwargs
|
Additional keyword arguments used to configure the SparkSession.
TYPE:
|