ADLS Gen2 Connector¶
ADLSGen2Connector ¶
Bases: Connector
Connector for Azure Data Lake Storage Gen2 using abfss://
URIs.
- Supports reading and writing Delta (default), Parquet, and CSV.
- Path must match
abfss://<container>@<account>.dfs.core.windows.net/<path>
.
read ¶
read(spark: SparkSession, source: Any, *, fmt: Optional[str] = None, schema: Optional[Any] = None, source_config: Optional[Mapping[str, Any]] = None, options: Optional[Mapping[str, Any]] = None, **kwargs: Any) -> DataFrame
Read a dataset from ADLS Gen2.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
spark
|
SparkSession
|
Active |
required |
source
|
Any
|
|
required |
fmt
|
Optional[str]
|
Optional format override: |
None
|
schema
|
Optional[Any]
|
Optional schema for structured reads. |
None
|
source_config
|
Optional[Mapping[str, Any]]
|
Unused for ADLS, accepted for interface compatibility. |
None
|
options
|
Optional[Mapping[str, Any]]
|
Additional Spark read options. |
None
|
Source code in src/spark_fuse/io/azure_adls.py
29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 |
|
validate_path ¶
validate_path(path: str) -> bool
Return True if the path is a valid ADLS Gen2 abfss://
URI.
Source code in src/spark_fuse/io/azure_adls.py
25 26 27 |
|
write ¶
write(df: DataFrame, path: str, *, fmt: Optional[str] = None, mode: str = 'error', **options: Any) -> None
Write a dataset to ADLS Gen2.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
df
|
DataFrame
|
DataFrame to write. |
required |
path
|
str
|
|
required |
fmt
|
Optional[str]
|
Optional format override: |
None
|
mode
|
str
|
Save mode, e.g. |
'error'
|
**options
|
Any
|
Additional Spark write options. |
{}
|
Source code in src/spark_fuse/io/azure_adls.py
69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 |
|