ADLS Reader runtime considerations
ADLS Reader monitoring metrics
In addition to the standard metrics, the following monitoring metrics are published for ADLS Reader.
Metric | Description |
---|---|
| The name of the cloud object whose metadata was recently fetched from the cloud. Frequency: every cloud object in a batch. For example:
|
| The name of the actual object whose metadata was recently fetched from the cloud, along with its path in the ADLS container. Frequency: every cloud object in a batch. For example:
|
CLOUD_OBJECT_LAST_BATCH_COUNT | The number of cloud objects whose metadata were captured in the latest fetch cycle. Frequency: every batch fetched. Units: count (Long) For example:
|
EXTERNAL_IO_LATENCY | The latency involved in capturing the cloud metadata in the latest fetch cycle. Frequency: every batch fetched. Units: milliseconds (Long) For example:
|
CLOUD_OBJECT_STATS | The following metrics related to the cloud objects are captured by ADLS Reader under Cloud objects statistics.
For example: { “Count of Objects metadata fetched”: 1, “Downloaded count”: 1, “Processed count”: 0, “Missing count”: 0, “Total objects size in MB”: 0.001, “Total downloaded size in MB”: 0.001, “Current Disk Utilization in MB”: 0.001 } |
ADLS Reader limitations
The following are limitations of ADLS Reader.
ADLS Reader cannot capture data from the objects that get deleted from ADLS while the data capture is in progress.
When using Parquet Parser, ADLS Gen2 Reader can not download and process objects whose unique file name is longer than that supported by the Operating System.
Among compression formats, ADLS Reader is capable of reading only the objects compressed in gzip format.