Skip to content

File indexes? #6

@josiahjohnston

Description

@josiahjohnston

Fun project!
I've needed file-level listings with indexes for a long time for delta tables. Typical usage is getting visibility into delta default settings, compaction outcomes, and understanding zorder or liquid clustering. I did work arounds for a while, but last week, I finally sorted out some alpha solutions for parsing delta logs robustly. delta-rs looked promising for high level library calls, but it is quite behind databricks default tables (deletion vectors & checkpoint v2 were the last incompatibilities).

Is it possible to use this library to get the file-level indexes? I expect this may be complicated by deletion vectors, but that is not an issue for my main use cases of append-only IoT data.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions