Best Archival Strategy for delta parquet in ADLS ?

Sagar, Sawan 0 Reputation points
2025-04-16T07:55:41.4+00:00

We have a situation where we will be loading SAP data into ADLS layer using SAP Datasphere in delta parquet format. Now, we want to implement a robust archival framework for this data. Can you please guide/help me with the correct/MS recommended approach to deal with this. I would really appreciate if you can help me with some documentations on this.

Thanks,

Azure Data Lake Storage
Azure Data Lake Storage
An Azure service that provides an enterprise-wide hyper-scale repository for big data analytic workloads and is integrated with Azure Blob Storage.
1,559 questions
{count} votes

1 answer

Sort by: Most helpful
  1. Vinod Kumar Reddy Chilupuri 4,180 Reputation points Microsoft External Staff Moderator
    2025-04-16T10:36:44.73+00:00

    Hi Sagar, Sawan,

    Azure Blob Storage lifecycle management policies enable the automatic movement or deletion of data based on its age or usage patterns, making them particularly effective for efficiently archiving Delta Parquet files in Azure Data Lake Storage (ADLS).

    Automatic Tier Transitioning:

    • Older files can be automatically moved from Hot to Cool or Archive storage tiers based on their last modified or last accessed timestamps. This feature helps reduce storage costs while ensuring that data remains accessible when needed.

    Data Expiration & Deletion:

    • You can establish rules to delete obsolete files after a specified period. This capability is beneficial for compliance purposes and aids in optimizing storage usage.

    Metadata-Based Filtering:

    • Policies can be applied to specific files using prefixes or blob index tags, allowing for more granular management of different datasets. This feature facilitates the organization and handling of data according to specific criteria.

    By leveraging these lifecycle management features, organizations can implement a robust and cost-effective archival strategy for their Delta Parquet files in ADLS.

    Configure a lifecycle management policy

    Optimize costs by automatically managing the data lifecycle

    Hope the above suggestion helps! Please let us know do you have any further queries.

    Please do consider to “up-vote” wherever the information provided helps you, this can be beneficial to other community members. 

    1 person found this answer helpful.
    0 comments No comments

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.