Data Retention
3 TopicsData archiving of delta table in Azure Databricks
Hi all, Currently I am researching on data archiving for delta table data on Azure platform as there is data retention policy within the company. I have studied the documentation from Databricks official (https://docs.databricks.com/en/optimizations/archive-delta.html) which is about archival support in Databricks. It said "If you enable this setting without having lifecycle policies set for your cloud object storage, Databricks still ignores files based on this specified threshold, but no data is archived." Therefore, I am thinking how to configure the lifecycle policy in azure storage account. I have read the documentation on Microsoft official (https://learn.microsoft.com/en-us/azure/storage/blobs/lifecycle-management-overview) Let say the delta table data are stored in "test-container/sales" and there are lots of "part-xxxx.snappy.parquet" data file stored in that folder. Should I simply specify "tierToArchive", "daysAfterCreationGreaterThan: 1825", "prefixMatch: ["test-container/sales"]? However, I am worried that will this archive mechanism impact on normal delta table operation? Besides, I am worried that what if the parquet data file moved to archive tier contains both data created before 5 years and after 5 years, it is possible? Will it by chance move data earlier to archive tier before 5 years? Highly appreciate if someone could help me out with the questions above. Thanks in advance.146Views0likes1CommentCounters Frequency and costs
Hi all I need to understand a way of calculating each VM Performance Counter frequency and associated costs. I get that Log Analytics charges you for Data Ingestion, correct? So if I have one perf counter (e.g. Memory %) configured to probe each 10 seconds, I would save costs by modifying it to, lets say, 30 seconds. However I don't understand how to calculate how much space is one perf counter probe using. It's a silly question but I can't find any straightforward way of calculating how much I can save/expend by tweaking the frequency of the performance counters. Thank you in advance.979Views0likes1CommentMonitoring OnTap devices
Hello, did someone has any expirience with using OMS (now Azure Monitor) to monitor large set of NAS devices (all NetAPP). I did not find much about monitoring of such "appliances" so Im trying to understand if there is some already prepared solution, or the monitoring of such devices is possible just by using Custom Log feature and load into Log Analytics txt logs generated by NAS device.... thanks for any information in advance JK1.2KViews0likes1Comment