Commvault Unveils Clumio Backtrack - Near Instant Dataset Recovery in S3
ChatGPT has gained a lot of attention recently, opening up a whole new world of possibilities across knowledge, research, and collaboration. The growing capabilities of AI and the proliferation of AI-based tools is giving rise to new modalities of working.
While it’s been fun to play around with large language models, I’ve been following the discourse on their implications for higher education dissertations, creative copyrights, and building software. What I find most interesting, however, is their impending impact on businesses in highly regulated industries. Some examples:
All these industries are subject to stringent data compliance requirements around retention, encryption, storage, and privacy, including but not restricted to:
The underlying data used in training machine learning models are typically stored in data lakes. The data lake also serves source data for customer portals, support dashboards, development projects, and such. Naturally, as AI and LLMs proliferate, it’s never been more important to ensure that your data lake is protected at the source level.
One cannot talk about data lakes without talking about Amazon S3. S3 is the underlying platform for all major data lakes operating on AWS – Delta Lake, LakeFormation, Iceberg, etc. And while the infrastructure behind S3 is supremely resilient, the resident data is your responsibility. This includes the resilience, uptime, availability, and integrity of all the data in your data lake. And that means discovering important data in your data lake and backing it up.
On this World Backup Day, take a few minutes to review your data protection strategy for your critical data, especially as it relates to data lakes.
Every forward-leaning company today has two things in common — they are leveraging data lakes, and they are subject to regulation. And data resilience is essential to not just a company’s innovation, but its survival. It’s the new metric by which business health will be measured. As much data growth as we’ve seen in the last few years, the next few years will bring orders of magnitude more. AI is just one of the technology trends intertwined with data at scale. I can’t wait to see what’s next.