Blog Post

Azure High Performance Computing (HPC) Blog
4 MIN READ

Using Azure CycleCloud with Weka

anhoward's avatar
anhoward
Icon for Microsoft rankMicrosoft
Feb 11, 2025

This blog post will introduce the integration of Azure CycleCloud with the Azure WEKA cloud native storage platform. This integration is particularly useful for those looking to leverage Azure for use cases such as high-performance computing (HPC), artificial intelligence (AI), machine learning (ML), big data analytics, and other data-driven applications. Azure CycleCloud and Azure WEKA combine the best of Azure’s scalable CPU/GPU node deployment services with WEKA’s leading infinate scalable high performance storage solution. The combo is the perfect solution to run complex computations, simulations, or any other resource intensive task!

What is Azure CycleCloud?  

Azure CycleCloud is an enterprise-friendly tool for orchestrating and managing HPC environments on Azure. With Azure CycleCloud, users can provision infrastructure for HPC systems, deploy familiar HPC schedulers, and automatically scale the infrastructure to run jobs efficiently at any scale.  

Figure 1: The Azure CycleCloud User Interface

CycleCloud is used for running workloads like scientific simulations, rendering tasks, Genomics and Bionomics, Financial Modeling, Artificial Intelligence, Machine Learning and other data-intensive operations that require large amounts of compute power. 

CycleCloud supports GPU computing which is useful for the workloads described above. One of the strengths of Azure CycleCloud is its ability to automatically scale resources up or down based on demand. If your workload requires more GPU power (such as for deep learning training), CycleCloud can provision additional GPU-enabled instances as needed. 

The question remains – If the GPU’s provisioned by CycleCloud are waiting for storage I/O operations, not only is the performance of the application severely impacted, the GPU is also underutilized meaning you are not fully exploiting the resources you are paying for! 

This brings us to Weka.io. But before we talk about the problems Weka & CycleCloud solve, let's talk about what Weka is. 

What is WEKA? 

The WEKA® Data Platform was purpose-built to seamlessly and sustainably deliver speed, simplicity, and scale that meets the needs of modern enterprises and research organizations without compromise. Its advanced, software-defined architecture supports next-generation workloads in virtually any location with cloud simplicity and on-premises performance. 

At the heart of the WEKA® Data Platform is a modern fully distributed parallel filesystem, WekaFS™ which can span across 1,000’s of NVMe SSD spread across multiple hosts and seamlessly extend itself over S3 compatible object storage.  

You can deploy WEKA software on a cluster of Microsoft Azure LSv3 VMs with local SSD to create a high-performance storage layer. WEKA can also take advantage of Azure Blob Storage to scale your namespace at the lowest cost. You can automate your WEKA deployment through HashiCorp Terraform templates for fast easy installation. Data stored with your WEKA environment is accessible to applications in your environment through multiple protocols, including NFS, SMB, POSIX, and S3-compliant applications. 

 

Figure 2: WekaFS combines NVMe flash with cloud object storage in a single global namespace

Key components to WEKA Data Platform in Azure include: 

  • The Architecture is deployed directly in the customer Tenant within a subscription ID of the customers choosing. 
  • WEKA software is deployed across 6 or more Azure LSv3 VMs. The LSv3 VMs are clustered to act as one single device. 
  • The WekaFS™ namespace is extended transparently onto Azure Hot Blob 
  • Scale Up and Scale down functions are driven by Logic App’s and Function Apps  
  • All client secrets are kept in Azure Vault  
  • Deployment is fully automated using Terraform WEKA Templates  

What is the integration?  

Using the Weka-CycleCloud template available here, any compute nodes deployed via CycleCloud will automatically install the WEKA agent as well as automatically mount to the WEKA filesystem. Users can deploy 10, 100, even 1000’s of compute nodes and they will all mount to the fastest storage in Azure (WEKA) 

Figure 3: The CycleCloud and WEKA integration

Full integration steps are available here: WEKA/CycleCloud for SLUM Integration  

Benefits 

The combined solution of Weka combines the best of both worlds. With the CycleCloud / Weka template, customers will get:  

  • Simplified HPC management. With CycleCloud, you can provision clusters with a few clicks using preconfigured templates – and the clusters will all be mounted directly to WEKA.  
  • A High-Performance End to End Architecture. CycleCloud & WEKA allows users to combine the benefits of CPUs/GPUs with ultra fast storage. This is essential to ensure high throughput and low latency for computational workloads. The goal is to ensure that the storage subsystem can keep up with the high-speed demands of the CPU/GPU, especially in scenarios where you're running compute-heavy workloads like deep learning, scientific simulations, or large-scale data processing. 
  • Cost Optimization #1. Both CycleCloud and WEKA allow for autoscaling (up and down). Adjust the number of compute resources (CycleCloud) as well as the number of Storage backend nodes (WEKA) based on workload needs.  
  • Cost Optimization #2. WEKA.IO offers intelligent data tiering to help optimize performance and storage costs. The tiering system is designed to automatically move data between different storage classes based on access patterns, which maximizes efficiency while minimizing expenses.

Conclusion 

The CycleCloud & WEKA integration delivers a simplified HPC (AI/ML) cloud management platform, exceptional performance for data-intensive workloads, cost optimization via elastic scaling, flash optimization, & data tiering, all in one user Interface  

This enables organizations to achieve high throughput, low latency, and optimal CPU/GPU resource utilization for their most demanding applications and use cases. 

Try it today! 

Figure 4: The template

Special thanks to Raj Sharma and the WEKA team for their work on this integration!

Updated Feb 11, 2025
Version 1.0
No CommentsBe the first to comment