Forum Discussion
ITManager8815
Jan 24, 2025Copper Contributor
Determining sizing requirements for GPU enabled Azure VM
Greetings, We are trying to determine the correct VM sizing requirement for our AI workload, which is used for NLP processing. This workload does not require any training, but will only be used for ...
Kidd_Ip
Jan 25, 2025MVP
How about this:
- CPU Requirements: The NC-series VMs, particularly the NC24r and the NVv4 series, provide numerous vCPUs and substantial memory, which should handle your multithreaded C# application. For instance, the NC24r configuration offers up to 24 vCPUs and 224 GB of RAM.
- GPU Requirements: Given your need to support at least a 7B parameter model and potentially larger models, the NC A100 v4-series is highly suitable. It provides NVIDIA A100 Tensor Core GPUs, which are among the best for AI inference workloads. The A100's large VRAM capacity (40-80 GB) will efficiently handle models up to 70B parameters.
- Budget Considerations: The NC A100 v4 instances, such as the NC96ads_A100_v4, offer a balance between performance and cost. They allow you to take advantage of NVIDIA's Multi-Instance GPU (MIG) technology, which means you can partition the GPU to optimize resource usage and potentially reduce costs.
Based on your budget of about $1,000/month, it seems feasible to utilize these high-performance configurations by optimizing resource allocation, particularly with the flexibility provided by the A100 GPUs. You can further refine your cost estimation by using the Azure Pricing Calculator to ensure your specific needs align with your financial considerations.
ITManager8815
Jan 25, 2025Copper Contributor
Thank you for the information.