New Amazon EC2 P6-B200 situations powered by NVIDIA Blackwell GPUs to speed up AI improvements


Voiced by Polly

At the moment, we’re asserting the final availability of Amazon Elastic Compute Cloud (Amazon EC2) P6-B200 situations powered by NVIDIA B200 to handle buyer wants for prime efficiency and scalability in synthetic intelligence (AI), machine studying (ML), and excessive efficiency computing (HPC) functions.

Amazon EC2 P6-B200 situations speed up a broad vary of GPU-enabled workloads however are particularly well-suited for large-scale distributed AI coaching and inferencing for basis fashions (FMs) with reinforcement studying (RL) and distillation, multimodal coaching and inference, and HPC functions resembling local weather modeling, drug discovery, seismic evaluation, and insurance coverage danger modeling.

When mixed with Elastic Cloth Adapter (EFAv4) networking, hyperscale clustering by EC2 UltraClusters, and superior virtualization and safety capabilities by AWS Nitro System, you possibly can practice and serve FMs with elevated velocity, scale, and safety. These situations additionally ship as much as two instances the efficiency for AI coaching (time to coach) and inference (tokens/sec) in comparison with EC2 P5en situations.

You possibly can speed up time-to-market for coaching FMs and ship quicker inference throughput, which lowers inference price and helps enhance adoption of generative AI functions in addition to elevated processing efficiency for HPC functions.

EC2 P6-B200 situations specs
New EC2 P6-B200 situations present eight NVIDIA B200 GPUs with 1440 GB of excessive bandwidth GPU reminiscence, fifth Era Intel Xeon Scalable processors (Emerald Rapids), 2 TiB of system reminiscence, and 30 TB of native NVMe storage.

Listed below are the specs for EC2 P6-B200 situations:

Occasion measurement GPUs (NVIDIA B200) GPU
reminiscence (GB)
vCPUs GPU Peer to see (GB/s) Occasion storage (TB) Community bandwidth (Gbps) EBS bandwidth (Gbps)
P6-b200.48xlarge 8 1440 HBM3e 192 1800 8 x 3.84 NVMe SSD 8 x 400 100

These situations characteristic as much as 125 % enchancment in GPU TFLOPs, 27 % enhance in GPU reminiscence measurement, and 60 % enhance in GPU reminiscence bandwidth in comparison with P5en situations.

P6-B200 situations in motion
You need to use P6-B200 situations within the US West (Oregon) AWS Area via EC2 Capability Blocks for ML. To order your EC2 Capability Blocks, select Capability Reservations on the Amazon EC2 console.

Choose Buy Capability Blocks for ML after which select your complete capability and specify how lengthy you want the EC2 Capability Block for p6-b200.48xlarge situations. The overall variety of days you could reserve EC2 Capability Blocks is 1-14 days, 21 days, 28 days, or multiples of seven as much as 182 days. You possibly can select your earliest begin date for as much as 8 weeks prematurely.

Now, your EC2 Capability Block might be scheduled efficiently. The overall worth of an EC2 Capability Block is charged up entrance, and the worth doesn’t change after buy. The cost might be billed to your account inside 12 hours after you buy the EC2 Capability Blocks. To study extra, go to Capability Blocks for ML within the Amazon EC2 Consumer Information.

When launching P6-B200 situations, you should use AWS Deep Studying AMIs (DLAMI) to assist EC2 P6-B200 situations. DLAMI gives ML practitioners and researchers with the infrastructure and instruments to shortly construct scalable, safe, distributed ML functions in preconfigured environments.

To run situations, you should use AWS Administration Console, AWS Command Line Interface (AWS CLI) or AWS SDKs.

You possibly can combine EC2 P6-B200 situations seamlessly with varied AWS managed providers resembling Amazon Elastic Kubernetes Companies (Amazon EKS), Amazon Easy Storage Service (Amazon S3), and Amazon FSx for Lustre. Help for Amazon SageMaker HyperPod can be coming quickly.

Now accessible
Amazon EC2 P6-B200 situations can be found at the moment within the US West (Oregon) Area and will be bought as EC2 Capability blocks for ML.

Give Amazon EC2 P6-B200 situations a attempt within the Amazon EC2 console. To study extra, check with the Amazon EC2 P6 occasion web page and ship suggestions to AWS re:Publish for EC2 or via your ordinary AWS Help contacts.

Channy


How is the Information Weblog doing? Take this 1 minute survey!

(This survey is hosted by an exterior firm. AWS handles your data as described within the AWS Privateness Discover. AWS will personal the info gathered through this survey and won’t share the knowledge collected with survey respondents.)



Related Articles

LEAVE A REPLY

Please enter your comment!
Please enter your name here

Latest Articles