An EPYC Exclusive for Azure: AMD's MI300C – By George Cozma

buran77 19 days ago

> Each AMD EPYC 9v64H CPU physically have 96 Zen 4 cores

> 8 Zen 4 cores per EPYC 9v64H CPU

The two consecutive lines of text give two different core counts. I know initial reporting on this CPU has been unclear, with everyone initially saying 88 cores, then updating to 96. But the author could have spent a couple of words on what the extra 8 cores are used for (best I could find is "used as overhead").

tiernano 19 days ago

I think 88 of the 96 cores per cou are assigned to the VM and the rest are assigned to the underlying hypervisor. I remember seeing that somewhere.
- bayindirh 19 days ago
  
  When you're dedicating a whole system to a single VM, you need have some spares for the underlying OS to keep it happy.
  The OS needs cores and RAM to be able to keep the system up. Everything from network cards to services need some spare power, otherwise things go very wrong, and the experience of the tenant becomes very bad.
  
  JoshTriplett 19 days ago
  
  Or you need separate dedicated hardware for the hypervisor, like AWS has with Nitro.
- bobim 19 days ago
  
  The article says single-tenant, so that would be a waste of 32 cores for just one VM? Seems a lot.
  
  tiernano 18 days ago
  
  That hypervisor, in the case of Azure, does more than normal... AWS seems to offload a LOT more to extra hardware. only ever seen MS mention hardware offload for networking... could be wrong, mind you...
  
  bobim 18 days ago
  
  Ok, I see, maybe these cores are managing things like i/o and hardware monitoring. Thanks for the opening.
  
  hulitu 17 days ago
  
  Well, Windows has a lot of svchost.exe processes running. /s
alberth 19 days ago
Isn’t it 8 chipsets of 12-cores each.
That should have stated:
```
  8, Zen 4 [chiplets], per EPYC 9v64H CPU
```
- buran77 18 days ago
  
  Yes but one core per chiplet or maybe an entire chiplet is not used. I can speculate why it's set aside but some official information or even a more educated guess would have been very informative.

freeqaz 19 days ago

I wonder what inference of big LLMs might look like with that much cache and memory bandwidth. Not trivial to get a benchmark for that but I wonder

wmf 19 days ago

For LLMs you'd want to use the MI300X variant and benchmarks should already be available.

JoshTriplett 19 days ago

I'd love to see the benchmarks for this with SMT on: 96x2x4 = 768 CPUs in one system, along with 512GB of HBM that has 6900 GB/s memory bandwidth, and then DDR5.

nusl 19 days ago

That's a monster of a CPU, wow.