NVIDIA and Google infrastructure cuts AI inference costs
At the Google Cloud Next convention, Google and NVIDIA outlined their {hardware} roadmap designed to handle the price of AI inference at scale. The firms detailed the brand new A5X bare-metal situations, which run on NVIDIA Vera Rubin NVL72 rack-scale methods. Through {hardware} and software program codesign, this structure goals to ship as much as…
