Code

Nvidia NeMo Microservices For AI Agents Hits The Market

Last year, amid all the talk of the “Blackwell” datacenter GPUs that were launched at last year’s GPU Technicval Conference, Nvidia also introduced the idea of Nvidia Inference Microservices, or NIMs, which are prepackaged enterprise-grade generative AI software stacks that companies can use as virtual copilots to add custom AI

Compute

The Chips Are Definitely Not Down

The semiconductor manufacturing business is absolutely immense. To give the numbers some perspective, in 2024, chip makers generated revenues that were about three quarters of the size of the US defense budget and about two-thirds the size of the social services budget allocated by Congress. And spending on chip manufacturing

Sign up to our newsletter

Featuring highlights, analysis, and stories from the week directly from us to your inbox with nothing in between.
Subscribe now

More Analysis

AI

Freeing Developers From GenAI Deployment Nightmares

PARTNER CONTENT: “Developers have to build it, right, and their first concern is to make it work,” says CentML chief executive officer Gennady Pekhimenko. “After that, when it starts to work, it’s like, ‘OK, let’s deploy it.’ And that’s where, all of a sudden, they face challenges.” Launching a new

AI

Nvidia Sacrifices Profits To Preserve Revenues In The US

Making a graphics card for gamers is one thing, but manufacturing a rackscale supercomputer with over 600,000 components that burns 120 kilowatts of power, that has over 5,000 copper cables for an all-to-all interconnect mesh for 72 dual-chip compute engines, and that weighs over 3,000 pounds is another thing entirely.

AI

ABCI Evolves To Meet Japan’s Changing AI Needs

SPONSORED FEATURE: Back before there were AI factories, there were two generations of the AI Bridging Cloud Infrastructure supercomputer, built by the National Institute of Advanced Industrial Science and Technology (AIST) in Japan. AIST created what is arguably the first AI factory, merging accelerated computing platforms with cloud infrastructure software

Compute

Google Woos HPC Centers With Fast CPUs And Networks

The HPC centers of the world like fast networks and compute, but they are also always working under budget constraints unlike their AI peers out there in the enterprise, where money seems to be unlimited to what sometimes looks like an irrationally exuberant extent. They are also don’t have a