Reading Inferencing - Search News

Microsoft Unveils A New AI Inference Accelerator Chip, Maia 200

Microsoft’s new Maia 200 inference accelerator chip enters this overheated market with a new chip that aims to cut the price ...

How AI Inference Can Unlock The Next Generation Of SaaS

The next generation of inference platforms must evolve to address all three layers. The goal is not only to serve models ...

VentureBeat

What's a NIM? Nvidia Inference Microservices is new approach to gen AI model deployment that could change the industry

Nvidia is aiming to dramatically accelerate and optimize the deployment of generative AI large language models (LLMs) with a new approach to delivering models for rapid inference. At Nvidia GTC today, ...

Seeking Alpha

AMD: Inference Is The Future Of AI

AMD is strategically positioned to dominate the rapidly growing AI inference market, which could be 10x larger than training by 2030. The MI300X's memory advantage and ROCm's ecosystem progress make ...

InfoWorld

AI is all about inference now

You train the model once, but you run it every day. Making sure your model has business context and guardrails to guarantee reliability is more valuable than fussing over LLMs. We’re years into the ...

InfoWorld

Evolving Kubernetes for generative AI inference

Kubernetes has become the leading platform for deploying cloud-native applications and microservices, backed by an extensive community and comprehensive feature set for managing distributed systems.

Network World

HPE to ship a dedicated inference server for the edge

The small form factor HPE Edgeline EL8000 is designed for AI tasks such as computer vision and natural-language processing. Later this month, HP Enterprise will ship what looks to be the first server ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results