AI Inferencing Everywhere: Scaling Enterprise AI from Core to Edge
As AI moves into production, enterprises must solve for distributed execution across core and edge environments. This conversation explores how infrastructure is evolving to support scalable, real-time AI inferencing.
AI is moving out of the lab and into real-world environments, and the challenge is no longer building models, it’s running them everywhere.
At NVIDIA GTC, hosts Patrick Moorhead and Daniel Newman sit down with Vlad Rozanovich of Lenovo and Jon Alexander of Akamai to explore how distributed infrastructure is enabling enterprise AI inferencing from the core to the edge.
The conversation unpacks how enterprises are shifting from centralized AI architectures to highly distributed environments where performance, consistency, and security must hold across locations. Through Lenovo’s collaboration with Akamai, AI workloads are being deployed on infrastructure that spans data centers to edge locations, redefining how organizations think about cloud, latency, and execution. As new performance metrics like time-to-first-token gain importance, the discussion highlights how infrastructure decisions directly impact real-world AI outcomes.
Key Takeaways
🔹 AI deployment is shifting from centralized models to distributed inferencing environments
🔹 Time-to-first-token is emerging as a critical performance metric for AI workloads
🔹 Unified infrastructure is key to maintaining consistency across core and edge
🔹 Lenovo and Akamai are redefining what “cloud” means in the AI era
🔹 Edge AI is enabling faster, more secure, real-time decision-making
As AI scales, the ability to execute models consistently across distributed environments will define enterprise success.
Watch the full conversation at sixfivemedia.com and subscribe to our Youtube channel for more insights from NVIDIA GTC 2026.
Listen to the audio here:
Disclaimer: Six Five On The Road is for information and entertainment purposes only. Over the course of this webcast, we may talk about companies that are publicly traded, and we may even reference that fact and their equity share price, but please do not take anything that we say as a recommendation about what you should do with your investment dollars. We are not investment advisors, and we ask that you do not treat us as such.
MORE VIDEOS

The Inference Inflection: MiTAC on Building Flexible AI Infrastructure for Enterprise Scale
As AI moves into production, infrastructure flexibility, orchestration, and data performance are becoming critical. MiTAC outlines how modular platforms and integrated partnerships are enabling scalable, high-performance AI deployments.
Other Categories
CYBERSECURITY

Threat Intelligence: Insights on Cybersecurity from Secureworks
Alex Rose from Secureworks joins Shira Rubinoff on the Cybersphere to share his insights on the critical role of threat intelligence in modern cybersecurity efforts, underscoring the importance of proactive, intelligence-driven defense mechanisms.
QUANTUM

Quantum in Action: Insights and Applications with Matt Kinsella
Quantum is no longer a technology of the future; the quantum opportunity is here now. During this keynote conversation, Infleqtion CEO, Matt Kinsella will explore the latest quantum developments and how organizations can best leverage quantum to their advantage.

Accelerating Breakthrough Quantum Applications with Neutral Atoms
Our planet needs major breakthroughs for a more sustainable future and quantum computing promises to provide a path to new solutions in a variety of industry segments. This talk will explore what it takes for quantum computers to be able to solve these significant computational challenges, and will show that the timeline to addressing valuable applications may be sooner than previously thought.



