{"id":85960,"date":"2026-03-25T09:58:19","date_gmt":"2026-03-25T07:58:19","guid":{"rendered":"https:\/\/gulftech-news.com\/en\/?p=85960"},"modified":"2026-03-25T09:58:20","modified_gmt":"2026-03-25T07:58:20","slug":"f5-and-nvidia-advance-ai-factory-economics-with-new-capabilities-for-accelerated-ai-inference","status":"publish","type":"post","link":"https:\/\/gulftech-news.com\/en\/2026\/03\/25\/f5-and-nvidia-advance-ai-factory-economics-with-new-capabilities-for-accelerated-ai-inference\/","title":{"rendered":"F5 and NVIDIA advance AI factory economics with new capabilities for accelerated AI inference"},"content":{"rendered":"\n<p><strong>F5 BIG-IP Next for Kubernetes accelerated with BlueField DPUs improves token throughput, reduces cost per token, and enables secure multi-tenant AI infrastructure, transforming AI factories for the agentic era<\/strong><\/p>\n\n\n\n<p><strong><a href=\"https:\/\/www.f5.com\/\">F5<\/a><\/strong> (NASDAQ: FFIV), the global leader in delivering and securing every app and API, today announced expanded capabilities in its ongoing\u00a0<a href=\"https:\/\/www.f5.com\/partners\/technology-alliances\/nvidia\" target=\"_blank\" rel=\"noreferrer noopener\"><strong>collaboration<\/strong><\/a>\u00a0with\u00a0NVIDIA\u00a0to accelerate and optimize AI inference infrastructures.<\/p>\n\n\n\n<p>The expanded integration combines&nbsp;<a href=\"https:\/\/www.f5.com\/products\/big-ip\/next\/big-ip-next-for-kubernetes\" target=\"_blank\" rel=\"noreferrer noopener\"><strong>F5 BIG-IP Next for Kubernetes<\/strong><\/a>&nbsp;with&nbsp;NVIDIA BlueField-3 DPUs, creating an intelligent, telemetry-aware infrastructure layer that increases token throughput with better GPU utilization, reduces latency, and enables secure multi-tenant AI platforms at scale.<\/p>\n\n\n\n<p>In AI systems, tokens represent the measurable unit of AI output\u2014the words, symbols, or data fragments generated and processed during inference. The volume and velocity of token production ultimately determine user experience, infrastructure efficiency, and revenue per accelerator.<\/p>\n\n\n\n<p>As enterprises and GPUaaS providers race to monetize AI and move from AI experimentation to revenue-generating services, infrastructure efficiency has become a defining metric. Success is increasingly measured not simply by deployed GPU capacity, but by token economics, sustained token throughput, time to first token (TTFT), cost per token, and revenue per GPU accelerator. The F5 and NVIDIA joint solution is designed to directly address these metrics.<\/p>\n\n\n\n<p><strong>Optimizing tokenomics through intelligent AI infrastructure<\/strong><\/p>\n\n\n\n<p>The shift from application-centric inference to agent-driven AI workflows demands new architectural approaches to optimize token throughput and reduce costs. BIG-IP Next for Kubernetes now leverages NVIDIA NIM statistics, Dynamo runtime signals, and GPU telemetry to make inference-aware routing decisions before execution. By matching workloads to the most appropriate accelerators in real time, the solution increases sustained utilization while reducing latency and re-compute.<\/p>\n\n\n\n<p>\u201cAI infrastructure is no longer just about access to GPU or scaling their deployments. It has evolved into maximizing economic output per accelerator,\u201d said Kunal Anand, Chief Product Officer, F5. \u201cTogether with NVIDIA, we are enabling AI factories to treat token production as a measurable business metric. BIG-IP Next for Kubernetes provides the intelligence and governance required to increase GPU yield, reduce cost per token, and scale shared AI platforms confidently.\u201d<\/p>\n\n\n\n<p><strong>Validated infrastructure efficiency: A structural uplift<\/strong><\/p>\n\n\n\n<p>The performance numbers speak for themselves. In testing validated by The Tolly Group, BIG-IP Next for Kubernetes, accelerated by NVIDIA BlueField-3 DPUs, delivered up to a&nbsp;40% increase in token throughput, a&nbsp;61% faster time to first token (TTFT), and a&nbsp;34% reduction in overall request latency.<\/p>\n\n\n\n<p>These are not incremental gains. By offloading networking, TLS\/encryption, AI-aware load balancing, and traffic management to NVIDIA BlueField-3 DPUs, BIG-IP Next for Kubernetes preserves host CPU capacity and frees GPUs to do what they were built for: sustained, high-throughput inference at scale. <\/p>\n\n\n\n<p>The result is improved GPU utilization, reduced queuing delays, and increased token yield\u2014enabling lower cost per token within a fixed infrastructure footprint. Critically, no model modifications were required, making these gains immediately deployable across existing AI factory infrastructure. For enterprises and NeoCloud providers competing on token economics, this is the difference between infrastructure that constrains AI output and infrastructure that accelerates it.<\/p>\n\n\n\n<p>\u201cNVIDIA\u2019s accelerated computing infrastructure coupled with F5\u2019s AI-aware Application Delivery and Security Platform unlocks superior AI factory tokenomics\u2014delivering scalable and cost-effective inference without making any changes to the models,\u201d said Kevin Deierling, SVP, Networking, NVIDIA. \u201cTogether, F5 and NVIDIA are empowering enterprises to scale AI factory inference efficiently and economically.\u201d<\/p>\n\n\n\n<p><strong>Built for agent-driven AI and multi-tenant AI platforms<\/strong><\/p>\n\n\n\n<p>Modern AI workloads are increasingly agent-driven, persistent, and context-aware. They demand intelligent traffic control that traditional load balancing cannot provide. The enhanced BIG-IP Next for Kubernetes solution can now support:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Inference-aware routing for agentic AI workflows<\/li>\n\n\n\n<li>Integration with NVIDIA DOCA Platform Framework (DPF) to simplify NVIDIA BlueField DPU deployment and lifecycle management<\/li>\n\n\n\n<li>EVPN-VXLAN with dynamic VRFs for secure network-level multi-tenancy<\/li>\n\n\n\n<li>Integrated security, token governance, and observability within Kubernetes AI environments<\/li>\n<\/ul>\n\n\n\n<p>These capabilities enable enterprises and NeoCloud providers to securely share GPU infrastructure across business units or external customers while preserving performance isolation and predictable service levels.<\/p>\n\n\n\n<p><strong>A control plane for AI factory economics<\/strong><\/p>\n\n\n\n<p>F5 and NVIDIA provide enterprises with validated tools and best practices to optimize inference architecture. With these advancements, BIG-IP Next for Kubernetes is positioned to become a strategic control plane for AI factory economics, governing token consumption, optimizing traffic flows, and maximizing infrastructure return on investment.<\/p>\n\n\n\n<p>Rather than overprovisioning to compensate for inefficiencies, organizations can now extract greater economic value from every GPU already in production. The result is improved revenue per GPU, lower operational overhead, and scalable AI services built for sustained growth. By combining NVIDIA\u2019s infrastructure telemetry and DPU acceleration with F5\u2019s traffic intelligence and security capabilities, the companies are helping enterprises transform AI factories into efficient, monetizable platforms ready for the agentic era.<\/p>\n\n\n\n<p><strong>Supporting materials<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>Blog:<\/strong>\u00a0<a href=\"https:\/\/www.f5.com\/company\/blog\/ai-factories-need-intelligent-infrastructure-new-results-from-the-tolly-group-show-why\" target=\"_blank\" rel=\"noreferrer noopener\"><strong>AI factories need intelligent infrastructure. New results from The Tolly Group show why.<\/strong><\/a><\/li>\n\n\n\n<li><strong>Report:<\/strong>\u00a0<a href=\"https:\/\/www.f5.com\/go\/report\/your-ai-infrastructure-should-be-producing-more-tokens\" target=\"_blank\" rel=\"noreferrer noopener\"><strong>Independent testing by Tolly: F5 BIG-IP Next for Kubernetes<\/strong><\/a><\/li>\n<\/ul>\n\n\n\n<p><\/p>\n","protected":false},"excerpt":{"rendered":"<p>F5 BIG-IP Next for Kubernetes accelerated with BlueField DPUs improves token throughput, reduces cost per token, and enables secure multi-tenant AI infrastructure, transforming AI factories for the agentic era F5 (NASDAQ: FFIV), the global leader in delivering and securing every app and API, today announced expanded capabilities in its ongoing\u00a0collaboration\u00a0with\u00a0NVIDIA\u00a0to accelerate and optimize AI inference &hellip;<\/p>\n","protected":false},"author":2,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[644],"tags":[227,3491,1985],"class_list":["post-85960","post","type-post","status-publish","format-standard","hentry","category-communications-technology","tag-ai","tag-f5","tag-nvidia"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v24.1 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>F5 and NVIDIA advance AI factory economics with new capabilities for accelerated AI inference - Gulf Tech News<\/title>\n<meta name=\"description\" content=\"F5 and NVIDIA advance AI factory economics with new capabilities for accelerated AI inference\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/gulftech-news.com\/en\/2026\/03\/25\/f5-and-nvidia-advance-ai-factory-economics-with-new-capabilities-for-accelerated-ai-inference\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"F5 and NVIDIA advance AI factory economics with new capabilities for accelerated AI inference - Gulf Tech News\" \/>\n<meta property=\"og:description\" content=\"F5 and NVIDIA advance AI factory economics with new capabilities for accelerated AI inference\" \/>\n<meta property=\"og:url\" content=\"https:\/\/gulftech-news.com\/en\/2026\/03\/25\/f5-and-nvidia-advance-ai-factory-economics-with-new-capabilities-for-accelerated-ai-inference\/\" \/>\n<meta property=\"og:site_name\" content=\"Gulf Tech News\" \/>\n<meta property=\"article:published_time\" content=\"2026-03-25T07:58:19+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2026-03-25T07:58:20+00:00\" \/>\n<meta name=\"author\" content=\"bessan helmi\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"bessan helmi\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"4 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\/\/gulftech-news.com\/en\/2026\/03\/25\/f5-and-nvidia-advance-ai-factory-economics-with-new-capabilities-for-accelerated-ai-inference\/\",\"url\":\"https:\/\/gulftech-news.com\/en\/2026\/03\/25\/f5-and-nvidia-advance-ai-factory-economics-with-new-capabilities-for-accelerated-ai-inference\/\",\"name\":\"F5 and NVIDIA advance AI factory economics with new capabilities for accelerated AI inference - Gulf Tech News\",\"isPartOf\":{\"@id\":\"https:\/\/gulftech-news.com\/en\/#website\"},\"datePublished\":\"2026-03-25T07:58:19+00:00\",\"dateModified\":\"2026-03-25T07:58:20+00:00\",\"author\":{\"@id\":\"https:\/\/gulftech-news.com\/en\/#\/schema\/person\/c033626e357b2f7e127eac0570ddc05c\"},\"description\":\"F5 and NVIDIA advance AI factory economics with new capabilities for accelerated AI inference\",\"breadcrumb\":{\"@id\":\"https:\/\/gulftech-news.com\/en\/2026\/03\/25\/f5-and-nvidia-advance-ai-factory-economics-with-new-capabilities-for-accelerated-ai-inference\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/gulftech-news.com\/en\/2026\/03\/25\/f5-and-nvidia-advance-ai-factory-economics-with-new-capabilities-for-accelerated-ai-inference\/\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/gulftech-news.com\/en\/2026\/03\/25\/f5-and-nvidia-advance-ai-factory-economics-with-new-capabilities-for-accelerated-ai-inference\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/gulftech-news.com\/en\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"F5 and NVIDIA advance AI factory economics with new capabilities for accelerated AI inference\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/gulftech-news.com\/en\/#website\",\"url\":\"https:\/\/gulftech-news.com\/en\/\",\"name\":\"Gulf Tech News\",\"description\":\"\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/gulftech-news.com\/en\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":\"Person\",\"@id\":\"https:\/\/gulftech-news.com\/en\/#\/schema\/person\/c033626e357b2f7e127eac0570ddc05c\",\"name\":\"bessan helmi\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/gulftech-news.com\/en\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/bb1e09a6f094e0fa605073926f8ad9eb228a8b0aacd381fda782c562612428cf?s=96&d=mm&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/bb1e09a6f094e0fa605073926f8ad9eb228a8b0aacd381fda782c562612428cf?s=96&d=mm&r=g\",\"caption\":\"bessan helmi\"},\"url\":\"https:\/\/gulftech-news.com\/en\/author\/bessan-helmi\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"F5 and NVIDIA advance AI factory economics with new capabilities for accelerated AI inference - Gulf Tech News","description":"F5 and NVIDIA advance AI factory economics with new capabilities for accelerated AI inference","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/gulftech-news.com\/en\/2026\/03\/25\/f5-and-nvidia-advance-ai-factory-economics-with-new-capabilities-for-accelerated-ai-inference\/","og_locale":"en_US","og_type":"article","og_title":"F5 and NVIDIA advance AI factory economics with new capabilities for accelerated AI inference - Gulf Tech News","og_description":"F5 and NVIDIA advance AI factory economics with new capabilities for accelerated AI inference","og_url":"https:\/\/gulftech-news.com\/en\/2026\/03\/25\/f5-and-nvidia-advance-ai-factory-economics-with-new-capabilities-for-accelerated-ai-inference\/","og_site_name":"Gulf Tech News","article_published_time":"2026-03-25T07:58:19+00:00","article_modified_time":"2026-03-25T07:58:20+00:00","author":"bessan helmi","twitter_card":"summary_large_image","twitter_misc":{"Written by":"bessan helmi","Est. reading time":"4 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/gulftech-news.com\/en\/2026\/03\/25\/f5-and-nvidia-advance-ai-factory-economics-with-new-capabilities-for-accelerated-ai-inference\/","url":"https:\/\/gulftech-news.com\/en\/2026\/03\/25\/f5-and-nvidia-advance-ai-factory-economics-with-new-capabilities-for-accelerated-ai-inference\/","name":"F5 and NVIDIA advance AI factory economics with new capabilities for accelerated AI inference - Gulf Tech News","isPartOf":{"@id":"https:\/\/gulftech-news.com\/en\/#website"},"datePublished":"2026-03-25T07:58:19+00:00","dateModified":"2026-03-25T07:58:20+00:00","author":{"@id":"https:\/\/gulftech-news.com\/en\/#\/schema\/person\/c033626e357b2f7e127eac0570ddc05c"},"description":"F5 and NVIDIA advance AI factory economics with new capabilities for accelerated AI inference","breadcrumb":{"@id":"https:\/\/gulftech-news.com\/en\/2026\/03\/25\/f5-and-nvidia-advance-ai-factory-economics-with-new-capabilities-for-accelerated-ai-inference\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/gulftech-news.com\/en\/2026\/03\/25\/f5-and-nvidia-advance-ai-factory-economics-with-new-capabilities-for-accelerated-ai-inference\/"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/gulftech-news.com\/en\/2026\/03\/25\/f5-and-nvidia-advance-ai-factory-economics-with-new-capabilities-for-accelerated-ai-inference\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/gulftech-news.com\/en\/"},{"@type":"ListItem","position":2,"name":"F5 and NVIDIA advance AI factory economics with new capabilities for accelerated AI inference"}]},{"@type":"WebSite","@id":"https:\/\/gulftech-news.com\/en\/#website","url":"https:\/\/gulftech-news.com\/en\/","name":"Gulf Tech News","description":"","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/gulftech-news.com\/en\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Person","@id":"https:\/\/gulftech-news.com\/en\/#\/schema\/person\/c033626e357b2f7e127eac0570ddc05c","name":"bessan helmi","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/gulftech-news.com\/en\/#\/schema\/person\/image\/","url":"https:\/\/secure.gravatar.com\/avatar\/bb1e09a6f094e0fa605073926f8ad9eb228a8b0aacd381fda782c562612428cf?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/bb1e09a6f094e0fa605073926f8ad9eb228a8b0aacd381fda782c562612428cf?s=96&d=mm&r=g","caption":"bessan helmi"},"url":"https:\/\/gulftech-news.com\/en\/author\/bessan-helmi\/"}]}},"_links":{"self":[{"href":"https:\/\/gulftech-news.com\/en\/wp-json\/wp\/v2\/posts\/85960","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/gulftech-news.com\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/gulftech-news.com\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/gulftech-news.com\/en\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/gulftech-news.com\/en\/wp-json\/wp\/v2\/comments?post=85960"}],"version-history":[{"count":1,"href":"https:\/\/gulftech-news.com\/en\/wp-json\/wp\/v2\/posts\/85960\/revisions"}],"predecessor-version":[{"id":85961,"href":"https:\/\/gulftech-news.com\/en\/wp-json\/wp\/v2\/posts\/85960\/revisions\/85961"}],"wp:attachment":[{"href":"https:\/\/gulftech-news.com\/en\/wp-json\/wp\/v2\/media?parent=85960"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/gulftech-news.com\/en\/wp-json\/wp\/v2\/categories?post=85960"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/gulftech-news.com\/en\/wp-json\/wp\/v2\/tags?post=85960"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}