Duality Technologies Enables Secure GenAI Workflows on NVIDIA GPUs

ByRicardo November 14, 2025

Duality Technologies, a frontrunner in privacy-enhancing applied sciences and safe knowledge collaboration, in the present day introduced assist for Google Cloud’s Confidential Computing portfolio, together with NVIDIA GPU-powered confidential digital machines on Google Cloud, enabling large-scale secured AI workloads reminiscent of LLM coaching and inference.

With this launch, the Duality Platform now helps GPU-backed LLM inference and encrypted Retrieval-Augmented Generation (RAG) inside trusted execution environments (TEEs) – a major efficiency leap from earlier CPU-only assist.

Customers can run a safe generative AI workflow on NVIDIA GPUs with Google Cloud Confidential Computing, now that includes end-to-end safety towards knowledge leakage powered by Duality. They can even mix full-stack knowledge confidentiality with NVIDIA H100 GPU efficiency, unlocking confidential AI use circumstances that had been beforehand impractical because of latency and throughput constraints.

“This adjustments the sport,” stated Dr. Alon Kaufman, CEO and Co-Founder of Duality Technologies. “Our prospects can now run privacy-preserving AI with LLMs at manufacturing scale. With GPU acceleration, the efficiency bottlenecks of safe computing are gone-making safe LLM coaching and inference sensible.”

The new functionality is constructed on Google Cloud’s Confidential Space and Confidential NVIDIA H100-powered confidential VMs, with assist for Intel TDX and Cloud KMS integration. Duality has efficiently validated the platform working a Mistral-7B mannequin utilizing encrypted vector RAG (through Faiss) in a completely confidential pipeline.

“With Confidential GPUs, organizations can course of delicate AI workloads completely inside trusted execution environments with out giving up efficiency,” stated Nelly Porter, Director of Product Management, Google Cloud. “Pairing NVIDIA H100-powered confidential VMs with Duality’s encrypted workflows permits LLM coaching and inference to occur at scale, with end-to-end safety from knowledge leakage.”

Key Highlights:

GPU Support for Confidential AI: Run safe LLMs and encrypted RAG on Confidential NVIDIA H100s
Scalable Performance: Orders-of-magnitude sooner runtimes vs CPU-only workloads
Enterprise-Ready: Meets the wants of regulated industries, protection, healthcare, and AI-native corporations
Seamless Cloud Integration: Now obtainable through Dynamic Workload Scheduler in Google Cloud’s Confidential Space

Until now, confidential AI was restricted to CPU-only environments – appropriate for fundamental testing, however inadequate for the calls for of large-scale AI. With the arrival of Confidential GPU as a part of the confidential computing portfolio, Duality prospects can now run each LLM coaching and inference securely inside Trusted Execution Environments. This breakthrough allows high-throughput, privacy-preserving AI workloads that had been beforehand unimaginable to execute – unlocking new prospects throughout industries and use circumstances.

This functionality is initially obtainable on the Google Cloud Confidential A3 digital machine kind in preview, with broader rollout anticipated later this yr.

To study extra go to dualitytech.com

The publish Duality Technologies Enables Secure GenAI Workflows on NVIDIA GPUs first appeared on AI-Tech Park.

AI

IntelePeer Launches SmartAgent Agentic AI for DSOs
ByRicardo August 7, 2025

IntelePeer, the leading end-to-end conversational AI provider, with its long history in serving Dental Service Organizations (DSOs), today announced the launch of SmartAgent for DSOs, a next generation agentic AI automation solution that improves operational efficiency, reduces costs, and delivers consistent, high-quality patient experiences across multi-location practices. Built on years of experience serving thousands of…

Read More IntelePeer Launches SmartAgent Agentic AI for DSOs
AI

Elastic’s Elastic Cloud Serverless Available on Microsoft Azure
ByRicardo July 2, 2025

Fast to start and easy to scale, Elastic Cloud Serverless brings security, observability, and search with decoupled storage, fast, low-latency querying, and zero infrastructure hassle Elastic (NYSE: ESTC), the Search AI Company, announced the general availability of Elastic Cloud Serverless on Microsoft Azure. This release expands the reach of Elastic Cloud Serverless, giving developers more flexibility…

Read More Elastic’s Elastic Cloud Serverless Available on Microsoft Azure
AI

Mintlify acquires Trieve to advance AI-powered knowledge retrieval
ByRicardo July 25, 2025

Mintlify announced today the acquisition of Trieve, a provider of retrieval-augmented generation (RAG) infrastructure. The move reinforces Mintlify’s modernization of how product knowledge is accessed—particularly as AI changes user expectations around support and documentation. Historically, searching for help within software products required sifting through long-form documentation or relying on brittle search tools. But with the…

Read More Mintlify acquires Trieve to advance AI-powered knowledge retrieval
AI

Intel and Exostellar Launch Multi-Cluster AI Accelerator
ByRicardo July 1, 2025

The future of compute is open, Intel and Exostellar lead the way. Exostellar, a self-managed AI infrastructure orchestration company, today announced a strategic collaboration with Intel to help enterprises deploy, manage, and scale AI workloads more efficiently by combining Intel® Gaudi® AI accelerators with Exostellar’s advanced Kubernetes-Native AI Orchestration, Multi-Cluster Operator. As AI and machine learning applications evolve, organizations…

Read More Intel and Exostellar Launch Multi-Cluster AI Accelerator
AI

Polimorphic Announces $18.6 Million Series A Led by General Catalyst
ByRicardo July 10, 2025

Polimorphic’s AI is enabling more accessible government services for over 36 million people across the United States. Polimorphic,which uses AI to digitize resident services for local governments and their constituents, today announced an $18.6 million Series A, led by General Catalyst, and continued backing from investors M13 and Shine. With ever-growing pressure on governments for…

Read More Polimorphic Announces $18.6 Million Series A Led by General Catalyst
AI

Flatiron Health Names New Chief Business, Technology, and Product Leaders
ByRicardo February 12, 2026

Michael Bierl, Allison Candido, and Kate Estep assume expanded roles to drive innovation and scale. Flatiron Health is pleased to announce three significant appointments to its Executive Team, reflecting the critical, enterprise-level responsibilities these leaders have assumed and the company’s evolving direction. Michael Bierl will serve as Chief Business Officer, Allison Candido will serve as…

Read More Flatiron Health Names New Chief Business, Technology, and Product Leaders

Duality Technologies Enables Secure GenAI Workflows on NVIDIA GPUs

IntelePeer Launches SmartAgent Agentic AI for DSOs

Elastic’s Elastic Cloud Serverless Available on Microsoft Azure

Mintlify acquires Trieve to advance AI-powered knowledge retrieval

Intel and Exostellar Launch Multi-Cluster AI Accelerator

Polimorphic Announces $18.6 Million Series A Led by General Catalyst

Flatiron Health Names New Chief Business, Technology, and Product Leaders

Curated by experts. Filtered for relevance.

Resources

About

Subscribe & learn more every day!

Similar Posts

Curated by experts. Filtered for relevance.

Resources

About

Subscribe & learn more every day!