AI and Us | Governance, Regulation & Policy

OpenAI unveils open-weight AI safety models for developers

ByRicardo October 29, 2025October 29, 2025

Banner for AI & Big Data Expo by TechEx events.

OpenAI is placing extra safety controls instantly into the arms of AI developers with a brand new analysis preview of “safeguard” models. The new ‘gpt-oss-safeguard’ household of open-weight models is aimed squarely at customising content material classification.

The new providing will embody two models, gpt-oss-safeguard-120b and a smaller gpt-oss-safeguard-20b. Both are fine-tuned variations of the present gpt-oss household and will probably be accessible beneath the permissive Apache 2.0 license. This will permit any organisation to freely use, tweak, and deploy the models as they see match.

The actual distinction right here isn’t simply the open license; it’s the strategy. Rather than counting on a hard and fast algorithm baked into the mannequin, gpt-oss-safeguard makes use of its reasoning capabilities to interpret a developer’s personal coverage on the level of inference. This means AI developers utilizing OpenAI’s new mannequin can arrange their very own particular safety framework to categorise something from single person prompts to full chat histories. The developer, not the mannequin supplier, has the ultimate say on the ruleset and may tailor it to their particular use case.

This method has a few clear benefits:

Transparency: The models use a chain-of-thought course of, so a developer can really look beneath the bonnet and see the mannequin’s logic for a classification. That’s an enormous step up from the standard “black field” classifier.

Agility: Because the safety coverage isn’t completely skilled into OpenAI’s new mannequin, developers can iterate and revise their pointers on the fly with no need a whole retraining cycle. OpenAI, which initially constructed this technique for its inside groups, notes it is a way more versatile strategy to deal with safety than coaching a conventional classifier to not directly guess what a coverage implies.

Rather than counting on a one-size-fits-all safety layer from a platform holder, developers utilizing open-source AI models can now construct and implement their very own particular requirements.

While not reside as of writing, developers will be capable of entry OpenAI’s new open-weight AI safety models on the Hugging Face platform.

See additionally: OpenAI restructures, enters ‘next chapter’ of Microsoft partnership

Want to be taught extra about AI and large information from trade leaders? Check out AI & Big Data Expo going down in Amsterdam, California, and London. The complete occasion is a part of TechEx and is co-located with different main expertise occasions together with the Cyber Security Expo, click on here for extra info.

AI News is powered by TechForge Media. Explore different upcoming enterprise expertise occasions and webinars here.

The publish OpenAI unveils open-weight AI safety models for developers appeared first on AI News.

AI and Us AI in Action

FIFA is rebuilding world football operations on AI. The World Cup is just the first test
ByRicardo March 17, 2026

When Romy Gai, FIFA’s chief enterprise officer, described the operational problem of operating a 48-team World Cup throughout Canada, Mexico and the United States, he was not speaking about know-how. He was speaking about complexity. Previous World Cups relied on native organising committees to soak up a lot of the logistical load. For 2026, FIFA is…

Read More FIFA is rebuilding world football operations on AI. The World Cup is just the first test
AI and Us AI Business Strategy

Why security chiefs demand urgent regulation of AI like DeepSeek
ByRicardo August 18, 2025

Anxiety is growing among Chief Information Security Officers (CISOs) in security operation centres, particularly around Chinese AI giant DeepSeek. AI was heralded as a new dawn for business efficiency and innovation, but for the people on the front lines of corporate defence, it’s casting some very long and dark shadows. Four in five (81%) UK…

Read More Why security chiefs demand urgent regulation of AI like DeepSeek
AI and Us AI in Action

EU publishes its AI content labelling playbook ahead of the AI Act’s August deadline
ByRicardo June 16, 2026June 16, 2026

The European Union has published its AI content labelling playbook, a voluntary Code of Practice meant to assist corporations meet transparency guidelines that turn into regulation throughout the bloc on August 2 onwards. The European Commission launched the closing Code on 10 June, setting out sensible steps for the companies that construct and use generative AI to…

Read More EU publishes its AI content labelling playbook ahead of the AI Act’s August deadline
AI and Us AI Business Strategy

BBVA embeds AI into banking workflows using ChatGPT Enterprise
ByRicardo December 12, 2025

BBVA is embedding AI into core banking workflows using ChatGPT Enterprise to overtake danger and repair within the sector. For the banking business, the problem of generative AI is never about adoption; it’s about worth extraction. BBVA has addressed this by integrating OpenAI’s platform instantly into its operational spine, a call that can see the…

Read More BBVA embeds AI into banking workflows using ChatGPT Enterprise
AI and Us AI in Action

How Huawei is building agentic AI systems that make decisions independently
ByRicardo October 14, 2025

In a cement plant operated by Conch Group, an agentic AI system constructed on Huawei infrastructure now predicts the energy of clinker with over 90% accuracy and autonomously adjusts calcination parameters to chop coal consumption by 1%—decisions that beforehand required human experience gathered over many years This exemplifies how Huawei is creating agentic AI systems…

Read More How Huawei is building agentic AI systems that make decisions independently
AI and Us AI Business Strategy

AI Expo 2026 Day 2: Moving experimental pilots to AI production
ByRicardo February 9, 2026

The second day of the co-located AI & Big Data Expo and Digital Transformation Week in London showed a market in a clear transition. Early excitement over generative models is fading. Enterprise leaders now face the friction of fitting these tools into current stacks. Day two sessions focused less on large language models and more…

Read More AI Expo 2026 Day 2: Moving experimental pilots to AI production