U.S. government to test AI models, expand oversight

May 5 (UPI) — The Center for AI Standards and Innovation, part of a U.S.government agency, announced Tuesday that it will test artificial intelligence models from some top firms before release to vet them for security risks.
CAISI has deals with Microsoft, xAI and Google DeepMind for this testing and targeted research “to better assess frontier AI capabilities and advance the state of AI security,” it said in a release. The center is part of the U.S. Department of Commerce’s National Institute of Standards and Technology.
This follows similar deals in 2024, under the Biden administration, with prominent AI leaders OpenAI and Anthropic, which have been “renegotiated” to fit Trump administration directives, Politico reported.
The government has increasingly shown interest in matters of AI technology and security. CNBC also reported Tuesday that the Trump administration is considering an executive order to create a process for AI oversight by the White House.
Some of this interest has been heightened by the announcement last month of Anthropic’s new Mythos AI model. The company described the model as excelling “at identifying weaknesses and security flaws within software” and limited its initial use to certain companies. These companies, including Amazon and Microsoft, will use it as part of defensive security work and as part of Project Glasswing, a cybersecurity initiative, Anthropic said.
The announcement Tuesday from CAISI said that the center has completed more than 40 evaluations of AI models so far.
“Independent, vigorous measurement science is essential to understanding frontier AI and its national security implications,” CAISI director Chris Fell said in a statement. “These expanded industry collaborations help us scale our work in the public interest in a critical moment.”
China’s DeepSeek unveils latest models a year after upending global tech | Technology News
Chinese startup says DeepSeek-V4-Pro beats all rival open models for maths and coding.
Published On 24 Apr 2026
China’s DeepSeek has unveiled the latest versions of its signature artificial intelligence-powered chatbot, a year after its flagship model sent shockwaves through the global tech scene.
The Chinese startup launched preview versions of DeepSeek-V4-Pro and DeepSeek-V4-Flash on Friday as it touted its ability to go toe-to-toe with US rivals such as OpenAI and Google.
Recommended Stories
list of 4 itemsend of list
Like DeepSeek’s previous chatbots, V4-Pro and V4-Flash follow an open-source model, meaning developers are free to use and modify the source code at will.
DeepSeek-V4-Pro beats all rival open models for maths and coding, and trails only Google’s Gemini 3.1-Pro, a closed model, for world knowledge, DeepSeek said in an announcement on social media.
The “pro” version’s performance falls only “marginally short” of OpenAI’s GPT‑5.4 and Gemini 3.1-Pro, “suggesting a developmental trajectory that trails state-of-the-art frontier models by approximately 3 to 6 months,” the Hangzhou-based startup said.
The “flash” model has similar reasoning abilities to the “pro” version, while offering faster response times and “highly cost-effective” usage pricing, the firm said.
The release comes after DeepSeek-R1 stunned the tech sector upon its launch in January last year with capabilities broadly comparable with those of ChatGPT and Gemini.
Marc Andreessen, a prominent Silicon Valley venture capitalist with close ties to United States President Donald Trump, hailed the model’s release at the time as “AI’s Sputnik moment”.
The performance of the Chinese-developed model attracted particular attention as its developers claimed to have spent less than $6m on computing costs – a fraction of the multibillion-dollar budgets that are usual in Silicon Valley.
Some tech analysts challenged DeepSeek’s account of working with such scant resources, arguing that the startup most likely had access to greater funding and more advanced chips than acknowledged.
DeepSeek’s arrival on the scene prompted blowback in some countries amid concerns about data protection and Chinese government censorship.
Multiple US states, Australia, Taiwan, South Korea, Denmark and Italy introduced bans or other restrictions on DeepSeek-R1 shortly after its release, citing privacy and national security concerns.



