• Home 1
  • Privacy Policy
LSD News
  • Home
  • Business
  • Crypto News
  • Finance
  • Health
  • Politics
  • Sports
  • Stock
  • Tech
  • Travel
No Result
View All Result
  • Home
  • Business
  • Crypto News
  • Finance
  • Health
  • Politics
  • Sports
  • Stock
  • Tech
  • Travel
No Result
View All Result
LSD News
No Result
View All Result
Home Tech

China’s DeepSeek launches next-gen AI model. Here’s what makes it different

by
September 30, 2025
in Tech
0
China’s DeepSeek launches next-gen AI model. Here’s what makes it different
0
SHARES
0
VIEWS
Share on FacebookShare on Twitter


Anna Barclay | Getty Images News | Getty Images

Chinese startup DeepSeek’s latest experimental model promises to increase efficiency and improve AI’s ability to handle a lot of information at a fraction of the cost, but questions remain over how effective and safe the architecture is.  

DeepSeek sent Silicon Valley into a frenzy when it launched its first model R1 out of nowhere last year, showing that it’s possible to train large language models (LLMs) quickly, on less powerful chips, using fewer resources.

The company released DeepSeek-V3.2-Exp on Monday, an experimental version of its current model DeepSeek-V3.1-Terminus, which builds further on its mission to increase efficiency in AI systems, according to a post on the AI forum Hugging Face.

“DeepSeek V3.2 continues the focus on efficiency, cost reduction, and open-source sharing,” Adina Yakefu, Chinese community lead at Hugging Face, told CNBC. “The big improvement is a new feature called DSA (DeepSeek Sparse Attention), which makes the AI better at handling long documents and conversations. It also cuts the cost of running the AI in half compared to the previous version.”

“It’s significant because it should make the model faster and more cost-effective to use without a noticeable drop in performance,” said Nick Patience, vice president and practice lead for AI at The Futurum Group. “This makes powerful AI more accessible to developers, researchers, and smaller companies, potentially leading to a wave of new and innovative applications.”

The pros and cons of sparse attention 

An AI model makes decisions based on its training data and new information, such as a prompt. Say an airline wants to find the best route from A to B, while there are many options, not all are feasible. By filtering out the less viable routes, you dramatically reduce the amount of time, fuel and, ultimately, money, needed to make the journey. That is exactly sparse attention does, it only factors in data that it thinks is important given the task at hand, as opposed to other models thus far which have crunched all data in the model.

“So basically, you cut out things that you think are not important,” said Ekaterina Almasque, the cofounder and managing partner of new venture capital fund BlankPage Capital.

Sparse attention is a boon for efficiency and the ability to scale AI given fewer resources are needed, but one concern is that it could lead to a drop in how reliable models are due to the lack of oversight in how and why it discounts information.

“The reality is, they [sparse attention models] have lost a lot of nuances,” said Almasque, who was an early supporter of Dataiku and Darktrace, and an investor in Graphcore. “And then the real question is, did they have the right mechanism to exclude not important data, or is there a mechanism excluding really important data, and then the outcome will be much less relevant?”

This could be particularly problematic for AI safety and inclusivity, the investor noted, adding that it may not be “the optimal one or the safest” AI model to use compared with competitors or traditional architectures. 

DeepSeek, however, says the experimental model works on par with its V3.1-Terminus. Despite speculation of a bubble forming, AI remains at the centre of geopolitical competition with the U.S. and China vying for the winning spot. Yakefu noted that DeepSeek’s models work “right out of the box” with Chinese-made AI chips, such as Ascend and Cambricon, meaning they can run locally on domestic hardware without any extra setup.

DeepSeek also shared the actual programming code and tools needed to use the experimental model, she said. “This means other people can learn from it and build their own improvements.”

But for Almasque, the very nature of this means the tech may not be defensible. “The approach is not super new,” she said, noting the industry has been “talking about sparse models since 2015” and that DeepSeek is not able to patent its technology due to being open source. DeepSeek’s competitive edge, therefore, must lie in how it decides what information to include, she added.

The company itself acknowledges V3.2-Exp is an “intermediate step toward our next-generation architecture,” per the Hugging Face post.

As Patience pointed out, “this is DeepSeek’s value prop all over: efficiency is becoming as important as raw power.”

“DeepSeek is playing the long game to keep the community invested in their progress,” Yakefu added. “People will always go for what is cheap, reliable, and effective.”

Tags: Breaking News: Technologybusiness newsChinasdeepseekHereslaunchesModelNextGenTechnology
Previous Post

Bitcoin Buyers Step Back After Failed Push Beyond $115,000: Data

Next Post

How Georgia’s top accounting official uses technology and change management to champion a new era in government finance | Fortune

Next Post
How Georgia’s top accounting official uses technology and change management to champion a new era in government finance | Fortune

How Georgia's top accounting official uses technology and change management to champion a new era in government finance | Fortune

Stay Connected test

  • 139 Followers
  • 205k Subscribers
  • 23.9k Followers
  • 99 Subscribers
ADVERTISEMENT
  • Trending
  • Comments
  • Latest
As Binance works toward redemption, CEO says Trump has been ‘fantastic’ for crypto

As Binance works toward redemption, CEO says Trump has been ‘fantastic’ for crypto

March 23, 2025
Georgia realtor receives invitation to play the Masters by mistake | CNN

Georgia realtor receives invitation to play the Masters by mistake | CNN

July 18, 2023
What made Pelé so great | CNN

What made Pelé so great | CNN

July 19, 2023
Left-Wing Democrats Wait on AOC’s Decision as They Look to 2028 Election

Left-Wing Democrats Wait on AOC’s Decision as They Look to 2028 Election

March 23, 2025
Tech layoffs in Southeast Asia mount as unprofitable startups seek to extend their runways

Tech layoffs in Southeast Asia mount as unprofitable startups seek to extend their runways

5
Contact lens maker faces lawsuit after woman said the product resulted in her losing an eye

Contact lens maker faces lawsuit after woman said the product resulted in her losing an eye

5
Why Cristiano Ronaldo’s move to Saudi Arabia means so much for the Gulf monarchy’s sporting ambitions | CNN

Why Cristiano Ronaldo’s move to Saudi Arabia means so much for the Gulf monarchy’s sporting ambitions | CNN

3
Georgia realtor receives invitation to play the Masters by mistake | CNN

Georgia realtor receives invitation to play the Masters by mistake | CNN

1
Drone sightings disrupt Munich airport, halt flights and impact thousands

Drone sightings disrupt Munich airport, halt flights and impact thousands

October 3, 2025
FDA commissioner says TrumpRx is a

FDA commissioner says TrumpRx is a

October 3, 2025
Elon Musk is telling his followers to cancel Netflix subscriptions. Here’s what’s happening

Elon Musk is telling his followers to cancel Netflix subscriptions. Here’s what’s happening

October 3, 2025
Wall Street closes with records as tech support offsets labor, shutdown uncertainties

Wall Street closes with records as tech support offsets labor, shutdown uncertainties

October 3, 2025

Recent News

Drone sightings disrupt Munich airport, halt flights and impact thousands

Drone sightings disrupt Munich airport, halt flights and impact thousands

October 3, 2025
FDA commissioner says TrumpRx is a

FDA commissioner says TrumpRx is a

October 3, 2025
Elon Musk is telling his followers to cancel Netflix subscriptions. Here’s what’s happening

Elon Musk is telling his followers to cancel Netflix subscriptions. Here’s what’s happening

October 3, 2025
Wall Street closes with records as tech support offsets labor, shutdown uncertainties

Wall Street closes with records as tech support offsets labor, shutdown uncertainties

October 3, 2025

We bring the latest news from all over the world and get all time updated you

Follow Us

Browse by Category

  • Business
  • Crypto News
  • Finance
  • Health
  • Politics
  • Sports
  • Stock
  • Tech
  • Travel
  • Uncategorized

Recent News

Drone sightings disrupt Munich airport, halt flights and impact thousands

Drone sightings disrupt Munich airport, halt flights and impact thousands

October 3, 2025
FDA commissioner says TrumpRx is a

FDA commissioner says TrumpRx is a

October 3, 2025
No Result
View All Result
  • Home 1
  • Privacy Policy

© 2024 LSD News title="Jegtheme">Jegtheme.