Sparse Modeling - Search News

How Google’s 'internal RL' could unlock long-horizon AI agents

Google researchers introduce ‘Internal RL,’ a technique that steers an models' hidden activations to solve long-horizon tasks ...

Decrypt

Baidu's ERNIE 5 AI Model Rises Up the Rankings—A Math Wiz That Beats OpenAI's GPT 5.1

Baidu's ERNIE-5.0-0110 ranks #8 globally on LMArena, becoming the only Chinese model in the top 10 while outperforming ...

DIGITIMES

DeepSeek V4 update: Conditional memory reshapes large-model efficiency

DeepSeek founder Liang Wenfeng has published a new paper with a research team from Peking University, outlining key technical ...

Semiconductor Engineering

Sparse Finite Element Problems on Neuromorphic HW (Sandia National Lab)

A new technical paper titled “Solving sparse finite element problems on neuromorphic hardware” was published by researchers ...

Geeky Gadgets

Jetson Thor vs DJX Spark vs Mac Mini M4 Pro : Local AI ML Showdown

Why pay more for the same specs? In the ever-evolving world of AI hardware, where performance and efficiency reign supreme, it’s not uncommon to see devices with nearly identical specifications priced ...

Geeky Gadgets

DeepSeek 3.2 AI Outperforms GPT-5 & Gemini 3 Thanks to a New Bold Training Method

What does it take to outshine giants in the fiercely competitive world of artificial intelligence? For years, proprietary systems like GPT-5 and Gemini Pro have dominated the landscape, setting ...

VentureBeat

OpenAI experiment finds that sparse models could give AI builders the tools to debug neural networks

OpenAI researchers are experimenting with a new approach to designing neural networks, with the aim of making AI models easier to understand, debug, and govern. Sparse models can provide enterprises ...

C&EN

Advanced Dimensionality Reduction for Imaging Mass Spectrometry of Human Eye Tissue through Low-Rank Modeling with Sparse and Dense Residuals

Department of Cell and Developmental Biology, Vanderbilt University, Nashville, Tennessee 37232, United States Mass Spectrometry Research Center, Vanderbilt University, Nashville, Tennessee 37232, ...

blockchain

DeepSeek-V3.2-Exp Launches with Sparse Attention for Faster AI Model Training and 50% API Price Drop

According to DeepSeek (@deepseek_ai), the company has launched DeepSeek-V3.2-Exp, an experimental AI model built on the V3.1-Terminus architecture. This release introduces DeepSeek Sparse Attention ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results