Gradient Angle CSS - Search News

Compressed Gradient Tracking Algorithm for Distributed Aggregative Optimization

Abstract: This article is devoted to addressing the distributed aggregative optimization (DAO) problem via compressed gradient tracking algorithms, where the cost function of each agent relies on the ...

GitHub

What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective (ACL'25 Oral)

This is the repo for the Layer_Gradient project, in which we try to understand the layer-wise gradient behaviors when LLMs are finetuned on Fast vs. Slow Thinking. What makes a difference in the ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

Compressed Gradient Tracking Algorithm for Distributed Aggregative Optimization

What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective (ACL'25 Oral)

Trending now