Dark Hacker News
new
|
best
|
ask
|
show
|
jobs
Post-Training Layer Scaling Prevents Forgetting and Enhances Model Merging | Dark Hacker News
Post-Training Layer Scaling Prevents Forgetting and Enhances Model Merging
(arxiv.org)
1 points
by
veryluckyxyz
354 days ago
| 0 comments
No comments yet