Post-Training Layer Scaling Prevents Forgetting and Enhances Model Merging(arxiv.org)1 points by veryluckyxyz 1 year ago | 0 commentsNo comments yet