Dark Hacker News
new
|
best
|
ask
|
show
|
jobs
Model Merging in Pre-Training of Large Language Models | Dark Hacker News
Model Merging in Pre-Training of Large Language Models
(arxiv.org)
2 points
by
veryluckyxyz
148 days ago
| 0 comments
No comments yet