Dark Hacker News
new
|
best
|
ask
|
show
|
jobs
Understanding RL for model training, and future directions with GRAPE | Dark Hacker News
Understanding RL for model training, and future directions with GRAPE
(arxiv.org)
33 points
by
sonabinu
111 days ago
| 1 comment