Dark Hacker News
new
|
best
|
ask
|
show
|
jobs
Gradient Descent on Token Input Embeddings
(lesswrong.com)
3 points
by
kp1197
303 days ago
| 1 comment
Gradient Descent on Token Input Embeddings | Dark Hacker News
kp1197
303 days ago
|
next
[−]
Does performing gradient descent on token input embeddings lead to interpretable results? And if not, why?