Dark Hacker News
new
|
best
|
ask
|
show
|
jobs
Token-Count-Based Batching: Faster, Cheaper Embedding Inference for Queries | Dark Hacker News
Token-Count-Based Batching: Faster, Cheaper Embedding Inference for Queries
(mongodb.com)
1 points
by
fzliu
23 days ago
| 0 comments
No comments yet