Deploying inference endpoints with PD disaggregation on AMD GPUs(dstack.ai)2 points by cheptsov 42 days ago | 0 commentsNo comments yet