Cost-efficient and pluggable Infrastructure components for GenAI inference | Dark Hacker News