New inference engine faster than vLLM, SGLang, TRT-LLM | Dark Hacker News