Scaling TensorFlow inference to unlimited items per request with bounded latency | Dark Hacker News