Intel reduces latencies of chat LLM app using quantisation(community.intel.com)5 points by mariarmestre 2 years ago | 5 comments