Universal LLM Deployment Engine with ML Compilation(blog.mlc.ai) |
Universal LLM Deployment Engine with ML Compilation(blog.mlc.ai) |
Any ideas on how those edge and cloud models collaborate on compound tasks (e.g. the compound ai systems: https://bair.berkeley.edu/blog/2024/02/18/compound-ai-system...)
It comes with full OpenAI-compatible API that runs directly with Python, iOS, Android, browsers. Supporting deploying latest large language models such as Qwen2, Phi3, and more.