Stanford: Physically Grounded Vision-Language Models for Robotic Manipulation(arxiv.org)1 points by socratic1 2 years ago | 0 commentsNo comments yet