According to Golden Finance, at 3 a.m. today, Microsoft opened the source of the basic model of multimodal AI Agent on its official website - Magma. Compared with traditional Agents, Magma has multimodal capabilities across the digital and physical worlds, and can automatically process different types of data such as images, videos, texts, etc. For example, you can use Magma to automatically place e-commerce orders and query weather; Get help when automatically operating a physical robot or playing real chess. In addition, Magma can also have built-in psychological prediction function, which enhances the understanding of space-time dynamics in future video frames, and can accurately infer the intentions and future behaviors of characters or objects in the video.