Google has launched Gemini 2.0 and announced that it’s powering their first-ever AI agent, called Project Mariner, which can move the cursor, click buttons, browse the web, and perform certain web-based tasks, autonomously, within the Chrome browser. It works by taking screenshots of the browser window (users must first agree to this), and sending these to the Cloud for processing, which Gemini then sends back to the computer—as instructions—to navigate the web page or perform the desired action.
No comments:
Post a Comment