OpenAI, the creator of ChatGPT, unveiled a new service called Operator. This generative AI service acts like an agent and performs tasks for you. Using its dedicated browser, Operator views web pages and automatically types, clicks and searches without your intervention.
The service will be rolled out gradually and will first be available to ChatGPT Pro subscribers in the US.
Operator has the ability to perform repetitive tasks in the browser and, according to OpenAI, it can fill out forms, make online purchases, and even create internet memes. The service uses the same user interfaces and tools that humans work with, which will create new opportunities for businesses to interact.
This service is designed by a new model called CUA (Computer Usage Agent). This model combines the vision capabilities of GPT-4o with advanced enhanced reasoning. CUA is trained to interact with graphical user interfaces (GUIs)—including buttons, menus, and text fields.
When the Operator encounters a problem or needs help, it returns control to you. It also requires your manual input to enter sensitive information such as passwords or confirmation forms.
Operator can work with services like Doordash, Etsy, Booking.com, Uber and Instacart and use partners like Associated Press and Reuters for research.
RCO NEWS