OpenAI has launched a new AI program named "Operator" that is designed to manage online tasks like placing orders or completing forms.
According to OpenAI, Operator can search for web pages and engage with them by typing, clicking, or scrolling, much like a human user would.
In an online post, OpenAI explained that Operator is capable of handling various repetitive browser activities, such as filling out forms, ordering groceries, and even making memes..
"The ability to use the same interfaces and tools that humans interact with on a daily basis broadens the utility of AI, helping people save time on everyday tasks while opening up new engagement opportunities for businesses."
An AI "agent," the latest Silicon Valley trend, is a digital helper that is supposed to sense surroundings, make decisions, and take actions to achieve specific goals.
Google in December announced agent capabilities with the launch of Gemini 2.0, its most advanced artificial intelligence model to date.
AI race rival Anthropic two months earlier added a "computer use" feature to its Claude frontier AI model in an experimental public beta phase.
"Developers can direct Claude to use computers the way people do—by looking at a screen, moving a cursor, clicking buttons, and typing text," Anthropic said in a post at the time, cautioning that it was a work in progress.
OpenAI described Operator as one of its first AI agents capable of doing work for people independently, designed to complete tasks it is given.
Operator is available only to US users who pay for Pro subscriptions to the OpenAI service "to ensure a safe and iterative rollout," OpenAI said.
"If it encounters challenges or makes mistakes, Operator can leverage its reasoning capabilities to self-correct," OpenAI said.
Source: Yahoo Tech
BDST: 1435 HRS, JAN 25, 2025
SMS