Openai Gives Its Agents a Voice – Now a ‘Medieval Knight’ Can Read Your Work Emails

Openai Gives Its Agents a Voice – Now a ‘Medieval Knight’ Can Read Your Work Emails

Openai Gives Its Agents a Voice – Now a ‘Medieval Knight’ Can Read Your Work Emails
Image: serhiibobyk/envato elements

Openai is expanding its controversial stable of ai Voices to include agentic models. Agentic models are the hot trend in generative ai, enabling two-step processes Specifically, the new models include:

  • GPT-4O-TRANSCRIBE and GPT-4O-Mini-TRANSCRIBE, bot of which are speech-to-text models.
  • GPT-4O-Mini-Tts, A Text-to -Speech Model.

Developers can access them on the openai api and integrate them with the agents sdk. Adding text-to-moneych and speech-to-text to the api allows them to be used in a variety of ai applications, including agentic tools,

Advanced Synthetic Voices Can Make SCAMS More Convinking

The Company Wants to Enable “Deeper, More Intritical Interactions with agents beyond just text,” but adding flexibility and great models rayses rayses rayses the possibility of more convincing SCAM BOTS.

“We’re Continuing⁠ to Engage in Conversations with Policymakers, Researchers, Developers, and Creatives Around the Challenges and Opportunities Syntic Voices Can Press,” According to A News release,

See: Have some spare cash? You’ll need it for Openai’s new api

Models have been tuned for accuracy, reliability, and realism

On March 21, Openai released new speech-to-text and text-to-speech audio tools in the api. The models have been tuned for accuracy and reliability, particularly in conversations “Accents, noisy environments, and varying speech speeds.” The models are intended for customer call centers or transaction meetings.

They can also be Instructed to Spec in Specific Ways, From Intential Specific to Dramaatic or Cheerful. Openai envisions some of these Ai models Being used for “Expressive narration for creative storytelling experiences.” I can imagine this being used at theme parks or theatrical events – use cases that raise the speaker of ai replacing creative professions. Example Voices Openai Suggessts Include “Bedtime Story,” “Surfer,” “True Crime Buff,” and “Medieval Knight.”

GPT-4O-Transcribe and GPT-4O-Min-Transcribe are designed to transmit speech more acuretely, particularly in conversions with accents, background noise, Background Noise, Or Varying Spend Spends.

GPT-4O-Mini-Tts can follow instructions to match tone or take on personas. Openai is careful to point that all of the text-to-speech voices on the api are “artificial, preset voices”-definitely not Scarlett johanssonWho has accused the company of Mimicking Her Voice Without Consent.

Agentic video ai may be on its way

Next, Openai Said Developers will be removed “custom voices” for “personalized experiences in ways that align with our safety standards.” The company is also Pursuing Ways to Use Video in Agentic Ai Experiences.

Leave a Reply

Your email address will not be published. Required fields are marked *