news-02082024-050637

OpenAI has finally released the ChatGPT Advanced Voice Mode after facing criticism and delays, with only a select group of alpha users currently testing it out. This new feature is integrated into the ChatGPT app for both iOS and Android, offering a more human-like and naturalistic audio conversational experience.

One of the most intriguing aspects of the Advanced Voice Mode is its ability to handle vision and audio inputs and outputs without relying on other specialized models. Users have shared examples of the ChatGPT Advanced Voice Mode engaging in a range of activities, from interactive language instruction to translating screens using the phone’s camera.

Italian-American AI writer Cristiano Giardina showcased various tests with the Advanced Voice Mode, including a demonstration where he asked it to count up to 50, with the model even pausing to catch its breath towards the end. Giardina highlighted the model’s ability to mimic natural speaking patterns, including breathing pauses, making the interactions with the AI feel more human-like.

Startup founder Ethan Sutin demonstrated how he got ChatGPT Advanced Voice Mode to beatbox convincingly, showcasing the model’s versatility in generating realistic voice outputs. Users have also explored the AI’s storytelling capabilities, with ChatGPT able to narrate stories complete with AI-generated sound effects like thunder and footsteps.

Additionally, the Advanced Voice Mode can reproduce distinct accents, mimic fictional characters, and even imitate a variety of regional British accents and soccer commentators across languages. The AI’s ability to adapt to different accents and speech patterns adds to its immersive and interactive capabilities.

As OpenAI plans to roll out the Advanced Voice Mode to ChatGPT Plus subscribers in the fall, the practical applications of this feature remain to be seen. While the mode offers a range of fun and engaging experiences for users, its true impact on making ChatGPT more useful and appealing to a wider audience is yet to be fully understood. As the company continues to expand access to this innovative technology, we can expect to see how it will shape the future of AI-powered conversations and interactions.