Google recently unveiled a slew of new hardware, including the Pixel 9 smartphones and wireless earbuds. However, the standout feature behind all this new technology is Google’s Gemini artificially intelligent assistant. Launched earlier this year, Gemini is now the default assistant on the Pixel 9 series and is already available on millions of Android phones globally. But there’s a new way to interact with this chatbot that’s making waves: Gemini Live.
Gemini Live is Google’s response to OpenAI’s GPT-40, offering users a more natural and fluid way to communicate with the assistant. This feature is currently rolling out to Gemini Advanced subscribers for $20 per month and can be accessed by tapping on the Live button in the Gemini app. While initially available in English, it will soon be introduced to the iOS app and other languages in the near future.
Sissie Hsiao, Google’s vice president of Gemini experiences, explains that Gemini Live is not simply a rehashed version of Google Assistant. Instead, it has been completely rebuilt using generative AI technology. Hsiao highlights that users have consistently requested a more seamless and natural assistant that can assist with complex problem-solving, not just basic tasks.
How Gemini Live Works
When you launch Gemini, you are greeted with a blank screen illuminated by an ethereal glow from the bottom. You can start conversing with the assistant even if your phone is locked or the screen is off. Additionally, Gemini Live is compatible with Google’s new Pixel Buds Pro 2 wireless earbuds, allowing for hands-free communication while your phone remains in your bag. Users have the option to choose from 10 different voices with varying tones, accents, and styles.
After concluding a session with Gemini Live, users can view a transcription of the entire conversation within the Gemini app. This feature enables users to refer back to the conversation at any time. Unlike traditional voice assistants, Gemini Live allows for interruptions without disrupting the overall experience. This functionality is particularly useful as Gemini tends to provide lengthy responses, allowing users to steer the conversation in a direction that suits their needs.
Google plans to integrate Gemini Live with other apps through extensions, although many of these extensions are not yet available. For instance, users will be able to request Gemini Live to retrieve a party invitation from their Gmail and inquire about the event details. Similarly, users can search for a recipe and ask Gemini Live to add the ingredients to a shopping list in Google Keep. These extensions are expected to roll out in the coming weeks, enhancing the functionality of Gemini Live.
In the future, Google intends to enhance Gemini Live with Project Astra, a computer vision technology teased at the developer conference earlier this year. This advancement will enable users to utilize their phone’s camera app to identify objects in real-time and seek information from Gemini Live. For example, users can point their camera at a concert poster and ask Gemini Live to store the event dates in their calendar while setting a reminder to purchase tickets.
Personalized Interactions with Gemini Live
The traditional approach to using voice assistants has been largely transactional, focusing on specific tasks or inquiries. However, interacting with Gemini Live offers a more conversational experience that allows for continuous engagement. Users can initiate discussions on various topics and explore new information through ongoing conversations with the assistant.
Sissie Hsiao shares her experience of using Gemini Live during her commute home, where she engaged in a conversation about the Paris Olympics and Celine Dion’s performance at the opening ceremony. The AI assistant provided insights on the song’s origins, the songwriter, and its significance. This interaction exemplifies the ability of Gemini Live to facilitate curiosity and exploration through natural conversation, offering users a unique way to interact with technology.
During a demonstration, Gemini Live was asked for dinner recommendations, prompting a dialogue about preferences for a light or hearty meal. The assistant suggested a shrimp dish, to which the user pretended to have a shrimp allergy. In response, Gemini Live recommended a salmon dish and ultimately proposed a grilled chicken salad recipe. This interactive approach allows users to engage in dynamic conversations with the assistant, exploring various options and receiving personalized recommendations.
While Gemini Live offers a novel way to engage with technology, concerns about information accuracy and sourcing may arise. Users may question the reliability of the information provided by the assistant and seek verification through external sources. Hsiao assures users that they can verify the accuracy of information by clicking on the “G” icon beneath the transcribed text in Gemini Live. Additionally, users can conduct their own Google searches to cross-reference information provided by the assistant.
Integration of Google Assistant and Gemini
With the introduction of Gemini and Gemini Live, users may wonder about the role of Google Assistant in this evolving landscape. While Gemini is positioned as a personal assistant tailored to individual users’ needs, Google Assistant serves as a communal assistant for shared household use. The distinction between the two assistants lies in their functionalities and target audiences, with Gemini focusing on personalized interactions and Google Assistant catering to broader household tasks.
Google’s strategy involves integrating Gemini’s large language models into Google Assistant to enhance its capabilities and provide a more natural voice experience. This integration aims to improve features such as analyzing video feeds from devices like Nest cameras to provide detailed information on detected activities. For instance, Google Assistant could notify users if a delivery person from FedEx arrives at their doorstep based on video footage from a connected camera.
As Google continues to develop Gemini and Google Assistant, there may be some overlap in functionalities between the two assistants. Hsiao acknowledges that the branding and positioning of these assistants are still in the early stages of development. Google aims to ensure that users receive optimal assistance based on their preferences and usage scenarios, whether through their personal devices or shared household gadgets.
In conclusion, the introduction of Gemini Live marks a significant advancement in the realm of voice assistants, offering users a more natural and engaging way to interact with AI technology. With its conversational capabilities and personalized recommendations, Gemini Live provides a glimpse into the future of seamless and intuitive assistance. As Google continues to refine its assistant offerings and integrate innovative technologies, users can look forward to a more interactive and personalized AI experience across various devices and platforms.