Gemini App Now Supports Audio Files: Listen & Analyze

▼ Summary
– Google’s Gemini app now accepts audio files, with free users limited to 10 minutes and five daily prompts, while paid users can upload up to three hours of audio.
, – Google Search’s AI Mode has expanded to support five new languages, Hindi, Indonesian, Japanese, Korean, and Brazilian Portuguese, enabling more complex queries in users’ preferred languages.
– NotebookLM has been updated to generate reports in over 80 languages, offering formats like study guides, blog posts, quizzes, and flashcards based on user-uploaded content.
– Audio file compatibility was the most requested feature for the Gemini app, as noted by Google’s vice president of Google Labs and Gemini.
– Google has recently introduced multiple AI features, including Gemini recalling user details, free access to Workspace’s Vids, and Photos upgrades for video generation.
Google’s Gemini app has taken a significant leap forward by introducing audio file support, a feature users have been eagerly requesting. This enhancement allows individuals to upload audio clips directly into the app for analysis and interaction, opening up new possibilities for content engagement and accessibility. Alongside this, Google has expanded language capabilities in Search and introduced versatile reporting tools in NotebookLM, reinforcing its commitment to making AI more intuitive and widely usable.
According to a recent announcement, audio file compatibility ranked as the most sought-after addition to the Gemini platform. Free users can now upload audio files up to ten minutes in length and utilize five prompts per day, while subscribers to the AI Pro or AI Ultra tiers enjoy substantially more flexibility, with support for audio lasting up to three hours. The system accepts a wide range of file types, including compressed ZIP archives, and permits up to ten uploads per prompt.
In a parallel development, Google Search has integrated Gemini 2.5 to bring AI Mode to five new languages: Hindi, Indonesian, Japanese, Korean, and Brazilian Portuguese. This expansion enables a broader global audience to pose intricate questions and explore search results in their native tongue, deepening the utility of AI-assisted web navigation.
NotebookLM, another product in the Gemini ecosystem, has also received a meaningful upgrade. It now offers customized report generation in over 80 languages, drawing from documents, media, and other materials provided by the user. Available formats include study guides, briefing documents, and blog posts, with additional options like flashcards and quizzes for educational or professional use. Users retain control over the tone, style, and structure of these AI-generated reports, with full rollout expected by the end of the week.
It’s worth noting that while audio analysis is new to the main Gemini app, NotebookLM already provided similar functionality as part of its research-oriented toolkit. This distinction highlights Google’s tailored approach to deploying features across its suite of AI products.
These updates are part of a broader pattern of rapid innovation from Google. Recent months have seen the introduction of conversational memory in Gemini, the rollout of video generation tools in Workspace, and upgrades to Photos with Veo 3 for creating short video clips from still images. Together, these advancements illustrate a clear investment in making AI tools more dynamic, personalized, and integrated into everyday digital experiences.
(Source: The Verge)





