{"id":9741,"date":"2025-12-14T17:04:22","date_gmt":"2025-12-14T17:04:22","guid":{"rendered":"https:\/\/techtrendfeed.com\/?p=9741"},"modified":"2025-12-14T17:04:22","modified_gmt":"2025-12-14T17:04:22","slug":"gemini-2-5-native-audio-improve-plus-text-to-speech-mannequin-updates","status":"publish","type":"post","link":"https:\/\/techtrendfeed.com\/?p=9741","title":{"rendered":"Gemini 2.5 Native Audio improve, plus text-to-speech mannequin updates"},"content":{"rendered":"<p> <br \/>\n<\/p>\n<div>\n<h3 data-block-key=\"61bfj\">What prospects are saying<\/h3>\n<p data-block-key=\"c8rur\"><a rel=\"nofollow\" target=\"_blank\" href=\"https:\/\/cloud.google.com\/blog\/products\/ai-machine-learning\/gemini-live-api-available-on-vertex-ai\">Google Cloud prospects<\/a> are already utilizing Gemini\u2019s native audio capabilities to drive actual enterprise outcomes, from mortgage processing to buyer calls.<\/p>\n<ul>\n<li data-block-key=\"f9d5j\"><i>\u201cCustomers typically neglect they\u2019re speaking to AI inside a minute of utilizing Sidekick, and in some circumstances have thanked the bot after an extended chat\u2026New Reside API AI capabilities provided via Gemini [2.5 Flash Native Audio] empower our retailers to win.\u201d<\/i> \u2013 David Wurtz, VP of Product, Shopify<\/li>\n<li data-block-key=\"cpoqr\"><i>&#8220;By integrating the Gemini 2.5 Flash Native Audio mannequin\u2026we have considerably enhanced Mia&#8217;s capabilities since launching in Could 2025. This highly effective mixture has enabled us to generate over 14,000 loans for our dealer companions.<\/i>&#8221; \u2013 Jason Bressler, Chief Expertise Officer, United Wholesale Mortgage (UWM)<\/li>\n<li data-block-key=\"5gvgc\"><i>\u201cWorking with the Gemini 2.5 Flash Native Audio mannequin via Vertex AI permits Newo.ai AI Receptionists to attain unmatched conversational intelligence &#8230; .They&#8217;ll determine the primary speaker even in noisy settings, swap languages mid-conversation, and sound remarkably pure and emotionally expressive.\u201d<\/i> \u2013 David Yang, Co-founder, Newo.ai<\/li>\n<\/ul>\n<h2 data-block-key=\"7lcen\">Reside Speech Translation<\/h2>\n<p data-block-key=\"9k6cr\">Gemini now natively helps new dwell speech-to-speech translation capabilities designed to deal with each steady listening and two-way dialog.<\/p>\n<p data-block-key=\"38f3\">With steady listening, Gemini robotically interprets speech in a number of languages right into a single goal language. This lets you put headphones in and listen to the world round you in your language.<\/p>\n<p data-block-key=\"eq38s\">For 2-way dialog, Gemini\u2019s dwell speech translation handles translation between two languages in real-time, robotically switching the output language based mostly on who&#8217;s talking. For instance, if you happen to communicate English and need to chat with a Hindi speaker, you\u2019ll hear English translations in real-time in your headphones, whereas your telephone broadcasts Hindi whenever you\u2019re achieved talking.<\/p>\n<p data-block-key=\"86q6c\">Gemini\u2019s dwell speech translation has various key capabilities that assist in the true world:<\/p>\n<ul>\n<li data-block-key=\"2afoq\"><b>Language protection<\/b>: Interprets speech in over 70 languages and 2000 language pairs by combining Gemini mannequin\u2019s world information and multilingual capabilities with its native audio capabilities<\/li>\n<li data-block-key=\"di844\"><b>Model switch:<\/b> Captures the nuance of human speech, preserving the speaker\u2019s intonation, pacing and pitch so the interpretation sounds pure.<\/li>\n<li data-block-key=\"bdrko\"><b>Multilingual enter:<\/b> Understands a number of languages concurrently in a single session, serving to you observe multilingual conversations while not having to fiddle round with language settings.<\/li>\n<li data-block-key=\"alfj9\"><b>Auto detection:<\/b> Identifies the spoken language and begins translation, so that you don\u2019t even have to know what language is being spoken to start out translating.<\/li>\n<li data-block-key=\"4j5i0\"><b>Noise robustness<\/b>: Filters out ambient noise so you&#8217;ll be able to converse comfortably even in loud, outside environments.<\/li>\n<\/ul>\n<\/div>\n\n","protected":false},"excerpt":{"rendered":"<p>What prospects are saying Google Cloud prospects are already utilizing Gemini\u2019s native audio capabilities to drive actual enterprise outcomes, from mortgage processing to buyer calls. \u201cCustomers typically neglect they\u2019re speaking to AI inside a minute of utilizing Sidekick, and in some circumstances have thanked the bot after an extended chat\u2026New Reside API AI capabilities provided [&hellip;]<\/p>\n","protected":false},"author":2,"featured_media":9743,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[55],"tags":[2781,295,358,3084,6485,614,3787],"class_list":["post-9741","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-machine-learning","tag-audio","tag-gemini","tag-model","tag-native","tag-texttospeech","tag-updates","tag-upgrade"],"_links":{"self":[{"href":"https:\/\/techtrendfeed.com\/index.php?rest_route=\/wp\/v2\/posts\/9741","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/techtrendfeed.com\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/techtrendfeed.com\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/techtrendfeed.com\/index.php?rest_route=\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/techtrendfeed.com\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=9741"}],"version-history":[{"count":1,"href":"https:\/\/techtrendfeed.com\/index.php?rest_route=\/wp\/v2\/posts\/9741\/revisions"}],"predecessor-version":[{"id":9742,"href":"https:\/\/techtrendfeed.com\/index.php?rest_route=\/wp\/v2\/posts\/9741\/revisions\/9742"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/techtrendfeed.com\/index.php?rest_route=\/wp\/v2\/media\/9743"}],"wp:attachment":[{"href":"https:\/\/techtrendfeed.com\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=9741"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/techtrendfeed.com\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=9741"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/techtrendfeed.com\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=9741"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}<!-- This website is optimized by Airlift. Learn more: https://airlift.net. Template:. Learn more: https://airlift.net. Template: 69d9690a190636c2e0989534. Config Timestamp: 2026-04-10 21:18:02 UTC, Cached Timestamp: 2026-05-06 16:39:50 UTC -->