{"id":10446,"date":"2026-01-05T01:54:11","date_gmt":"2026-01-05T01:54:11","guid":{"rendered":"https:\/\/techtrendfeed.com\/?p=10446"},"modified":"2026-01-05T01:54:11","modified_gmt":"2026-01-05T01:54:11","slug":"unigen-1-5-enhancing-picture-technology-and-enhancing-by-way-of-reward-unification-in-reinforcement-studying","status":"publish","type":"post","link":"https:\/\/techtrendfeed.com\/?p=10446","title":{"rendered":"UniGen-1.5: Enhancing Picture Technology and Enhancing by way of Reward Unification in Reinforcement Studying"},"content":{"rendered":"<p> <br \/>\n<\/p>\n<div>\n<p>We current UniGen-1.5, a unified multimodal massive language mannequin (MLLM) for superior picture understanding, technology and enhancing. Constructing upon UniGen, we comprehensively improve the mannequin structure and coaching pipeline to strengthen the picture understanding and technology capabilities whereas unlocking robust picture enhancing capacity. Particularly, we suggest a unified Reinforcement Studying (RL) technique that improves each picture technology and picture enhancing collectively by way of shared reward fashions. To additional improve picture enhancing efficiency, we suggest a lightweight Edit Instruction Alignment stage that considerably improves the enhancing instruction comprehension that&#8217;s important for the success of the RL coaching. Experimental outcomes present that UniGen-1.5 demonstrates aggressive understanding and technology efficiency. Particularly, UniGen-1.5 achieves 0.89 and 4.31 general scores on GenEval and ImgEdit that surpass the state-of-the-art fashions akin to BAGEL and reaching efficiency corresponding to proprietary fashions akin to GPT-Picture-1.<\/p>\n<ul class=\"links-stacked\">\n<li>\u2020 Institute of Reliable Embodied AI, Fudan College<\/li>\n<li>\u2021 Mission lead<\/li>\n<li>\u00a7 Corresponding authors<\/li>\n<\/ul>\n<\/div>\n\n","protected":false},"excerpt":{"rendered":"<p>We current UniGen-1.5, a unified multimodal massive language mannequin (MLLM) for superior picture understanding, technology and enhancing. Constructing upon UniGen, we comprehensively improve the mannequin structure and coaching pipeline to strengthen the picture understanding and technology capabilities whereas unlocking robust picture enhancing capacity. Particularly, we suggest a unified Reinforcement Studying (RL) technique that improves each [&hellip;]<\/p>\n","protected":false},"author":2,"featured_media":10448,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[55],"tags":[4996,7254,615,182,136,1855,4865,7255,7253],"class_list":["post-10446","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-machine-learning","tag-editing","tag-enhancing","tag-generation","tag-image","tag-learning","tag-reinforcement","tag-reward","tag-unification","tag-unigen1-5"],"_links":{"self":[{"href":"https:\/\/techtrendfeed.com\/index.php?rest_route=\/wp\/v2\/posts\/10446","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/techtrendfeed.com\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/techtrendfeed.com\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/techtrendfeed.com\/index.php?rest_route=\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/techtrendfeed.com\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=10446"}],"version-history":[{"count":1,"href":"https:\/\/techtrendfeed.com\/index.php?rest_route=\/wp\/v2\/posts\/10446\/revisions"}],"predecessor-version":[{"id":10447,"href":"https:\/\/techtrendfeed.com\/index.php?rest_route=\/wp\/v2\/posts\/10446\/revisions\/10447"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/techtrendfeed.com\/index.php?rest_route=\/wp\/v2\/media\/10448"}],"wp:attachment":[{"href":"https:\/\/techtrendfeed.com\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=10446"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/techtrendfeed.com\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=10446"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/techtrendfeed.com\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=10446"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}<!-- This website is optimized by Airlift. Learn more: https://airlift.net. Template:. Learn more: https://airlift.net. Template: 69d9690a190636c2e0989534. Config Timestamp: 2026-04-10 21:18:02 UTC, Cached Timestamp: 2026-06-06 20:01:19 UTC -->