{"id":2688,"date":"2025-05-21T13:12:14","date_gmt":"2025-05-21T13:12:14","guid":{"rendered":"https:\/\/techtrendfeed.com\/?p=2688"},"modified":"2025-05-21T13:12:15","modified_gmt":"2025-05-21T13:12:15","slug":"entangled-ai-how-neural-networks-can-study-from-every-different-with-out-sharing-knowledge-by-mehmet-ozel-could-2025","status":"publish","type":"post","link":"https:\/\/techtrendfeed.com\/?p=2688","title":{"rendered":"Entangled AI: How Neural Networks Can Study From Every Different With out Sharing Knowledge. | by Mehmet \u00d6zel | Could, 2025"},"content":{"rendered":"<p> <br \/>\n<\/p>\n<div>\n<div>\n<div>\n<div class=\"speechify-ignore ac cp\">\n<div class=\"speechify-ignore bh m\">\n<div class=\"ac hx hy hz ia ib ic id ie if ig ih\">\n<div class=\"ac r ih\">\n<div class=\"ac ii\">\n<div>\n<div class=\"bm\" aria-hidden=\"false\"><a rel=\"nofollow\" target=\"_blank\" rel=\"noopener follow\" href=\"https:\/\/medium.com\/@mehmet.ozel2701?source=post_page---byline--beeeeadba451---------------------------------------\"><\/p>\n<div class=\"m ij ik bx il im\">\n<div class=\"m fl\"><img decoding=\"async\" alt=\"Mehmet \u00d6zel\" class=\"m fd bx by bz cx\" src=\"https:\/\/miro.medium.com\/v2\/da:true\/resize:fill:64:64\/0*HwS8HAvvR0WbMxjF\" width=\"32\" height=\"32\" loading=\"lazy\" data-testid=\"authorPhoto\"\/><\/div>\n<\/div>\n<p><\/a><\/div>\n<\/div>\n<\/div>\n<p><span class=\"bf b bg ab bk\"\/><\/div>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<p id=\"cf8b\" class=\"pw-post-body-paragraph mk ml gw mm b mn mo mp mq mr ms mt mu mv mw mx my mz na nb nc nd ne nf ng nh gp bk\"><strong class=\"mm gx\"># Entangled Studying Between Neural Architectures by way of Output Alignment<\/strong><\/p>\n<p id=\"d681\" class=\"pw-post-body-paragraph mk ml gw mm b mn mo mp mq mr ms mt mu mv mw mx my mz na nb nc nd ne nf ng nh gp bk\"><em class=\"ni\">*By Mehmet \u00d6zel*<\/em><\/p>\n<p id=\"5193\" class=\"pw-post-body-paragraph mk ml gw mm b mn mo mp mq mr ms mt mu mv mw mx my mz na nb nc nd ne nf ng nh gp bk\">\u2014 &#8211;<\/p>\n<p id=\"0969\" class=\"pw-post-body-paragraph mk ml gw mm b mn mo mp mq mr ms mt mu mv mw mx my mz na nb nc nd ne nf ng nh gp bk\"><strong class=\"mm gx\">## Introduction<\/strong><\/p>\n<p id=\"97c3\" class=\"pw-post-body-paragraph mk ml gw mm b mn mo mp mq mr ms mt mu mv mw mx my mz na nb nc nd ne nf ng nh gp bk\">As synthetic intelligence fashions turn out to be more and more specialised, the necessity for collaboration between heterogeneous architectures grows. On this challenge, we suggest a novel framework: <strong class=\"mm gx\">**entangled studying**<\/strong> \u2014 a system the place fashions with totally different architectures be taught collaboratively by aligning their output distributions. Impressed by the idea of quantum entanglement, our methodology permits fashions to enhance one another\u2019s studying course of with out sharing information or inner parameters.<\/p>\n<p id=\"390b\" class=\"pw-post-body-paragraph mk ml gw mm b mn mo mp mq mr ms mt mu mv mw mx my mz na nb nc nd ne nf ng nh gp bk\">\u2014 &#8211;<\/p>\n<p id=\"2fe3\" class=\"pw-post-body-paragraph mk ml gw mm b mn mo mp mq mr ms mt mu mv mw mx my mz na nb nc nd ne nf ng nh gp bk\"><strong class=\"mm gx\">## Methodology<\/strong><\/p>\n<p id=\"2d0d\" class=\"pw-post-body-paragraph mk ml gw mm b mn mo mp mq mr ms mt mu mv mw mx my mz na nb nc nd ne nf ng nh gp bk\">We implement entangled studying utilizing two fashions:<\/p>\n<p id=\"eb04\" class=\"pw-post-body-paragraph mk ml gw mm b mn mo mp mq mr ms mt mu mv mw mx my mz na nb nc nd ne nf ng nh gp bk\">&#8211; <strong class=\"mm gx\">**Mannequin A:**<\/strong> A Convolutional Neural Community (CNN)<\/p>\n<p id=\"150a\" class=\"pw-post-body-paragraph mk ml gw mm b mn mo mp mq mr ms mt mu mv mw mx my mz na nb nc nd ne nf ng nh gp bk\">&#8211; <strong class=\"mm gx\">**Mannequin B:**<\/strong> A Multi-Layer Perceptron (MLP)<\/p>\n<p id=\"a205\" class=\"pw-post-body-paragraph mk ml gw mm b mn mo mp mq mr ms mt mu mv mw mx my mz na nb nc nd ne nf ng nh gp bk\">Each fashions are educated on the MNIST dataset however be taught not solely from floor reality labels, but in addition from one another\u2019s predictions. The important thing concept is to <strong class=\"mm gx\">**penalize divergence between mannequin outputs**<\/strong>, encouraging alignment over time.<\/p>\n<p id=\"1afe\" class=\"pw-post-body-paragraph mk ml gw mm b mn mo mp mq mr ms mt mu mv mw mx my mz na nb nc nd ne nf ng nh gp bk\">The entire loss operate for every mannequin is:<\/p>\n<p id=\"9b65\" class=\"pw-post-body-paragraph mk ml gw mm b mn mo mp mq mr ms mt mu mv mw mx my mz na nb nc nd ne nf ng nh gp bk\">&#8220;`<\/p>\n<p id=\"8af0\" class=\"pw-post-body-paragraph mk ml gw mm b mn mo mp mq mr ms mt mu mv mw mx my mz na nb nc nd ne nf ng nh gp bk\">Loss_total = CategoricalCrossentropy(y_true, y_pred) + \u03bb * KL(y_pred_self || y_pred_other)<\/p>\n<p id=\"cd2b\" class=\"pw-post-body-paragraph mk ml gw mm b mn mo mp mq mr ms mt mu mv mw mx my mz na nb nc nd ne nf ng nh gp bk\">&#8220;`<\/p>\n<p id=\"fe1f\" class=\"pw-post-body-paragraph mk ml gw mm b mn mo mp mq mr ms mt mu mv mw mx my mz na nb nc nd ne nf ng nh gp bk\">The place `\u03bb` is a dynamically rising entanglement coefficient.<\/p>\n<p id=\"e7e1\" class=\"pw-post-body-paragraph mk ml gw mm b mn mo mp mq mr ms mt mu mv mw mx my mz na nb nc nd ne nf ng nh gp bk\">\u2014 &#8211;<\/p>\n<p id=\"6c6c\" class=\"pw-post-body-paragraph mk ml gw mm b mn mo mp mq mr ms mt mu mv mw mx my mz na nb nc nd ne nf ng nh gp bk\"><strong class=\"mm gx\">## Implementation Highlights<\/strong><\/p>\n<p id=\"8469\" class=\"pw-post-body-paragraph mk ml gw mm b mn mo mp mq mr ms mt mu mv mw mx my mz na nb nc nd ne nf ng nh gp bk\">&#8211; <strong class=\"mm gx\">**Dynamic Entanglement Weight (\u03bb):**<\/strong> Begins at 0 and will increase linearly to 0.05 over 30 epochs.<\/p>\n<p id=\"fe10\" class=\"pw-post-body-paragraph mk ml gw mm b mn mo mp mq mr ms mt mu mv mw mx my mz na nb nc nd ne nf ng nh gp bk\">&#8211; <strong class=\"mm gx\">**Entangled Loss:**<\/strong> Combines normal classification loss with KL divergence between predictions.<\/p>\n<p id=\"3c8b\" class=\"pw-post-body-paragraph mk ml gw mm b mn mo mp mq mr ms mt mu mv mw mx my mz na nb nc nd ne nf ng nh gp bk\">&#8211; <strong class=\"mm gx\">**Shared Process:**<\/strong> Each fashions carry out digit classification on MNIST inputs.<\/p>\n<p id=\"84dc\" class=\"pw-post-body-paragraph mk ml gw mm b mn mo mp mq mr ms mt mu mv mw mx my mz na nb nc nd ne nf ng nh gp bk\">\u2014 &#8211;<\/p>\n<p id=\"7f58\" class=\"pw-post-body-paragraph mk ml gw mm b mn mo mp mq mr ms mt mu mv mw mx my mz na nb nc nd ne nf ng nh gp bk\"><strong class=\"mm gx\">## Outcomes<\/strong><\/p>\n<p id=\"5946\" class=\"pw-post-body-paragraph mk ml gw mm b mn mo mp mq mr ms mt mu mv mw mx my mz na nb nc nd ne nf ng nh gp bk\">| Mannequin | Structure | Accuracy | Loss |<\/p>\n<p id=\"4eef\" class=\"pw-post-body-paragraph mk ml gw mm b mn mo mp mq mr ms mt mu mv mw mx my mz na nb nc nd ne nf ng nh gp bk\">| \u2014 \u2014 \u2014 -| \u2014 \u2014 \u2014 \u2014 \u2014 \u2014 \u2014 | \u2014 \u2014 \u2014 \u2014 \u2014 | \u2014 \u2014 \u2014 |<\/p>\n<p id=\"d418\" class=\"pw-post-body-paragraph mk ml gw mm b mn mo mp mq mr ms mt mu mv mw mx my mz na nb nc nd ne nf ng nh gp bk\">| A | CNN | 99.64% | 0.0318 |<\/p>\n<p id=\"c866\" class=\"pw-post-body-paragraph mk ml gw mm b mn mo mp mq mr ms mt mu mv mw mx my mz na nb nc nd ne nf ng nh gp bk\">| B | MLP | 98.74% | 0.0659 |<\/p>\n<p id=\"4146\" class=\"pw-post-body-paragraph mk ml gw mm b mn mo mp mq mr ms mt mu mv mw mx my mz na nb nc nd ne nf ng nh gp bk\">These outcomes present that even a weaker mannequin (MLP) considerably advantages from entangled coaching with a stronger mannequin (CNN).<\/p>\n<p id=\"ec27\" class=\"pw-post-body-paragraph mk ml gw mm b mn mo mp mq mr ms mt mu mv mw mx my mz na nb nc nd ne nf ng nh gp bk\">\u2014 &#8211;<\/p>\n<p id=\"fbce\" class=\"pw-post-body-paragraph mk ml gw mm b mn mo mp mq mr ms mt mu mv mw mx my mz na nb nc nd ne nf ng nh gp bk\"><strong class=\"mm gx\">## Experiment Visualization<\/strong><\/p>\n<p id=\"8c85\" class=\"pw-post-body-paragraph mk ml gw mm b mn mo mp mq mr ms mt mu mv mw mx my mz na nb nc nd ne nf ng nh gp bk\">&#8211; <strong class=\"mm gx\">**Lambda Development Over Time:**<\/strong> \u03bb will increase from 0 to 0.05<\/p>\n<p id=\"d013\" class=\"pw-post-body-paragraph mk ml gw mm b mn mo mp mq mr ms mt mu mv mw mx my mz na nb nc nd ne nf ng nh gp bk\">&#8211; <strong class=\"mm gx\">**Synchronized Studying:**<\/strong> Loss values for each fashions converge steadily<\/p>\n<p id=\"3a2f\" class=\"pw-post-body-paragraph mk ml gw mm b mn mo mp mq mr ms mt mu mv mw mx my mz na nb nc nd ne nf ng nh gp bk\">&#8211; <strong class=\"mm gx\">**Output Alignment:**<\/strong> Prediction distributions turn out to be extra comparable over epochs<\/p>\n<p id=\"643f\" class=\"pw-post-body-paragraph mk ml gw mm b mn mo mp mq mr ms mt mu mv mw mx my mz na nb nc nd ne nf ng nh gp bk\">\u2014 &#8211;<\/p>\n<p id=\"4e65\" class=\"pw-post-body-paragraph mk ml gw mm b mn mo mp mq mr ms mt mu mv mw mx my mz na nb nc nd ne nf ng nh gp bk\"><strong class=\"mm gx\">## Dialogue<\/strong><\/p>\n<p id=\"c4e9\" class=\"pw-post-body-paragraph mk ml gw mm b mn mo mp mq mr ms mt mu mv mw mx my mz na nb nc nd ne nf ng nh gp bk\">Entangled studying mimics human collaborative studying:<\/p>\n<p id=\"b410\" class=\"pw-post-body-paragraph mk ml gw mm b mn mo mp mq mr ms mt mu mv mw mx my mz na nb nc nd ne nf ng nh gp bk\">&#8211; It permits <strong class=\"mm gx\">**oblique information switch**<\/strong><\/p>\n<p id=\"8b20\" class=\"pw-post-body-paragraph mk ml gw mm b mn mo mp mq mr ms mt mu mv mw mx my mz na nb nc nd ne nf ng nh gp bk\">&#8211; It maintains <strong class=\"mm gx\">**modular, non-public architectures**<\/strong><\/p>\n<p id=\"f25e\" class=\"pw-post-body-paragraph mk ml gw mm b mn mo mp mq mr ms mt mu mv mw mx my mz na nb nc nd ne nf ng nh gp bk\">&#8211; It scales to a number of fashions and duties<\/p>\n<p id=\"cef0\" class=\"pw-post-body-paragraph mk ml gw mm b mn mo mp mq mr ms mt mu mv mw mx my mz na nb nc nd ne nf ng nh gp bk\">This opens the door to <strong class=\"mm gx\">**privacy-preserving AI collaboration**<\/strong>, <strong class=\"mm gx\">**multi-agent methods**<\/strong>, and even <strong class=\"mm gx\">**federated entangled studying**<\/strong>.<\/p>\n<p id=\"c044\" class=\"pw-post-body-paragraph mk ml gw mm b mn mo mp mq mr ms mt mu mv mw mx my mz na nb nc nd ne nf ng nh gp bk\">\u2014 &#8211;<\/p>\n<p id=\"f67c\" class=\"pw-post-body-paragraph mk ml gw mm b mn mo mp mq mr ms mt mu mv mw mx my mz na nb nc nd ne nf ng nh gp bk\"><strong class=\"mm gx\">Conclusion<\/strong><\/p>\n<p id=\"adec\" class=\"pw-post-body-paragraph mk ml gw mm b mn mo mp mq mr ms mt mu mv mw mx my mz na nb nc nd ne nf ng nh gp bk\">This challenge offers a proof-of-concept for output-aligned entangled coaching. Our outcomes present that heterogeneous AI methods can be taught higher collectively \u2014 not by sharing information, however by sharing <em class=\"ni\">*instinct*<\/em> by means of their predictions.<\/p>\n<p id=\"1a5d\" class=\"pw-post-body-paragraph mk ml gw mm b mn mo mp mq mr ms mt mu mv mw mx my mz na nb nc nd ne nf ng nh gp bk\">\u2014 &#8211;<\/p>\n<p id=\"269f\" class=\"pw-post-body-paragraph mk ml gw mm b mn mo mp mq mr ms mt mu mv mw mx my mz na nb nc nd ne nf ng nh gp bk\"><em class=\"ni\">*Supply code &amp; full experiment out there on GitHub:*<\/em><\/p>\n<p id=\"e1fd\" class=\"pw-post-body-paragraph mk ml gw mm b mn mo mp mq mr ms mt mu mv mw mx my mz na nb nc nd ne nf ng nh gp bk\">[Entangled AI Learners Repository](https:\/\/github.com\/madara88645\/entangled-ai-learners)<\/p>\n<\/div>\n\n","protected":false},"excerpt":{"rendered":"<p># Entangled Studying Between Neural Architectures by way of Output Alignment *By Mehmet \u00d6zel* \u2014 &#8211; ## Introduction As synthetic intelligence fashions turn out to be more and more specialised, the necessity for collaboration between heterogeneous architectures grows. On this challenge, we suggest a novel framework: **entangled studying** \u2014 a system the place fashions with [&hellip;]<\/p>\n","protected":false},"author":2,"featured_media":2690,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[55],"tags":[157,2615,976,2617,667,298,2618,2616],"class_list":["post-2688","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-machine-learning","tag-data","tag-entangled","tag-learn","tag-mehmet","tag-networks","tag-neural","tag-ozel","tag-sharing"],"_links":{"self":[{"href":"https:\/\/techtrendfeed.com\/index.php?rest_route=\/wp\/v2\/posts\/2688","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/techtrendfeed.com\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/techtrendfeed.com\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/techtrendfeed.com\/index.php?rest_route=\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/techtrendfeed.com\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=2688"}],"version-history":[{"count":1,"href":"https:\/\/techtrendfeed.com\/index.php?rest_route=\/wp\/v2\/posts\/2688\/revisions"}],"predecessor-version":[{"id":2689,"href":"https:\/\/techtrendfeed.com\/index.php?rest_route=\/wp\/v2\/posts\/2688\/revisions\/2689"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/techtrendfeed.com\/index.php?rest_route=\/wp\/v2\/media\/2690"}],"wp:attachment":[{"href":"https:\/\/techtrendfeed.com\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=2688"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/techtrendfeed.com\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=2688"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/techtrendfeed.com\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=2688"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}<!-- This website is optimized by Airlift. Learn more: https://airlift.net. Template:. Learn more: https://airlift.net. Template: 69d9690a190636c2e0989534. Config Timestamp: 2026-04-10 21:18:02 UTC, Cached Timestamp: 2026-05-14 09:41:58 UTC -->