<?xml version="1.0" encoding="UTF-8"?><rss xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:atom="http://www.w3.org/2005/Atom" version="2.0" xmlns:itunes="http://www.itunes.com/dtds/podcast-1.0.dtd" xmlns:psc="http://podlove.org/simple-chapters" xmlns:podcast="https://podcastindex.org/namespace/1.0"><channel><title><![CDATA[The Information Bottleneck]]></title><description><![CDATA[<p>Two AI Researchers - Ravid Shwartz Ziv, and Allen Roush, discuss the latest trends, news, and research within Generative AI, LLMs, GPUs, and Cloud Systems.</p>]]></description><link>https://www.the-information-bottleneck.com</link><generator>Riverside.fm (https://riverside.com)</generator><lastBuildDate>Tue, 30 Jun 2026 07:15:44 GMT</lastBuildDate><atom:link href="https://api.riverside.com/hosting/VGaMOITx.rss" rel="self" type="application/rss+xml"/><author><![CDATA[Ravid Shwartz-Ziv & Allen Roush]]></author><pubDate>Thu, 21 Aug 2025 04:04:07 GMT</pubDate><copyright><![CDATA[2025 Ravid Shwartz-Ziv & Allen Roush]]></copyright><language><![CDATA[en]]></language><ttl>60</ttl><category><![CDATA[Technology]]></category><category><![CDATA[Science]]></category><itunes:author>Ravid Shwartz-Ziv &amp; Allen Roush</itunes:author><itunes:summary>&lt;p&gt;Two AI Researchers - Ravid Shwartz Ziv, and Allen Roush, discuss the latest trends, news, and research within Generative AI, LLMs, GPUs, and Cloud Systems.&lt;/p&gt;</itunes:summary><itunes:type>episodic</itunes:type><itunes:owner><itunes:name>Ravid Shwartz-Ziv &amp; Allen Roush</itunes:name><itunes:email>ravid.ziv@mail.huji.ac.il</itunes:email></itunes:owner><itunes:explicit>no</itunes:explicit><itunes:category text="Technology"/><itunes:category text="Science"/><itunes:image href="https://hosting-media.riverside.com/media/podcasts/eebfac8f-2369-4a89-924b-be279b57f190/logos/71320ada-c62f-4607-9641-81e40c066e46.png"/><item><title><![CDATA[AI for Science with Qichao Hu (Molecular Universe / SES AI)]]></title><description><![CDATA[<hr /><p><b>Most AI-for-science companies are selling shovels. Qichao Hu wants the gold.</b></p><p>In this episode, we talk with Qichao, the founder and CEO of Molecular Universe, the AI-for-science platform that grew out of SES AI, a high-energy-density battery developer he's run for fourteen years. His core distinction is that companies from the AI world build tools, such as foundation models that predict properties, while companies from the science world care about the final product, such as the new battery or material that actually ships. Molecular Universe sits firmly on the science side, and the difference shows up everywhere from what they publish to what they refuse to.</p><p>We get into the actual workflow of materials discovery and where AI compresses it. A single trial in a traditional lab can take a year with maybe a 40% success rate; the goal is to run a thousand candidates in parallel and turn that year into a week. Qichao walks through improving low-temperature fast-charging for EV batteries:  from hypothesis generation through molecule-, material-, and device-level property prediction, down to autonomous labs that synthesize and test the top candidates without a human touching a pipette.</p><p>The hardest problem, it turns out, isn't predicting molecular properties or measuring device performance, but it's the black box connecting the two. In batteries, that's the solid-electrolyte interface, which the field has been hand-waving about since the seventies. And the thing standing in the way of cracking it isn't a clever training trick but data: companies sitting on twenty years of records are finding it too messy, incomplete, and poorly labeled to train on, and are having to start collecting from scratch with new protocols and robots.</p><hr /><p><b>Timeline</b></p><ul><li><b>00:13</b> — Intro and welcome;</li><li><b>01:19</b> — Shovel vs. gold</li><li><b>05:18</b> — Why the world's smartest scientist doesn't automatically give you a better battery</li><li><b>07:25</b> — The discovery workflow</li><li><b>09:37</b> — Exploration vs. exploitation</li><li><b>11:54</b> — Safety and filtering: screening novel molecules against banned and toxic-substance lists</li><li><b>17:55</b> — How hypotheses get generated, and where frontier LLMs help</li><li><b>20:29</b> — From hypothesis to ~400 formulations: property prediction, ranking, and handing off to autonomous labs</li><li><b>26:37</b> — "A foundation model for everything" — and the black box between molecular properties and device performance</li><li><b>30:01</b> — World models and physics</li><li><b>33:09</b> — The great unknown in batteries</li><li><b>37:08</b> — Simulation vs. reality: calibrating massive simulated datasets with a sliver of experimental data</li><li><b>41:47</b> — Lab robotics: how fast the hardware has caught up, and what a floor of autonomous labs looks like</li><li><b>43:50</b> — The real bottlenecks</li><li><b>50:21</b> — Pre-training from scratch vs. post-training LLMs, and why training tricks haven't reduced the need for good data</li><li><b>52:42</b> — Evaluation</li><li><b>55:42</b> — Publish the B+ model, keep the A model</li><li><b>58:05</b> — Five years out</li><li><b>1:00:37</b> — Closing thoughts and wrap</li></ul><hr /><p>Music:</p><ul><li>"Kid Kodi" - Blue Dot Sessions - via Free Music Archive - CC BY-NC 4.0.</li></ul><hr /><p>About: The Information Bottleneck is hosted by Ravid Shwartz-Ziv and Allen Roush, featuring in-depth conversations with leading AI researchers about the ideas shaping the future of machine learning.</p>]]></description><guid isPermaLink="false">d624b924-5740-47ac-ac51-ee8bf32ba5c6</guid><dc:creator><![CDATA[Ravid Shwartz-Ziv & Allen Roush]]></dc:creator><pubDate>Mon, 29 Jun 2026 03:52:34 GMT</pubDate><enclosure url="https://api.riverside.com/hosting-analytics/media/44fb942ee22cbe64372151a515c37b53bf56eb4466ae12b3dd497fc8a0d8ac43/eyJlcGlzb2RlSWQiOiJkNjI0YjkyNC01NzQwLTQ3YWMtYWM1MS1lZThiZjMyYmE1YzYiLCJwb2RjYXN0SWQiOiJlZWJmYWM4Zi0yMzY5LTRhODktOTI0Yi1iZTI3OWI1N2YxOTAiLCJhY2NvdW50SWQiOiI2OGE1ZjAzMzFkNjQ0MTNlODhlN2FkMmYiLCJwYXRoIjoibWVkaWEvY2xpcHMvNmE0MWU2NmI3MDVjZWFhYTcxNDUwMzBkL3JhdmlkLXNod2FydHoteml2cy1zdHVkaW8tY29tcG9zZXItMjAyNi02LTI5X181LTI4LTQzLm1wMyJ9.mp3" length="116997790" type="audio/mpeg"/><itunes:summary>&lt;hr /&gt;&lt;p&gt;&lt;b&gt;Most AI-for-science companies are selling shovels. Qichao Hu wants the gold.&lt;/b&gt;&lt;/p&gt;&lt;p&gt;In this episode, we talk with Qichao, the founder and CEO of Molecular Universe, the AI-for-science platform that grew out of SES AI, a high-energy-density battery developer he&apos;s run for fourteen years. His core distinction is that companies from the AI world build tools, such as foundation models that predict properties, while companies from the science world care about the final product, such as the new battery or material that actually ships. Molecular Universe sits firmly on the science side, and the difference shows up everywhere from what they publish to what they refuse to.&lt;/p&gt;&lt;p&gt;We get into the actual workflow of materials discovery and where AI compresses it. A single trial in a traditional lab can take a year with maybe a 40% success rate; the goal is to run a thousand candidates in parallel and turn that year into a week. Qichao walks through improving low-temperature fast-charging for EV batteries:  from hypothesis generation through molecule-, material-, and device-level property prediction, down to autonomous labs that synthesize and test the top candidates without a human touching a pipette.&lt;/p&gt;&lt;p&gt;The hardest problem, it turns out, isn&apos;t predicting molecular properties or measuring device performance, but it&apos;s the black box connecting the two. In batteries, that&apos;s the solid-electrolyte interface, which the field has been hand-waving about since the seventies. And the thing standing in the way of cracking it isn&apos;t a clever training trick but data: companies sitting on twenty years of records are finding it too messy, incomplete, and poorly labeled to train on, and are having to start collecting from scratch with new protocols and robots.&lt;/p&gt;&lt;hr /&gt;&lt;p&gt;&lt;b&gt;Timeline&lt;/b&gt;&lt;/p&gt;&lt;ul&gt;&lt;li&gt;&lt;b&gt;00:13&lt;/b&gt; — Intro and welcome;&lt;/li&gt;&lt;li&gt;&lt;b&gt;01:19&lt;/b&gt; — Shovel vs. gold&lt;/li&gt;&lt;li&gt;&lt;b&gt;05:18&lt;/b&gt; — Why the world&apos;s smartest scientist doesn&apos;t automatically give you a better battery&lt;/li&gt;&lt;li&gt;&lt;b&gt;07:25&lt;/b&gt; — The discovery workflow&lt;/li&gt;&lt;li&gt;&lt;b&gt;09:37&lt;/b&gt; — Exploration vs. exploitation&lt;/li&gt;&lt;li&gt;&lt;b&gt;11:54&lt;/b&gt; — Safety and filtering: screening novel molecules against banned and toxic-substance lists&lt;/li&gt;&lt;li&gt;&lt;b&gt;17:55&lt;/b&gt; — How hypotheses get generated, and where frontier LLMs help&lt;/li&gt;&lt;li&gt;&lt;b&gt;20:29&lt;/b&gt; — From hypothesis to ~400 formulations: property prediction, ranking, and handing off to autonomous labs&lt;/li&gt;&lt;li&gt;&lt;b&gt;26:37&lt;/b&gt; — &quot;A foundation model for everything&quot; — and the black box between molecular properties and device performance&lt;/li&gt;&lt;li&gt;&lt;b&gt;30:01&lt;/b&gt; — World models and physics&lt;/li&gt;&lt;li&gt;&lt;b&gt;33:09&lt;/b&gt; — The great unknown in batteries&lt;/li&gt;&lt;li&gt;&lt;b&gt;37:08&lt;/b&gt; — Simulation vs. reality: calibrating massive simulated datasets with a sliver of experimental data&lt;/li&gt;&lt;li&gt;&lt;b&gt;41:47&lt;/b&gt; — Lab robotics: how fast the hardware has caught up, and what a floor of autonomous labs looks like&lt;/li&gt;&lt;li&gt;&lt;b&gt;43:50&lt;/b&gt; — The real bottlenecks&lt;/li&gt;&lt;li&gt;&lt;b&gt;50:21&lt;/b&gt; — Pre-training from scratch vs. post-training LLMs, and why training tricks haven&apos;t reduced the need for good data&lt;/li&gt;&lt;li&gt;&lt;b&gt;52:42&lt;/b&gt; — Evaluation&lt;/li&gt;&lt;li&gt;&lt;b&gt;55:42&lt;/b&gt; — Publish the B+ model, keep the A model&lt;/li&gt;&lt;li&gt;&lt;b&gt;58:05&lt;/b&gt; — Five years out&lt;/li&gt;&lt;li&gt;&lt;b&gt;1:00:37&lt;/b&gt; — Closing thoughts and wrap&lt;/li&gt;&lt;/ul&gt;&lt;hr /&gt;&lt;p&gt;Music:&lt;/p&gt;&lt;ul&gt;&lt;li&gt;&quot;Kid Kodi&quot; - Blue Dot Sessions - via Free Music Archive - CC BY-NC 4.0.&lt;/li&gt;&lt;/ul&gt;&lt;hr /&gt;&lt;p&gt;About: The Information Bottleneck is hosted by Ravid Shwartz-Ziv and Allen Roush, featuring in-depth conversations with leading AI researchers about the ideas shaping the future of machine learning.&lt;/p&gt;</itunes:summary><itunes:explicit>no</itunes:explicit><itunes:duration>01:00:56</itunes:duration><itunes:image href="https://hosting-media.riverside.com/media/podcasts/eebfac8f-2369-4a89-924b-be279b57f190/episodes/d624b924-5740-47ac-ac51-ee8bf32ba5c6/images/7dc2b801-6840-44bd-9392-eddf1997a90d.png"/><itunes:season>1</itunes:season><itunes:episode>50</itunes:episode><itunes:title>AI for Science with Qichao Hu (Molecular Universe / SES AI)</itunes:title><itunes:episodeType>full</itunes:episodeType></item><item><title><![CDATA[Infrastructure for AI at Scale - With Benny Chen (Fireworks AI)]]></title><description><![CDATA[<p>We talk a lot on this show about RL, agents, and the move between pre-training and post-training, but not enough about the layer everything actually runs on. Benny Chen, co-founder of Fireworks AI, one of the largest inference platforms around, walks us through what it takes to serve models at scale: sourcing GPUs, writing the kernels, the runtime, and the routing layer that lets a customer hit one endpoint and forget the rest.</p><p>We talk why the real bottleneck is power, not chips, and why that favors Nvidia and Google. Why MoE keeps winning even when dense models look better on paper and why he'd rather run fungible capacity at 95% than specialized chips at 60%. We also talk about quantization limits, where RL efficiency has to go next, and his case that AI is still <i>under</i>-hyped. We also get into cross-region training, sparse autoencoders and why interpretability hasn't taken off in open source, whether open models can close the gap, and a frank read on Anthropic's go-to-market.</p><hr /><p></p><p><b>Timeline</b></p><ul><li>00:00 — Intro: the part of AI nobody talks about</li><li>01:20 — What "infrastructure for AI" actually means: the layers, from GPUs up to routing</li><li>02:59 — Why not just buy your own GPUs and do it yourself?</li><li>05:17 — The scale Fireworks runs at</li><li>06:35 — Hardware inflation, GPU costs, and the real risk hiding in commit duration</li><li>10:14 — Nvidia vs AMD vs TPUs, and why power is the bottleneck</li><li>11:57 — Mixing GPU types and generations; fungibility vs. specialization</li><li>14:22 — Once you have the GPUs, what's the next layer to build?</li><li>17:04 — Dense vs. MoE, and why the hardware picks the winner</li><li>21:07 — Quantization: is FP4 the floor? TurboQuant and INT vs. FP</li><li>24:28 — How tied are the algorithms to the hardware?</li><li>25:12 — DeepSeek, DeepGEMM, and next-token prediction as reconstruction loss</li><li>28:50 — Why RL is still wildly inefficient compared to pre-training</li><li>30:08 — Speculative decoding, AI-generated kernels, and auto-research</li><li>34:00 — The AGI question: why text gets automated but vision may stay expensive</li><li>37:07 — Hype check: why Benny thinks AI is still under-hyped</li><li>41:28 — Training vs. inference at the infrastructure level</li><li>44:12 — Scaling across data centers: cross-region training with Cursor</li><li>45:40 — Sparse autoencoders, interpretability, and why open source is human-constrained</li><li>49:04 — Will open models catch up — on quality and on compute?</li><li>51:41 — Are we plateauing? Opus 4.7 vs. 4.6 and the coming data wars</li><li>54:41 — Physical limits, HBM, and whether chips keep getting faster</li><li>58:17 — The belief about inference everyone gets wrong</li><li>59:31 — Anthropic, mythos, and a frank take on go-to-market</li><li>1:04:41 — Wrap-up<hr /><p></p></li></ul><p>Music:</p><ul><li>"Kid Kodi" - Blue Dot Sessions - via Free Music Archive - CC BY-NC 4.0.</li></ul><p></p><p>About: The Information Bottleneck is hosted by Ravid Shwartz-Ziv and Allen Roush, featuring in-depth conversations with leading AI researchers about the ideas shaping the future of machine learning.</p>]]></description><guid isPermaLink="false">4b9e44af-69c7-40df-a014-02c2c7624fc6</guid><dc:creator><![CDATA[Ravid Shwartz-Ziv & Allen Roush]]></dc:creator><pubDate>Wed, 24 Jun 2026 04:03:28 GMT</pubDate><enclosure url="https://api.riverside.com/hosting-analytics/media/586a654cd098b15bf5c1f14b0e9827da9ac83c898f68a0d218abda05ef3144de/eyJlcGlzb2RlSWQiOiI0YjllNDRhZi02OWM3LTQwZGYtYTAxNC0wMmMyYzc2MjRmYzYiLCJwb2RjYXN0SWQiOiJlZWJmYWM4Zi0yMzY5LTRhODktOTI0Yi1iZTI3OWI1N2YxOTAiLCJhY2NvdW50SWQiOiI2OGE1ZjAzMzFkNjQ0MTNlODhlN2FkMmYiLCJwYXRoIjoibWVkaWEvY2xpcHMvNmEzYjJmNWU5M2IwNjk5NWRjNGVhMDBlL3JhdmlkLXNod2FydHoteml2cy1zdHVkaW8tY29tcG9zZXItMjAyNi02LTI0X18zLTE0LTYubXAzIn0=.mp3" length="126406887" type="audio/mpeg"/><itunes:summary>&lt;p&gt;We talk a lot on this show about RL, agents, and the move between pre-training and post-training, but not enough about the layer everything actually runs on. Benny Chen, co-founder of Fireworks AI, one of the largest inference platforms around, walks us through what it takes to serve models at scale: sourcing GPUs, writing the kernels, the runtime, and the routing layer that lets a customer hit one endpoint and forget the rest.&lt;/p&gt;&lt;p&gt;We talk why the real bottleneck is power, not chips, and why that favors Nvidia and Google. Why MoE keeps winning even when dense models look better on paper and why he&apos;d rather run fungible capacity at 95% than specialized chips at 60%. We also talk about quantization limits, where RL efficiency has to go next, and his case that AI is still &lt;i&gt;under&lt;/i&gt;-hyped. We also get into cross-region training, sparse autoencoders and why interpretability hasn&apos;t taken off in open source, whether open models can close the gap, and a frank read on Anthropic&apos;s go-to-market.&lt;/p&gt;&lt;hr /&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;&lt;b&gt;Timeline&lt;/b&gt;&lt;/p&gt;&lt;ul&gt;&lt;li&gt;00:00 — Intro: the part of AI nobody talks about&lt;/li&gt;&lt;li&gt;01:20 — What &quot;infrastructure for AI&quot; actually means: the layers, from GPUs up to routing&lt;/li&gt;&lt;li&gt;02:59 — Why not just buy your own GPUs and do it yourself?&lt;/li&gt;&lt;li&gt;05:17 — The scale Fireworks runs at&lt;/li&gt;&lt;li&gt;06:35 — Hardware inflation, GPU costs, and the real risk hiding in commit duration&lt;/li&gt;&lt;li&gt;10:14 — Nvidia vs AMD vs TPUs, and why power is the bottleneck&lt;/li&gt;&lt;li&gt;11:57 — Mixing GPU types and generations; fungibility vs. specialization&lt;/li&gt;&lt;li&gt;14:22 — Once you have the GPUs, what&apos;s the next layer to build?&lt;/li&gt;&lt;li&gt;17:04 — Dense vs. MoE, and why the hardware picks the winner&lt;/li&gt;&lt;li&gt;21:07 — Quantization: is FP4 the floor? TurboQuant and INT vs. FP&lt;/li&gt;&lt;li&gt;24:28 — How tied are the algorithms to the hardware?&lt;/li&gt;&lt;li&gt;25:12 — DeepSeek, DeepGEMM, and next-token prediction as reconstruction loss&lt;/li&gt;&lt;li&gt;28:50 — Why RL is still wildly inefficient compared to pre-training&lt;/li&gt;&lt;li&gt;30:08 — Speculative decoding, AI-generated kernels, and auto-research&lt;/li&gt;&lt;li&gt;34:00 — The AGI question: why text gets automated but vision may stay expensive&lt;/li&gt;&lt;li&gt;37:07 — Hype check: why Benny thinks AI is still under-hyped&lt;/li&gt;&lt;li&gt;41:28 — Training vs. inference at the infrastructure level&lt;/li&gt;&lt;li&gt;44:12 — Scaling across data centers: cross-region training with Cursor&lt;/li&gt;&lt;li&gt;45:40 — Sparse autoencoders, interpretability, and why open source is human-constrained&lt;/li&gt;&lt;li&gt;49:04 — Will open models catch up — on quality and on compute?&lt;/li&gt;&lt;li&gt;51:41 — Are we plateauing? Opus 4.7 vs. 4.6 and the coming data wars&lt;/li&gt;&lt;li&gt;54:41 — Physical limits, HBM, and whether chips keep getting faster&lt;/li&gt;&lt;li&gt;58:17 — The belief about inference everyone gets wrong&lt;/li&gt;&lt;li&gt;59:31 — Anthropic, mythos, and a frank take on go-to-market&lt;/li&gt;&lt;li&gt;1:04:41 — Wrap-up&lt;hr /&gt;&lt;p&gt;&lt;/p&gt;&lt;/li&gt;&lt;/ul&gt;&lt;p&gt;Music:&lt;/p&gt;&lt;ul&gt;&lt;li&gt;&quot;Kid Kodi&quot; - Blue Dot Sessions - via Free Music Archive - CC BY-NC 4.0.&lt;/li&gt;&lt;/ul&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;About: The Information Bottleneck is hosted by Ravid Shwartz-Ziv and Allen Roush, featuring in-depth conversations with leading AI researchers about the ideas shaping the future of machine learning.&lt;/p&gt;</itunes:summary><itunes:explicit>no</itunes:explicit><itunes:duration>01:05:50</itunes:duration><itunes:image href="https://hosting-media.riverside.com/media/podcasts/eebfac8f-2369-4a89-924b-be279b57f190/episodes/4b9e44af-69c7-40df-a014-02c2c7624fc6/images/3b74e70e-1be7-4cd9-b1db-581ec5f5bb39.png"/><itunes:season>1</itunes:season><itunes:episode>49</itunes:episode><itunes:title>Infrastructure for AI at Scale - With Benny Chen (Fireworks AI)</itunes:title><itunes:episodeType>full</itunes:episodeType></item><item><title><![CDATA[Broken Peer Review, AI, and Worms — with Oded Rechavi]]></title><description><![CDATA[<p>Oded Rechavi is a biologist at Tel Aviv University and the co-founder of QED, a company building AI to review scientific work. He's also spent years studying worms.</p><p>We start with what's wrong with peer review and grant funding: why it takes years to publish, why reviewers are often your own competitors, and why the whole thing is locked to an economic model that rewards publishing more papers, not better ones. Oded explains why he doesn't call QED "peer review" at all, and what it would take to actually validate science instead of just stamping it.</p><p>Then we get into the biology. C. elegans has exactly 959 cells, every one of them named, and a fully mapped brain. Oded's lab studies how a worm's experiences get passed to its offspring through RNA rather than DNA — meaning what happens to a worm in its lifetime can change its descendants. We also talk about using ancient DNA to reassemble the Dead Sea Scrolls, what AI can and can't do for biology, and why he wants to build an "Ironman suit" for researchers rather than replace them.</p><hr /><p>00:00  Intro</p><p>01:35  Why scientific publishing is broken</p><p>04:02  Years to publish, and what it costs science</p><p>07:20  Bad reviewers, conflicts of interest, and the money</p><p>10:47  Why preprints don't fix it</p><p>15:37  How AI conferences handle review</p><p>22:07  Conferences vs. journals — does slow review help?</p><p>25:22  Building QED: review, not peer review</p><p>30:02  Tracking a paper from idea to submission</p><p>33:11  What writing a grant actually involves</p><p>35:00  The ERC reviewer crisis</p><p>37:06  Tailoring feedback to your field</p><p>41:48  Switching to biology</p><p>44:30  Every cell has a name: inside C. elegans</p><p>46:28  Inheritance without DNA</p><p>48:16  What the worm "thinks" changes its offspring</p><p>51:58  Reassembling the Dead Sea Scrolls with ancient DNA</p><p>56:07  Psychedelics and worms</p><p>58:36  Can AI run the research itself?</p><p>1:04:49  Automation vs. validation</p><p>1:07:12  The origin of life</p><p>1:08:49  Why people reject AI-written work</p><p>1:16:18  Will humans still have a role?</p><p>1:17:39  Wrap-up</p><hr /><p>Music:</p><ul><li>"Kid Kodi" - Blue Dot Sessions - via Free Music Archive - CC BY-NC 4.0.</li></ul><hr /><p></p><p>About: The Information Bottleneck is hosted by Ravid Shwartz-Ziv and Allen Roush, featuring in-depth conversations with leading AI researchers about the ideas shaping the future of machine learning.</p><p></p><p></p>]]></description><guid isPermaLink="false">669aceac-f918-4476-a23e-b06d1525ca15</guid><dc:creator><![CDATA[Ravid Shwartz-Ziv & Allen Roush]]></dc:creator><pubDate>Sun, 21 Jun 2026 03:53:01 GMT</pubDate><enclosure url="https://api.riverside.com/hosting-analytics/media/86fc02cf9892c91dfdf6705ae85a78efa2e2288e792d3f3add62e449b2583445/eyJlcGlzb2RlSWQiOiI2NjlhY2VhYy1mOTE4LTQ0NzYtYTIzZS1iMDZkMTUyNWNhMTUiLCJwb2RjYXN0SWQiOiJlZWJmYWM4Zi0yMzY5LTRhODktOTI0Yi1iZTI3OWI1N2YxOTAiLCJhY2NvdW50SWQiOiI2OGE1ZjAzMzFkNjQ0MTNlODhlN2FkMmYiLCJwYXRoIjoibWVkaWEvY2xpcHMvNmEzNzVlMDM3MTQ3ODczMDQ0YzdlY2NiL3JhdmlkLXNod2FydHoteml2cy1zdHVkaW8tY29tcG9zZXItMjAyNi02LTIxX181LTQ0LTIubXAzIn0=.mp3" length="149891178" type="audio/mpeg"/><itunes:summary>&lt;p&gt;Oded Rechavi is a biologist at Tel Aviv University and the co-founder of QED, a company building AI to review scientific work. He&apos;s also spent years studying worms.&lt;/p&gt;&lt;p&gt;We start with what&apos;s wrong with peer review and grant funding: why it takes years to publish, why reviewers are often your own competitors, and why the whole thing is locked to an economic model that rewards publishing more papers, not better ones. Oded explains why he doesn&apos;t call QED &quot;peer review&quot; at all, and what it would take to actually validate science instead of just stamping it.&lt;/p&gt;&lt;p&gt;Then we get into the biology. C. elegans has exactly 959 cells, every one of them named, and a fully mapped brain. Oded&apos;s lab studies how a worm&apos;s experiences get passed to its offspring through RNA rather than DNA — meaning what happens to a worm in its lifetime can change its descendants. We also talk about using ancient DNA to reassemble the Dead Sea Scrolls, what AI can and can&apos;t do for biology, and why he wants to build an &quot;Ironman suit&quot; for researchers rather than replace them.&lt;/p&gt;&lt;hr /&gt;&lt;p&gt;00:00  Intro&lt;/p&gt;&lt;p&gt;01:35  Why scientific publishing is broken&lt;/p&gt;&lt;p&gt;04:02  Years to publish, and what it costs science&lt;/p&gt;&lt;p&gt;07:20  Bad reviewers, conflicts of interest, and the money&lt;/p&gt;&lt;p&gt;10:47  Why preprints don&apos;t fix it&lt;/p&gt;&lt;p&gt;15:37  How AI conferences handle review&lt;/p&gt;&lt;p&gt;22:07  Conferences vs. journals — does slow review help?&lt;/p&gt;&lt;p&gt;25:22  Building QED: review, not peer review&lt;/p&gt;&lt;p&gt;30:02  Tracking a paper from idea to submission&lt;/p&gt;&lt;p&gt;33:11  What writing a grant actually involves&lt;/p&gt;&lt;p&gt;35:00  The ERC reviewer crisis&lt;/p&gt;&lt;p&gt;37:06  Tailoring feedback to your field&lt;/p&gt;&lt;p&gt;41:48  Switching to biology&lt;/p&gt;&lt;p&gt;44:30  Every cell has a name: inside C. elegans&lt;/p&gt;&lt;p&gt;46:28  Inheritance without DNA&lt;/p&gt;&lt;p&gt;48:16  What the worm &quot;thinks&quot; changes its offspring&lt;/p&gt;&lt;p&gt;51:58  Reassembling the Dead Sea Scrolls with ancient DNA&lt;/p&gt;&lt;p&gt;56:07  Psychedelics and worms&lt;/p&gt;&lt;p&gt;58:36  Can AI run the research itself?&lt;/p&gt;&lt;p&gt;1:04:49  Automation vs. validation&lt;/p&gt;&lt;p&gt;1:07:12  The origin of life&lt;/p&gt;&lt;p&gt;1:08:49  Why people reject AI-written work&lt;/p&gt;&lt;p&gt;1:16:18  Will humans still have a role?&lt;/p&gt;&lt;p&gt;1:17:39  Wrap-up&lt;/p&gt;&lt;hr /&gt;&lt;p&gt;Music:&lt;/p&gt;&lt;ul&gt;&lt;li&gt;&quot;Kid Kodi&quot; - Blue Dot Sessions - via Free Music Archive - CC BY-NC 4.0.&lt;/li&gt;&lt;/ul&gt;&lt;hr /&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;About: The Information Bottleneck is hosted by Ravid Shwartz-Ziv and Allen Roush, featuring in-depth conversations with leading AI researchers about the ideas shaping the future of machine learning.&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;&lt;/p&gt;</itunes:summary><itunes:explicit>no</itunes:explicit><itunes:duration>01:18:04</itunes:duration><itunes:image href="https://hosting-media.riverside.com/media/podcasts/eebfac8f-2369-4a89-924b-be279b57f190/episodes/669aceac-f918-4476-a23e-b06d1525ca15/images/f44b5950-91c2-4512-a552-a9c4752dec6d.png"/><itunes:season>1</itunes:season><itunes:episode>48</itunes:episode><itunes:title>Broken Peer Review, AI, and Worms — with Oded Rechavi</itunes:title><itunes:episodeType>full</itunes:episodeType></item><item><title><![CDATA[Will AI Take Our Jobs? With Alex Imas (Google/University of Chicago)]]></title><description><![CDATA[<p>Will AI take our jobs? We put the question to Alex Imas, the new Director of AGI Economics at Google DeepMind and a professor at Chicago Booth, whose entire job now is studying how frontier AI reshapes the economy. His short answer: probably some of them, but the popular story is mostly wrong about which jobs and how fast.</p><p>Alex makes the case that a job is a bundle of tasks, not a single thing AI either does or doesn't do, and that the number of people who should actually care about is how much consumer demand responds to falling prices. Get that wrong and you predict mass layoffs. Get it right and you sometimes predict more hiring. We get into why the automation panic is two centuries old, why he thinks blue-collar work is in more danger than white-collar, and why the people already winning are the ones adopting AI fastest.</p><p>We also cover the AGI versus ASI distinction and why it changes everything for the economy, what happens when there's no moat and open models stay six to eight months behind, the three-tier pricing future he sees coming after the 2026 compute crunch, and what any of this means if you're deciding whether to send your kids to college.</p><p></p><ul><li>The episode was recorded before Alex joined Google</li></ul><hr /><p><b>Timestamps</b></p><p>00:00 Meeting Alex Imas</p><p>00:44 Will AI take our jobs?</p><p>03:35 Is this an AI question or an economics question?</p><p>06:18 The economy is already behind the AI we have</p><p>07:43 Why AI adoption is K-shaped</p><p>12:51 Was Andrew Yang right?</p><p>13:45 The automation panic is 200 years old</p><p>16:46 Dario's six-month claim, and why we don't see it yet</p><p>17:22 A job is not a task</p><p>22:38 The three numbers that actually predict the labor market</p><p>22:42 The chess engine analogy and the centaur phase</p><p>25:45 Recursive self-improvement and the hamburger problem</p><p>30:06 Should AI labs be the ones answering alignment questions?</p><p>31:17 The "invisible hand wave" and why nobody wants fully autonomous AI</p><p>33:27 AGI vs ASI, and why the difference is everything</p><p>35:28 Commodities vs relational goods</p><p>41:14 Star Trek, replicators, and predicting with sci-fi</p><p>45:20 Inequality and the Upper West Side VCs</p><p>46:21 Your money manager was automated in the 1960s</p><p>50:47 Are OpenAI and Anthropic overvalued? The moat problem</p><p>54:29 What has to be true for the losses to make sense</p><p>55:43 Cognitive atrophy and monopoly fears</p><p>57:00 The 2026 compute crunch and the three-tier pricing future</p><p>1:01:52 The Apple vs Android analogy</p><p>1:03:54 A rich-country perspective</p><p>1:04:16 Protecting the skills that actually matter</p><p>1:07:02 Will not using AI become a status symbol?</p><p>1:08:53 Does capitalism even survive?</p><p>1:13:44 Redistribution becomes the political battleground</p><p>1:18:16 Blue collar vs white collar: who's really at risk</p><p>1:21:18 Advice for parents in an AI world</p><p>1:22:43 Saving for retirement when the Valley says don't</p><p>1:25:06 Will non-elite colleges survive?</p><hr /><p>Music:</p><ul><li>"Kid Kodi" - Blue Dot Sessions - via Free Music Archive - CC BY-NC 4.0.<hr /><p></p></li></ul><p>About: The Information Bottleneck is hosted by Ravid Shwartz-Ziv and Allen Roush, featuring in-depth conversations with leading AI researchers about the ideas shaping the future of machine learning.</p>]]></description><guid isPermaLink="false">33d246f4-f4e9-4663-82e3-c6a31670f701</guid><dc:creator><![CDATA[Ravid Shwartz-Ziv & Allen Roush]]></dc:creator><pubDate>Tue, 16 Jun 2026 14:49:38 GMT</pubDate><enclosure url="https://api.riverside.com/hosting-analytics/media/3594400f85cad684bb81aecf52b3ced4af4f96c80313c08e6baea80d8958a394/eyJlcGlzb2RlSWQiOiIzM2QyNDZmNC1mNGU5LTQ2NjMtODJlMy1jNmEzMTY3MGY3MDEiLCJwb2RjYXN0SWQiOiJlZWJmYWM4Zi0yMzY5LTRhODktOTI0Yi1iZTI3OWI1N2YxOTAiLCJhY2NvdW50SWQiOiI2OGE1ZjAzMzFkNjQ0MTNlODhlN2FkMmYiLCJwYXRoIjoibWVkaWEvY2xpcHMvNmEzMGQ1YmQ3NjI3OGJkZGIzMDk0MTU0L3JhdmlkLXNod2FydHoteml2cy1zdHVkaW8tY29tcG9zZXItMjAyNi02LTE2X182LTQ5LTAubXAzIn0=.mp3" length="170901986" type="audio/mpeg"/><itunes:summary>&lt;p&gt;Will AI take our jobs? We put the question to Alex Imas, the new Director of AGI Economics at Google DeepMind and a professor at Chicago Booth, whose entire job now is studying how frontier AI reshapes the economy. His short answer: probably some of them, but the popular story is mostly wrong about which jobs and how fast.&lt;/p&gt;&lt;p&gt;Alex makes the case that a job is a bundle of tasks, not a single thing AI either does or doesn&apos;t do, and that the number of people who should actually care about is how much consumer demand responds to falling prices. Get that wrong and you predict mass layoffs. Get it right and you sometimes predict more hiring. We get into why the automation panic is two centuries old, why he thinks blue-collar work is in more danger than white-collar, and why the people already winning are the ones adopting AI fastest.&lt;/p&gt;&lt;p&gt;We also cover the AGI versus ASI distinction and why it changes everything for the economy, what happens when there&apos;s no moat and open models stay six to eight months behind, the three-tier pricing future he sees coming after the 2026 compute crunch, and what any of this means if you&apos;re deciding whether to send your kids to college.&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;ul&gt;&lt;li&gt;The episode was recorded before Alex joined Google&lt;/li&gt;&lt;/ul&gt;&lt;hr /&gt;&lt;p&gt;&lt;b&gt;Timestamps&lt;/b&gt;&lt;/p&gt;&lt;p&gt;00:00 Meeting Alex Imas&lt;/p&gt;&lt;p&gt;00:44 Will AI take our jobs?&lt;/p&gt;&lt;p&gt;03:35 Is this an AI question or an economics question?&lt;/p&gt;&lt;p&gt;06:18 The economy is already behind the AI we have&lt;/p&gt;&lt;p&gt;07:43 Why AI adoption is K-shaped&lt;/p&gt;&lt;p&gt;12:51 Was Andrew Yang right?&lt;/p&gt;&lt;p&gt;13:45 The automation panic is 200 years old&lt;/p&gt;&lt;p&gt;16:46 Dario&apos;s six-month claim, and why we don&apos;t see it yet&lt;/p&gt;&lt;p&gt;17:22 A job is not a task&lt;/p&gt;&lt;p&gt;22:38 The three numbers that actually predict the labor market&lt;/p&gt;&lt;p&gt;22:42 The chess engine analogy and the centaur phase&lt;/p&gt;&lt;p&gt;25:45 Recursive self-improvement and the hamburger problem&lt;/p&gt;&lt;p&gt;30:06 Should AI labs be the ones answering alignment questions?&lt;/p&gt;&lt;p&gt;31:17 The &quot;invisible hand wave&quot; and why nobody wants fully autonomous AI&lt;/p&gt;&lt;p&gt;33:27 AGI vs ASI, and why the difference is everything&lt;/p&gt;&lt;p&gt;35:28 Commodities vs relational goods&lt;/p&gt;&lt;p&gt;41:14 Star Trek, replicators, and predicting with sci-fi&lt;/p&gt;&lt;p&gt;45:20 Inequality and the Upper West Side VCs&lt;/p&gt;&lt;p&gt;46:21 Your money manager was automated in the 1960s&lt;/p&gt;&lt;p&gt;50:47 Are OpenAI and Anthropic overvalued? The moat problem&lt;/p&gt;&lt;p&gt;54:29 What has to be true for the losses to make sense&lt;/p&gt;&lt;p&gt;55:43 Cognitive atrophy and monopoly fears&lt;/p&gt;&lt;p&gt;57:00 The 2026 compute crunch and the three-tier pricing future&lt;/p&gt;&lt;p&gt;1:01:52 The Apple vs Android analogy&lt;/p&gt;&lt;p&gt;1:03:54 A rich-country perspective&lt;/p&gt;&lt;p&gt;1:04:16 Protecting the skills that actually matter&lt;/p&gt;&lt;p&gt;1:07:02 Will not using AI become a status symbol?&lt;/p&gt;&lt;p&gt;1:08:53 Does capitalism even survive?&lt;/p&gt;&lt;p&gt;1:13:44 Redistribution becomes the political battleground&lt;/p&gt;&lt;p&gt;1:18:16 Blue collar vs white collar: who&apos;s really at risk&lt;/p&gt;&lt;p&gt;1:21:18 Advice for parents in an AI world&lt;/p&gt;&lt;p&gt;1:22:43 Saving for retirement when the Valley says don&apos;t&lt;/p&gt;&lt;p&gt;1:25:06 Will non-elite colleges survive?&lt;/p&gt;&lt;hr /&gt;&lt;p&gt;Music:&lt;/p&gt;&lt;ul&gt;&lt;li&gt;&quot;Kid Kodi&quot; - Blue Dot Sessions - via Free Music Archive - CC BY-NC 4.0.&lt;hr /&gt;&lt;p&gt;&lt;/p&gt;&lt;/li&gt;&lt;/ul&gt;&lt;p&gt;About: The Information Bottleneck is hosted by Ravid Shwartz-Ziv and Allen Roush, featuring in-depth conversations with leading AI researchers about the ideas shaping the future of machine learning.&lt;/p&gt;</itunes:summary><itunes:explicit>no</itunes:explicit><itunes:duration>01:29:01</itunes:duration><itunes:image href="https://hosting-media.riverside.com/media/podcasts/eebfac8f-2369-4a89-924b-be279b57f190/episodes/33d246f4-f4e9-4663-82e3-c6a31670f701/images/3ac5658d-072f-495d-85db-99844d5da4cb.png"/><itunes:season>1</itunes:season><itunes:episode>47</itunes:episode><itunes:title>Will AI Take Our Jobs? With Alex Imas (Google/University of Chicago)</itunes:title><itunes:episodeType>full</itunes:episodeType></item><item><title><![CDATA[Why AI Benchmarks Are Lying to You - with Wenhu Chen (Meta/University of Waterloo)]]></title><description><![CDATA[<p><b>In this episode, we sit down with Wenhu Chen,</b> research scientist at Meta MSL, assistant professor at the University of Waterloo, and the person behind MMLU-Pro and MMMU. If you've read a frontier model release in the last two years, you've seen his benchmarks. That makes him one of the best people to answer the question everyone dances around: when a model jumps from 40% to 90% on your benchmark, how much of that is real? In this episode, we dig into why benchmarks have become the loss function of the entire field - design a bad one, and thousands of brilliant researchers will spend months hill-climbing in the wrong direction. Wenhu is surprisingly candid about the limits of his own creations: contamination is everywhere, saturation turns frontier benchmarks into unit tests, and popular alternatives, such as LM Arena, mostly measure tone and length rather than capability. His answer is to evaluate models where they've never been: private codebases, hospital data, and the messy, live internet.</p><p>We also talk about ClawBench, his new benchmark that deploys agents to over 140 real production websites to do things people actually want done, such, such as ordering food, booking tickets, and applying for jobs. The best model in the world completes about a third of these tasks. We unpack why: bot detection, models that refuse to click "pay," agents that give up the moment an environment doesn't match their training, and harnesses that can swing results by 20% without changing the model at all.</p><p>Along the way, we cover the overlooked science of evaluating pre-training, data flywheels, and synthetic environments for agent training, and whether RL teaches models to reason or just surfaces what's already there. We close with Wenhu's predictions: exploration and adaptability will improve rapidly, but security will become the field's hardest problem as agents gain real permissions in the real world.</p><hr /><p></p><p><b>Timestamps</b></p><p>00:00 – Intro<br />00:55 – What good evaluation means, and how it's changed since the early GPT days<br />03:35 – Benchmarks as the field's loss function<br />05:50 – Contamination: the problem nobody fully solves<br />08:08 – MMLU-Pro scores: real progress or training on the test set?<br />11:05 – Can you measure creativity?<br />12:34 – Why human judges and arenas are unreliable — and what to use instead<br />19:22 – What a good benchmark actually looks like<br />22:34 – Chain of thought: signal or scratchpad?<br />26:01 – Auto-research and hill-climbing agents<br />28:52 – Harnesses: 20% swings without touching the model<br />32:28 – Safety, model release, and an "FDA for models"<br />36:53 – The overlooked science of pre-training evaluation<br />43:49 – Designing pre-training benchmarks when one run costs a billion dollars<br />49:45 – ClawBench: agents on 140+ live websites, and why the best model gets 33%<br />54:42 – How MMLU-Pro and MMMU-Pro were born from public complaints<br />59:16 – Pixel agents vs. APIs: will MCP kill computer use?<br />1:02:11 – Training agents: data flywheels and synthetic environments<br />1:05:43 – SFT vs. RL, and does RL teach reasoning or reveal it?<br />1:09:21 – What gets solved next year — and what doesn't<br />1:14:32 – Undervalued ideas, and what's next for ClawBench</p><hr /><p></p><p>Music:</p><ul><li>"Kid Kodi" - Blue Dot Sessions - via Free Music Archive - CC BY-NC 4.0.<hr /><p></p></li></ul><p>About: The Information Bottleneck is hosted by Ravid Shwartz-Ziv and Allen Roush, featuring in-depth conversations with leading AI researchers about the ideas shaping the future of machine learning.</p>]]></description><guid isPermaLink="false">fb1c30e1-e667-4b79-99e0-d0d4b8d3d6af</guid><dc:creator><![CDATA[Ravid Shwartz-Ziv & Allen Roush]]></dc:creator><pubDate>Sat, 13 Jun 2026 20:05:34 GMT</pubDate><enclosure url="https://api.riverside.com/hosting-analytics/media/541ccd831ecdf87cd81baf304cc41b9f7a97046950609d594b2845d10714fce9/eyJlcGlzb2RlSWQiOiJmYjFjMzBlMS1lNjY3LTRiNzktOTllMC1kMGQ0YjhkM2Q2YWYiLCJwb2RjYXN0SWQiOiJlZWJmYWM4Zi0yMzY5LTRhODktOTI0Yi1iZTI3OWI1N2YxOTAiLCJhY2NvdW50SWQiOiI2OGE1ZjAzMzFkNjQ0MTNlODhlN2FkMmYiLCJwYXRoIjoibWVkaWEvY2xpcHMvNmEyZGI0NDE3MmI0OTBmMzQzY2RhYTgyL3JhdmlkLXNod2FydHoteml2cy1zdHVkaW8tY29tcG9zZXItMjAyNi02LTEzX18yMS00OS0yMS5tcDMifQ==.mp3" length="151790384" type="audio/mpeg"/><itunes:summary>&lt;p&gt;&lt;b&gt;In this episode, we sit down with Wenhu Chen,&lt;/b&gt; research scientist at Meta MSL, assistant professor at the University of Waterloo, and the person behind MMLU-Pro and MMMU. If you&apos;ve read a frontier model release in the last two years, you&apos;ve seen his benchmarks. That makes him one of the best people to answer the question everyone dances around: when a model jumps from 40% to 90% on your benchmark, how much of that is real? In this episode, we dig into why benchmarks have become the loss function of the entire field - design a bad one, and thousands of brilliant researchers will spend months hill-climbing in the wrong direction. Wenhu is surprisingly candid about the limits of his own creations: contamination is everywhere, saturation turns frontier benchmarks into unit tests, and popular alternatives, such as LM Arena, mostly measure tone and length rather than capability. His answer is to evaluate models where they&apos;ve never been: private codebases, hospital data, and the messy, live internet.&lt;/p&gt;&lt;p&gt;We also talk about ClawBench, his new benchmark that deploys agents to over 140 real production websites to do things people actually want done, such, such as ordering food, booking tickets, and applying for jobs. The best model in the world completes about a third of these tasks. We unpack why: bot detection, models that refuse to click &quot;pay,&quot; agents that give up the moment an environment doesn&apos;t match their training, and harnesses that can swing results by 20% without changing the model at all.&lt;/p&gt;&lt;p&gt;Along the way, we cover the overlooked science of evaluating pre-training, data flywheels, and synthetic environments for agent training, and whether RL teaches models to reason or just surfaces what&apos;s already there. We close with Wenhu&apos;s predictions: exploration and adaptability will improve rapidly, but security will become the field&apos;s hardest problem as agents gain real permissions in the real world.&lt;/p&gt;&lt;hr /&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;&lt;b&gt;Timestamps&lt;/b&gt;&lt;/p&gt;&lt;p&gt;00:00 – Intro&lt;br /&gt;00:55 – What good evaluation means, and how it&apos;s changed since the early GPT days&lt;br /&gt;03:35 – Benchmarks as the field&apos;s loss function&lt;br /&gt;05:50 – Contamination: the problem nobody fully solves&lt;br /&gt;08:08 – MMLU-Pro scores: real progress or training on the test set?&lt;br /&gt;11:05 – Can you measure creativity?&lt;br /&gt;12:34 – Why human judges and arenas are unreliable — and what to use instead&lt;br /&gt;19:22 – What a good benchmark actually looks like&lt;br /&gt;22:34 – Chain of thought: signal or scratchpad?&lt;br /&gt;26:01 – Auto-research and hill-climbing agents&lt;br /&gt;28:52 – Harnesses: 20% swings without touching the model&lt;br /&gt;32:28 – Safety, model release, and an &quot;FDA for models&quot;&lt;br /&gt;36:53 – The overlooked science of pre-training evaluation&lt;br /&gt;43:49 – Designing pre-training benchmarks when one run costs a billion dollars&lt;br /&gt;49:45 – ClawBench: agents on 140+ live websites, and why the best model gets 33%&lt;br /&gt;54:42 – How MMLU-Pro and MMMU-Pro were born from public complaints&lt;br /&gt;59:16 – Pixel agents vs. APIs: will MCP kill computer use?&lt;br /&gt;1:02:11 – Training agents: data flywheels and synthetic environments&lt;br /&gt;1:05:43 – SFT vs. RL, and does RL teach reasoning or reveal it?&lt;br /&gt;1:09:21 – What gets solved next year — and what doesn&apos;t&lt;br /&gt;1:14:32 – Undervalued ideas, and what&apos;s next for ClawBench&lt;/p&gt;&lt;hr /&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;Music:&lt;/p&gt;&lt;ul&gt;&lt;li&gt;&quot;Kid Kodi&quot; - Blue Dot Sessions - via Free Music Archive - CC BY-NC 4.0.&lt;hr /&gt;&lt;p&gt;&lt;/p&gt;&lt;/li&gt;&lt;/ul&gt;&lt;p&gt;About: The Information Bottleneck is hosted by Ravid Shwartz-Ziv and Allen Roush, featuring in-depth conversations with leading AI researchers about the ideas shaping the future of machine learning.&lt;/p&gt;</itunes:summary><itunes:explicit>no</itunes:explicit><itunes:duration>01:19:03</itunes:duration><itunes:image href="https://hosting-media.riverside.com/media/podcasts/eebfac8f-2369-4a89-924b-be279b57f190/episodes/fb1c30e1-e667-4b79-99e0-d0d4b8d3d6af/images/af24af9c-b5e4-4f84-a37b-cc654b561e42.png"/><itunes:season>1</itunes:season><itunes:episode>46</itunes:episode><itunes:title>Why AI Benchmarks Are Lying to You - with Wenhu Chen (Meta/University of Waterloo)</itunes:title><itunes:episodeType>full</itunes:episodeType></item><item><title><![CDATA[Jürgen Schmidhuber - Part 2: JEPA, the Road to AGI, and Who Really Invented Modern AI]]></title><description><![CDATA[<p>In the second half of our conversation with Jürgen Schmidhuber, we focus on the key ideas he's pursued since the early 1990s and discuss why he believes these concepts are only now being rediscovered.</p><p></p><p>We start with JEPA. Jürgen argues that the method LeCun named in 2022 is the same family he published in 1992 as Predictability Maximization. From there he traces the adversarial lineage back further still, to his 1990 world-model paper and 1991 Predictability Minimization  -  the curiosity-driven minimax games he sees as the real origins of GANs.</p><p>We also talk about why these ideas took thirty years to land, why today's trillion-dollar data-center buildout is driven by AGI fear, and why he thinks Apple may come out ahead.</p><p></p><p>The back half turns to what he sees as the real frontier: physical AI. Today's systems are superhuman behind the screen but helpless at a leaky pipe, and until a robot can use human tools, there's no AGI. He discusses self-replicating, self-improving machines as "a new kind of life," reframes continual learning and test-time training as ideas from his 1991 fast-weight work, and detours through Solomonoff's universal prior, Hutter's AIXI, and the Gödel machine.</p><p></p><p>We close on the subject Jürgen is famous for: scientific credit. He makes his case for rigorous attribution, casts himself as a "speaker for the dead" championing forgotten pioneers like Ivakhnenko, and reflects candidly on whether the fights are personal.</p><hr /><p></p><p><b>Timeline</b></p><p></p><p>00:30 — What JEPA is, and the 1992 Predictability Maximization story </p><p>04:54 — Implementing PMAX: autoencoders, Siamese networks, Infomax </p><p>09:10 — Predictability Minimization, factorial codes, and the roots of GANs </p><p>16:00 — Why it took 30 years: the economics of compute </p><p>20:52 — Data, the web, and 1990 as the origin point </p><p>23:09 — Hardware inflation, the trillion-dollar buildout, and the coming crash </p><p>34:05 — Physical AI: the plumber problem and self-replicating machines </p><p>41:14 — Which 90s ideas are being scaled right now </p><p>45:26 — Continual learning and test-time training as "old hats" </p><p>55:19 — Measuring intelligence: Solomonoff, AIXI, and the Gödel machine </p><p>1:05:26 — Self-replication and von Neumann </p><p>1:09:51 — Will he see AGI in his lifetime? </p><p>1:10:42 — Credit, integrity, and being a "speaker for the dead" </p><hr /><p></p><p>Music:</p><ul><li>"Kid Kodi" - Blue Dot Sessions - via Free Music Archive - CC BY-NC 4.0.</li><li>"Palms Down" - Blue Dot Sessions - via Free Music Archive - CC BY-NC 4.0.</li><li>Changes: trimmed</li><li><hr /><p></p></li></ul><p>About: The Information Bottleneck is hosted by Ravid Shwartz-Ziv and Allen Roush, featuring in-depth conversations with leading AI researchers about the ideas shaping the future of machine learning. </p>]]></description><guid isPermaLink="false">ec403c43-6758-4944-b4d6-26e031df48e3</guid><dc:creator><![CDATA[Ravid Shwartz-Ziv & Allen Roush]]></dc:creator><pubDate>Sun, 07 Jun 2026 18:13:28 GMT</pubDate><enclosure url="https://api.riverside.com/hosting-analytics/media/e56f6d5fa8089ac6a71024d0e85417926734c0115a2108b9d2cbff7050ff8b84/eyJlcGlzb2RlSWQiOiJlYzQwM2M0My02NzU4LTQ5NDQtYjRkNi0yNmUwMzFkZjQ4ZTMiLCJwb2RjYXN0SWQiOiJlZWJmYWM4Zi0yMzY5LTRhODktOTI0Yi1iZTI3OWI1N2YxOTAiLCJhY2NvdW50SWQiOiI2OGE1ZjAzMzFkNjQ0MTNlODhlN2FkMmYiLCJwYXRoIjoibWVkaWEvY2xpcHMvNmExMGNkZDc1NzA2ZGZiMTQzMWU5YTBjL3JhdmlkLXNod2FydHoteml2cy1zdHVkaW8tY29tcG9zZXItMjAyNi01LTIyX18yMy00Mi00Ny5tcDMifQ==.mp3" length="171824840" type="audio/mpeg"/><podcast:transcript url="https://hosting-media.riverside.com/media/podcasts/eebfac8f-2369-4a89-924b-be279b57f190/episodes/ec403c43-6758-4944-b4d6-26e031df48e3/transcripts.txt" type="text/plain"/><itunes:summary>&lt;p&gt;In the second half of our conversation with Jürgen Schmidhuber, we focus on the key ideas he&apos;s pursued since the early 1990s and discuss why he believes these concepts are only now being rediscovered.&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;We start with JEPA. Jürgen argues that the method LeCun named in 2022 is the same family he published in 1992 as Predictability Maximization. From there he traces the adversarial lineage back further still, to his 1990 world-model paper and 1991 Predictability Minimization  -  the curiosity-driven minimax games he sees as the real origins of GANs.&lt;/p&gt;&lt;p&gt;We also talk about why these ideas took thirty years to land, why today&apos;s trillion-dollar data-center buildout is driven by AGI fear, and why he thinks Apple may come out ahead.&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;The back half turns to what he sees as the real frontier: physical AI. Today&apos;s systems are superhuman behind the screen but helpless at a leaky pipe, and until a robot can use human tools, there&apos;s no AGI. He discusses self-replicating, self-improving machines as &quot;a new kind of life,&quot; reframes continual learning and test-time training as ideas from his 1991 fast-weight work, and detours through Solomonoff&apos;s universal prior, Hutter&apos;s AIXI, and the Gödel machine.&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;We close on the subject Jürgen is famous for: scientific credit. He makes his case for rigorous attribution, casts himself as a &quot;speaker for the dead&quot; championing forgotten pioneers like Ivakhnenko, and reflects candidly on whether the fights are personal.&lt;/p&gt;&lt;hr /&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;&lt;b&gt;Timeline&lt;/b&gt;&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;00:30 — What JEPA is, and the 1992 Predictability Maximization story &lt;/p&gt;&lt;p&gt;04:54 — Implementing PMAX: autoencoders, Siamese networks, Infomax &lt;/p&gt;&lt;p&gt;09:10 — Predictability Minimization, factorial codes, and the roots of GANs &lt;/p&gt;&lt;p&gt;16:00 — Why it took 30 years: the economics of compute &lt;/p&gt;&lt;p&gt;20:52 — Data, the web, and 1990 as the origin point &lt;/p&gt;&lt;p&gt;23:09 — Hardware inflation, the trillion-dollar buildout, and the coming crash &lt;/p&gt;&lt;p&gt;34:05 — Physical AI: the plumber problem and self-replicating machines &lt;/p&gt;&lt;p&gt;41:14 — Which 90s ideas are being scaled right now &lt;/p&gt;&lt;p&gt;45:26 — Continual learning and test-time training as &quot;old hats&quot; &lt;/p&gt;&lt;p&gt;55:19 — Measuring intelligence: Solomonoff, AIXI, and the Gödel machine &lt;/p&gt;&lt;p&gt;1:05:26 — Self-replication and von Neumann &lt;/p&gt;&lt;p&gt;1:09:51 — Will he see AGI in his lifetime? &lt;/p&gt;&lt;p&gt;1:10:42 — Credit, integrity, and being a &quot;speaker for the dead&quot; &lt;/p&gt;&lt;hr /&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;Music:&lt;/p&gt;&lt;ul&gt;&lt;li&gt;&quot;Kid Kodi&quot; - Blue Dot Sessions - via Free Music Archive - CC BY-NC 4.0.&lt;/li&gt;&lt;li&gt;&quot;Palms Down&quot; - Blue Dot Sessions - via Free Music Archive - CC BY-NC 4.0.&lt;/li&gt;&lt;li&gt;Changes: trimmed&lt;/li&gt;&lt;li&gt;&lt;hr /&gt;&lt;p&gt;&lt;/p&gt;&lt;/li&gt;&lt;/ul&gt;&lt;p&gt;About: The Information Bottleneck is hosted by Ravid Shwartz-Ziv and Allen Roush, featuring in-depth conversations with leading AI researchers about the ideas shaping the future of machine learning. &lt;/p&gt;</itunes:summary><itunes:explicit>no</itunes:explicit><itunes:duration>01:29:29</itunes:duration><itunes:image href="https://hosting-media.riverside.com/media/podcasts/eebfac8f-2369-4a89-924b-be279b57f190/episodes/ec403c43-6758-4944-b4d6-26e031df48e3/images/6c386f71-02af-4467-826a-d9e3467675f3.png"/><itunes:season>1</itunes:season><itunes:episode>44</itunes:episode><itunes:title>Jürgen Schmidhuber - Part 2: JEPA, the Road to AGI, and Who Really Invented Modern AI</itunes:title><itunes:episodeType>full</itunes:episodeType></item><item><title><![CDATA[Jürgen Schmidhuber  -  World Models, RL, and the Year that changed AI (Part 1)]]></title><description><![CDATA[<p>In this episode, we host Jürgen Schmidhuber  -  the man, the legend, one of the godfathers of modern AI. His lab worked out many ideas behind today’s systems (LSTM, world models, artificial curiosity, Transformer variants, and even GAN-style setups) decades before they became fashionable, and he’s just as well known for making sure people remember who did what first. This is the first of two conversations with him.</p><p>We go back to his lab in the early 90s and ask how one small group came up with so many of the ideas that are now being scaled to a thousand billion dollars, back when compute was ten million times more expensive. A lot of the episode comes down to one distinction he keeps making: prediction vs. decision-making. His take is that LLMs are very good prediction machines that imitate the web, but that’s only half the problem. To actually act in the world, you need a controller that uses a world model to plan. He talks about his 1990 work on world models and artificial curiosity, where the controller gets rewarded for running experiments that improve its own model (an adversarial setup years before GANs), why planning millisecond by millisecond doesn’t scale, and why you need sub-goals instead.</p><p>We also talk about compression as the core of understanding, from falling apples to Kepler to Einstein, and why we still don’t have a robot that can do what a plumber does, even though the AI behind the screen keeps getting better. Then the conversation moves to credit assignment: how “to Schmidhuber” became a verb, what he thinks is broken about the award system, and a long exchange on PMAX vs. JEPA. He ends on the real origins of deep learning and a prediction about self-replicating machines in space.</p><hr /><p></p><p><b>Timeline</b></p><p>00:00  Intro<br />00:55  1991 in Munich, and why that lab mattered<br />02:38  "I'm not very smart"  and why compute getting 10× cheaper every 5 years changed everything<br />04:25  Chess as an AI proxy<br />08:27  Artificial curiosity in the 90s vs. today's RL exploration<br />09:10  Why RL is harder than supervised learning<br />20:48  Coding agents vs. robots, and how a baby learns its own hands<br />26:20  Compression as understanding<br />33:40  What's actually missing on the road to AGI<br />37:30  Why millisecond-by-millisecond planning is stupid<br />47:44  Convergence to LLMs, GPUs, and how far we still are from the Bremermann limit<br />51:49  Unsupervised learning, factorial codes, and predictability minimization<br />58:12  Credit assignment: the fights with LeCun and the Nobel critique<br />1:02:13  On his last name becoming a verb<br />1:05:17  The award system's missing peer review<br />1:07:03  Closed labs and the decline of open research<br />1:13:23  Audience questions<br />1:34:02  Closing: who really invented deep learning?</p><p></p><hr /><p></p><p>Music:</p><ul><li>"Kid Kodi" - Blue Dot Sessions - via Free Music Archive - CC BY-NC 4.0.</li><li>"Palms Down" - Blue Dot Sessions - via Free Music Archive - CC BY-NC 4.0.</li><li>Changes: trimmed<hr /><p></p></li></ul><p>About: The Information Bottleneck is hosted by Ravid Shwartz-Ziv and Allen Roush, featuring in-depth conversations with leading AI researchers about the ideas shaping the future of machine learning.</p>]]></description><guid isPermaLink="false">ff8b060a-0ffa-4481-9345-366fe1f47679</guid><dc:creator><![CDATA[Ravid Shwartz-Ziv & Allen Roush]]></dc:creator><pubDate>Thu, 04 Jun 2026 12:59:25 GMT</pubDate><enclosure url="https://api.riverside.com/hosting-analytics/media/18bdc6e4e65b7f1e7f030d92287a4033601db1ed48ce81b421ee7f29dc146f82/eyJlcGlzb2RlSWQiOiJmZjhiMDYwYS0wZmZhLTQ0ODEtOTM0NS0zNjZmZTFmNDc2NzkiLCJwb2RjYXN0SWQiOiJlZWJmYWM4Zi0yMzY5LTRhODktOTI0Yi1iZTI3OWI1N2YxOTAiLCJhY2NvdW50SWQiOiI2OGE1ZjAzMzFkNjQ0MTNlODhlN2FkMmYiLCJwYXRoIjoibWVkaWEvY2xpcHMvNmExMGM1YzQwY2Y2MzM4NDJkN2Y5MjBhL3JhdmlkLXNod2FydHoteml2cy1zdHVkaW8tY29tcG9zZXItMjAyNi01LTIyX18yMy04LTIwLm1wMyJ9.mp3" length="188024102" type="audio/mpeg"/><podcast:transcript url="https://hosting-media.riverside.com/media/podcasts/eebfac8f-2369-4a89-924b-be279b57f190/episodes/ff8b060a-0ffa-4481-9345-366fe1f47679/transcripts.txt" type="text/plain"/><itunes:summary>&lt;p&gt;In this episode, we host Jürgen Schmidhuber  -  the man, the legend, one of the godfathers of modern AI. His lab worked out many ideas behind today’s systems (LSTM, world models, artificial curiosity, Transformer variants, and even GAN-style setups) decades before they became fashionable, and he’s just as well known for making sure people remember who did what first. This is the first of two conversations with him.&lt;/p&gt;&lt;p&gt;We go back to his lab in the early 90s and ask how one small group came up with so many of the ideas that are now being scaled to a thousand billion dollars, back when compute was ten million times more expensive. A lot of the episode comes down to one distinction he keeps making: prediction vs. decision-making. His take is that LLMs are very good prediction machines that imitate the web, but that’s only half the problem. To actually act in the world, you need a controller that uses a world model to plan. He talks about his 1990 work on world models and artificial curiosity, where the controller gets rewarded for running experiments that improve its own model (an adversarial setup years before GANs), why planning millisecond by millisecond doesn’t scale, and why you need sub-goals instead.&lt;/p&gt;&lt;p&gt;We also talk about compression as the core of understanding, from falling apples to Kepler to Einstein, and why we still don’t have a robot that can do what a plumber does, even though the AI behind the screen keeps getting better. Then the conversation moves to credit assignment: how “to Schmidhuber” became a verb, what he thinks is broken about the award system, and a long exchange on PMAX vs. JEPA. He ends on the real origins of deep learning and a prediction about self-replicating machines in space.&lt;/p&gt;&lt;hr /&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;&lt;b&gt;Timeline&lt;/b&gt;&lt;/p&gt;&lt;p&gt;00:00  Intro&lt;br /&gt;00:55  1991 in Munich, and why that lab mattered&lt;br /&gt;02:38  &quot;I&apos;m not very smart&quot;  and why compute getting 10× cheaper every 5 years changed everything&lt;br /&gt;04:25  Chess as an AI proxy&lt;br /&gt;08:27  Artificial curiosity in the 90s vs. today&apos;s RL exploration&lt;br /&gt;09:10  Why RL is harder than supervised learning&lt;br /&gt;20:48  Coding agents vs. robots, and how a baby learns its own hands&lt;br /&gt;26:20  Compression as understanding&lt;br /&gt;33:40  What&apos;s actually missing on the road to AGI&lt;br /&gt;37:30  Why millisecond-by-millisecond planning is stupid&lt;br /&gt;47:44  Convergence to LLMs, GPUs, and how far we still are from the Bremermann limit&lt;br /&gt;51:49  Unsupervised learning, factorial codes, and predictability minimization&lt;br /&gt;58:12  Credit assignment: the fights with LeCun and the Nobel critique&lt;br /&gt;1:02:13  On his last name becoming a verb&lt;br /&gt;1:05:17  The award system&apos;s missing peer review&lt;br /&gt;1:07:03  Closed labs and the decline of open research&lt;br /&gt;1:13:23  Audience questions&lt;br /&gt;1:34:02  Closing: who really invented deep learning?&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;hr /&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;Music:&lt;/p&gt;&lt;ul&gt;&lt;li&gt;&quot;Kid Kodi&quot; - Blue Dot Sessions - via Free Music Archive - CC BY-NC 4.0.&lt;/li&gt;&lt;li&gt;&quot;Palms Down&quot; - Blue Dot Sessions - via Free Music Archive - CC BY-NC 4.0.&lt;/li&gt;&lt;li&gt;Changes: trimmed&lt;hr /&gt;&lt;p&gt;&lt;/p&gt;&lt;/li&gt;&lt;/ul&gt;&lt;p&gt;About: The Information Bottleneck is hosted by Ravid Shwartz-Ziv and Allen Roush, featuring in-depth conversations with leading AI researchers about the ideas shaping the future of machine learning.&lt;/p&gt;</itunes:summary><itunes:explicit>no</itunes:explicit><itunes:duration>01:37:56</itunes:duration><itunes:image href="https://hosting-media.riverside.com/media/podcasts/eebfac8f-2369-4a89-924b-be279b57f190/episodes/ff8b060a-0ffa-4481-9345-366fe1f47679/images/5a817e8c-dc14-4db9-b40a-916feba11cb9.png"/><itunes:season>1</itunes:season><itunes:episode>43</itunes:episode><itunes:title>Jürgen Schmidhuber  -  World Models, RL, and the Year that changed AI (Part 1)</itunes:title><itunes:episodeType>full</itunes:episodeType></item><item><title><![CDATA[AI for Science and the Thermodynamics of Generative AI - with Max Welling (UvA, CuspAI)]]></title><description><![CDATA[<p>In this episode, we sit with Max Welling, Professor of Machine Learning at the University of Amsterdam, co-founder and CTO of CuspAI, and a foundational figure behind variational autoencoders (VAEs), equivariant networks, and Bayesian deep learning. We talk about AI for science, the physics underneath generative models, and what's still missing on the road to real intelligence.</p><p>Max starts with what impresses him and what worries him about the LLM era, then makes the case that the next leaps will come from physical AI and from science itself. We dig into how machine learning actually works in the lab, world models and whether priors like geometry and symmetry should be built in or simply learned, and whether transformers will still rule a decade from now. At the end, we talk about CuspAI's climate mission, AI risk and regulation, Max’s new book, and where neuroscience might inspire the next wave of ML.</p><p></p><hr /><p><b>Timeline</b></p><ul><li><b>00:00</b> — Intro</li><li><b>00:47</b> — Are we happy with the LLM era?</li><li><b>03:14</b> — Embodiment and physical AI</li><li><b>08:05</b> — Does "AGI" even matter as a term?</li><li><b>11:34</b> — Verifiers, RL, and why math/coding are tractable</li><li><b>13:17</b> — What actually shifted to make materials discovery work</li><li><b>14:42</b> — From molecules to biology and wet labs</li><li><b>16:26</b> — Working with real labs: timescales, friction, and the "Mira" agent</li><li><b>20:29</b> — Balancing simulators vs. experiments: the exploration–exploitation trade-off</li><li><b>23:44</b> — Active learning for experimental design</li><li><b>24:23</b> — Why active learning hasn't been central to LLMs</li><li><b>25:24</b> — A general loop for ML-for-science across domains</li><li><b>27:10</b> — Foundation models for chemistry: a "mother ship" plus a zoo of fine-tuned models</li><li><b>30:04</b> — Quantum mechanics, interpretation, and AI as a creative theorist</li><li><b>31:54</b> — World models and Yann LeCun's view; priors vs. learning</li><li><b>34:57</b> — Should world knowledge be explicit? (responding to Stefano Ermon)</li><li><b>36:41</b> — Vision: equivariance vs. transformers, and the role of optimization</li><li><b>40:32</b> — Best model for molecular properties in 10 years? Will transformers survive?</li><li><b>43:16</b> — CuspAI's climate focus and what motivated it</li><li><b>47:10</b> — One platform for every material class — what transfers and what doesn't</li><li><b>48:42</b> — Where does the risk of human extinction really come from?</li><li><b>51:06</b> — The "pause AI" debate and the arms-race reality</li><li><b>52:40</b> — Regulating powerful models: government vs. self-regulation</li><li><b>55:16</b> — Who should design AI regulation? </li><li><b>56:29</b> — The new book</li><li><b>1:00:31</b> — Compression, the information bottleneck, and renormalization</li><li><b>1:03:30</b> — The role of foundational principles in modern AI</li><li><b>1:04:06</b> — Waves in computing, the brain, and the next wave of innovation</li><li><b>1:07:11</b> — Neuroscience and ML: are we in a better position now?</li><li><b>1:09:17</b> — Conferences, the ICLR keynote, and finding the right people<hr /><p>Music:</p><ul><li>"Kid Kodi" - Blue Dot Sessions - via Free Music Archive - CC BY-NC 4.0.</li><li>"Palms Down" - Blue Dot Sessions - via Free Music Archive - CC BY-NC 4.0.</li><li>Changes: trimmed</li></ul><hr /><p>About: The Information Bottleneck is hosted by Ravid Shwartz-Ziv and Allen Roush, featuring in-depth conversations with leading AI researchers about the ideas shaping the future of machine learning.</p></li></ul>]]></description><guid isPermaLink="false">805768a5-3626-4efd-bd6a-8269937c9039</guid><dc:creator><![CDATA[Ravid Shwartz-Ziv & Allen Roush]]></dc:creator><pubDate>Fri, 29 May 2026 03:58:30 GMT</pubDate><enclosure url="https://api.riverside.com/hosting-analytics/media/441e7fa3fbe6b85a8c7e4a275f99e16c456a433d3f09c114ef168302ef06c03f/eyJlcGlzb2RlSWQiOiI4MDU3NjhhNS0zNjI2LTRlZmQtYmQ2YS04MjY5OTM3YzkwMzkiLCJwb2RjYXN0SWQiOiJlZWJmYWM4Zi0yMzY5LTRhODktOTI0Yi1iZTI3OWI1N2YxOTAiLCJhY2NvdW50SWQiOiI2OGE1ZjAzMzFkNjQ0MTNlODhlN2FkMmYiLCJwYXRoIjoibWVkaWEvY2xpcHMvNmExODZhODA1NDFhN2QxOTk5NjI2OTNhL3JhdmlkLXNod2FydHoteml2cy1zdHVkaW8tY29tcG9zZXItMjAyNi01LTI4X18xOC0xNy00Lm1wMyJ9.mp3" length="141646515" type="audio/mpeg"/><itunes:summary>&lt;p&gt;In this episode, we sit with Max Welling, Professor of Machine Learning at the University of Amsterdam, co-founder and CTO of CuspAI, and a foundational figure behind variational autoencoders (VAEs), equivariant networks, and Bayesian deep learning. We talk about AI for science, the physics underneath generative models, and what&apos;s still missing on the road to real intelligence.&lt;/p&gt;&lt;p&gt;Max starts with what impresses him and what worries him about the LLM era, then makes the case that the next leaps will come from physical AI and from science itself. We dig into how machine learning actually works in the lab, world models and whether priors like geometry and symmetry should be built in or simply learned, and whether transformers will still rule a decade from now. At the end, we talk about CuspAI&apos;s climate mission, AI risk and regulation, Max’s new book, and where neuroscience might inspire the next wave of ML.&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;hr /&gt;&lt;p&gt;&lt;b&gt;Timeline&lt;/b&gt;&lt;/p&gt;&lt;ul&gt;&lt;li&gt;&lt;b&gt;00:00&lt;/b&gt; — Intro&lt;/li&gt;&lt;li&gt;&lt;b&gt;00:47&lt;/b&gt; — Are we happy with the LLM era?&lt;/li&gt;&lt;li&gt;&lt;b&gt;03:14&lt;/b&gt; — Embodiment and physical AI&lt;/li&gt;&lt;li&gt;&lt;b&gt;08:05&lt;/b&gt; — Does &quot;AGI&quot; even matter as a term?&lt;/li&gt;&lt;li&gt;&lt;b&gt;11:34&lt;/b&gt; — Verifiers, RL, and why math/coding are tractable&lt;/li&gt;&lt;li&gt;&lt;b&gt;13:17&lt;/b&gt; — What actually shifted to make materials discovery work&lt;/li&gt;&lt;li&gt;&lt;b&gt;14:42&lt;/b&gt; — From molecules to biology and wet labs&lt;/li&gt;&lt;li&gt;&lt;b&gt;16:26&lt;/b&gt; — Working with real labs: timescales, friction, and the &quot;Mira&quot; agent&lt;/li&gt;&lt;li&gt;&lt;b&gt;20:29&lt;/b&gt; — Balancing simulators vs. experiments: the exploration–exploitation trade-off&lt;/li&gt;&lt;li&gt;&lt;b&gt;23:44&lt;/b&gt; — Active learning for experimental design&lt;/li&gt;&lt;li&gt;&lt;b&gt;24:23&lt;/b&gt; — Why active learning hasn&apos;t been central to LLMs&lt;/li&gt;&lt;li&gt;&lt;b&gt;25:24&lt;/b&gt; — A general loop for ML-for-science across domains&lt;/li&gt;&lt;li&gt;&lt;b&gt;27:10&lt;/b&gt; — Foundation models for chemistry: a &quot;mother ship&quot; plus a zoo of fine-tuned models&lt;/li&gt;&lt;li&gt;&lt;b&gt;30:04&lt;/b&gt; — Quantum mechanics, interpretation, and AI as a creative theorist&lt;/li&gt;&lt;li&gt;&lt;b&gt;31:54&lt;/b&gt; — World models and Yann LeCun&apos;s view; priors vs. learning&lt;/li&gt;&lt;li&gt;&lt;b&gt;34:57&lt;/b&gt; — Should world knowledge be explicit? (responding to Stefano Ermon)&lt;/li&gt;&lt;li&gt;&lt;b&gt;36:41&lt;/b&gt; — Vision: equivariance vs. transformers, and the role of optimization&lt;/li&gt;&lt;li&gt;&lt;b&gt;40:32&lt;/b&gt; — Best model for molecular properties in 10 years? Will transformers survive?&lt;/li&gt;&lt;li&gt;&lt;b&gt;43:16&lt;/b&gt; — CuspAI&apos;s climate focus and what motivated it&lt;/li&gt;&lt;li&gt;&lt;b&gt;47:10&lt;/b&gt; — One platform for every material class — what transfers and what doesn&apos;t&lt;/li&gt;&lt;li&gt;&lt;b&gt;48:42&lt;/b&gt; — Where does the risk of human extinction really come from?&lt;/li&gt;&lt;li&gt;&lt;b&gt;51:06&lt;/b&gt; — The &quot;pause AI&quot; debate and the arms-race reality&lt;/li&gt;&lt;li&gt;&lt;b&gt;52:40&lt;/b&gt; — Regulating powerful models: government vs. self-regulation&lt;/li&gt;&lt;li&gt;&lt;b&gt;55:16&lt;/b&gt; — Who should design AI regulation? &lt;/li&gt;&lt;li&gt;&lt;b&gt;56:29&lt;/b&gt; — The new book&lt;/li&gt;&lt;li&gt;&lt;b&gt;1:00:31&lt;/b&gt; — Compression, the information bottleneck, and renormalization&lt;/li&gt;&lt;li&gt;&lt;b&gt;1:03:30&lt;/b&gt; — The role of foundational principles in modern AI&lt;/li&gt;&lt;li&gt;&lt;b&gt;1:04:06&lt;/b&gt; — Waves in computing, the brain, and the next wave of innovation&lt;/li&gt;&lt;li&gt;&lt;b&gt;1:07:11&lt;/b&gt; — Neuroscience and ML: are we in a better position now?&lt;/li&gt;&lt;li&gt;&lt;b&gt;1:09:17&lt;/b&gt; — Conferences, the ICLR keynote, and finding the right people&lt;hr /&gt;&lt;p&gt;Music:&lt;/p&gt;&lt;ul&gt;&lt;li&gt;&quot;Kid Kodi&quot; - Blue Dot Sessions - via Free Music Archive - CC BY-NC 4.0.&lt;/li&gt;&lt;li&gt;&quot;Palms Down&quot; - Blue Dot Sessions - via Free Music Archive - CC BY-NC 4.0.&lt;/li&gt;&lt;li&gt;Changes: trimmed&lt;/li&gt;&lt;/ul&gt;&lt;hr /&gt;&lt;p&gt;About: The Information Bottleneck is hosted by Ravid Shwartz-Ziv and Allen Roush, featuring in-depth conversations with leading AI researchers about the ideas shaping the future of machine learning.&lt;/p&gt;&lt;/li&gt;&lt;/ul&gt;</itunes:summary><itunes:explicit>no</itunes:explicit><itunes:duration>01:13:46</itunes:duration><itunes:image href="https://hosting-media.riverside.com/media/podcasts/eebfac8f-2369-4a89-924b-be279b57f190/episodes/805768a5-3626-4efd-bd6a-8269937c9039/images/3d04d4af-af9e-48bb-a8b0-d7f35fafe757.png"/><itunes:season>1</itunes:season><itunes:episode>42</itunes:episode><itunes:title>AI for Science and the Thermodynamics of Generative AI - with Max Welling (UvA, CuspAI)</itunes:title><itunes:episodeType>full</itunes:episodeType></item><item><title><![CDATA[After Math Falls, What's Next?  with Julia Kempe (NYU/Meta)]]></title><description><![CDATA[<p><b>Julia Kempe on Why Math Will Fall Next, Superhuman Provers, and the Return of the Renaissance Researcher</b></p><p></p><p>In this episode, we sit down with Julia Kempe, a Professor at NYU's Center for Data Science and researcher at Meta FAIR's Foundations of Reasoning team,  for a wide-ranging conversation on the future of AI research.</p><p>We dig into why verifiable domains like mathematics may be on track to "fall" the way Go did. With formal verification through Lean and the Mathlib infrastructure, LLM agents can now generate and check proofs at scale, and Julia makes the case that a new industry of automated mathematical discovery is closer than most mathematicians believe. We explore why Erdős problems are already falling, what's still missing for harder fields like analysis and physics, and how synthetic data, curation, and verification fit together.</p><p>From there we get into the energy and scaling limits of frontier models, the case for academic research that big labs can't pursue, how to advise PhD students when Claude can already do their first-year work, the rise of AI safety and security as research priorities, and Julia's optimistic argument that AI tools are bringing back the Renaissance generalist  -  the researcher who can finally work fluently across math, biology, and beyond.</p><hr /><p><b>Timeline</b></p><ul><li><b>00:00</b> — Introductions</li><li><b>01:00</b> — Defining reasoning and verifiable domains</li><li><b>04:00</b> — Lean, Mathlib, and the formalization of mathematics</li><li><b>10:00</b> — Constructive proofs, Erdős problems, and the new wave of "AI mathematicians"</li><li><b>14:00</b> — Will math be "solved"? Art, photography, and the changing nature of creative work</li><li><b>18:00</b> — Why physics is harder than math</li><li><b>22:00</b> — Moravec's paradox, evolution, and why robotics lags behind language</li><li><b>27:00</b> — The Renaissance is back: generalist researchers in the age of AI</li><li><b>29:00</b> — Advising students: math, programming, and what core education still matters</li><li><b>32:00</b> — Teaching and assessment when GPT can do the homework</li><li><b>35:00</b> — Anti-AI backlash, energy costs, and the security threat</li><li><b>40:00</b> — Scaling vs. efficiency</li><li><b>42:00</b> — Model collapse, synthetic data, and what's left to squeeze from the internet</li><li><b>44:00</b> — What's exciting next: AI for science, safety, robotics, memory, and planning</li><li><b>47:00</b> — Annotation costs as a proxy</li><li><b>50:00</b> — Superhuman models and what security even means against them</li><li><b>52:00</b> — AlphaGo as precedent for verifiable superhuman performance</li><li><b>54:00</b> — Hallucination, the Mirage paper, and whether these are solvable problems</li><li><b>56:00</b> — Why coding isn't fully solved yet</li><li><b>58:00</b> — Agent security, prompt injection, and the Wild West of deployed agents</li><li><b>1:01:00</b> — Regulation: what's needed and what's possible</li><li><b>1:04:00</b> — Advice for PhD students and what research academia should pursue</li><li><b>1:09:00</b> — Startup opportunities: robotics, security, and AI for finance</li><li><b>1:12:00</b> — Closing thoughts: use the tools, and build grassroots AI for good</li></ul><hr /><p>Music:</p><ul><li>"Kid Kodi" - Blue Dot Sessions - via Free Music Archive - CC BY-NC 4.0.</li><li>"Palms Down" - Blue Dot Sessions - via Free Music Archive - CC BY-NC 4.0.</li><li>Changes: trimmed</li></ul><hr /><p>About: The Information Bottleneck is hosted by Ravid Shwartz-Ziv and Allen Roush, featuring in-depth conversations with leading AI researchers about the ideas shaping the future of machine learning.</p>]]></description><guid isPermaLink="false">dc4d65b9-0abf-49f7-90f2-11f45f76f9ce</guid><dc:creator><![CDATA[Ravid Shwartz-Ziv & Allen Roush]]></dc:creator><pubDate>Mon, 25 May 2026 02:10:56 GMT</pubDate><enclosure url="https://api.riverside.com/hosting-analytics/media/18c286ab10bf9fe6029f42086c58458089b6a551a4335831ffa0d7fb78fd698e/eyJlcGlzb2RlSWQiOiJkYzRkNjViOS0wYWJmLTQ5ZjctOTBmMi0xMWY0NWY3NmY5Y2UiLCJwb2RjYXN0SWQiOiJlZWJmYWM4Zi0yMzY5LTRhODktOTI0Yi1iZTI3OWI1N2YxOTAiLCJhY2NvdW50SWQiOiI2OGE1ZjAzMzFkNjQ0MTNlODhlN2FkMmYiLCJwYXRoIjoibWVkaWEvY2xpcHMvNmExM2FiZTkxYzQzZjAwMWZjMDExZGUwL3JhdmlkLXNod2FydHoteml2cy1zdHVkaW8tY29tcG9zZXItMjAyNi01LTI1X18zLTU0LTQ5Lm1wMyJ9.mp3" length="143465473" type="audio/mpeg"/><podcast:transcript url="https://hosting-media.riverside.com/media/podcasts/eebfac8f-2369-4a89-924b-be279b57f190/episodes/dc4d65b9-0abf-49f7-90f2-11f45f76f9ce/transcripts.txt" type="text/plain"/><itunes:summary>&lt;p&gt;&lt;b&gt;Julia Kempe on Why Math Will Fall Next, Superhuman Provers, and the Return of the Renaissance Researcher&lt;/b&gt;&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;In this episode, we sit down with Julia Kempe, a Professor at NYU&apos;s Center for Data Science and researcher at Meta FAIR&apos;s Foundations of Reasoning team,  for a wide-ranging conversation on the future of AI research.&lt;/p&gt;&lt;p&gt;We dig into why verifiable domains like mathematics may be on track to &quot;fall&quot; the way Go did. With formal verification through Lean and the Mathlib infrastructure, LLM agents can now generate and check proofs at scale, and Julia makes the case that a new industry of automated mathematical discovery is closer than most mathematicians believe. We explore why Erdős problems are already falling, what&apos;s still missing for harder fields like analysis and physics, and how synthetic data, curation, and verification fit together.&lt;/p&gt;&lt;p&gt;From there we get into the energy and scaling limits of frontier models, the case for academic research that big labs can&apos;t pursue, how to advise PhD students when Claude can already do their first-year work, the rise of AI safety and security as research priorities, and Julia&apos;s optimistic argument that AI tools are bringing back the Renaissance generalist  -  the researcher who can finally work fluently across math, biology, and beyond.&lt;/p&gt;&lt;hr /&gt;&lt;p&gt;&lt;b&gt;Timeline&lt;/b&gt;&lt;/p&gt;&lt;ul&gt;&lt;li&gt;&lt;b&gt;00:00&lt;/b&gt; — Introductions&lt;/li&gt;&lt;li&gt;&lt;b&gt;01:00&lt;/b&gt; — Defining reasoning and verifiable domains&lt;/li&gt;&lt;li&gt;&lt;b&gt;04:00&lt;/b&gt; — Lean, Mathlib, and the formalization of mathematics&lt;/li&gt;&lt;li&gt;&lt;b&gt;10:00&lt;/b&gt; — Constructive proofs, Erdős problems, and the new wave of &quot;AI mathematicians&quot;&lt;/li&gt;&lt;li&gt;&lt;b&gt;14:00&lt;/b&gt; — Will math be &quot;solved&quot;? Art, photography, and the changing nature of creative work&lt;/li&gt;&lt;li&gt;&lt;b&gt;18:00&lt;/b&gt; — Why physics is harder than math&lt;/li&gt;&lt;li&gt;&lt;b&gt;22:00&lt;/b&gt; — Moravec&apos;s paradox, evolution, and why robotics lags behind language&lt;/li&gt;&lt;li&gt;&lt;b&gt;27:00&lt;/b&gt; — The Renaissance is back: generalist researchers in the age of AI&lt;/li&gt;&lt;li&gt;&lt;b&gt;29:00&lt;/b&gt; — Advising students: math, programming, and what core education still matters&lt;/li&gt;&lt;li&gt;&lt;b&gt;32:00&lt;/b&gt; — Teaching and assessment when GPT can do the homework&lt;/li&gt;&lt;li&gt;&lt;b&gt;35:00&lt;/b&gt; — Anti-AI backlash, energy costs, and the security threat&lt;/li&gt;&lt;li&gt;&lt;b&gt;40:00&lt;/b&gt; — Scaling vs. efficiency&lt;/li&gt;&lt;li&gt;&lt;b&gt;42:00&lt;/b&gt; — Model collapse, synthetic data, and what&apos;s left to squeeze from the internet&lt;/li&gt;&lt;li&gt;&lt;b&gt;44:00&lt;/b&gt; — What&apos;s exciting next: AI for science, safety, robotics, memory, and planning&lt;/li&gt;&lt;li&gt;&lt;b&gt;47:00&lt;/b&gt; — Annotation costs as a proxy&lt;/li&gt;&lt;li&gt;&lt;b&gt;50:00&lt;/b&gt; — Superhuman models and what security even means against them&lt;/li&gt;&lt;li&gt;&lt;b&gt;52:00&lt;/b&gt; — AlphaGo as precedent for verifiable superhuman performance&lt;/li&gt;&lt;li&gt;&lt;b&gt;54:00&lt;/b&gt; — Hallucination, the Mirage paper, and whether these are solvable problems&lt;/li&gt;&lt;li&gt;&lt;b&gt;56:00&lt;/b&gt; — Why coding isn&apos;t fully solved yet&lt;/li&gt;&lt;li&gt;&lt;b&gt;58:00&lt;/b&gt; — Agent security, prompt injection, and the Wild West of deployed agents&lt;/li&gt;&lt;li&gt;&lt;b&gt;1:01:00&lt;/b&gt; — Regulation: what&apos;s needed and what&apos;s possible&lt;/li&gt;&lt;li&gt;&lt;b&gt;1:04:00&lt;/b&gt; — Advice for PhD students and what research academia should pursue&lt;/li&gt;&lt;li&gt;&lt;b&gt;1:09:00&lt;/b&gt; — Startup opportunities: robotics, security, and AI for finance&lt;/li&gt;&lt;li&gt;&lt;b&gt;1:12:00&lt;/b&gt; — Closing thoughts: use the tools, and build grassroots AI for good&lt;/li&gt;&lt;/ul&gt;&lt;hr /&gt;&lt;p&gt;Music:&lt;/p&gt;&lt;ul&gt;&lt;li&gt;&quot;Kid Kodi&quot; - Blue Dot Sessions - via Free Music Archive - CC BY-NC 4.0.&lt;/li&gt;&lt;li&gt;&quot;Palms Down&quot; - Blue Dot Sessions - via Free Music Archive - CC BY-NC 4.0.&lt;/li&gt;&lt;li&gt;Changes: trimmed&lt;/li&gt;&lt;/ul&gt;&lt;hr /&gt;&lt;p&gt;About: The Information Bottleneck is hosted by Ravid Shwartz-Ziv and Allen Roush, featuring in-depth conversations with leading AI researchers about the ideas shaping the future of machine learning.&lt;/p&gt;</itunes:summary><itunes:explicit>no</itunes:explicit><itunes:duration>01:14:43</itunes:duration><itunes:image href="https://hosting-media.riverside.com/media/podcasts/eebfac8f-2369-4a89-924b-be279b57f190/episodes/dc4d65b9-0abf-49f7-90f2-11f45f76f9ce/images/09c1af14-0114-4575-83f5-43e8356ee119.png"/><itunes:season>1</itunes:season><itunes:episode>41</itunes:episode><itunes:title>After Math Falls, What&apos;s Next?  with Julia Kempe (NYU/Meta)</itunes:title><itunes:episodeType>full</itunes:episodeType></item><item><title><![CDATA[Language, Cognition, and the Limits of LLMs - with Tal Linzen (NYU/Google) ]]></title><description><![CDATA[<p>We host Tal Linzen, Associate Professor at NYU and Research Scientist at Google, for a conversation on the intersection of cognitive science and large language models.</p><p>We discussed why children can learn language from around 100 million words while LLMs need trillions, and the surprising finding that as models get better at predicting the next word, they become <i>worse</i> models of how humans actually process language. Tal walked us through how his lab uses eye-tracking and reading-time data to compare model behavior to human behavior, and what that reveals about prediction, working memory, and the limits of current architectures.</p><p>We also got into nature versus nurture and how inductive biases can be instilled by pre-training on synthetic languages, world models and whether transformers actually <i>use</i> the geometric structure they encode, the BabyLM challenge and data-efficient language learning, and what mechanistic interpretability can offer cognitive science beyond just fixing model bugs. The conversation closed on academia versus industry, the role of PhDs in the current AI moment, and how AI coding tools are changing the way Tal teaches and evaluates students at NYU.</p><hr /><p></p><p><b>Timeline</b></p><ul><li>00:13 — Intro and what cognitive science means</li><li>02:16 — Using computational simulations to understand how humans learn language</li><li>05:26 — How children learn language vs. how LLMs are pre-trained</li><li>07:53 — Why mainstream LLMs are not good models of humans </li><li>10:07 — Comparing humans and models with eye-tracking and reading behavior</li><li>13:52 — Sensory modalities, smell, and how much you can learn from language alone</li><li>16:03 — Animal cognition and decoding animal communication</li><li>17:00 — Nature vs. nurture, inductive biases, and what transformers can and can't learn</li><li>21:21 — Instilling inductive biases through synthetic languages </li><li>27:34 — The bouba/kiki effect and cross-linguistic sound symbolism</li><li>28:33 — Latent causal structure in language and whether models discover it</li><li>31:13 — Does knowing linguistics help build better models?</li><li>35:07 — World models: what they mean, and why transformers encode geometry but don't use it</li><li>39:13 — Tokenization, and why Tal doesn't like it</li><li>41:35 — Scaling laws and the inverse-U curve of model quality vs. human fit</li><li>44:34 — Where the human–model mismatch comes from: architecture, memory, and data</li><li>47:08 — Diffusion language models and sentence planning</li><li>48:21 — Data quality, synthetic data, and curriculum effects</li><li>50:54 — Comparing models at different training stages to human development; BabyLM</li><li>54:40 — What level of the model should we actually probe? Representations vs. behavior</li><li>1:01:04 — Mechanistic interpretability, Deep Dream, and human dreaming</li><li>1:02:11 — Cognitive neuroscience, intracranial recordings, and working memory</li><li>1:10:31 — Should you still do a PhD in 2026?</li><li>1:12:31 — Will software engineers lose their jobs to AI?</li><li>1:17:43 — Teaching in the age of coding agents: what changes in the classroom</li><li>1:20:54 — What's next: human-like LLMs as user simulators, and recruiting<hr /><p>Music:</p><ul><li>"Kid Kodi" - Blue Dot Sessions - via Free Music Archive - CC BY-NC 4.0.</li><li>"Palms Down" - Blue Dot Sessions - via Free Music Archive - CC BY-NC 4.0.</li><li>Changes: trimmed</li></ul><hr /><p>About: The Information Bottleneck is hosted by Ravid Shwartz-Ziv and Allen Roush, featuring in-depth conversations with leading AI researchers about the ideas shaping the future of machine learning.</p></li></ul>]]></description><guid isPermaLink="false">494415b1-5cd5-4e8f-97bc-992af0c6fbf0</guid><dc:creator><![CDATA[Ravid Shwartz-Ziv & Allen Roush]]></dc:creator><pubDate>Sun, 17 May 2026 00:58:22 GMT</pubDate><enclosure url="https://api.riverside.com/hosting-analytics/media/ba5473a9ef4aba29f6b856195cc726681c141d753bcfda8dbd505010f4c7cb56/eyJlcGlzb2RlSWQiOiI0OTQ0MTViMS01Y2Q1LTRlOGYtOTdiYy05OTJhZjBjNmZiZjAiLCJwb2RjYXN0SWQiOiJlZWJmYWM4Zi0yMzY5LTRhODktOTI0Yi1iZTI3OWI1N2YxOTAiLCJhY2NvdW50SWQiOiI2OGE1ZjAzMzFkNjQ0MTNlODhlN2FkMmYiLCJwYXRoIjoibWVkaWEvY2xpcHMvNmEwOTA5MzFlNGQ1NTUxOTM2ZGVlYTlkL3JhdmlkLXNod2FydHoteml2cy1zdHVkaW8tY29tcG9zZXItMjAyNi01LTE3X18yLTE3LTUzLm1wMyJ9.mp3" length="160177153" type="audio/mpeg"/><podcast:transcript url="https://hosting-media.riverside.com/media/podcasts/eebfac8f-2369-4a89-924b-be279b57f190/episodes/494415b1-5cd5-4e8f-97bc-992af0c6fbf0/transcripts.txt" type="text/plain"/><itunes:summary>&lt;p&gt;We host Tal Linzen, Associate Professor at NYU and Research Scientist at Google, for a conversation on the intersection of cognitive science and large language models.&lt;/p&gt;&lt;p&gt;We discussed why children can learn language from around 100 million words while LLMs need trillions, and the surprising finding that as models get better at predicting the next word, they become &lt;i&gt;worse&lt;/i&gt; models of how humans actually process language. Tal walked us through how his lab uses eye-tracking and reading-time data to compare model behavior to human behavior, and what that reveals about prediction, working memory, and the limits of current architectures.&lt;/p&gt;&lt;p&gt;We also got into nature versus nurture and how inductive biases can be instilled by pre-training on synthetic languages, world models and whether transformers actually &lt;i&gt;use&lt;/i&gt; the geometric structure they encode, the BabyLM challenge and data-efficient language learning, and what mechanistic interpretability can offer cognitive science beyond just fixing model bugs. The conversation closed on academia versus industry, the role of PhDs in the current AI moment, and how AI coding tools are changing the way Tal teaches and evaluates students at NYU.&lt;/p&gt;&lt;hr /&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;&lt;b&gt;Timeline&lt;/b&gt;&lt;/p&gt;&lt;ul&gt;&lt;li&gt;00:13 — Intro and what cognitive science means&lt;/li&gt;&lt;li&gt;02:16 — Using computational simulations to understand how humans learn language&lt;/li&gt;&lt;li&gt;05:26 — How children learn language vs. how LLMs are pre-trained&lt;/li&gt;&lt;li&gt;07:53 — Why mainstream LLMs are not good models of humans &lt;/li&gt;&lt;li&gt;10:07 — Comparing humans and models with eye-tracking and reading behavior&lt;/li&gt;&lt;li&gt;13:52 — Sensory modalities, smell, and how much you can learn from language alone&lt;/li&gt;&lt;li&gt;16:03 — Animal cognition and decoding animal communication&lt;/li&gt;&lt;li&gt;17:00 — Nature vs. nurture, inductive biases, and what transformers can and can&apos;t learn&lt;/li&gt;&lt;li&gt;21:21 — Instilling inductive biases through synthetic languages &lt;/li&gt;&lt;li&gt;27:34 — The bouba/kiki effect and cross-linguistic sound symbolism&lt;/li&gt;&lt;li&gt;28:33 — Latent causal structure in language and whether models discover it&lt;/li&gt;&lt;li&gt;31:13 — Does knowing linguistics help build better models?&lt;/li&gt;&lt;li&gt;35:07 — World models: what they mean, and why transformers encode geometry but don&apos;t use it&lt;/li&gt;&lt;li&gt;39:13 — Tokenization, and why Tal doesn&apos;t like it&lt;/li&gt;&lt;li&gt;41:35 — Scaling laws and the inverse-U curve of model quality vs. human fit&lt;/li&gt;&lt;li&gt;44:34 — Where the human–model mismatch comes from: architecture, memory, and data&lt;/li&gt;&lt;li&gt;47:08 — Diffusion language models and sentence planning&lt;/li&gt;&lt;li&gt;48:21 — Data quality, synthetic data, and curriculum effects&lt;/li&gt;&lt;li&gt;50:54 — Comparing models at different training stages to human development; BabyLM&lt;/li&gt;&lt;li&gt;54:40 — What level of the model should we actually probe? Representations vs. behavior&lt;/li&gt;&lt;li&gt;1:01:04 — Mechanistic interpretability, Deep Dream, and human dreaming&lt;/li&gt;&lt;li&gt;1:02:11 — Cognitive neuroscience, intracranial recordings, and working memory&lt;/li&gt;&lt;li&gt;1:10:31 — Should you still do a PhD in 2026?&lt;/li&gt;&lt;li&gt;1:12:31 — Will software engineers lose their jobs to AI?&lt;/li&gt;&lt;li&gt;1:17:43 — Teaching in the age of coding agents: what changes in the classroom&lt;/li&gt;&lt;li&gt;1:20:54 — What&apos;s next: human-like LLMs as user simulators, and recruiting&lt;hr /&gt;&lt;p&gt;Music:&lt;/p&gt;&lt;ul&gt;&lt;li&gt;&quot;Kid Kodi&quot; - Blue Dot Sessions - via Free Music Archive - CC BY-NC 4.0.&lt;/li&gt;&lt;li&gt;&quot;Palms Down&quot; - Blue Dot Sessions - via Free Music Archive - CC BY-NC 4.0.&lt;/li&gt;&lt;li&gt;Changes: trimmed&lt;/li&gt;&lt;/ul&gt;&lt;hr /&gt;&lt;p&gt;About: The Information Bottleneck is hosted by Ravid Shwartz-Ziv and Allen Roush, featuring in-depth conversations with leading AI researchers about the ideas shaping the future of machine learning.&lt;/p&gt;&lt;/li&gt;&lt;/ul&gt;</itunes:summary><itunes:explicit>no</itunes:explicit><itunes:duration>01:23:26</itunes:duration><itunes:image href="https://hosting-media.riverside.com/media/podcasts/eebfac8f-2369-4a89-924b-be279b57f190/episodes/494415b1-5cd5-4e8f-97bc-992af0c6fbf0/images/0fb01731-ca7b-41df-aa0e-78ff7b1c4796.png"/><itunes:season>1</itunes:season><itunes:episode>39</itunes:episode><itunes:title>Language, Cognition, and the Limits of LLMs - with Tal Linzen (NYU/Google) </itunes:title><itunes:episodeType>full</itunes:episodeType></item><item><title><![CDATA[Intelligence in an Open World - with Mengye Ren (NYU)]]></title><description><![CDATA[<p>We talk with <b>Mengye Ren</b>, Assistant Professor at NYU's Center for Data Science, about what intelligence actually means once you step outside a benchmark, and why scaling a single centralized model isn't the whole story.</p><p>We get into why intelligence has to be defined in open environments, not closed ones, and what that means for how we measure progress. We push on the creativity question: today's models sample bottom-up from a softmax or a Gaussian, with no internal loop of consideration, and as Mengye puts it, we haven't understood creativity yet and we're already prepared to hand it over.</p><p>We also talk about what's missing for the next paradigm: continual learning, memory, embodied grounding, and smaller models that actually accumulate experience instead of re-deriving everything from scratch each call. Along the way, we get into JEPA and latent variables, biology as inspiration vs. blueprint, why frontier labs don't lean on explicit latents, the limits of synthetic data and world models, agent-to-agent communication, model uncertainty and forecasting, and whether ML education still matters when AI writes the experiments.</p><p>A grounded, contrarian conversation about where AI research should be looking next, beyond benchmarks, beyond scale.</p><hr /><h3>Timeline</h3><p><b>00:00</b> — Intro and welcome</p><p><b>01:24</b> — What is intelligence? Defining it relative to objectives and open environments</p><p><b>04:19</b> — Is intelligence really the path to human flourishing, or is it productivity?</p><p><b>04:57</b> — Safety, scalable oversight, and whether stronger models help or hurt</p><p><b>06:09</b> — What does "alignment" actually mean?</p><p><b>07:18</b> — Centralized vs. decentralized models: objectivity vs. personal meaning</p><p><b>08:50</b> — Hinton vs. LeCun: where Mengye stands on AI risk</p><p><b>10:29</b> — Bottom-up vs. top-down architectures and feedback loops</p><p><b>21:28</b> — Biology and AI: inspiration, not blueprint</p><p><b>24:14</b> — Biological plausibility, spiking nets, and where the analogy breaks</p><p><b>25:39</b> — JEPA, Mamba, and architectures beyond the transformer</p><p><b>27:31</b> — Language as a special modality: abstraction built for communication</p><p><b>29:04</b> — Are we too locked into the current paradigm? Risk of creativity collapse</p><p><b>30:09</b> — Synthetic data, simulation, and the brain's own generative models</p><p><b>31:43</b> — World models and physical AI: how babies actually learn <b>33:03</b> — The case for smaller, continually learning models</p><p><b>37:02</b> — The role of academic research in a frontier-lab world</p><p><b>39:47</b> — Why LLMs aren't funny: the creativity gap</p><p><b>40:35</b> — What research areas matter most: embodiment, continual learning, creativity</p><p><b>42:05</b> — Creativity is bounded by experience — and why bottom-up sampling isn't enough</p><p><b>45:35</b> — Agent-to-agent communication and the limits of sub-agents</p><p><b>46:39</b> — Model confidence, epistemic uncertainty, and forecasting</p><p><b>49:44</b> — Tokenization, static vs. dynamic worlds, and always-learning systems</p><p><b>52:20</b> — Latent variables, JEPA, and why frontier models skip them</p><p><b>53:40</b> — The future of ML education when AI writes the experiments</p><hr /><p>Music:</p><ul><li>"Kid Kodi" - Blue Dot Sessions - via Free Music Archive - CC BY-NC 4.0.</li><li>"Palms Down" - Blue Dot Sessions - via Free Music Archive - CC BY-NC 4.0.</li><li>Changes: trimmed</li></ul><hr /><p>About: The Information Bottleneck is hosted by Ravid Shwartz-Ziv and Allen Roush, featuring in-depth conversations with leading AI researchers about the ideas shaping the future of machine learning.</p>]]></description><guid isPermaLink="false">fb76810f-7e3f-4ca5-869c-86600f7e0fd9</guid><dc:creator><![CDATA[Ravid Shwartz-Ziv & Allen Roush]]></dc:creator><pubDate>Wed, 20 May 2026 13:00:00 GMT</pubDate><enclosure url="https://api.riverside.com/hosting-analytics/media/1de7258a887ba0408d1fdf497ec23725bedefebb25262021d4de6de71cb21f29/eyJlcGlzb2RlSWQiOiJmYjc2ODEwZi03ZTNmLTRjYTUtODY5Yy04NjYwMGY3ZTBmZDkiLCJwb2RjYXN0SWQiOiJlZWJmYWM4Zi0yMzY5LTRhODktOTI0Yi1iZTI3OWI1N2YxOTAiLCJhY2NvdW50SWQiOiI2OGE1ZjAzMzFkNjQ0MTNlODhlN2FkMmYiLCJwYXRoIjoibWVkaWEvY2xpcHMvNmEwOGZiNDNiMjU0ZjY3ODI1MThiYjg1L3JhdmlkLXNod2FydHoteml2cy1zdHVkaW8tY29tcG9zZXItMjAyNi01LTE3X18xLTE4LTI3Lm1wMyJ9.mp3" length="113787863" type="audio/mpeg"/><podcast:transcript url="https://hosting-media.riverside.com/media/podcasts/eebfac8f-2369-4a89-924b-be279b57f190/episodes/fb76810f-7e3f-4ca5-869c-86600f7e0fd9/transcripts.txt" type="text/plain"/><itunes:summary>&lt;p&gt;We talk with &lt;b&gt;Mengye Ren&lt;/b&gt;, Assistant Professor at NYU&apos;s Center for Data Science, about what intelligence actually means once you step outside a benchmark, and why scaling a single centralized model isn&apos;t the whole story.&lt;/p&gt;&lt;p&gt;We get into why intelligence has to be defined in open environments, not closed ones, and what that means for how we measure progress. We push on the creativity question: today&apos;s models sample bottom-up from a softmax or a Gaussian, with no internal loop of consideration, and as Mengye puts it, we haven&apos;t understood creativity yet and we&apos;re already prepared to hand it over.&lt;/p&gt;&lt;p&gt;We also talk about what&apos;s missing for the next paradigm: continual learning, memory, embodied grounding, and smaller models that actually accumulate experience instead of re-deriving everything from scratch each call. Along the way, we get into JEPA and latent variables, biology as inspiration vs. blueprint, why frontier labs don&apos;t lean on explicit latents, the limits of synthetic data and world models, agent-to-agent communication, model uncertainty and forecasting, and whether ML education still matters when AI writes the experiments.&lt;/p&gt;&lt;p&gt;A grounded, contrarian conversation about where AI research should be looking next, beyond benchmarks, beyond scale.&lt;/p&gt;&lt;hr /&gt;&lt;h3&gt;Timeline&lt;/h3&gt;&lt;p&gt;&lt;b&gt;00:00&lt;/b&gt; — Intro and welcome&lt;/p&gt;&lt;p&gt;&lt;b&gt;01:24&lt;/b&gt; — What is intelligence? Defining it relative to objectives and open environments&lt;/p&gt;&lt;p&gt;&lt;b&gt;04:19&lt;/b&gt; — Is intelligence really the path to human flourishing, or is it productivity?&lt;/p&gt;&lt;p&gt;&lt;b&gt;04:57&lt;/b&gt; — Safety, scalable oversight, and whether stronger models help or hurt&lt;/p&gt;&lt;p&gt;&lt;b&gt;06:09&lt;/b&gt; — What does &quot;alignment&quot; actually mean?&lt;/p&gt;&lt;p&gt;&lt;b&gt;07:18&lt;/b&gt; — Centralized vs. decentralized models: objectivity vs. personal meaning&lt;/p&gt;&lt;p&gt;&lt;b&gt;08:50&lt;/b&gt; — Hinton vs. LeCun: where Mengye stands on AI risk&lt;/p&gt;&lt;p&gt;&lt;b&gt;10:29&lt;/b&gt; — Bottom-up vs. top-down architectures and feedback loops&lt;/p&gt;&lt;p&gt;&lt;b&gt;21:28&lt;/b&gt; — Biology and AI: inspiration, not blueprint&lt;/p&gt;&lt;p&gt;&lt;b&gt;24:14&lt;/b&gt; — Biological plausibility, spiking nets, and where the analogy breaks&lt;/p&gt;&lt;p&gt;&lt;b&gt;25:39&lt;/b&gt; — JEPA, Mamba, and architectures beyond the transformer&lt;/p&gt;&lt;p&gt;&lt;b&gt;27:31&lt;/b&gt; — Language as a special modality: abstraction built for communication&lt;/p&gt;&lt;p&gt;&lt;b&gt;29:04&lt;/b&gt; — Are we too locked into the current paradigm? Risk of creativity collapse&lt;/p&gt;&lt;p&gt;&lt;b&gt;30:09&lt;/b&gt; — Synthetic data, simulation, and the brain&apos;s own generative models&lt;/p&gt;&lt;p&gt;&lt;b&gt;31:43&lt;/b&gt; — World models and physical AI: how babies actually learn &lt;b&gt;33:03&lt;/b&gt; — The case for smaller, continually learning models&lt;/p&gt;&lt;p&gt;&lt;b&gt;37:02&lt;/b&gt; — The role of academic research in a frontier-lab world&lt;/p&gt;&lt;p&gt;&lt;b&gt;39:47&lt;/b&gt; — Why LLMs aren&apos;t funny: the creativity gap&lt;/p&gt;&lt;p&gt;&lt;b&gt;40:35&lt;/b&gt; — What research areas matter most: embodiment, continual learning, creativity&lt;/p&gt;&lt;p&gt;&lt;b&gt;42:05&lt;/b&gt; — Creativity is bounded by experience — and why bottom-up sampling isn&apos;t enough&lt;/p&gt;&lt;p&gt;&lt;b&gt;45:35&lt;/b&gt; — Agent-to-agent communication and the limits of sub-agents&lt;/p&gt;&lt;p&gt;&lt;b&gt;46:39&lt;/b&gt; — Model confidence, epistemic uncertainty, and forecasting&lt;/p&gt;&lt;p&gt;&lt;b&gt;49:44&lt;/b&gt; — Tokenization, static vs. dynamic worlds, and always-learning systems&lt;/p&gt;&lt;p&gt;&lt;b&gt;52:20&lt;/b&gt; — Latent variables, JEPA, and why frontier models skip them&lt;/p&gt;&lt;p&gt;&lt;b&gt;53:40&lt;/b&gt; — The future of ML education when AI writes the experiments&lt;/p&gt;&lt;hr /&gt;&lt;p&gt;Music:&lt;/p&gt;&lt;ul&gt;&lt;li&gt;&quot;Kid Kodi&quot; - Blue Dot Sessions - via Free Music Archive - CC BY-NC 4.0.&lt;/li&gt;&lt;li&gt;&quot;Palms Down&quot; - Blue Dot Sessions - via Free Music Archive - CC BY-NC 4.0.&lt;/li&gt;&lt;li&gt;Changes: trimmed&lt;/li&gt;&lt;/ul&gt;&lt;hr /&gt;&lt;p&gt;About: The Information Bottleneck is hosted by Ravid Shwartz-Ziv and Allen Roush, featuring in-depth conversations with leading AI researchers about the ideas shaping the future of machine learning.&lt;/p&gt;</itunes:summary><itunes:explicit>no</itunes:explicit><itunes:duration>00:59:16</itunes:duration><itunes:image href="https://hosting-media.riverside.com/media/podcasts/eebfac8f-2369-4a89-924b-be279b57f190/episodes/fb76810f-7e3f-4ca5-869c-86600f7e0fd9/images/56187eac-6554-45ee-8d00-b67d62b2043b.png"/><itunes:season>1</itunes:season><itunes:episode>40</itunes:episode><itunes:title>Intelligence in an Open World - with Mengye Ren (NYU)</itunes:title><itunes:episodeType>full</itunes:episodeType></item><item><title><![CDATA[The Principles of Diffusion Models -  with Jesse Lai (Sony AI)]]></title><description><![CDATA[<p>We host Chieh-Hsin (Jesse) Lai, Staff Research Scientist at Sony AI and visiting professor at National Yang Ming Chiao Tung University, Taiwan, for a conversation about diffusion models, the technology behind tools like Stable Diffusion, and most of the AI image and video generators you've seen in the last few years. Jesse recently co-authored <i>The Principles of Diffusion Models</i> with Stefano Ermon, and the book is quickly becoming a go-to reference in the field.</p><p>We start with what a generative model actually is, and what it means to "generate" an image or a sound. Jesse explains the core idea behind diffusion in plain terms. You start with pure noise, and a neural network gradually cleans it up, step by step, until a realistic image emerges.</p><p>From there, we talk about why diffusion has come to dominate so much of generative AI. Because the model builds an image gradually, you can guide it along the way, nudging the output toward what you actually want, refining details, or combining it with other controls. We also discuss the common critique that diffusion is slow and how the field has largely addressed it through new techniques.</p><p>We zoom out to the bigger picture, too. Jesse shares his view on world models and whether diffusion is the right foundation for them. We talk about what makes a generative model genuinely good versus just good at gaming benchmarks, and why evaluating creativity and realism is so much harder than scoring a multiple-choice test.</p><hr /><p></p><p><b>Timeline</b></p><p>00:12 — Intro and welcoming Jesse</p><p>00:47 — Why Jesse wrote the book, and who it's for</p><p>03:29 — The three families of diffusion models, and why they're really one idea</p><p>05:14 — What makes a good generative model</p><p>07:39 — How do you even measure if a generated image is good</p><p>08:59 — Why diffusion beats autoregressive models for images</p><p>10:33 — Is diffusion still slow? How fast generation got fast</p><p>11:12 — A simple intuition for what a "score" is</p><p>14:12 — How the different flavors of diffusion connect under the hood</p><p>14:42 — Diffusion for text and proteins</p><p>17:12 — Consistency models and the push for one-step generation</p><p>22:12 — Diffusion for world models: simulating reality in real time</p><p>26:12 — Do world models need to understand language</p><p>35:12 — Is diffusion the right tool, or just a convenient one</p><p>38:12 — What benchmarks actually tell us, and what they miss</p><p>46:12 — Closing thoughts and where to find the book</p><hr /><p></p><p>Music:</p><ul><li>"Kid Kodi" - Blue Dot Sessions - via Free Music Archive - CC BY-NC 4.0.</li><li>"Palms Down" - Blue Dot Sessions - via Free Music Archive - CC BY-NC 4.0.</li><li>Changes: trimmed</li></ul><hr /><p>About: The Information Bottleneck is hosted by Ravid Shwartz-Ziv and Allen Roush, featuring in-depth conversations with leading AI researchers about the ideas shaping the future of machine learning.</p>]]></description><guid isPermaLink="false">50aae256-f497-43b1-9419-f80874493954</guid><dc:creator><![CDATA[Ravid Shwartz-Ziv & Allen Roush]]></dc:creator><pubDate>Sun, 10 May 2026 16:09:47 GMT</pubDate><enclosure url="https://api.riverside.com/hosting-analytics/media/2e078bbad18454af2356ce7ea03cf70e32def73e9d8688071e52f0ed82f5df78/eyJlcGlzb2RlSWQiOiI1MGFhZTI1Ni1mNDk3LTQzYjEtOTQxOS1mODA4NzQ0OTM5NTQiLCJwb2RjYXN0SWQiOiJlZWJmYWM4Zi0yMzY5LTRhODktOTI0Yi1iZTI3OWI1N2YxOTAiLCJhY2NvdW50SWQiOiI2OGE1ZjAzMzFkNjQ0MTNlODhlN2FkMmYiLCJwYXRoIjoibWVkaWEvY2xpcHMvNjlmZmE4OGZlY2NhMzE1ZGE0MjRlZmVmL3JhdmlkLXNod2FydHoteml2cy1zdHVkaW8tY29tcG9zZXItMjAyNi01LTlfXzIzLTM1LTExLm1wMyJ9.mp3" length="107269372" type="audio/mpeg"/><itunes:summary>&lt;p&gt;We host Chieh-Hsin (Jesse) Lai, Staff Research Scientist at Sony AI and visiting professor at National Yang Ming Chiao Tung University, Taiwan, for a conversation about diffusion models, the technology behind tools like Stable Diffusion, and most of the AI image and video generators you&apos;ve seen in the last few years. Jesse recently co-authored &lt;i&gt;The Principles of Diffusion Models&lt;/i&gt; with Stefano Ermon, and the book is quickly becoming a go-to reference in the field.&lt;/p&gt;&lt;p&gt;We start with what a generative model actually is, and what it means to &quot;generate&quot; an image or a sound. Jesse explains the core idea behind diffusion in plain terms. You start with pure noise, and a neural network gradually cleans it up, step by step, until a realistic image emerges.&lt;/p&gt;&lt;p&gt;From there, we talk about why diffusion has come to dominate so much of generative AI. Because the model builds an image gradually, you can guide it along the way, nudging the output toward what you actually want, refining details, or combining it with other controls. We also discuss the common critique that diffusion is slow and how the field has largely addressed it through new techniques.&lt;/p&gt;&lt;p&gt;We zoom out to the bigger picture, too. Jesse shares his view on world models and whether diffusion is the right foundation for them. We talk about what makes a generative model genuinely good versus just good at gaming benchmarks, and why evaluating creativity and realism is so much harder than scoring a multiple-choice test.&lt;/p&gt;&lt;hr /&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;&lt;b&gt;Timeline&lt;/b&gt;&lt;/p&gt;&lt;p&gt;00:12 — Intro and welcoming Jesse&lt;/p&gt;&lt;p&gt;00:47 — Why Jesse wrote the book, and who it&apos;s for&lt;/p&gt;&lt;p&gt;03:29 — The three families of diffusion models, and why they&apos;re really one idea&lt;/p&gt;&lt;p&gt;05:14 — What makes a good generative model&lt;/p&gt;&lt;p&gt;07:39 — How do you even measure if a generated image is good&lt;/p&gt;&lt;p&gt;08:59 — Why diffusion beats autoregressive models for images&lt;/p&gt;&lt;p&gt;10:33 — Is diffusion still slow? How fast generation got fast&lt;/p&gt;&lt;p&gt;11:12 — A simple intuition for what a &quot;score&quot; is&lt;/p&gt;&lt;p&gt;14:12 — How the different flavors of diffusion connect under the hood&lt;/p&gt;&lt;p&gt;14:42 — Diffusion for text and proteins&lt;/p&gt;&lt;p&gt;17:12 — Consistency models and the push for one-step generation&lt;/p&gt;&lt;p&gt;22:12 — Diffusion for world models: simulating reality in real time&lt;/p&gt;&lt;p&gt;26:12 — Do world models need to understand language&lt;/p&gt;&lt;p&gt;35:12 — Is diffusion the right tool, or just a convenient one&lt;/p&gt;&lt;p&gt;38:12 — What benchmarks actually tell us, and what they miss&lt;/p&gt;&lt;p&gt;46:12 — Closing thoughts and where to find the book&lt;/p&gt;&lt;hr /&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;Music:&lt;/p&gt;&lt;ul&gt;&lt;li&gt;&quot;Kid Kodi&quot; - Blue Dot Sessions - via Free Music Archive - CC BY-NC 4.0.&lt;/li&gt;&lt;li&gt;&quot;Palms Down&quot; - Blue Dot Sessions - via Free Music Archive - CC BY-NC 4.0.&lt;/li&gt;&lt;li&gt;Changes: trimmed&lt;/li&gt;&lt;/ul&gt;&lt;hr /&gt;&lt;p&gt;About: The Information Bottleneck is hosted by Ravid Shwartz-Ziv and Allen Roush, featuring in-depth conversations with leading AI researchers about the ideas shaping the future of machine learning.&lt;/p&gt;</itunes:summary><itunes:explicit>no</itunes:explicit><itunes:duration>00:55:52</itunes:duration><itunes:image href="https://hosting-media.riverside.com/media/podcasts/eebfac8f-2369-4a89-924b-be279b57f190/episodes/50aae256-f497-43b1-9419-f80874493954/images/f59b8c74-c015-4d0e-9249-6c04a5978e30.png"/><itunes:season>1</itunes:season><itunes:episode>38</itunes:episode><itunes:title>The Principles of Diffusion Models -  with Jesse Lai (Sony AI)</itunes:title><itunes:episodeType>full</itunes:episodeType></item><item><title><![CDATA[Inside xAI, and the Bet on AI Math - with Christian Szegedy (Math Inc) ]]></title><description><![CDATA[<p>We talked with Christian Szegedy, co-inventor of Inception and Batch Normalization, founding scientist at xAI, now at Math Inc, about what it takes to build a frontier lab, and why he left xAI to work on formal mathematics. Christian thinks Lean and auto-formalization are the missing piece for trustworthy AI: a machine-checkable layer underneath all reasoning, where proofs are guaranteed correct without anyone having to read them.</p><p>We got into his bet with François Chollet that AI will hit superhuman mathematician level by 2026, and what that actually unlocks beyond math itself: verified software instead of vibe-coded apps that break when you refactor, AI systems you can actually trust because their reasoning is checkable, and a path to handling protein folding, chemistry, and parts of biology with real guarantees instead of hand-waving. Christian also walked us through how Math Inc's Gauss system pulled off a proof in two weeks that human experts had estimated would take another year.</p><p>We also covered xAI's first 12-person year, why Christian no longer buys the original batch normalization story, why he's sure transformers won't be the dominant architecture in five years, what mathematicians do in a world of cheap proofs, and his take on whether humanity will handle AI well. He distrusts humanity more than he distrusts AI.</p><hr /><h2>Timeline</h2><p>00:12 — Intros: Christian's background (Inception, Batch Norm, xAI, Math Inc)</p><p>01:29 — Building a frontier lab from scratch: the first 12 people at xAI</p><p>04:15 — Hiring for proven track records when 200K GPUs are at stake</p><p>06:07 — Elon's "dependency graph" and balancing long-term vision with investor demos</p><p>07:28 — Gauss formalizes the strong prime number theorem in 2 weeks</p><p>12:25 — What "formalization" actually means (and why it's not what most people think)</p><p>14:39 — Why Lean gives 100% certainty and why that matters for RL</p><p>15:26 — ProofBridge and joint embeddings across mathematical subfields 18:07 — Does math formalization transfer to coding and other fields?</p><p>21:44 — Can every domain be mathematized? </p><p>23:14 — Verified software, chip design, and why vibe-coded apps are dangerous</p><p>26:35 — Scaling Mathlib by 100–1000x</p><p>28:27 — Artisan formalizers vs. invisible machine-language formalists</p><p>33:26 — Can verification generalize?</p><p>45:19 — Revisiting Batch Norm: covariate shift, loss landscape, and what really happens</p><p>48:22 — Is normalization even necessary? </p><p>50:10 — What's actually fundamental in modern AI architectures</p><p>51:41 — Why Christian thinks transformers won't last 5 years</p><p>52:38 — The 2026 superhuman AI mathematician bet</p><p>55:15 — What's missing: better verification + a much larger formalized math repository</p><p>56:13 — Lean vs. Coq vs. HOL Light -  does the proof assistant actually matter?</p><p>59:26 — The role of mathematicians in 5–10 years</p><p>1:02:00 — A human element to mathematics: Newton, Leibniz, and competitive proving</p><p>1:03:25 — The telescope analogy: AI as the instrument that lets us see the math universe</p><p>1:05:19 — Job apocalypse or Jevons paradox? </p><p>1:08:41 — Advice for students</p><p>1:09:50 — Can we formally verify AI alignment? </p><p>1:11:52 — Closing thanks</p><hr /><p></p><p>Music:</p><ul><li>"Kid Kodi" - Blue Dot Sessions - via Free Music Archive - CC BY-NC 4.0.</li><li>"Palms Down" - Blue Dot Sessions - via Free Music Archive - CC BY-NC 4.0.</li><li>Changes: trimmed</li></ul><hr /><p>About: The Information Bottleneck is hosted by Ravid Shwartz-Ziv and Allen Roush, featuring in-depth conversations with leading AI researchers about the ideas shaping the future of machine learning.</p>]]></description><guid isPermaLink="false">509aaae0-0ef8-4b36-ad08-bf35730dba6f</guid><dc:creator><![CDATA[Ravid Shwartz-Ziv & Allen Roush]]></dc:creator><pubDate>Mon, 04 May 2026 12:45:04 GMT</pubDate><enclosure url="https://api.riverside.com/hosting-analytics/media/f710a8c682972a222afa44fe6f292cc8988cd6ba2606e83706c7e07d5f25df4c/eyJlcGlzb2RlSWQiOiI1MDlhYWFlMC0wZWY4LTRiMzYtYWQwOC1iZjM1NzMwZGJhNmYiLCJwb2RjYXN0SWQiOiJlZWJmYWM4Zi0yMzY5LTRhODktOTI0Yi1iZTI3OWI1N2YxOTAiLCJhY2NvdW50SWQiOiI2OGE1ZjAzMzFkNjQ0MTNlODhlN2FkMmYiLCJwYXRoIjoibWVkaWEvY2xpcHMvNjlmODNhMWEyMzhlMWNiMjVmYWI0NGJiL3JhdmlkLXNod2FydHoteml2cy1zdHVkaW8tY29tcG9zZXItMjAyNi01LTRfXzgtMTgtMi5tcDMifQ==.mp3" length="139276686" type="audio/mpeg"/><podcast:transcript url="https://hosting-media.riverside.com/media/podcasts/eebfac8f-2369-4a89-924b-be279b57f190/episodes/509aaae0-0ef8-4b36-ad08-bf35730dba6f/transcripts.txt" type="text/plain"/><itunes:summary>&lt;p&gt;We talked with Christian Szegedy, co-inventor of Inception and Batch Normalization, founding scientist at xAI, now at Math Inc, about what it takes to build a frontier lab, and why he left xAI to work on formal mathematics. Christian thinks Lean and auto-formalization are the missing piece for trustworthy AI: a machine-checkable layer underneath all reasoning, where proofs are guaranteed correct without anyone having to read them.&lt;/p&gt;&lt;p&gt;We got into his bet with François Chollet that AI will hit superhuman mathematician level by 2026, and what that actually unlocks beyond math itself: verified software instead of vibe-coded apps that break when you refactor, AI systems you can actually trust because their reasoning is checkable, and a path to handling protein folding, chemistry, and parts of biology with real guarantees instead of hand-waving. Christian also walked us through how Math Inc&apos;s Gauss system pulled off a proof in two weeks that human experts had estimated would take another year.&lt;/p&gt;&lt;p&gt;We also covered xAI&apos;s first 12-person year, why Christian no longer buys the original batch normalization story, why he&apos;s sure transformers won&apos;t be the dominant architecture in five years, what mathematicians do in a world of cheap proofs, and his take on whether humanity will handle AI well. He distrusts humanity more than he distrusts AI.&lt;/p&gt;&lt;hr /&gt;&lt;h2&gt;Timeline&lt;/h2&gt;&lt;p&gt;00:12 — Intros: Christian&apos;s background (Inception, Batch Norm, xAI, Math Inc)&lt;/p&gt;&lt;p&gt;01:29 — Building a frontier lab from scratch: the first 12 people at xAI&lt;/p&gt;&lt;p&gt;04:15 — Hiring for proven track records when 200K GPUs are at stake&lt;/p&gt;&lt;p&gt;06:07 — Elon&apos;s &quot;dependency graph&quot; and balancing long-term vision with investor demos&lt;/p&gt;&lt;p&gt;07:28 — Gauss formalizes the strong prime number theorem in 2 weeks&lt;/p&gt;&lt;p&gt;12:25 — What &quot;formalization&quot; actually means (and why it&apos;s not what most people think)&lt;/p&gt;&lt;p&gt;14:39 — Why Lean gives 100% certainty and why that matters for RL&lt;/p&gt;&lt;p&gt;15:26 — ProofBridge and joint embeddings across mathematical subfields 18:07 — Does math formalization transfer to coding and other fields?&lt;/p&gt;&lt;p&gt;21:44 — Can every domain be mathematized? &lt;/p&gt;&lt;p&gt;23:14 — Verified software, chip design, and why vibe-coded apps are dangerous&lt;/p&gt;&lt;p&gt;26:35 — Scaling Mathlib by 100–1000x&lt;/p&gt;&lt;p&gt;28:27 — Artisan formalizers vs. invisible machine-language formalists&lt;/p&gt;&lt;p&gt;33:26 — Can verification generalize?&lt;/p&gt;&lt;p&gt;45:19 — Revisiting Batch Norm: covariate shift, loss landscape, and what really happens&lt;/p&gt;&lt;p&gt;48:22 — Is normalization even necessary? &lt;/p&gt;&lt;p&gt;50:10 — What&apos;s actually fundamental in modern AI architectures&lt;/p&gt;&lt;p&gt;51:41 — Why Christian thinks transformers won&apos;t last 5 years&lt;/p&gt;&lt;p&gt;52:38 — The 2026 superhuman AI mathematician bet&lt;/p&gt;&lt;p&gt;55:15 — What&apos;s missing: better verification + a much larger formalized math repository&lt;/p&gt;&lt;p&gt;56:13 — Lean vs. Coq vs. HOL Light -  does the proof assistant actually matter?&lt;/p&gt;&lt;p&gt;59:26 — The role of mathematicians in 5–10 years&lt;/p&gt;&lt;p&gt;1:02:00 — A human element to mathematics: Newton, Leibniz, and competitive proving&lt;/p&gt;&lt;p&gt;1:03:25 — The telescope analogy: AI as the instrument that lets us see the math universe&lt;/p&gt;&lt;p&gt;1:05:19 — Job apocalypse or Jevons paradox? &lt;/p&gt;&lt;p&gt;1:08:41 — Advice for students&lt;/p&gt;&lt;p&gt;1:09:50 — Can we formally verify AI alignment? &lt;/p&gt;&lt;p&gt;1:11:52 — Closing thanks&lt;/p&gt;&lt;hr /&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;Music:&lt;/p&gt;&lt;ul&gt;&lt;li&gt;&quot;Kid Kodi&quot; - Blue Dot Sessions - via Free Music Archive - CC BY-NC 4.0.&lt;/li&gt;&lt;li&gt;&quot;Palms Down&quot; - Blue Dot Sessions - via Free Music Archive - CC BY-NC 4.0.&lt;/li&gt;&lt;li&gt;Changes: trimmed&lt;/li&gt;&lt;/ul&gt;&lt;hr /&gt;&lt;p&gt;About: The Information Bottleneck is hosted by Ravid Shwartz-Ziv and Allen Roush, featuring in-depth conversations with leading AI researchers about the ideas shaping the future of machine learning.&lt;/p&gt;</itunes:summary><itunes:explicit>no</itunes:explicit><itunes:duration>01:12:32</itunes:duration><itunes:image href="https://hosting-media.riverside.com/media/podcasts/eebfac8f-2369-4a89-924b-be279b57f190/episodes/509aaae0-0ef8-4b36-ad08-bf35730dba6f/images/bce0dcfe-3987-49aa-a5b8-6e3f481270f7.png"/><itunes:season>1</itunes:season><itunes:episode>37</itunes:episode><itunes:title>Inside xAI, and the Bet on AI Math - with Christian Szegedy (Math Inc) </itunes:title><itunes:episodeType>full</itunes:episodeType></item><item><title><![CDATA[Reasoning Models and Planning - with Rao Kambhampati (Arizona State)]]></title><description><![CDATA[<p>We sat down with Rao Kambhampati, a Professor of CS at Arizona State University and former President of AAAI, to talk about reasoning models: what they are, when they work, and when they break.</p><p>Rao has been working on planning and decision-making since long before deep learning, which makes him one of the most grounded voices on what today's reasoning systems actually do. We start with definitions of what reasoning is, why planning is the hard subset of it, and what changed when systems like o1 and DeepSeek R1 moved the verifier from inference into post-training. From there we get into where these models generalize, where they don't, and why benchmarks can be misleading about both.</p><p>A big chunk of the conversation is on chain-of-thought: what intermediate tokens are actually doing, why they help the model more than they help the reader, and what outcome-based RL does to whatever semantic content was there to begin with. We also cover world models and why Rao thinks the video-only framing is the wrong bet, the difference between agentic safety and existential risk, and what the planning community figured out decades ago that the LLM community keeps rediscovering.</p><ul><li><hr /></li></ul><h3>Timeline</h3><ul><li>(00:12) Intros</li><li>(01:32) Defining "reasoning" and the System 1 / System 2 framing</li><li>(04:12) Blocksworld vs Sokoban, and non-ergodicity</li><li>(06:42) Pre-o1: PlanBench and "LLMs are zero-shot X" papers</li><li>(07:42) LLM-Modulo and moving the verifier into post-training</li><li>(10:12) Is RL post-training reasoning, or case-based retrieval?</li><li>(13:12) τ-Bench and benchmarks that avoid action interactions</li><li>(14:12) OOD generalization and what we don't know about post-training data</li><li>(19:02) Does it matter how they work if they answer the questions we care about?</li><li>(21:27) Architecture lotteries and why no one tries different designs</li><li>(23:42) Intermediate tokens and the "reduce thinking effort" cottage industry</li><li>(26:12) The 30×30 maze experiment</li><li>(27:42) Sokoban, NetHack, and Mystery Blocksworld</li><li>(34:58) Stop Anthropomorphizing Intermediate Tokens — the swapped-trace experiment</li><li>(46:12) Latent reasoning, Coconut, and why R0 beat R1</li><li>(50:12) How outcome-based RL erodes CoT semantics</li><li>(52:12) Dot-dot-dot and Anthropic's CoT monitoring paper</li><li>(53:42) Safety: Hinton, Bengio, LeCun</li><li>(57:12) Existential risk vs real safety work</li><li>(59:42) World models, transition models, and video-only approaches</li><li>(1:03:12) Why linguistic abstractions matter — pick and roll</li><li>(1:05:42) What the planning community knew in 2005</li><li>(1:08:12) Multi-agent LLMs</li><li>(1:09:57) Closing thoughts: the bridge analogy<hr /><p></p></li></ul><p>Music:</p><ul><li>"Kid Kodi" - Blue Dot Sessions - via Free Music Archive - CC BY-NC 4.0.</li><li>"Palms Down" - Blue Dot Sessions - via Free Music Archive - CC BY-NC 4.0.</li><li>Changes: trimmed</li></ul><hr /><p>About: The Information Bottleneck is hosted by Ravid Shwartz-Ziv and Allen Roush, featuring in-depth conversations with leading AI researchers about the ideas shaping the future of machine learning.</p>]]></description><guid isPermaLink="false">343b07ae-872e-43b8-a92e-c277e0069a6c</guid><dc:creator><![CDATA[Ravid Shwartz-Ziv & Allen Roush]]></dc:creator><pubDate>Wed, 29 Apr 2026 15:18:11 GMT</pubDate><enclosure url="https://api.riverside.com/hosting-analytics/media/00a73e9b33ea15f5787b5987809139a34c6e1a99ca8a852a49ef3e0b5e966cb4/eyJlcGlzb2RlSWQiOiIzNDNiMDdhZS04NzJlLTQzYjgtYTkyZS1jMjc3ZTAwNjlhNmMiLCJwb2RjYXN0SWQiOiJlZWJmYWM4Zi0yMzY5LTRhODktOTI0Yi1iZTI3OWI1N2YxOTAiLCJhY2NvdW50SWQiOiI2OGE1ZjAzMzFkNjQ0MTNlODhlN2FkMmYiLCJwYXRoIjoibWVkaWEvY2xpcHMvNjlmMjE4ZTQ1ZmRiNjE3ZTlmYjcwOWFhL3JhdmlkLXNod2FydHoteml2cy1zdHVkaW8tY29tcG9zZXItMjAyNi00LTI5X18xNi00Mi00NC5tcDMifQ==.mp3" length="138032840" type="audio/mpeg"/><podcast:transcript url="https://hosting-media.riverside.com/media/podcasts/eebfac8f-2369-4a89-924b-be279b57f190/episodes/343b07ae-872e-43b8-a92e-c277e0069a6c/transcripts.txt" type="text/plain"/><itunes:summary>&lt;p&gt;We sat down with Rao Kambhampati, a Professor of CS at Arizona State University and former President of AAAI, to talk about reasoning models: what they are, when they work, and when they break.&lt;/p&gt;&lt;p&gt;Rao has been working on planning and decision-making since long before deep learning, which makes him one of the most grounded voices on what today&apos;s reasoning systems actually do. We start with definitions of what reasoning is, why planning is the hard subset of it, and what changed when systems like o1 and DeepSeek R1 moved the verifier from inference into post-training. From there we get into where these models generalize, where they don&apos;t, and why benchmarks can be misleading about both.&lt;/p&gt;&lt;p&gt;A big chunk of the conversation is on chain-of-thought: what intermediate tokens are actually doing, why they help the model more than they help the reader, and what outcome-based RL does to whatever semantic content was there to begin with. We also cover world models and why Rao thinks the video-only framing is the wrong bet, the difference between agentic safety and existential risk, and what the planning community figured out decades ago that the LLM community keeps rediscovering.&lt;/p&gt;&lt;ul&gt;&lt;li&gt;&lt;hr /&gt;&lt;/li&gt;&lt;/ul&gt;&lt;h3&gt;Timeline&lt;/h3&gt;&lt;ul&gt;&lt;li&gt;(00:12) Intros&lt;/li&gt;&lt;li&gt;(01:32) Defining &quot;reasoning&quot; and the System 1 / System 2 framing&lt;/li&gt;&lt;li&gt;(04:12) Blocksworld vs Sokoban, and non-ergodicity&lt;/li&gt;&lt;li&gt;(06:42) Pre-o1: PlanBench and &quot;LLMs are zero-shot X&quot; papers&lt;/li&gt;&lt;li&gt;(07:42) LLM-Modulo and moving the verifier into post-training&lt;/li&gt;&lt;li&gt;(10:12) Is RL post-training reasoning, or case-based retrieval?&lt;/li&gt;&lt;li&gt;(13:12) τ-Bench and benchmarks that avoid action interactions&lt;/li&gt;&lt;li&gt;(14:12) OOD generalization and what we don&apos;t know about post-training data&lt;/li&gt;&lt;li&gt;(19:02) Does it matter how they work if they answer the questions we care about?&lt;/li&gt;&lt;li&gt;(21:27) Architecture lotteries and why no one tries different designs&lt;/li&gt;&lt;li&gt;(23:42) Intermediate tokens and the &quot;reduce thinking effort&quot; cottage industry&lt;/li&gt;&lt;li&gt;(26:12) The 30×30 maze experiment&lt;/li&gt;&lt;li&gt;(27:42) Sokoban, NetHack, and Mystery Blocksworld&lt;/li&gt;&lt;li&gt;(34:58) Stop Anthropomorphizing Intermediate Tokens — the swapped-trace experiment&lt;/li&gt;&lt;li&gt;(46:12) Latent reasoning, Coconut, and why R0 beat R1&lt;/li&gt;&lt;li&gt;(50:12) How outcome-based RL erodes CoT semantics&lt;/li&gt;&lt;li&gt;(52:12) Dot-dot-dot and Anthropic&apos;s CoT monitoring paper&lt;/li&gt;&lt;li&gt;(53:42) Safety: Hinton, Bengio, LeCun&lt;/li&gt;&lt;li&gt;(57:12) Existential risk vs real safety work&lt;/li&gt;&lt;li&gt;(59:42) World models, transition models, and video-only approaches&lt;/li&gt;&lt;li&gt;(1:03:12) Why linguistic abstractions matter — pick and roll&lt;/li&gt;&lt;li&gt;(1:05:42) What the planning community knew in 2005&lt;/li&gt;&lt;li&gt;(1:08:12) Multi-agent LLMs&lt;/li&gt;&lt;li&gt;(1:09:57) Closing thoughts: the bridge analogy&lt;hr /&gt;&lt;p&gt;&lt;/p&gt;&lt;/li&gt;&lt;/ul&gt;&lt;p&gt;Music:&lt;/p&gt;&lt;ul&gt;&lt;li&gt;&quot;Kid Kodi&quot; - Blue Dot Sessions - via Free Music Archive - CC BY-NC 4.0.&lt;/li&gt;&lt;li&gt;&quot;Palms Down&quot; - Blue Dot Sessions - via Free Music Archive - CC BY-NC 4.0.&lt;/li&gt;&lt;li&gt;Changes: trimmed&lt;/li&gt;&lt;/ul&gt;&lt;hr /&gt;&lt;p&gt;About: The Information Bottleneck is hosted by Ravid Shwartz-Ziv and Allen Roush, featuring in-depth conversations with leading AI researchers about the ideas shaping the future of machine learning.&lt;/p&gt;</itunes:summary><itunes:explicit>no</itunes:explicit><itunes:duration>01:11:53</itunes:duration><itunes:image href="https://hosting-media.riverside.com/media/podcasts/eebfac8f-2369-4a89-924b-be279b57f190/episodes/343b07ae-872e-43b8-a92e-c277e0069a6c/images/a730824a-2c16-4d65-9d00-72442a4244b5.png"/><itunes:season>1</itunes:season><itunes:episode>36</itunes:episode><itunes:title>Reasoning Models and Planning - with Rao Kambhampati (Arizona State)</itunes:title><itunes:episodeType>full</itunes:episodeType></item><item><title><![CDATA[What Actually Matters in AI? - with Zhuang Liu (Princeton)]]></title><description><![CDATA[<p>In this episode, we hosted <b>Zhuang Liu</b>, Assistant Professor at Princeton and former researcher at Meta, for a conversation about what actually matters in modern AI and what turns out to be a historical accident.</p><p>Zhuang is behind some of the most important papers in recent years (with more than 100k citations): ConvNeXt (showing ConvNets can match Transformers if you get the details right), Transformers Without Normalization (replacing LayerNorm with dynamic tanh), ImageBind, Eyes Wide Shut on CLIP's blind spots, the dataset bias work showing that even our biggest "diverse" datasets are still distinguishable from each other, and more.</p><p>We got into whether architecture research is even worth doing anymore, what "good data" actually means, why vision is the natural bridge across modalities but language drove the adoption wave, whether we need per-lab RL environments or better continual learning, whether LLMs have world models (and for which tasks you'd need one), why LLM outputs carry fingerprints that survive paraphrasing, and where coding agents like Claude Code fit into research workflows today and where they still fall short.</p><hr /><p></p><p><b>Timeline</b></p><p>00:13 — Intro</p><p>01:15 — ConvNeXt and whether architecture still matters</p><p>06:35 — What actually drove the jump from GPT-1 to  GPT-3</p><p>08:24 — Setting the bar for architecture papers today</p><p>11:14 — Dataset bias: why "diverse" datasets still aren't</p><p>22:52 — What good data actually looks like</p><p>26:49 — ImageBind and vision as the bridge across modalities</p><p>29:09 — Why language drove the adoption wave, not vision</p><p>32:24 — Eyes Wide Shut: CLIP's blind spots</p><p>34:57 — RL environments, continual learning, and memory as the real bottleneck</p><p>43:06 — Are inductive biases just historical accidents?</p><p>44:30 — Do LLMs have world models?</p><p>48:15 — Which tasks actually need a vision world model</p><p>50:14 — Idiosyncrasy in LLMs: pre-training vs post-training fingerprints</p><p>53:39 — The future of pre-training, mid-training, and post-training</p><p>57:57 — Claude Code, Codex, and coding agents in research</p><p>59:11 — Do we still need students in the age of autonomous research?</p><p>1:04:19 — Transformers Without Normalization and the four pillars that survived</p><p>1:06:53 — MetaMorph: Does generation help understanding, or the other way around?</p><p>1:09:17 — Wrap</p><hr /><p></p><p>Music:</p><ul><li>"Kid Kodi" - Blue Dot Sessions - via Free Music Archive - CC BY-NC 4.0.</li><li>"Palms Down" - Blue Dot Sessions - via Free Music Archive - CC BY-NC 4.0.</li><li>Changes: trimmed</li></ul><hr /><p>About: The Information Bottleneck is hosted by Ravid Shwartz-Ziv and Allen Roush, featuring in-depth conversations with leading AI researchers about the ideas shaping the future of machine learning.</p>]]></description><guid isPermaLink="false">54263932-abdb-4e57-bcb4-baf0b5775215</guid><dc:creator><![CDATA[Ravid Shwartz-Ziv & Allen Roush]]></dc:creator><pubDate>Fri, 24 Apr 2026 18:21:23 GMT</pubDate><enclosure url="https://api.riverside.com/hosting-analytics/media/a47a8d080f8624b33ce80cb181a137bacaacb115e04dad1018fcaba16b8fbc26/eyJlcGlzb2RlSWQiOiI1NDI2MzkzMi1hYmRiLTRlNTctYmNiNC1iYWYwYjU3NzUyMTUiLCJwb2RjYXN0SWQiOiJlZWJmYWM4Zi0yMzY5LTRhODktOTI0Yi1iZTI3OWI1N2YxOTAiLCJhY2NvdW50SWQiOiI2OGE1ZjAzMzFkNjQ0MTNlODhlN2FkMmYiLCJwYXRoIjoibWVkaWEvY2xpcHMvNjllYmFjNGViOTQ5NTlkZGMwMTM4ZjEyL3JhdmlkLXNod2FydHoteml2cy1zdHVkaW8tY29tcG9zZXItMjAyNi00LTI0X18xOS00NS01MC5tcDMifQ==.mp3" length="134268699" type="audio/mpeg"/><podcast:transcript url="https://hosting-media.riverside.com/media/podcasts/eebfac8f-2369-4a89-924b-be279b57f190/episodes/54263932-abdb-4e57-bcb4-baf0b5775215/transcripts.txt" type="text/plain"/><itunes:summary>&lt;p&gt;In this episode, we hosted &lt;b&gt;Zhuang Liu&lt;/b&gt;, Assistant Professor at Princeton and former researcher at Meta, for a conversation about what actually matters in modern AI and what turns out to be a historical accident.&lt;/p&gt;&lt;p&gt;Zhuang is behind some of the most important papers in recent years (with more than 100k citations): ConvNeXt (showing ConvNets can match Transformers if you get the details right), Transformers Without Normalization (replacing LayerNorm with dynamic tanh), ImageBind, Eyes Wide Shut on CLIP&apos;s blind spots, the dataset bias work showing that even our biggest &quot;diverse&quot; datasets are still distinguishable from each other, and more.&lt;/p&gt;&lt;p&gt;We got into whether architecture research is even worth doing anymore, what &quot;good data&quot; actually means, why vision is the natural bridge across modalities but language drove the adoption wave, whether we need per-lab RL environments or better continual learning, whether LLMs have world models (and for which tasks you&apos;d need one), why LLM outputs carry fingerprints that survive paraphrasing, and where coding agents like Claude Code fit into research workflows today and where they still fall short.&lt;/p&gt;&lt;hr /&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;&lt;b&gt;Timeline&lt;/b&gt;&lt;/p&gt;&lt;p&gt;00:13 — Intro&lt;/p&gt;&lt;p&gt;01:15 — ConvNeXt and whether architecture still matters&lt;/p&gt;&lt;p&gt;06:35 — What actually drove the jump from GPT-1 to  GPT-3&lt;/p&gt;&lt;p&gt;08:24 — Setting the bar for architecture papers today&lt;/p&gt;&lt;p&gt;11:14 — Dataset bias: why &quot;diverse&quot; datasets still aren&apos;t&lt;/p&gt;&lt;p&gt;22:52 — What good data actually looks like&lt;/p&gt;&lt;p&gt;26:49 — ImageBind and vision as the bridge across modalities&lt;/p&gt;&lt;p&gt;29:09 — Why language drove the adoption wave, not vision&lt;/p&gt;&lt;p&gt;32:24 — Eyes Wide Shut: CLIP&apos;s blind spots&lt;/p&gt;&lt;p&gt;34:57 — RL environments, continual learning, and memory as the real bottleneck&lt;/p&gt;&lt;p&gt;43:06 — Are inductive biases just historical accidents?&lt;/p&gt;&lt;p&gt;44:30 — Do LLMs have world models?&lt;/p&gt;&lt;p&gt;48:15 — Which tasks actually need a vision world model&lt;/p&gt;&lt;p&gt;50:14 — Idiosyncrasy in LLMs: pre-training vs post-training fingerprints&lt;/p&gt;&lt;p&gt;53:39 — The future of pre-training, mid-training, and post-training&lt;/p&gt;&lt;p&gt;57:57 — Claude Code, Codex, and coding agents in research&lt;/p&gt;&lt;p&gt;59:11 — Do we still need students in the age of autonomous research?&lt;/p&gt;&lt;p&gt;1:04:19 — Transformers Without Normalization and the four pillars that survived&lt;/p&gt;&lt;p&gt;1:06:53 — MetaMorph: Does generation help understanding, or the other way around?&lt;/p&gt;&lt;p&gt;1:09:17 — Wrap&lt;/p&gt;&lt;hr /&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;Music:&lt;/p&gt;&lt;ul&gt;&lt;li&gt;&quot;Kid Kodi&quot; - Blue Dot Sessions - via Free Music Archive - CC BY-NC 4.0.&lt;/li&gt;&lt;li&gt;&quot;Palms Down&quot; - Blue Dot Sessions - via Free Music Archive - CC BY-NC 4.0.&lt;/li&gt;&lt;li&gt;Changes: trimmed&lt;/li&gt;&lt;/ul&gt;&lt;hr /&gt;&lt;p&gt;About: The Information Bottleneck is hosted by Ravid Shwartz-Ziv and Allen Roush, featuring in-depth conversations with leading AI researchers about the ideas shaping the future of machine learning.&lt;/p&gt;</itunes:summary><itunes:explicit>no</itunes:explicit><itunes:duration>01:09:56</itunes:duration><itunes:image href="https://hosting-media.riverside.com/media/podcasts/eebfac8f-2369-4a89-924b-be279b57f190/episodes/54263932-abdb-4e57-bcb4-baf0b5775215/images/b1868c30-27ce-4f17-af32-5927d1d87906.png"/><itunes:season>1</itunes:season><itunes:episode>35</itunes:episode><itunes:title>What Actually Matters in AI? - with Zhuang Liu (Princeton)</itunes:title><itunes:episodeType>full</itunes:episodeType></item><item><title><![CDATA[The Future of Coding Agents with Sasha Rush (Cursor/Cornell)]]></title><description><![CDATA[<p>We talked with <b>Sasha Rush</b>, researcher at Cursor and professor at Cornell, about what it actually feels like to we in the heart of the AI revolution and build coding agents right now. Sasha shared how these systems are changing day-to-day work and how it feels to develop these systems.</p><p>A big part of the conversation was about why coding has become such a powerful setting for these tools. We discussed what makes code different from other domains, why agents seem to work especially well there, and how much of today’s progress comes not just from better models, but from better ways of using them. Sasha also gave an inside look at how Cursor thinks about training coding models, long-running agents, context limits, bug finding, and the balance between autonomy and human oversight.</p><p>We also talked about the broader shift happening in software engineering. Are developers moving to a higher level of abstraction? Is this just a phase where we “babysit” models, or the beginning of a deeper change in how software gets built? Sasha had a very thoughtful perspective here, including what he’s seeing from students, researchers, and engineers who are growing up native to these tools.</p><p>More broadly, this episode is about what it means to do serious technical work in a moment when the tools are changing incredibly fast. Sasha brought both optimism and skepticism to the discussion, and that made this a really grounded conversation about where coding agents are today, what they are already surprisingly good at, and where all of this might be going next.</p><hr /><p></p><p><b>Timeline</b><br /><b>00:00</b> Intro and Sasha joins us<br /><b>01:11</b> What “coding agents” actually mean<br /><b>02:34</b> Why coding became the breakout use case<br /><b>08:56</b> Long-running agents and autonomous workflows<br /><b>15:08</b> How these tools are changing the work of engineers<br /><b>17:15</b> Are people just babysitting models right now?<br /><b>22:11</b> How Cursor builds its coding models<br /><b>26:29</b> Rewards, training, and what makes agents work<br /><b>34:53</b> Memory, continual learning, and agent communication<br /><b>38:00</b> How context compaction works in practice<br /><b>41:29</b> Why coding agents recently got much better<br /><b>50:31</b> Refactoring, maintenance, and self-improving codebases<br /><b>52:16</b> Bug finding, oversight, and verification<br /><b>54:43</b> Will this pace of progress continue?<br /><b>56:42</b> Can this spread beyond coding?<br /><b>58:27</b> The future of Cursor and coding agents<br /><b>1:03:08</b> Model architectures beyond standard transformers<br /><b>1:05:37</b> World models, diffusion, and what may come next</p><hr /><p>Music:</p><ul><li>"Kid Kodi" - Blue Dot Sessions - via Free Music Archive - CC BY-NC 4.0.</li><li>"Palms Down" - Blue Dot Sessions - via Free Music Archive - CC BY-NC 4.0.</li><li>Changes: trimmed</li></ul><hr /><p>About: The Information Bottleneck is hosted by Ravid Shwartz-Ziv and Allen Roush, featuring in-depth conversations with leading AI researchers about the ideas shaping the future of machine learning.</p>]]></description><guid isPermaLink="false">f732544b-6a0b-48b8-89a2-588ae4a4254d</guid><dc:creator><![CDATA[Ravid Shwartz-Ziv & Allen Roush]]></dc:creator><pubDate>Wed, 15 Apr 2026 16:57:16 GMT</pubDate><enclosure url="https://api.riverside.com/hosting-analytics/media/85d6e38a0852644678ccc46de720fa71983085096320d1de62a1359e6273fa34/eyJlcGlzb2RlSWQiOiJmNzMyNTQ0Yi02YTBiLTQ4YjgtODlhMi01ODhhZTRhNDI1NGQiLCJwb2RjYXN0SWQiOiJlZWJmYWM4Zi0yMzY5LTRhODktOTI0Yi1iZTI3OWI1N2YxOTAiLCJhY2NvdW50SWQiOiI2OGE1ZjAzMzFkNjQ0MTNlODhlN2FkMmYiLCJwYXRoIjoibWVkaWEvY2xpcHMvNjlkZmJlNDJjYTA5MGM4Mjk4MzkwZTBiL3JhdmlkLXNod2FydHoteml2cy1zdHVkaW8tY29tcG9zZXItMjAyNi00LTE1X18xOC0zNS0xNC5tcDMifQ==.mp3" length="122207442" type="audio/mpeg"/><podcast:transcript url="https://hosting-media.riverside.com/media/podcasts/eebfac8f-2369-4a89-924b-be279b57f190/episodes/f732544b-6a0b-48b8-89a2-588ae4a4254d/transcripts.txt" type="text/plain"/><itunes:summary>&lt;p&gt;We talked with &lt;b&gt;Sasha Rush&lt;/b&gt;, researcher at Cursor and professor at Cornell, about what it actually feels like to we in the heart of the AI revolution and build coding agents right now. Sasha shared how these systems are changing day-to-day work and how it feels to develop these systems.&lt;/p&gt;&lt;p&gt;A big part of the conversation was about why coding has become such a powerful setting for these tools. We discussed what makes code different from other domains, why agents seem to work especially well there, and how much of today’s progress comes not just from better models, but from better ways of using them. Sasha also gave an inside look at how Cursor thinks about training coding models, long-running agents, context limits, bug finding, and the balance between autonomy and human oversight.&lt;/p&gt;&lt;p&gt;We also talked about the broader shift happening in software engineering. Are developers moving to a higher level of abstraction? Is this just a phase where we “babysit” models, or the beginning of a deeper change in how software gets built? Sasha had a very thoughtful perspective here, including what he’s seeing from students, researchers, and engineers who are growing up native to these tools.&lt;/p&gt;&lt;p&gt;More broadly, this episode is about what it means to do serious technical work in a moment when the tools are changing incredibly fast. Sasha brought both optimism and skepticism to the discussion, and that made this a really grounded conversation about where coding agents are today, what they are already surprisingly good at, and where all of this might be going next.&lt;/p&gt;&lt;hr /&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;&lt;b&gt;Timeline&lt;/b&gt;&lt;br /&gt;&lt;b&gt;00:00&lt;/b&gt; Intro and Sasha joins us&lt;br /&gt;&lt;b&gt;01:11&lt;/b&gt; What “coding agents” actually mean&lt;br /&gt;&lt;b&gt;02:34&lt;/b&gt; Why coding became the breakout use case&lt;br /&gt;&lt;b&gt;08:56&lt;/b&gt; Long-running agents and autonomous workflows&lt;br /&gt;&lt;b&gt;15:08&lt;/b&gt; How these tools are changing the work of engineers&lt;br /&gt;&lt;b&gt;17:15&lt;/b&gt; Are people just babysitting models right now?&lt;br /&gt;&lt;b&gt;22:11&lt;/b&gt; How Cursor builds its coding models&lt;br /&gt;&lt;b&gt;26:29&lt;/b&gt; Rewards, training, and what makes agents work&lt;br /&gt;&lt;b&gt;34:53&lt;/b&gt; Memory, continual learning, and agent communication&lt;br /&gt;&lt;b&gt;38:00&lt;/b&gt; How context compaction works in practice&lt;br /&gt;&lt;b&gt;41:29&lt;/b&gt; Why coding agents recently got much better&lt;br /&gt;&lt;b&gt;50:31&lt;/b&gt; Refactoring, maintenance, and self-improving codebases&lt;br /&gt;&lt;b&gt;52:16&lt;/b&gt; Bug finding, oversight, and verification&lt;br /&gt;&lt;b&gt;54:43&lt;/b&gt; Will this pace of progress continue?&lt;br /&gt;&lt;b&gt;56:42&lt;/b&gt; Can this spread beyond coding?&lt;br /&gt;&lt;b&gt;58:27&lt;/b&gt; The future of Cursor and coding agents&lt;br /&gt;&lt;b&gt;1:03:08&lt;/b&gt; Model architectures beyond standard transformers&lt;br /&gt;&lt;b&gt;1:05:37&lt;/b&gt; World models, diffusion, and what may come next&lt;/p&gt;&lt;hr /&gt;&lt;p&gt;Music:&lt;/p&gt;&lt;ul&gt;&lt;li&gt;&quot;Kid Kodi&quot; - Blue Dot Sessions - via Free Music Archive - CC BY-NC 4.0.&lt;/li&gt;&lt;li&gt;&quot;Palms Down&quot; - Blue Dot Sessions - via Free Music Archive - CC BY-NC 4.0.&lt;/li&gt;&lt;li&gt;Changes: trimmed&lt;/li&gt;&lt;/ul&gt;&lt;hr /&gt;&lt;p&gt;About: The Information Bottleneck is hosted by Ravid Shwartz-Ziv and Allen Roush, featuring in-depth conversations with leading AI researchers about the ideas shaping the future of machine learning.&lt;/p&gt;</itunes:summary><itunes:explicit>no</itunes:explicit><itunes:duration>01:24:52</itunes:duration><itunes:image href="https://hosting-media.riverside.com/media/podcasts/eebfac8f-2369-4a89-924b-be279b57f190/episodes/f732544b-6a0b-48b8-89a2-588ae4a4254d/images/83b98ca4-ff7e-41e4-b321-ad33a855aeec.png"/><itunes:season>1</itunes:season><itunes:episode>34</itunes:episode><itunes:title>The Future of Coding Agents with Sasha Rush (Cursor/Cornell)</itunes:title><itunes:episodeType>full</itunes:episodeType></item><item><title><![CDATA[The Hidden Engine of Vision with Peyman Milanfar (Google)]]></title><description><![CDATA[<p><b>How Denoising Secretly Powers Everything in AI</b></p><p></p><p>Peyman Milanfar is a Distinguished Scientist at Google, leading its Computational Imaging team. He's a member of the National Academy of Engineering, an IEEE Fellow, and one of the key people behind the Pixel camera pipeline. Before Google, he was a professor at UC Santa Cruz for 15 years and helped build the imaging pipeline for Google Glass at Google X. Over 35,000 citations.</p><p>Peyman makes a provocative case that denoising, long dismissed as a boring cleanup task, is actually one of the most fundamental operations in modern ML, on par with SGD and backprop. Knowing how to remove noise from a signal basically means you have a map of the manifold that signals live on, and that insight connects everything from classical inverse problems to diffusion models.</p><p>We go from early patch-based denoisers to his 2010 "Is Denoising Dead?" paper, and then to the question that redirected his research: if denoising is nearly solved, what else can denoisers do? That led to Regularization by Denoising (RED), which, if you unroll it, looks a lot like a diffusion process, years before diffusion models existed. We also cover how his team shipped a one-step diffusion model on the Pixel phone for 100x ProRes Zoom, the perception-distortion-authenticity tradeoff in generative imaging, and a new paper on why diffusion models don't actually need noise conditioning. The conversation wraps with a debate on why language has dominated the AI spotlight while vision lags, and Peyman's argument that visual intelligence, grounded in physics and robotics, is coming next.</p><hr /><p></p><p>Timeline</p><p>0:00 Intro and Peyman's background</p><p>1:22 Why denoising matters more than you think Sensor diversity and Tesla's vision-only bet</p><p>15:04 BM3D and why it was secretly an MMSE estimator</p><p>17:02 "Is Denoising Dead?" then what else can denoisers do?</p><p>18:07 Plug-and-play methods and Regularization by Denoising (RED)</p><p>26:18 Denoising, manifolds, and the compression connection</p><p>28:12 Energy-based models vs. diffusion: "The Geometry of Noise"</p><p>31:40 Natural gradient descent and why flow models work</p><p>34:48 Gradient-free optimization and high-dimensional noise</p><p>45:13 Image quality and the perception-distortion tradeoff</p><p>48:39 Information theory, rate-distortion, and generative models</p><p>52:57 Denoising vs. editing</p><p>54:25 The changing role of theory</p><p>57:07 Hobbyist tools vs. shipping consumer products</p><p>59:40 Coding agents, vibe coding, and domain expertise</p><p>1:05:00 Vision and more complex-dimensional signals</p><p>1:09:31 Do models need to interact with the physical world?</p><p>1:11:28 Continual learning and novelty-driven updates</p><p>1:13:00 On-device learning and privacy</p><p>1:15:01 Why has language dominated AI? Is vision next?</p><p>1:17:14 How kids learn: vision first, language later</p><p>1:19:36 Academia vs. industry</p><p>1:22:28 10,000 citations vs. shipping to millions, why choose?</p><hr /><p>Music:</p><ul><li>"Kid Kodi" - Blue Dot Sessions - via Free Music Archive - CC BY-NC 4.0.</li><li>"Palms Down" - Blue Dot Sessions - via Free Music Archive - CC BY-NC 4.0.</li><li>Changes: trimmed</li></ul><hr /><p>About: The Information Bottleneck is hosted by Ravid Shwartz-Ziv and Allen Roush, featuring in-depth conversations with leading AI researchers about the ideas shaping the future of machine learning.</p>]]></description><guid isPermaLink="false">f967123c-9862-4c01-b557-6f03d9ffddec</guid><dc:creator><![CDATA[Ravid Shwartz-Ziv & Allen Roush]]></dc:creator><pubDate>Fri, 10 Apr 2026 14:13:34 GMT</pubDate><enclosure url="https://api.riverside.com/hosting-analytics/media/326d147d9c9c0dd5e997b60814f15e0c03ed307484a65dd4af7ec34c147fda3a/eyJlcGlzb2RlSWQiOiJmOTY3MTIzYy05ODYyLTRjMDEtYjU1Ny02ZjAzZDlmZmRkZWMiLCJwb2RjYXN0SWQiOiJlZWJmYWM4Zi0yMzY5LTRhODktOTI0Yi1iZTI3OWI1N2YxOTAiLCJhY2NvdW50SWQiOiI2OGE1ZjAzMzFkNjQ0MTNlODhlN2FkMmYiLCJwYXRoIjoibWVkaWEvY2xpcHMvNjlkODY5MjExNjA3NWE0NDZlYjg4MWQzL3JhdmlkLXNod2FydHoteml2cy1zdHVkaW8tY29tcG9zZXItMjAyNi00LTEwX181LTYtOC5tcDMifQ==.mp3" length="121570472" type="audio/mpeg"/><podcast:transcript url="https://hosting-media.riverside.com/media/podcasts/eebfac8f-2369-4a89-924b-be279b57f190/episodes/f967123c-9862-4c01-b557-6f03d9ffddec/transcripts.txt" type="text/plain"/><itunes:summary>&lt;p&gt;&lt;b&gt;How Denoising Secretly Powers Everything in AI&lt;/b&gt;&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;Peyman Milanfar is a Distinguished Scientist at Google, leading its Computational Imaging team. He&apos;s a member of the National Academy of Engineering, an IEEE Fellow, and one of the key people behind the Pixel camera pipeline. Before Google, he was a professor at UC Santa Cruz for 15 years and helped build the imaging pipeline for Google Glass at Google X. Over 35,000 citations.&lt;/p&gt;&lt;p&gt;Peyman makes a provocative case that denoising, long dismissed as a boring cleanup task, is actually one of the most fundamental operations in modern ML, on par with SGD and backprop. Knowing how to remove noise from a signal basically means you have a map of the manifold that signals live on, and that insight connects everything from classical inverse problems to diffusion models.&lt;/p&gt;&lt;p&gt;We go from early patch-based denoisers to his 2010 &quot;Is Denoising Dead?&quot; paper, and then to the question that redirected his research: if denoising is nearly solved, what else can denoisers do? That led to Regularization by Denoising (RED), which, if you unroll it, looks a lot like a diffusion process, years before diffusion models existed. We also cover how his team shipped a one-step diffusion model on the Pixel phone for 100x ProRes Zoom, the perception-distortion-authenticity tradeoff in generative imaging, and a new paper on why diffusion models don&apos;t actually need noise conditioning. The conversation wraps with a debate on why language has dominated the AI spotlight while vision lags, and Peyman&apos;s argument that visual intelligence, grounded in physics and robotics, is coming next.&lt;/p&gt;&lt;hr /&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;Timeline&lt;/p&gt;&lt;p&gt;0:00 Intro and Peyman&apos;s background&lt;/p&gt;&lt;p&gt;1:22 Why denoising matters more than you think Sensor diversity and Tesla&apos;s vision-only bet&lt;/p&gt;&lt;p&gt;15:04 BM3D and why it was secretly an MMSE estimator&lt;/p&gt;&lt;p&gt;17:02 &quot;Is Denoising Dead?&quot; then what else can denoisers do?&lt;/p&gt;&lt;p&gt;18:07 Plug-and-play methods and Regularization by Denoising (RED)&lt;/p&gt;&lt;p&gt;26:18 Denoising, manifolds, and the compression connection&lt;/p&gt;&lt;p&gt;28:12 Energy-based models vs. diffusion: &quot;The Geometry of Noise&quot;&lt;/p&gt;&lt;p&gt;31:40 Natural gradient descent and why flow models work&lt;/p&gt;&lt;p&gt;34:48 Gradient-free optimization and high-dimensional noise&lt;/p&gt;&lt;p&gt;45:13 Image quality and the perception-distortion tradeoff&lt;/p&gt;&lt;p&gt;48:39 Information theory, rate-distortion, and generative models&lt;/p&gt;&lt;p&gt;52:57 Denoising vs. editing&lt;/p&gt;&lt;p&gt;54:25 The changing role of theory&lt;/p&gt;&lt;p&gt;57:07 Hobbyist tools vs. shipping consumer products&lt;/p&gt;&lt;p&gt;59:40 Coding agents, vibe coding, and domain expertise&lt;/p&gt;&lt;p&gt;1:05:00 Vision and more complex-dimensional signals&lt;/p&gt;&lt;p&gt;1:09:31 Do models need to interact with the physical world?&lt;/p&gt;&lt;p&gt;1:11:28 Continual learning and novelty-driven updates&lt;/p&gt;&lt;p&gt;1:13:00 On-device learning and privacy&lt;/p&gt;&lt;p&gt;1:15:01 Why has language dominated AI? Is vision next?&lt;/p&gt;&lt;p&gt;1:17:14 How kids learn: vision first, language later&lt;/p&gt;&lt;p&gt;1:19:36 Academia vs. industry&lt;/p&gt;&lt;p&gt;1:22:28 10,000 citations vs. shipping to millions, why choose?&lt;/p&gt;&lt;hr /&gt;&lt;p&gt;Music:&lt;/p&gt;&lt;ul&gt;&lt;li&gt;&quot;Kid Kodi&quot; - Blue Dot Sessions - via Free Music Archive - CC BY-NC 4.0.&lt;/li&gt;&lt;li&gt;&quot;Palms Down&quot; - Blue Dot Sessions - via Free Music Archive - CC BY-NC 4.0.&lt;/li&gt;&lt;li&gt;Changes: trimmed&lt;/li&gt;&lt;/ul&gt;&lt;hr /&gt;&lt;p&gt;About: The Information Bottleneck is hosted by Ravid Shwartz-Ziv and Allen Roush, featuring in-depth conversations with leading AI researchers about the ideas shaping the future of machine learning.&lt;/p&gt;</itunes:summary><itunes:explicit>no</itunes:explicit><itunes:duration>01:24:25</itunes:duration><itunes:image href="https://hosting-media.riverside.com/media/podcasts/eebfac8f-2369-4a89-924b-be279b57f190/episodes/f967123c-9862-4c01-b557-6f03d9ffddec/images/9c5b43e8-dbb4-466c-ac57-670a3ac39a07.png"/><itunes:season>1</itunes:season><itunes:episode>33</itunes:episode><itunes:title>The Hidden Engine of Vision with Peyman Milanfar (Google)</itunes:title><itunes:episodeType>full</itunes:episodeType></item><item><title><![CDATA[Reinventing AI From Scratch with Yaroslav Bulatov]]></title><description><![CDATA[<p>Yaroslav Bulatov helped build the AI era from the inside, as one of the earliest researchers at both OpenAI and Google Brain. Now he wants to tear it all down and start over. Modern deep learning, he argues, is up to 100x more wasteful than it needs to be  -  a Frankenstein of hacks designed for the wrong hardware. With a power wall approaching in two years, Yaroslav is leading an open effort to reinvent AI from scratch: no backprop, no legacy assumptions, just the benefit of hindsight and AI agents that compress decades of research into months. Along the way, we dig into why AGI is a "religious question," how a sales guy with no ML background became one of his most productive contributors, and why the Muon optimizer, one of the biggest recent breakthroughs, could only have been discovered by a non-expert.</p><hr /><p><b>Timeline</b></p><p>00:12 — Introduction and Yaroslav's background at OpenAI and Google Brain</p><p>01:16 — Why deep learning isn't such a good idea</p><p>02:03 — The three definitions of AGI: religious, financial, and vibes-based</p><p>07:52 — The SAI framework: do we need the term AGI at all?</p><p>10:58 — What matters more than AGI: efficiency and refactoring the AI stack</p><p>13:28 — Jevons paradox and the coming energy wall</p><p>14:49 — The recipe: replaying 70 years of AI with hindsight</p><p>17:23 — Memory, energy, and gradient checkpointing</p><p>18:34 — Why you can't just optimize the current stack (the recurrent laryngeal nerve analogy)</p><p>21:05 — What a redesigned AI might look like: hierarchical message passing</p><p>22:31 — Can a small team replicate decades of research?</p><p>24:23 — Why non-experts outperform domain specialists</p><p>27:42 — The GPT-2 benchmark: what success looks like</p><p>29:01 — Ian Goodfellow, Theano, and the origins of TensorFlow</p><p>30:12 — The Muon optimizer origin story and beating Google on ImageNet</p><p>36:16 — AI coding agents for software engineering and research</p><p>40:12 — 10-year outlook and the voice-first workflow</p><p>42:23 — Why start with text over multimodality</p><p>45:13 — Are AI labs like SSI on the right track?</p><p>48:52 — Getting rid of backprop — and maybe math itself</p><p>53:57 — The state of ML academia and NeurIPS culture</p><p>56:41 — The Sutra group challenge: inventing better learning algorithms</p><hr /><p>Music:</p><ul><li>"Kid Kodi" - Blue Dot Sessions - via Free Music Archive - CC BY-NC 4.0.</li><li>"Palms Down" - Blue Dot Sessions - via Free Music Archive - CC BY-NC 4.0.</li><li>Changes: trimmed</li></ul><hr /><p>About: The Information Bottleneck is hosted by Ravid Shwartz-Ziv and Allen Roush, featuring in-depth conversations with leading AI researchers about the ideas shaping the future of machine learning.</p>]]></description><guid isPermaLink="false">1f05b554-23f6-40f5-91db-e84a44d9343f</guid><dc:creator><![CDATA[Ravid Shwartz-Ziv & Allen Roush]]></dc:creator><pubDate>Mon, 30 Mar 2026 23:22:20 GMT</pubDate><enclosure url="https://api.riverside.com/hosting-analytics/media/4e51ae2005f42859e8ff25d3ce88eeeb942eeabaf8c359b8f9c68dc87ece09de/eyJlcGlzb2RlSWQiOiIxZjA1YjU1NC0yM2Y2LTQwZjUtOTFkYi1lODRhNDRkOTM0M2YiLCJwb2RjYXN0SWQiOiJlZWJmYWM4Zi0yMzY5LTRhODktOTI0Yi1iZTI3OWI1N2YxOTAiLCJhY2NvdW50SWQiOiI2OGE1ZjAzMzFkNjQ0MTNlODhlN2FkMmYiLCJwYXRoIjoibWVkaWEvY2xpcHMvNjljYWZiYzFlNTUyNjEwZDFmYWJkNWI3L3JhdmlkLXNod2FydHoteml2cy1zdHVkaW8tY29tcG9zZXItMjAyNi0zLTMxX18wLTQwLTAubXAzIn0=.mp3" length="83197431" type="audio/mpeg"/><podcast:transcript url="https://hosting-media.riverside.com/media/podcasts/eebfac8f-2369-4a89-924b-be279b57f190/episodes/1f05b554-23f6-40f5-91db-e84a44d9343f/transcripts.txt" type="text/plain"/><itunes:summary>&lt;p&gt;Yaroslav Bulatov helped build the AI era from the inside, as one of the earliest researchers at both OpenAI and Google Brain. Now he wants to tear it all down and start over. Modern deep learning, he argues, is up to 100x more wasteful than it needs to be  -  a Frankenstein of hacks designed for the wrong hardware. With a power wall approaching in two years, Yaroslav is leading an open effort to reinvent AI from scratch: no backprop, no legacy assumptions, just the benefit of hindsight and AI agents that compress decades of research into months. Along the way, we dig into why AGI is a &quot;religious question,&quot; how a sales guy with no ML background became one of his most productive contributors, and why the Muon optimizer, one of the biggest recent breakthroughs, could only have been discovered by a non-expert.&lt;/p&gt;&lt;hr /&gt;&lt;p&gt;&lt;b&gt;Timeline&lt;/b&gt;&lt;/p&gt;&lt;p&gt;00:12 — Introduction and Yaroslav&apos;s background at OpenAI and Google Brain&lt;/p&gt;&lt;p&gt;01:16 — Why deep learning isn&apos;t such a good idea&lt;/p&gt;&lt;p&gt;02:03 — The three definitions of AGI: religious, financial, and vibes-based&lt;/p&gt;&lt;p&gt;07:52 — The SAI framework: do we need the term AGI at all?&lt;/p&gt;&lt;p&gt;10:58 — What matters more than AGI: efficiency and refactoring the AI stack&lt;/p&gt;&lt;p&gt;13:28 — Jevons paradox and the coming energy wall&lt;/p&gt;&lt;p&gt;14:49 — The recipe: replaying 70 years of AI with hindsight&lt;/p&gt;&lt;p&gt;17:23 — Memory, energy, and gradient checkpointing&lt;/p&gt;&lt;p&gt;18:34 — Why you can&apos;t just optimize the current stack (the recurrent laryngeal nerve analogy)&lt;/p&gt;&lt;p&gt;21:05 — What a redesigned AI might look like: hierarchical message passing&lt;/p&gt;&lt;p&gt;22:31 — Can a small team replicate decades of research?&lt;/p&gt;&lt;p&gt;24:23 — Why non-experts outperform domain specialists&lt;/p&gt;&lt;p&gt;27:42 — The GPT-2 benchmark: what success looks like&lt;/p&gt;&lt;p&gt;29:01 — Ian Goodfellow, Theano, and the origins of TensorFlow&lt;/p&gt;&lt;p&gt;30:12 — The Muon optimizer origin story and beating Google on ImageNet&lt;/p&gt;&lt;p&gt;36:16 — AI coding agents for software engineering and research&lt;/p&gt;&lt;p&gt;40:12 — 10-year outlook and the voice-first workflow&lt;/p&gt;&lt;p&gt;42:23 — Why start with text over multimodality&lt;/p&gt;&lt;p&gt;45:13 — Are AI labs like SSI on the right track?&lt;/p&gt;&lt;p&gt;48:52 — Getting rid of backprop — and maybe math itself&lt;/p&gt;&lt;p&gt;53:57 — The state of ML academia and NeurIPS culture&lt;/p&gt;&lt;p&gt;56:41 — The Sutra group challenge: inventing better learning algorithms&lt;/p&gt;&lt;hr /&gt;&lt;p&gt;Music:&lt;/p&gt;&lt;ul&gt;&lt;li&gt;&quot;Kid Kodi&quot; - Blue Dot Sessions - via Free Music Archive - CC BY-NC 4.0.&lt;/li&gt;&lt;li&gt;&quot;Palms Down&quot; - Blue Dot Sessions - via Free Music Archive - CC BY-NC 4.0.&lt;/li&gt;&lt;li&gt;Changes: trimmed&lt;/li&gt;&lt;/ul&gt;&lt;hr /&gt;&lt;p&gt;About: The Information Bottleneck is hosted by Ravid Shwartz-Ziv and Allen Roush, featuring in-depth conversations with leading AI researchers about the ideas shaping the future of machine learning.&lt;/p&gt;</itunes:summary><itunes:explicit>no</itunes:explicit><itunes:duration>00:57:46</itunes:duration><itunes:image href="https://hosting-media.riverside.com/media/podcasts/eebfac8f-2369-4a89-924b-be279b57f190/episodes/1f05b554-23f6-40f5-91db-e84a44d9343f/images/eb3a03e3-f34c-4443-b658-6c0cee1f23d3.png"/><itunes:season>1</itunes:season><itunes:episode>32</itunes:episode><itunes:title>Reinventing AI From Scratch with Yaroslav Bulatov</itunes:title><itunes:episodeType>full</itunes:episodeType></item><item><title><![CDATA[Why Healthcare Is AI's Hardest and Most Important Problem with Kyunghyun Cho (NYU) ]]></title><description><![CDATA[<p>We talk with Kyunghyun Cho, who is a Professor of Health Statistics and a Professor of Computer Science and Data Science at New York University, and a former <a rel="noopener noreferrer nofollow" href="https://www.linkedin.com/company/2276/" target="_blank">Executive Director</a> at Genentech, about why healthcare might be the most important and most difficult domain for AI to transform. Kyunghyun shares his vision for a future where patients own their own medical records, proposes a provocative idea for running continuous society-level clinical trials by having doctors "toss a coin" between plausible diagnoses, and explains why drug discovery's stage-wise pipeline has hit a wall that only end-to-end AI thinking can break through. We also get into GLP-1 drugs and why they're more mysterious than people realize, the brutal economics of antibiotic research, how language models trained across scientific literature and clinical data could compress 50 years of drug development into five, and what Kyunghyun would do with $10 billion (spoiler: buy a hospital network in the Midwest). We wrap up with a great discussion on the rise of professor-founded "neo-labs," why academia got spoiled during the deep learning boom, and an encouraging message for PhD students who feel lost right now.</p><hr /><p></p><p><b>Timeline:</b></p><p><b>(00:00)</b> Intro and welcome</p><p><b>(01:25)</b> Why healthcare is uniquely hard</p><p><b>(04:46)</b> Who owns your medical records? — The case for patient-controlled data and tapping your phone at the doctor's office</p><p><b>(06:43)</b> Centralized vs. decentralized healthcare — comparing Israel, Korea, and the US</p><p><b>(13:19)</b> Why most existing health data isn't as useful as we think — selection bias and the lack of randomization</p><p><b>(16:53)</b> The "toss a coin" proposal — continuous clinical trials through automated randomization, and the surprising connection to LLM sampling.</p><p><b>(23:07)</b> Drug discovery's broken pipeline — why stage-wise optimization is failing, and we need end-to-end thinking</p><p><b>(28:30)</b> Why the current system is already failing society — wearables, preventive care, and the case for urgency</p><p><b>(31:13)</b> Allen's personal healthcare journey and the GLP-1 conversation</p><p><b>(33:13)</b> GLP-1 deep dive — 40 years from discovery to weight loss drugs, brain receptors, and embracing uncertainty</p><p><b>(36:28)</b> Why antibiotic R&amp;D is "economic suicide" and how AI can help</p><p><b>(42:52)</b> Language models in the clinic and the lab — from clinical notes to back-propagating clinical outcomes, all the way to molecular design</p><p><b>(48:04)</b> Do you need domain expertise, or can you throw compute at it?</p><p><b>(54:30)</b> The $10 billion question — distributed GPU clouds and a patient-in-the-loop drug discovery system</p><p><b>(58:28)</b> Vertical scaling vs. horizontal scaling for healthcare AI</p><p><b>(1:01:06)</b> AI regulation — who's missing from the conversation and why regulation should follow deployment</p><p></p><p><b>(1:06:52)</b> Professors as founders and the "neo-lab" phenomenon — how Ilya cracked the code</p><p><b>(1:11:18)</b> Can neo-labs actually ship products? Why researchers should do research</p><p><b>(1:13:09)</b> Academia got spoiled — the deep learning anomaly is ending, and that's okay</p><p><b>(1:16:07)</b> Closing message — why it's a great time to be a PhD student and researcher</p><hr /><p></p><p>Music:</p><ul><li>"Kid Kodi" - Blue Dot Sessions - via Free Music Archive - CC BY-NC 4.0.</li><li>"Palms Down" - Blue Dot Sessions - via Free Music Archive - CC BY-NC 4.0.</li><li>Changes: trimmed<p></p><hr /><p></p></li></ul><p>About: The Information Bottleneck is hosted by Ravid Shwartz-Ziv and Allen Roush, featuring in-depth conversations with leading AI researchers about the ideas shaping the future of machine learning.</p>]]></description><guid isPermaLink="false">716148aa-fd15-4e7f-8cdf-2faa380ab084</guid><dc:creator><![CDATA[Ravid Shwartz-Ziv & Allen Roush]]></dc:creator><pubDate>Tue, 24 Mar 2026 05:11:26 GMT</pubDate><enclosure url="https://api.riverside.com/hosting-analytics/media/3a22483b854be3650ccb922c28b0dc4b402c4995693e91ef1ad23597df7d0b99/eyJlcGlzb2RlSWQiOiI3MTYxNDhhYS1mZDE1LTRlN2YtOGNkZi0yZmFhMzgwYWIwODQiLCJwb2RjYXN0SWQiOiJlZWJmYWM4Zi0yMzY5LTRhODktOTI0Yi1iZTI3OWI1N2YxOTAiLCJhY2NvdW50SWQiOiI2OGE1ZjAzMzFkNjQ0MTNlODhlN2FkMmYiLCJwYXRoIjoibWVkaWEvY2xpcHMvNjljMjEyZTQ2MDBiN2VhNWUwYTc4ZGYyL3JhdmlkLXNod2FydHoteml2cy1zdHVkaW8tY29tcG9zZXItMjAyNi0zLTI0X181LTI4LTE5Lm1wMyJ9.mp3" length="112763237" type="audio/mpeg"/><podcast:transcript url="https://hosting-media.riverside.com/media/podcasts/eebfac8f-2369-4a89-924b-be279b57f190/episodes/716148aa-fd15-4e7f-8cdf-2faa380ab084/transcripts.txt" type="text/plain"/><itunes:summary>&lt;p&gt;We talk with Kyunghyun Cho, who is a Professor of Health Statistics and a Professor of Computer Science and Data Science at New York University, and a former &lt;a rel=&quot;noopener noreferrer nofollow&quot; href=&quot;https://www.linkedin.com/company/2276/&quot; target=&quot;_blank&quot;&gt;Executive Director&lt;/a&gt; at Genentech, about why healthcare might be the most important and most difficult domain for AI to transform. Kyunghyun shares his vision for a future where patients own their own medical records, proposes a provocative idea for running continuous society-level clinical trials by having doctors &quot;toss a coin&quot; between plausible diagnoses, and explains why drug discovery&apos;s stage-wise pipeline has hit a wall that only end-to-end AI thinking can break through. We also get into GLP-1 drugs and why they&apos;re more mysterious than people realize, the brutal economics of antibiotic research, how language models trained across scientific literature and clinical data could compress 50 years of drug development into five, and what Kyunghyun would do with $10 billion (spoiler: buy a hospital network in the Midwest). We wrap up with a great discussion on the rise of professor-founded &quot;neo-labs,&quot; why academia got spoiled during the deep learning boom, and an encouraging message for PhD students who feel lost right now.&lt;/p&gt;&lt;hr /&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;&lt;b&gt;Timeline:&lt;/b&gt;&lt;/p&gt;&lt;p&gt;&lt;b&gt;(00:00)&lt;/b&gt; Intro and welcome&lt;/p&gt;&lt;p&gt;&lt;b&gt;(01:25)&lt;/b&gt; Why healthcare is uniquely hard&lt;/p&gt;&lt;p&gt;&lt;b&gt;(04:46)&lt;/b&gt; Who owns your medical records? — The case for patient-controlled data and tapping your phone at the doctor&apos;s office&lt;/p&gt;&lt;p&gt;&lt;b&gt;(06:43)&lt;/b&gt; Centralized vs. decentralized healthcare — comparing Israel, Korea, and the US&lt;/p&gt;&lt;p&gt;&lt;b&gt;(13:19)&lt;/b&gt; Why most existing health data isn&apos;t as useful as we think — selection bias and the lack of randomization&lt;/p&gt;&lt;p&gt;&lt;b&gt;(16:53)&lt;/b&gt; The &quot;toss a coin&quot; proposal — continuous clinical trials through automated randomization, and the surprising connection to LLM sampling.&lt;/p&gt;&lt;p&gt;&lt;b&gt;(23:07)&lt;/b&gt; Drug discovery&apos;s broken pipeline — why stage-wise optimization is failing, and we need end-to-end thinking&lt;/p&gt;&lt;p&gt;&lt;b&gt;(28:30)&lt;/b&gt; Why the current system is already failing society — wearables, preventive care, and the case for urgency&lt;/p&gt;&lt;p&gt;&lt;b&gt;(31:13)&lt;/b&gt; Allen&apos;s personal healthcare journey and the GLP-1 conversation&lt;/p&gt;&lt;p&gt;&lt;b&gt;(33:13)&lt;/b&gt; GLP-1 deep dive — 40 years from discovery to weight loss drugs, brain receptors, and embracing uncertainty&lt;/p&gt;&lt;p&gt;&lt;b&gt;(36:28)&lt;/b&gt; Why antibiotic R&amp;amp;D is &quot;economic suicide&quot; and how AI can help&lt;/p&gt;&lt;p&gt;&lt;b&gt;(42:52)&lt;/b&gt; Language models in the clinic and the lab — from clinical notes to back-propagating clinical outcomes, all the way to molecular design&lt;/p&gt;&lt;p&gt;&lt;b&gt;(48:04)&lt;/b&gt; Do you need domain expertise, or can you throw compute at it?&lt;/p&gt;&lt;p&gt;&lt;b&gt;(54:30)&lt;/b&gt; The $10 billion question — distributed GPU clouds and a patient-in-the-loop drug discovery system&lt;/p&gt;&lt;p&gt;&lt;b&gt;(58:28)&lt;/b&gt; Vertical scaling vs. horizontal scaling for healthcare AI&lt;/p&gt;&lt;p&gt;&lt;b&gt;(1:01:06)&lt;/b&gt; AI regulation — who&apos;s missing from the conversation and why regulation should follow deployment&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;&lt;b&gt;(1:06:52)&lt;/b&gt; Professors as founders and the &quot;neo-lab&quot; phenomenon — how Ilya cracked the code&lt;/p&gt;&lt;p&gt;&lt;b&gt;(1:11:18)&lt;/b&gt; Can neo-labs actually ship products? Why researchers should do research&lt;/p&gt;&lt;p&gt;&lt;b&gt;(1:13:09)&lt;/b&gt; Academia got spoiled — the deep learning anomaly is ending, and that&apos;s okay&lt;/p&gt;&lt;p&gt;&lt;b&gt;(1:16:07)&lt;/b&gt; Closing message — why it&apos;s a great time to be a PhD student and researcher&lt;/p&gt;&lt;hr /&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;Music:&lt;/p&gt;&lt;ul&gt;&lt;li&gt;&quot;Kid Kodi&quot; - Blue Dot Sessions - via Free Music Archive - CC BY-NC 4.0.&lt;/li&gt;&lt;li&gt;&quot;Palms Down&quot; - Blue Dot Sessions - via Free Music Archive - CC BY-NC 4.0.&lt;/li&gt;&lt;li&gt;Changes: trimmed&lt;p&gt;&lt;/p&gt;&lt;hr /&gt;&lt;p&gt;&lt;/p&gt;&lt;/li&gt;&lt;/ul&gt;&lt;p&gt;About: The Information Bottleneck is hosted by Ravid Shwartz-Ziv and Allen Roush, featuring in-depth conversations with leading AI researchers about the ideas shaping the future of machine learning.&lt;/p&gt;</itunes:summary><itunes:explicit>no</itunes:explicit><itunes:duration>01:18:18</itunes:duration><itunes:image href="https://hosting-media.riverside.com/media/podcasts/eebfac8f-2369-4a89-924b-be279b57f190/episodes/716148aa-fd15-4e7f-8cdf-2faa380ab084/images/861f286e-93c5-47cf-b527-61e821d432c7.png"/><itunes:season>1</itunes:season><itunes:episode>31</itunes:episode><itunes:title>Why Healthcare Is AI&apos;s Hardest and Most Important Problem with Kyunghyun Cho (NYU) </itunes:title><itunes:episodeType>full</itunes:episodeType></item><item><title><![CDATA[Diffusion LLM & Why the Future of AI Won't Be Autoregressive -  Stefano Ermon (Stanford /Inception)]]></title><description><![CDATA[<p></p><p>In this episode, we talk with Stefano Ermon,  Stanford professor, co-founder &amp; CEO of Inception AI, and co-inventor of DDIM, FlashAttention, DPO, and score-based/diffusion models, about why diffusion-based language models may overtake the autoregressive paradigm that dominates today's LLMs.</p><p></p><p>We start with the fundamental topics, such as what diffusion models actually are, and why iterative refinement (starting from noise, progressively denoising) offers structural advantages over autoregressive generation.</p><p>From there,  we dive into the technical core of diffusion LLMs. Stefano explains how discrete diffusion works on text, why masking is just one of many possible noise processes, and how the mathematics of score matching carries over from the continuous image setting with surprising elegance.</p><p>A major theme is the inference advantage. Because diffusion models produce multiple tokens in parallel, they can be dramatically faster than autoregressive models at inference time. Stefano argues this fundamentally changes the cost-quality Pareto frontier, and becomes especially powerful in RL-based post-training.</p><p>We also discuss Inception AI's Mercury II model, which Stefano describes as best-in-class for latency-constrained tasks like voice agents and code completion.</p><p>In the final part, we get into broader questions  - why transformers work so well, research advice for PhD students, whether recursive self-improvement is imminent, the real state of AI coding tools, and Stefano's journey from academia to startup founder.</p><hr /><p></p><p>TIMESTAMPS</p><p>0:12 – Introduction<br />1:08 – Origins of diffusion models: from GANs to score-based models in 2019<br />3:13 – Diffusion vs. autoregressive: the typewriter vs. editor analogy<br />4:43 – Speed, creativity, and quality trade-offs between the two approaches<br />7:44 – Temperature and sampling in diffusion LLMs — why it's more subtle than you think<br />9:56 – Can diffusion LLMs scale? Inception AI and Gemini Diffusion as proof points<br />11:50 – State space models and hybrid transformer architectures<br />13:03 – Scaling laws for diffusion: pre-training, post-training, and test-time compute<br />14:33 – Ecosystem and tooling: what transfers and what doesn't<br />16:58 – From images to text: how discrete diffusion actually works<br />19:59 – Theory vs. practice in deep learning<br />21:50 – Loss functions and scoring rules for generative models<br />23:12 – Mercury II and where diffusion LLMs already win<br />26:20 – Creativity, slop, and output diversity in parallel generation<br />28:43 – Hardware for diffusion models: why current GPUs favor autoregressive workloads<br />30:56 – Optimization algorithms and managing technical risk at a startup<br />32:46 – Why do transformers work so well?<br />33:30 – Research advice for PhD students: focus on inference<br />34:57 – Recursive self-improvement and AGI timelines<br />35:56 – Will AI replace software engineers? Real-world experience at Inception<br />37:54 – Professor vs. startup founder: different execution, similar mission<br />39:56 – The founding story of Inception AI — from ICML Best Paper to company<br />42:30 – The researcher-to-founder pipeline and big funding rounds<br />45:02 – PhD vs. industry in 2026: the widening financial gap<br />47:30 – The industry in 5-10 years: Stefano's outlook</p><p>Music:</p><ul><li>"Kid Kodi" - Blue Dot Sessions - via Free Music Archive - CC BY-NC 4.0.</li><li>"Palms Down" - Blue Dot Sessions - via Free Music Archive - CC BY-NC 4.0.</li><li>Changes: trimmed</li></ul><hr /><p>About: The Information Bottleneck is hosted by Ravid Shwartz-Ziv and Allen Roush, featuring in-depth conversations with leading AI researchers about the ideas shaping the future of machine learning.</p>]]></description><guid isPermaLink="false">897858a7-274a-4998-8fd2-69c97319d15c</guid><dc:creator><![CDATA[Ravid Shwartz-Ziv & Allen Roush]]></dc:creator><pubDate>Thu, 19 Mar 2026 01:41:21 GMT</pubDate><enclosure url="https://api.riverside.com/hosting-analytics/media/f65c362876a6a353881eadbca136db2efbe4f11780e78bba7ae9339f9fb38404/eyJlcGlzb2RlSWQiOiI4OTc4NThhNy0yNzRhLTQ5OTgtOGZkMi02OWM5NzMxOWQxNWMiLCJwb2RjYXN0SWQiOiJlZWJmYWM4Zi0yMzY5LTRhODktOTI0Yi1iZTI3OWI1N2YxOTAiLCJhY2NvdW50SWQiOiI2OGE1ZjAzMzFkNjQ0MTNlODhlN2FkMmYiLCJwYXRoIjoibWVkaWEvY2xpcHMvNjliOGMyOWYxM2Y1MzdjZTAwZTQyM2FiL3JhdmlkLXNod2FydHoteml2cy1zdHVkaW8tY29tcG9zZXItMjAyNi0zLTE3X18zLTU1LTI3Lm1wMyJ9.mp3" length="70995948" type="audio/mpeg"/><podcast:transcript url="https://hosting-media.riverside.com/media/podcasts/eebfac8f-2369-4a89-924b-be279b57f190/episodes/897858a7-274a-4998-8fd2-69c97319d15c/transcripts.txt" type="text/plain"/><itunes:summary>&lt;p&gt;&lt;/p&gt;&lt;p&gt;In this episode, we talk with Stefano Ermon,  Stanford professor, co-founder &amp;amp; CEO of Inception AI, and co-inventor of DDIM, FlashAttention, DPO, and score-based/diffusion models, about why diffusion-based language models may overtake the autoregressive paradigm that dominates today&apos;s LLMs.&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;We start with the fundamental topics, such as what diffusion models actually are, and why iterative refinement (starting from noise, progressively denoising) offers structural advantages over autoregressive generation.&lt;/p&gt;&lt;p&gt;From there,  we dive into the technical core of diffusion LLMs. Stefano explains how discrete diffusion works on text, why masking is just one of many possible noise processes, and how the mathematics of score matching carries over from the continuous image setting with surprising elegance.&lt;/p&gt;&lt;p&gt;A major theme is the inference advantage. Because diffusion models produce multiple tokens in parallel, they can be dramatically faster than autoregressive models at inference time. Stefano argues this fundamentally changes the cost-quality Pareto frontier, and becomes especially powerful in RL-based post-training.&lt;/p&gt;&lt;p&gt;We also discuss Inception AI&apos;s Mercury II model, which Stefano describes as best-in-class for latency-constrained tasks like voice agents and code completion.&lt;/p&gt;&lt;p&gt;In the final part, we get into broader questions  - why transformers work so well, research advice for PhD students, whether recursive self-improvement is imminent, the real state of AI coding tools, and Stefano&apos;s journey from academia to startup founder.&lt;/p&gt;&lt;hr /&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;TIMESTAMPS&lt;/p&gt;&lt;p&gt;0:12 – Introduction&lt;br /&gt;1:08 – Origins of diffusion models: from GANs to score-based models in 2019&lt;br /&gt;3:13 – Diffusion vs. autoregressive: the typewriter vs. editor analogy&lt;br /&gt;4:43 – Speed, creativity, and quality trade-offs between the two approaches&lt;br /&gt;7:44 – Temperature and sampling in diffusion LLMs — why it&apos;s more subtle than you think&lt;br /&gt;9:56 – Can diffusion LLMs scale? Inception AI and Gemini Diffusion as proof points&lt;br /&gt;11:50 – State space models and hybrid transformer architectures&lt;br /&gt;13:03 – Scaling laws for diffusion: pre-training, post-training, and test-time compute&lt;br /&gt;14:33 – Ecosystem and tooling: what transfers and what doesn&apos;t&lt;br /&gt;16:58 – From images to text: how discrete diffusion actually works&lt;br /&gt;19:59 – Theory vs. practice in deep learning&lt;br /&gt;21:50 – Loss functions and scoring rules for generative models&lt;br /&gt;23:12 – Mercury II and where diffusion LLMs already win&lt;br /&gt;26:20 – Creativity, slop, and output diversity in parallel generation&lt;br /&gt;28:43 – Hardware for diffusion models: why current GPUs favor autoregressive workloads&lt;br /&gt;30:56 – Optimization algorithms and managing technical risk at a startup&lt;br /&gt;32:46 – Why do transformers work so well?&lt;br /&gt;33:30 – Research advice for PhD students: focus on inference&lt;br /&gt;34:57 – Recursive self-improvement and AGI timelines&lt;br /&gt;35:56 – Will AI replace software engineers? Real-world experience at Inception&lt;br /&gt;37:54 – Professor vs. startup founder: different execution, similar mission&lt;br /&gt;39:56 – The founding story of Inception AI — from ICML Best Paper to company&lt;br /&gt;42:30 – The researcher-to-founder pipeline and big funding rounds&lt;br /&gt;45:02 – PhD vs. industry in 2026: the widening financial gap&lt;br /&gt;47:30 – The industry in 5-10 years: Stefano&apos;s outlook&lt;/p&gt;&lt;p&gt;Music:&lt;/p&gt;&lt;ul&gt;&lt;li&gt;&quot;Kid Kodi&quot; - Blue Dot Sessions - via Free Music Archive - CC BY-NC 4.0.&lt;/li&gt;&lt;li&gt;&quot;Palms Down&quot; - Blue Dot Sessions - via Free Music Archive - CC BY-NC 4.0.&lt;/li&gt;&lt;li&gt;Changes: trimmed&lt;/li&gt;&lt;/ul&gt;&lt;hr /&gt;&lt;p&gt;About: The Information Bottleneck is hosted by Ravid Shwartz-Ziv and Allen Roush, featuring in-depth conversations with leading AI researchers about the ideas shaping the future of machine learning.&lt;/p&gt;</itunes:summary><itunes:explicit>no</itunes:explicit><itunes:duration>00:49:18</itunes:duration><itunes:image href="https://hosting-media.riverside.com/media/podcasts/eebfac8f-2369-4a89-924b-be279b57f190/episodes/897858a7-274a-4998-8fd2-69c97319d15c/images/aa95473d-de64-468e-9f0d-790794b6ba8d.png"/><itunes:season>1</itunes:season><itunes:episode>30</itunes:episode><itunes:title>Diffusion LLM &amp; Why the Future of AI Won&apos;t Be Autoregressive -  Stefano Ermon (Stanford /Inception)</itunes:title><itunes:episodeType>full</itunes:episodeType></item><item><title><![CDATA[Training Is Nothing Like Learning with Naomi Saphra (Harvard)]]></title><description><![CDATA[<p>Naomi Saphra, Kempner Research Fellow at Harvard and incoming Assistant Professor at Boston University, joins us to explain why you can't do interpretability without understanding training dynamics,  in the same way you can't do biology without evolution.</p><p>Naomi argues that many structures researchers find inside trained models are vestigial, they mattered early in training but are meaningless by the end. Grokking is one case of a broader phenomenon: models go through multiple consecutive phase transitions during training, driven by symmetry breaking and head specialization, but the smooth loss curve hides all of it. We talk about why training is nothing like human learning, and why our intuitions about what's hard for models are consistently wrong  -  code in pretraining helps language reasoning, tokenization drives behaviors people attribute to deeper cognition, and language already encodes everything humans care about. We also get into why SAEs are basically topic models, the Platonic representation hypothesis, using AI to decode animal communication, and why non-determinism across training runs is a real problem that RL and MoE might be making worse.</p><hr /><p>Timeline: </p><p>(00:12) Introduction and guest welcome </p><p>(01:01) Why training dynamics matter -  the evolutionary biology analogy </p><p>(03:05) Jennifer Aniston neurons and the danger of biological parallels </p><p>(04:48) What is grokking and why it's one instance of a broader phenomenon </p><p>(08:25) Phase transitions, symmetry breaking, and head specialization </p><p>(11:53) Double descent, overfitting, and the death of classical train-test splits </p><p>(15:10) Training is nothing like learning </p><p>(16:08) Scaling axes -  data, model size, compute, and why they're not interchangeable </p><p>(19:29) Data quality, code as reasoning fuel, and GPT-2's real contribution </p><p>(20:43) Multilingual models and the interlingua hypothesis </p><p>(25:58) The Platonic representation hypothesis and why image classification was always multimodal </p><p>(29:12) Sparse autoencoders, interpretability, and Marr's levels </p><p>(37:32) Can we ever truly understand what models know? </p><p>(43:59) The language modality chauvinist argument </p><p>(51:55) Vision, redundancy, and self-supervised learning </p><p>(57:18) World models -  measurable capabilities over philosophical definitions </p><p>(1:00:14) Is coding really a solved task? </p><p>(1:04:18) Non-determinism, scaling laws, and why one training run isn't enough </p><p>(1:10:12) Naomi's new lab at BU and recruiting</p><hr /><p></p><p>Music:</p><ul><li>"Kid Kodi" - Blue Dot Sessions - via Free Music Archive - CC BY-NC 4.0.</li><li>"Palms Down" - Blue Dot Sessions - via Free Music Archive - CC BY-NC 4.0. </li><li>Changes: trimmed<p></p></li></ul><hr /><p>About: The Information Bottleneck is hosted by Ravid Shwartz-Ziv and Allen Roush, featuring in-depth conversations with leading AI researchers about the ideas shaping the future of machine learning.</p>]]></description><guid isPermaLink="false">5bcda987-d014-45bd-8e88-65f04004169e</guid><dc:creator><![CDATA[Ravid Shwartz-Ziv & Allen Roush]]></dc:creator><pubDate>Fri, 13 Mar 2026 20:15:48 GMT</pubDate><enclosure url="https://api.riverside.com/hosting-analytics/media/6a9c81f42203dcd057d07d9743fb4361002002743170487f536295b12ff23533/eyJlcGlzb2RlSWQiOiI1YmNkYTk4Ny1kMDE0LTQ1YmQtOGU4OC02NWYwNDAwNDE2OWUiLCJwb2RjYXN0SWQiOiJlZWJmYWM4Zi0yMzY5LTRhODktOTI0Yi1iZTI3OWI1N2YxOTAiLCJhY2NvdW50SWQiOiI2OGE1ZjAzMzFkNjQ0MTNlODhlN2FkMmYiLCJwYXRoIjoibWVkaWEvY2xpcHMvNjliNDA3YjhiM2NlNTA2OTVhYmY1ZWUxL3JhdmlkLXNod2FydHoteml2cy1zdHVkaW8tY29tcG9zZXItMjAyNi0zLTEzX18xMy00OC01Ni5tcDMifQ==.mp3" length="103047566" type="audio/mpeg"/><podcast:transcript url="https://hosting-media.riverside.com/media/podcasts/eebfac8f-2369-4a89-924b-be279b57f190/episodes/5bcda987-d014-45bd-8e88-65f04004169e/transcripts.txt" type="text/plain"/><itunes:summary>&lt;p&gt;Naomi Saphra, Kempner Research Fellow at Harvard and incoming Assistant Professor at Boston University, joins us to explain why you can&apos;t do interpretability without understanding training dynamics,  in the same way you can&apos;t do biology without evolution.&lt;/p&gt;&lt;p&gt;Naomi argues that many structures researchers find inside trained models are vestigial, they mattered early in training but are meaningless by the end. Grokking is one case of a broader phenomenon: models go through multiple consecutive phase transitions during training, driven by symmetry breaking and head specialization, but the smooth loss curve hides all of it. We talk about why training is nothing like human learning, and why our intuitions about what&apos;s hard for models are consistently wrong  -  code in pretraining helps language reasoning, tokenization drives behaviors people attribute to deeper cognition, and language already encodes everything humans care about. We also get into why SAEs are basically topic models, the Platonic representation hypothesis, using AI to decode animal communication, and why non-determinism across training runs is a real problem that RL and MoE might be making worse.&lt;/p&gt;&lt;hr /&gt;&lt;p&gt;Timeline: &lt;/p&gt;&lt;p&gt;(00:12) Introduction and guest welcome &lt;/p&gt;&lt;p&gt;(01:01) Why training dynamics matter -  the evolutionary biology analogy &lt;/p&gt;&lt;p&gt;(03:05) Jennifer Aniston neurons and the danger of biological parallels &lt;/p&gt;&lt;p&gt;(04:48) What is grokking and why it&apos;s one instance of a broader phenomenon &lt;/p&gt;&lt;p&gt;(08:25) Phase transitions, symmetry breaking, and head specialization &lt;/p&gt;&lt;p&gt;(11:53) Double descent, overfitting, and the death of classical train-test splits &lt;/p&gt;&lt;p&gt;(15:10) Training is nothing like learning &lt;/p&gt;&lt;p&gt;(16:08) Scaling axes -  data, model size, compute, and why they&apos;re not interchangeable &lt;/p&gt;&lt;p&gt;(19:29) Data quality, code as reasoning fuel, and GPT-2&apos;s real contribution &lt;/p&gt;&lt;p&gt;(20:43) Multilingual models and the interlingua hypothesis &lt;/p&gt;&lt;p&gt;(25:58) The Platonic representation hypothesis and why image classification was always multimodal &lt;/p&gt;&lt;p&gt;(29:12) Sparse autoencoders, interpretability, and Marr&apos;s levels &lt;/p&gt;&lt;p&gt;(37:32) Can we ever truly understand what models know? &lt;/p&gt;&lt;p&gt;(43:59) The language modality chauvinist argument &lt;/p&gt;&lt;p&gt;(51:55) Vision, redundancy, and self-supervised learning &lt;/p&gt;&lt;p&gt;(57:18) World models -  measurable capabilities over philosophical definitions &lt;/p&gt;&lt;p&gt;(1:00:14) Is coding really a solved task? &lt;/p&gt;&lt;p&gt;(1:04:18) Non-determinism, scaling laws, and why one training run isn&apos;t enough &lt;/p&gt;&lt;p&gt;(1:10:12) Naomi&apos;s new lab at BU and recruiting&lt;/p&gt;&lt;hr /&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;Music:&lt;/p&gt;&lt;ul&gt;&lt;li&gt;&quot;Kid Kodi&quot; - Blue Dot Sessions - via Free Music Archive - CC BY-NC 4.0.&lt;/li&gt;&lt;li&gt;&quot;Palms Down&quot; - Blue Dot Sessions - via Free Music Archive - CC BY-NC 4.0. &lt;/li&gt;&lt;li&gt;Changes: trimmed&lt;p&gt;&lt;/p&gt;&lt;/li&gt;&lt;/ul&gt;&lt;hr /&gt;&lt;p&gt;About: The Information Bottleneck is hosted by Ravid Shwartz-Ziv and Allen Roush, featuring in-depth conversations with leading AI researchers about the ideas shaping the future of machine learning.&lt;/p&gt;</itunes:summary><itunes:explicit>no</itunes:explicit><itunes:duration>01:11:34</itunes:duration><itunes:image href="https://hosting-media.riverside.com/media/podcasts/eebfac8f-2369-4a89-924b-be279b57f190/episodes/5bcda987-d014-45bd-8e88-65f04004169e/images/6d40da61-7972-4b5a-99cd-545c996eb050.png"/><itunes:season>1</itunes:season><itunes:episode>29</itunes:episode><itunes:title>Training Is Nothing Like Learning with Naomi Saphra (Harvard)</itunes:title><itunes:episodeType>full</itunes:episodeType></item><item><title><![CDATA[EP28: How to Control a Stochastic Agent with Stefano Soatto (VP AWS/ Pro. UCLA)]]></title><description><![CDATA[<p>Stefano Soatto, VP for AI at AWS and Professor at UCLA,  the person responsible for agentic AI at AWS, joins us to explain why building reliable AI agents is fundamentally a control theory problem.</p><p>Stefano sees LLMs as stochastic dynamical systems that need to be controlled, not just prompted. He introduces "strands coding," a new framework AWS is building that sits between vibe coding and spec coding, you write a skeleton with AI functions constrained by pre- and post-conditions, verifying intent before a single line of code is generated. The surprising part: even as AI coding adoption goes up, developer trust in the output is going down.</p><p>We go deep into the philosophy of models and the world. Stefano argues that the dichotomy between "language models" and "world models" doesn't really exist, where a reasoning engine trained on rich enough data <i>is</i> a world model. He walks us through why naive realism is indefensible, how reverse diffusion was originally intended to show that models can't be identical to reality, and why that matters now.</p><p>We also discuss three types of information, Shannon, algorithmic, and conceptual, and why algorithmic information is the one that actually matters to agents. Synthetic data doesn't add Shannon information, but it adds algorithmic information, which is why it works. Intelligence isn't about scaling to Solomonov's universal induction; it's about learning to solve new problems fast.</p><hr /><p></p><p>Takeaways:</p><ul><li>Vibe coding is local feedback control with high cognitive load; spec coding is open-loop global control with silent failures, neither scales well alone.</li><li>Trust in AI-generated code is declining even as adoption rises.</li><li>The distinction between next-token prediction and world model is mostly nomenclature - reasoning engines operating on multimodal data are world models.</li><li>Algorithmic information, not Shannon information, is what matters in the agentic setting.</li><li>Intelligence isn't minimizing inference uncertainty - it's minimizing time to solve unforeseen tasks.</li><li>The intent gap between user and model cannot be fully automated or delegated.<hr /><p></p></li></ul><p>Timeline</p><p>(00:13) Introduction and guest welcome</p><p>(01:12) How the agentic era changed machine learning</p><p>(06:11) Vibe coding one year later</p><p>(07:23) Vibe vs. spec vs. strands coding</p><p>(14:30) Why English is not a programming language</p><p>(16:36) Constrained generation and agent choreography</p><p>(20:44) Diffusion models vs. autoregressive models (25:59) The platonic representation hypothesis and naive realism</p><p>(31:14) Synthetic data and the information bottleneck</p><p>(36:22) Three types of information: Shannon, algorithmic, conceptual</p><p>(38:47) Scaling laws and Solomonov induction</p><p>(42:14) World models and the Goethian vs. Marrian approach</p><p>(49:00) Encoding vs. generation and JEPA-style training</p><p>(55:50) Are language models already world models?</p><p>(59:13) Closing thoughts on trust, education, and responsibility.</p><hr /><p></p><p>Music:</p><ul><li>"Kid Kodi" - Blue Dot Sessions - via Free Music Archive - CC BY-NC 4.0.</li><li>"Palms Down" - Blue Dot Sessions - via Free Music Archive - CC BY-NC 4.0. Changes: trimmed</li></ul><hr /><p>About</p><p>The Information Bottleneck is hosted by Ravid Shwartz-Ziv and Allen Roush, featuring in-depth conversations with leading AI researchers about the ideas shaping the future of machine learning.</p>]]></description><guid isPermaLink="false">983a748a-8e18-4bcb-8db9-942b20b586bf</guid><dc:creator><![CDATA[Ravid Shwartz-Ziv & Allen Roush]]></dc:creator><pubDate>Fri, 06 Mar 2026 14:04:28 GMT</pubDate><enclosure url="https://api.riverside.com/hosting-analytics/media/8dd19f3f56b7f0b1e65a892f97ed8442837d510119927cb7ab4e0a75fc097b1e/eyJlcGlzb2RlSWQiOiI5ODNhNzQ4YS04ZTE4LTRiY2ItOGRiOS05NDJiMjBiNTg2YmYiLCJwb2RjYXN0SWQiOiJlZWJmYWM4Zi0yMzY5LTRhODktOTI0Yi1iZTI3OWI1N2YxOTAiLCJhY2NvdW50SWQiOiI2OGE1ZjAzMzFkNjQ0MTNlODhlN2FkMmYiLCJwYXRoIjoibWVkaWEvY2xpcHMvNjlhYTQyODM4NWQ5Y2MzYTY3YWVlZGM1L3JhdmlkLXNod2FydHoteml2cy1zdHVkaW8tY29tcG9zZXItMjAyNi0zLTZfXzMtNTctNy5tcDMifQ==.mp3" length="90003478" type="audio/mpeg"/><podcast:transcript url="https://hosting-media.riverside.com/media/podcasts/eebfac8f-2369-4a89-924b-be279b57f190/episodes/983a748a-8e18-4bcb-8db9-942b20b586bf/transcripts.txt" type="text/plain"/><itunes:summary>&lt;p&gt;Stefano Soatto, VP for AI at AWS and Professor at UCLA,  the person responsible for agentic AI at AWS, joins us to explain why building reliable AI agents is fundamentally a control theory problem.&lt;/p&gt;&lt;p&gt;Stefano sees LLMs as stochastic dynamical systems that need to be controlled, not just prompted. He introduces &quot;strands coding,&quot; a new framework AWS is building that sits between vibe coding and spec coding, you write a skeleton with AI functions constrained by pre- and post-conditions, verifying intent before a single line of code is generated. The surprising part: even as AI coding adoption goes up, developer trust in the output is going down.&lt;/p&gt;&lt;p&gt;We go deep into the philosophy of models and the world. Stefano argues that the dichotomy between &quot;language models&quot; and &quot;world models&quot; doesn&apos;t really exist, where a reasoning engine trained on rich enough data &lt;i&gt;is&lt;/i&gt; a world model. He walks us through why naive realism is indefensible, how reverse diffusion was originally intended to show that models can&apos;t be identical to reality, and why that matters now.&lt;/p&gt;&lt;p&gt;We also discuss three types of information, Shannon, algorithmic, and conceptual, and why algorithmic information is the one that actually matters to agents. Synthetic data doesn&apos;t add Shannon information, but it adds algorithmic information, which is why it works. Intelligence isn&apos;t about scaling to Solomonov&apos;s universal induction; it&apos;s about learning to solve new problems fast.&lt;/p&gt;&lt;hr /&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;Takeaways:&lt;/p&gt;&lt;ul&gt;&lt;li&gt;Vibe coding is local feedback control with high cognitive load; spec coding is open-loop global control with silent failures, neither scales well alone.&lt;/li&gt;&lt;li&gt;Trust in AI-generated code is declining even as adoption rises.&lt;/li&gt;&lt;li&gt;The distinction between next-token prediction and world model is mostly nomenclature - reasoning engines operating on multimodal data are world models.&lt;/li&gt;&lt;li&gt;Algorithmic information, not Shannon information, is what matters in the agentic setting.&lt;/li&gt;&lt;li&gt;Intelligence isn&apos;t minimizing inference uncertainty - it&apos;s minimizing time to solve unforeseen tasks.&lt;/li&gt;&lt;li&gt;The intent gap between user and model cannot be fully automated or delegated.&lt;hr /&gt;&lt;p&gt;&lt;/p&gt;&lt;/li&gt;&lt;/ul&gt;&lt;p&gt;Timeline&lt;/p&gt;&lt;p&gt;(00:13) Introduction and guest welcome&lt;/p&gt;&lt;p&gt;(01:12) How the agentic era changed machine learning&lt;/p&gt;&lt;p&gt;(06:11) Vibe coding one year later&lt;/p&gt;&lt;p&gt;(07:23) Vibe vs. spec vs. strands coding&lt;/p&gt;&lt;p&gt;(14:30) Why English is not a programming language&lt;/p&gt;&lt;p&gt;(16:36) Constrained generation and agent choreography&lt;/p&gt;&lt;p&gt;(20:44) Diffusion models vs. autoregressive models (25:59) The platonic representation hypothesis and naive realism&lt;/p&gt;&lt;p&gt;(31:14) Synthetic data and the information bottleneck&lt;/p&gt;&lt;p&gt;(36:22) Three types of information: Shannon, algorithmic, conceptual&lt;/p&gt;&lt;p&gt;(38:47) Scaling laws and Solomonov induction&lt;/p&gt;&lt;p&gt;(42:14) World models and the Goethian vs. Marrian approach&lt;/p&gt;&lt;p&gt;(49:00) Encoding vs. generation and JEPA-style training&lt;/p&gt;&lt;p&gt;(55:50) Are language models already world models?&lt;/p&gt;&lt;p&gt;(59:13) Closing thoughts on trust, education, and responsibility.&lt;/p&gt;&lt;hr /&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;Music:&lt;/p&gt;&lt;ul&gt;&lt;li&gt;&quot;Kid Kodi&quot; - Blue Dot Sessions - via Free Music Archive - CC BY-NC 4.0.&lt;/li&gt;&lt;li&gt;&quot;Palms Down&quot; - Blue Dot Sessions - via Free Music Archive - CC BY-NC 4.0. Changes: trimmed&lt;/li&gt;&lt;/ul&gt;&lt;hr /&gt;&lt;p&gt;About&lt;/p&gt;&lt;p&gt;The Information Bottleneck is hosted by Ravid Shwartz-Ziv and Allen Roush, featuring in-depth conversations with leading AI researchers about the ideas shaping the future of machine learning.&lt;/p&gt;</itunes:summary><itunes:explicit>no</itunes:explicit><itunes:duration>01:02:30</itunes:duration><itunes:image href="https://hosting-media.riverside.com/media/podcasts/eebfac8f-2369-4a89-924b-be279b57f190/episodes/983a748a-8e18-4bcb-8db9-942b20b586bf/images/5e16fdce-1ccd-46e6-b138-9ef37a22a23e.png"/><itunes:season>1</itunes:season><itunes:episode>28</itunes:episode><itunes:title>EP28: How to Control a Stochastic Agent with Stefano Soatto (VP AWS/ Pro. UCLA)</itunes:title><itunes:episodeType>full</itunes:episodeType></item><item><title><![CDATA[EP27: Medical Foundation Models - with Tanishq Abraham (Sophont.AI)]]></title><description><![CDATA[<p><b>Tanishq Abraham</b>, CEO and co-founder of <a rel="noopener noreferrer nofollow" href="http://Sophont.ai" target="_blank">Sophont.ai</a>, joins us to talk about building foundation models specifically for medicine.</p><p>Sophont is trying to be something like an OpenAI or Anthropic but for healthcare  - training models across pathology, neuroimaging, and clinical text, to eventually fuse them into one multimodal system. The surprising part: their pathology model trained on 12,000 public slides performs on par with models trained on millions of private ones. Data quality beats data quantity.</p><p>We talk about what actually excites Tanishq, which is not replacing doctors, but finding things doctors can't see. AI predicting gene mutations from a tissue slide, or cardiovascular risk from an eye scan.</p><p>We also talk about the regulation and how the picture is less scary than people assume. Text-based clinical decision support can ship without FDA approval. Pharma partnerships offer near-term impact. The five-to-ten-year timeline people fear is really about drug discovery, not all of medical AI.</p><p></p><p><b>Takeaways:</b></p><ul><li>The real promise of medical AI is finding hidden signals in existing data, not just automating doctors</li><li>Small, curated public datasets can rival massive private ones</li><li>Multimodal fusion is the goal, but you need strong individual encoders first</li><li>AI research itself might get automated sooner than biology or chemistry</li><li>FDA regulation has more flexibility than most people think</li></ul><hr /><p></p><p><b>Timeline</b></p><p>(00:12) Introduction and guest welcome</p><p>(02:32) Anthropic's ad about ChatGPT ads</p><p>(07:26) XAI merging into SpaceX</p><p>(13:32) Vibe coding one year later</p><p>(17:00) Claude Code and agentic workflows</p><p>(21:52) Can AI automate AI research?</p><p>(26:57) What is medical AI</p><p>(31:06) Sofont as a frontier medical AI lab</p><p>(33:52) Public vs. private data - 12K slides vs. millions</p><p>(36:43) Domain expertise vs. scaling</p><p>(41:54) Cancer, diabetes, and personal stakes</p><p>(47:52) Classification vs. prediction in medicine</p><p>(50:36) When doctors disagree</p><p>(54:43) Quackery and AI</p><p>(57:15) Uncertainty in medical AI</p><p>(1:03:11) Will AI replace doctors?</p><p>(1:07:24) Self-supervised learning on sleep data</p><p>(1:10:10) Aligning modalities</p><p>(1:13:17) FDA regulation</p><p>(1:22:28) Closing </p><hr /><p></p><p><b>Music:</b></p><ul><li>"Kid Kodi" - Blue Dot Sessions - via Free Music Archive - CC BY-NC 4.0.</li><li>"Palms Down" - Blue Dot Sessions - via Free Music Archive - CC BY-NC 4.0.</li></ul><p>Changes: trimmed</p><hr /><p></p><p><b>About</b></p><p> The Information Bottleneck is hosted by Ravid Shwartz-Ziv and Allen Roush, featuring in-depth conversations with leading AI researchers about the ideas shaping the future of machine learning.</p>]]></description><guid isPermaLink="false">7c0e949c-407e-435c-bc06-30084dda3097</guid><dc:creator><![CDATA[Ravid Shwartz-Ziv & Allen Roush]]></dc:creator><pubDate>Mon, 02 Mar 2026 04:42:56 GMT</pubDate><enclosure url="https://api.riverside.com/hosting-analytics/media/1657cd170b00a1a04a238150e746f81d53750f98896c598f5a255678d01d4779/eyJlcGlzb2RlSWQiOiI3YzBlOTQ5Yy00MDdlLTQzNWMtYmMwNi0zMDA4NGRkYTMwOTciLCJwb2RjYXN0SWQiOiJlZWJmYWM4Zi0yMzY5LTRhODktOTI0Yi1iZTI3OWI1N2YxOTAiLCJhY2NvdW50SWQiOiI2OGE1ZjAzMzFkNjQ0MTNlODhlN2FkMmYiLCJwYXRoIjoibWVkaWEvY2xpcHMvNjlhNTBmZTNjZDE4NzE5MWRiM2Y0ZDdmL3JhdmlkLXNod2FydHoteml2cy1zdHVkaW8tY29tcG9zZXItMjAyNi0zLTJfXzUtMTktNDYubXAzIn0=.mp3" length="123256938" type="audio/mpeg"/><itunes:summary>&lt;p&gt;&lt;b&gt;Tanishq Abraham&lt;/b&gt;, CEO and co-founder of &lt;a rel=&quot;noopener noreferrer nofollow&quot; href=&quot;http://Sophont.ai&quot; target=&quot;_blank&quot;&gt;Sophont.ai&lt;/a&gt;, joins us to talk about building foundation models specifically for medicine.&lt;/p&gt;&lt;p&gt;Sophont is trying to be something like an OpenAI or Anthropic but for healthcare  - training models across pathology, neuroimaging, and clinical text, to eventually fuse them into one multimodal system. The surprising part: their pathology model trained on 12,000 public slides performs on par with models trained on millions of private ones. Data quality beats data quantity.&lt;/p&gt;&lt;p&gt;We talk about what actually excites Tanishq, which is not replacing doctors, but finding things doctors can&apos;t see. AI predicting gene mutations from a tissue slide, or cardiovascular risk from an eye scan.&lt;/p&gt;&lt;p&gt;We also talk about the regulation and how the picture is less scary than people assume. Text-based clinical decision support can ship without FDA approval. Pharma partnerships offer near-term impact. The five-to-ten-year timeline people fear is really about drug discovery, not all of medical AI.&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;&lt;b&gt;Takeaways:&lt;/b&gt;&lt;/p&gt;&lt;ul&gt;&lt;li&gt;The real promise of medical AI is finding hidden signals in existing data, not just automating doctors&lt;/li&gt;&lt;li&gt;Small, curated public datasets can rival massive private ones&lt;/li&gt;&lt;li&gt;Multimodal fusion is the goal, but you need strong individual encoders first&lt;/li&gt;&lt;li&gt;AI research itself might get automated sooner than biology or chemistry&lt;/li&gt;&lt;li&gt;FDA regulation has more flexibility than most people think&lt;/li&gt;&lt;/ul&gt;&lt;hr /&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;&lt;b&gt;Timeline&lt;/b&gt;&lt;/p&gt;&lt;p&gt;(00:12) Introduction and guest welcome&lt;/p&gt;&lt;p&gt;(02:32) Anthropic&apos;s ad about ChatGPT ads&lt;/p&gt;&lt;p&gt;(07:26) XAI merging into SpaceX&lt;/p&gt;&lt;p&gt;(13:32) Vibe coding one year later&lt;/p&gt;&lt;p&gt;(17:00) Claude Code and agentic workflows&lt;/p&gt;&lt;p&gt;(21:52) Can AI automate AI research?&lt;/p&gt;&lt;p&gt;(26:57) What is medical AI&lt;/p&gt;&lt;p&gt;(31:06) Sofont as a frontier medical AI lab&lt;/p&gt;&lt;p&gt;(33:52) Public vs. private data - 12K slides vs. millions&lt;/p&gt;&lt;p&gt;(36:43) Domain expertise vs. scaling&lt;/p&gt;&lt;p&gt;(41:54) Cancer, diabetes, and personal stakes&lt;/p&gt;&lt;p&gt;(47:52) Classification vs. prediction in medicine&lt;/p&gt;&lt;p&gt;(50:36) When doctors disagree&lt;/p&gt;&lt;p&gt;(54:43) Quackery and AI&lt;/p&gt;&lt;p&gt;(57:15) Uncertainty in medical AI&lt;/p&gt;&lt;p&gt;(1:03:11) Will AI replace doctors?&lt;/p&gt;&lt;p&gt;(1:07:24) Self-supervised learning on sleep data&lt;/p&gt;&lt;p&gt;(1:10:10) Aligning modalities&lt;/p&gt;&lt;p&gt;(1:13:17) FDA regulation&lt;/p&gt;&lt;p&gt;(1:22:28) Closing &lt;/p&gt;&lt;hr /&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;&lt;b&gt;Music:&lt;/b&gt;&lt;/p&gt;&lt;ul&gt;&lt;li&gt;&quot;Kid Kodi&quot; - Blue Dot Sessions - via Free Music Archive - CC BY-NC 4.0.&lt;/li&gt;&lt;li&gt;&quot;Palms Down&quot; - Blue Dot Sessions - via Free Music Archive - CC BY-NC 4.0.&lt;/li&gt;&lt;/ul&gt;&lt;p&gt;Changes: trimmed&lt;/p&gt;&lt;hr /&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;&lt;b&gt;About&lt;/b&gt;&lt;/p&gt;&lt;p&gt; The Information Bottleneck is hosted by Ravid Shwartz-Ziv and Allen Roush, featuring in-depth conversations with leading AI researchers about the ideas shaping the future of machine learning.&lt;/p&gt;</itunes:summary><itunes:explicit>no</itunes:explicit><itunes:duration>01:25:36</itunes:duration><itunes:image href="https://hosting-media.riverside.com/media/podcasts/eebfac8f-2369-4a89-924b-be279b57f190/episodes/7c0e949c-407e-435c-bc06-30084dda3097/images/1500ae21-ddc6-49b1-ac1b-725e6391eac4.png"/><itunes:season>1</itunes:season><itunes:episode>27</itunes:episode><itunes:title>EP27: Medical Foundation Models - with Tanishq Abraham (Sophont.AI)</itunes:title><itunes:episodeType>full</itunes:episodeType></item><item><title><![CDATA[EP26: Measuring Intelligence in the Wild -  Arena and the Future of AI Evaluation]]></title><description><![CDATA[<p><b>Anastasios Angelopoulos</b>, Co-Founder and CEO of <b>Arena AI </b>(formerly LMArena), joins us to talk about why static benchmarks are failing, how human preference data actually works under the hood, and what it takes to be the "gold standard" of AI evaluation.</p><p>Anastasios sits at a fascinating intersection -   a theoretical statistician running the platform that every major lab watches when they release a model. We talk about the messiness of AI-generated code slop (yes, he hides Claude's commits too), then dig into the statistical machinery that powers Arena's leaderboards and why getting evaluation right is harder than most people think.</p><p>We explore why style control is both necessary and philosophically tricky, where you can regress away markdown headers and response length, but separating style from substance is a genuinely unsolved causal inference problem. We also get into why users are surprisingly good judges of model quality, how Arena serves as a pre-release testing ground for labs shipping stealth models under codenames, and whether the fragmentation of the AI market (Anthropic going enterprise, OpenAI going consumer, everyone going multimodal) is actually a feature, not a bug. Plus, we discuss the role of rigorous statistics in the age of "just run it again," why structured decoding can hurt model performance, and what Arena's 2026 roadmap looks like.</p><hr /><p></p><p><b>Timeline:</b></p><p>(00:12) Introduction and Anastasios's Background</p><p>(00:55) What Arena Does and Why Static Benchmarks Aren't Enough</p><p>(02:26) Coverage of Use Cases - Is There Enough?</p><p>(04:22) Style Control and the Bradley-Terry Methodology</p><p>(08:35) Can You Actually Separate Style from Substance?</p><p>(10:24) Measuring Slop - And the Anti-Slop Paper Plug</p><p>(11:52) Can Users Judge Factual Correctness?</p><p>(13:31) Tool Use and Agentic Evaluation on Arena</p><p>(14:14) Intermediate Feedback Signals Beyond Final Preference</p><p>(15:30) Tool Calling Accuracy and Code Arena</p><p>(17:42) AI-Generated Code Slop and Hiding Claude's Commits</p><p>(19:49) Do We Need Separate Code Streams for Humans and LLMs?</p><p>(20:01) RL Flywheels and Arena's Preference Data</p><p>(21:16) Focus as a Startup - Being the Evaluation Company</p><p>(22:16) Structured vs. Unconstrained Generation</p><p>(25:00) The Role of Rigorous Statistics in the Age of AI</p><p>(29:23) LLM Sampling Parameters and Evaluation Complexity</p><p>(30:56) Model Versioning and the Frequentist Approach to Fairness</p><p>(32:12) Quantization and Its Effects on Model Quality</p><p>(33:10) Pre-Release Testing and Stealth Models (34:23) Transparency - What to Share with the Public vs. Labs</p><p>(36:27) When Winning Models Don't Get Released</p><p>(36:59) Why Users Keep Coming Back to Arena</p><p>(38:19) Market Fragmentation and Arena's Future Value</p><p>(39:37) Custom Evaluation Frameworks for Specific Users</p><p>(40:03) Arena's 2026 Roadmap - Science, Methodology, and New Paradigms</p><p>(42:15) The Economics of Free Inference</p><p>(43:13) Hiring and Closing Thoughts</p><hr /><p></p><p><b>Music:</b></p><ul><li>"Kid Kodi" — Blue Dot Sessions — via Free Music Archive — CC BY-NC 4.0.</li><li>"Palms Down" — Blue Dot Sessions — via Free Music Archive — CC BY-NC 4.0.</li><li>Changes: trimmed<hr /><p></p></li></ul><p><b>About:</b> </p><p>The Information Bottleneck is hosted by Ravid Shwartz-Ziv and Allen Roush, featuring in-depth conversations with leading AI researchers about the ideas shaping the future of machine learning.</p>]]></description><guid isPermaLink="false">aad57d3d-46c2-4df3-8fa9-077d7187a5d6</guid><dc:creator><![CDATA[Ravid Shwartz-Ziv & Allen Roush]]></dc:creator><pubDate>Tue, 24 Feb 2026 16:15:27 GMT</pubDate><enclosure url="https://api.riverside.com/hosting-analytics/media/6ef1498e5ad6bf6af5aed159881a9ff69f5a047355ca25975240d754e78dd719/eyJlcGlzb2RlSWQiOiJhYWQ1N2QzZC00NmMyLTRkZjMtOGZhOS0wNzdkNzE4N2E1ZDYiLCJwb2RjYXN0SWQiOiJlZWJmYWM4Zi0yMzY5LTRhODktOTI0Yi1iZTI3OWI1N2YxOTAiLCJhY2NvdW50SWQiOiI2OGE1ZjAzMzFkNjQ0MTNlODhlN2FkMmYiLCJwYXRoIjoibWVkaWEvY2xpcHMvNjk5ZDJmZmIwOTQ3OGYwODIyMmYxMDc1L3JhdmlkLXNod2FydHoteml2cy1zdHVkaW8tY29tcG9zZXItMjAyNi0yLTI0X181LTU4LTM1Lm1wMyJ9.mp3" length="64489578" type="audio/mpeg"/><itunes:summary>&lt;p&gt;&lt;b&gt;Anastasios Angelopoulos&lt;/b&gt;, Co-Founder and CEO of &lt;b&gt;Arena AI &lt;/b&gt;(formerly LMArena), joins us to talk about why static benchmarks are failing, how human preference data actually works under the hood, and what it takes to be the &quot;gold standard&quot; of AI evaluation.&lt;/p&gt;&lt;p&gt;Anastasios sits at a fascinating intersection -   a theoretical statistician running the platform that every major lab watches when they release a model. We talk about the messiness of AI-generated code slop (yes, he hides Claude&apos;s commits too), then dig into the statistical machinery that powers Arena&apos;s leaderboards and why getting evaluation right is harder than most people think.&lt;/p&gt;&lt;p&gt;We explore why style control is both necessary and philosophically tricky, where you can regress away markdown headers and response length, but separating style from substance is a genuinely unsolved causal inference problem. We also get into why users are surprisingly good judges of model quality, how Arena serves as a pre-release testing ground for labs shipping stealth models under codenames, and whether the fragmentation of the AI market (Anthropic going enterprise, OpenAI going consumer, everyone going multimodal) is actually a feature, not a bug. Plus, we discuss the role of rigorous statistics in the age of &quot;just run it again,&quot; why structured decoding can hurt model performance, and what Arena&apos;s 2026 roadmap looks like.&lt;/p&gt;&lt;hr /&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;&lt;b&gt;Timeline:&lt;/b&gt;&lt;/p&gt;&lt;p&gt;(00:12) Introduction and Anastasios&apos;s Background&lt;/p&gt;&lt;p&gt;(00:55) What Arena Does and Why Static Benchmarks Aren&apos;t Enough&lt;/p&gt;&lt;p&gt;(02:26) Coverage of Use Cases - Is There Enough?&lt;/p&gt;&lt;p&gt;(04:22) Style Control and the Bradley-Terry Methodology&lt;/p&gt;&lt;p&gt;(08:35) Can You Actually Separate Style from Substance?&lt;/p&gt;&lt;p&gt;(10:24) Measuring Slop - And the Anti-Slop Paper Plug&lt;/p&gt;&lt;p&gt;(11:52) Can Users Judge Factual Correctness?&lt;/p&gt;&lt;p&gt;(13:31) Tool Use and Agentic Evaluation on Arena&lt;/p&gt;&lt;p&gt;(14:14) Intermediate Feedback Signals Beyond Final Preference&lt;/p&gt;&lt;p&gt;(15:30) Tool Calling Accuracy and Code Arena&lt;/p&gt;&lt;p&gt;(17:42) AI-Generated Code Slop and Hiding Claude&apos;s Commits&lt;/p&gt;&lt;p&gt;(19:49) Do We Need Separate Code Streams for Humans and LLMs?&lt;/p&gt;&lt;p&gt;(20:01) RL Flywheels and Arena&apos;s Preference Data&lt;/p&gt;&lt;p&gt;(21:16) Focus as a Startup - Being the Evaluation Company&lt;/p&gt;&lt;p&gt;(22:16) Structured vs. Unconstrained Generation&lt;/p&gt;&lt;p&gt;(25:00) The Role of Rigorous Statistics in the Age of AI&lt;/p&gt;&lt;p&gt;(29:23) LLM Sampling Parameters and Evaluation Complexity&lt;/p&gt;&lt;p&gt;(30:56) Model Versioning and the Frequentist Approach to Fairness&lt;/p&gt;&lt;p&gt;(32:12) Quantization and Its Effects on Model Quality&lt;/p&gt;&lt;p&gt;(33:10) Pre-Release Testing and Stealth Models (34:23) Transparency - What to Share with the Public vs. Labs&lt;/p&gt;&lt;p&gt;(36:27) When Winning Models Don&apos;t Get Released&lt;/p&gt;&lt;p&gt;(36:59) Why Users Keep Coming Back to Arena&lt;/p&gt;&lt;p&gt;(38:19) Market Fragmentation and Arena&apos;s Future Value&lt;/p&gt;&lt;p&gt;(39:37) Custom Evaluation Frameworks for Specific Users&lt;/p&gt;&lt;p&gt;(40:03) Arena&apos;s 2026 Roadmap - Science, Methodology, and New Paradigms&lt;/p&gt;&lt;p&gt;(42:15) The Economics of Free Inference&lt;/p&gt;&lt;p&gt;(43:13) Hiring and Closing Thoughts&lt;/p&gt;&lt;hr /&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;&lt;b&gt;Music:&lt;/b&gt;&lt;/p&gt;&lt;ul&gt;&lt;li&gt;&quot;Kid Kodi&quot; — Blue Dot Sessions — via Free Music Archive — CC BY-NC 4.0.&lt;/li&gt;&lt;li&gt;&quot;Palms Down&quot; — Blue Dot Sessions — via Free Music Archive — CC BY-NC 4.0.&lt;/li&gt;&lt;li&gt;Changes: trimmed&lt;hr /&gt;&lt;p&gt;&lt;/p&gt;&lt;/li&gt;&lt;/ul&gt;&lt;p&gt;&lt;b&gt;About:&lt;/b&gt; &lt;/p&gt;&lt;p&gt;The Information Bottleneck is hosted by Ravid Shwartz-Ziv and Allen Roush, featuring in-depth conversations with leading AI researchers about the ideas shaping the future of machine learning.&lt;/p&gt;</itunes:summary><itunes:explicit>no</itunes:explicit><itunes:duration>00:44:47</itunes:duration><itunes:image href="https://hosting-media.riverside.com/media/podcasts/eebfac8f-2369-4a89-924b-be279b57f190/episodes/aad57d3d-46c2-4df3-8fa9-077d7187a5d6/images/9541e656-3e4d-4634-84f9-b5ad7f6c69bd.png"/><itunes:title>EP26: Measuring Intelligence in the Wild -  Arena and the Future of AI Evaluation</itunes:title><itunes:episodeType>full</itunes:episodeType></item><item><title><![CDATA[EP25: Personalization, Data, and the Chaos of Fine-Tuning with Fred Sala (UW-Madison / Snorkel AI)]]></title><description><![CDATA[<p>Fred Sala, Assistant Professor at UW-Madison and Chief Scientist at Snorkel AI, joins us to talk about why personalization might be the next frontier for LLMs, why data still matters more than architecture, and how weak supervision refuses to die.</p><p>Fred sits at a rare intersection,  building the theory of data-centric AI in academia while shipping it to enterprise clients at Snorkel. We talk about the chaos of OpenClaw (the personal AI assistant that's getting people hacked the old-fashioned way, via open ports), then focus on one of the most important questions: how do you make a model truly yours?</p><p>We dig into why prompting your preferences doesn't scale, why even LoRA might be too expensive for per-user personalization, and why activation steering methods like REFT could be the sweet spot. We also explore self-distillation for continual learning, the unsolved problem of building realistic personas for evaluation, and Fred's take on the data vs. architecture debate (spoiler: data is still undervalued). Plus, we discuss why the internet's "Ouroboros effect" might not doom pre-training as much as people fear, and what happens when models become smarter than the humans who generate their training data.</p><hr /><p></p><p>Takeaways:</p><ul><li>Personalization requires ultra-efficient methods - even one LoRA per user is probably too expensive. Activation steering is the promising middle ground.</li><li>The "pink elephant problem" makes prompt-based personalization fundamentally limited - telling a model what not to do often makes it do it more.</li><li>Self-distillation can enable on-policy continual learning without expensive RL reward functions, dramatically reducing catastrophic forgetting.</li><li>Data is still undervalued relative to architecture and compute, especially high-quality post-training data, which is actually improving, not getting worse.</li><li>Weak supervision principles are alive and well inside modern LLM data pipelines, even if people don't call it that anymore.<hr /><p></p></li></ul><p>Timeline:</p><p>(00:13) Introduction and Fred's Background</p><p>(00:39) OpenClaw — The Personal AI Assistant Taking Over Macs</p><p>(03:43) Agent Security Risks and the Privacy Problem</p><p>(05:13) Cloud Code, Permissions, and Living Dangerously</p><p>(07:47) AI Social Media and Agents Talking to Each Other</p><p>(08:56) AI Persuasion and Competitive Debate</p><p>(09:51) Self-Distillation for Continual Learning</p><p>(12:43) What Does Continual Learning Actually Mean?</p><p>(14:12) Updating Weights on the Fly — A Grand Challenge</p><p>(15:09) The Personalization Problem — Motivation and Use Cases</p><p>(17:41) The Pink Elephant Problem with Prompt-Based Personalization</p><p>(19:58) Taxonomy of Personalization — Preferences vs. Tone vs. Style</p><p>(21:31) Activation Steering, REFT, and Parameter-Efficient Fine-Tuning</p><p>(27:00) Evaluating Personalization — Benchmarks and Personas</p><p>(31:14) Unlearning and Un-Personalization</p><p>(31:51) Cultural Alignment as Group-Level Personalization</p><p>(41:00) Can LLM Personas Replace Surveys and Polling?</p><p>(44:32) Is Continued Pre-Training Still Relevant?</p><p>(46:28) Data vs. Architecture — What Matters More?</p><p>(52:25) Multi-Epoch Training — Is It Over?</p><p>(54:53) What Makes Good Data? Matching Real-World Usage</p><p>(59:23) Decomposing Uncertainty for Better Data Selection</p><p>(1:01:52) Mapping Human Difficulty to Model Difficulty</p><p>(1:04:49) Scaling Small Ideas — From Academic Proof to Frontier Models</p><p>(1:12:01) What Happens When Models Surpass Human Training Data?</p><p>(1:15:24) Closing Thoughts</p><hr /><p>Music:</p><ul><li>"Kid Kodi" — Blue Dot Sessions — via Free Music Archive — CC BY-NC 4.0.</li><li>"Palms Down" — Blue Dot Sessions — via Free Music Archive — CC BY-NC 4.0.</li><li>Changes: trimmed</li></ul>]]></description><guid isPermaLink="false">2975aa34-a255-403e-9995-7b5ec668b2ba</guid><dc:creator><![CDATA[Ravid Shwartz-Ziv & Allen Roush]]></dc:creator><pubDate>Tue, 17 Feb 2026 05:16:39 GMT</pubDate><enclosure url="https://api.riverside.com/hosting-analytics/media/0347ab8352c6c487d05fbed64d3beeef069d8fff240f27cc3d69c91d80ac3d70/eyJlcGlzb2RlSWQiOiIyOTc1YWEzNC1hMjU1LTQwM2UtOTk5NS03YjVlYzY2OGIyYmEiLCJwb2RjYXN0SWQiOiJlZWJmYWM4Zi0yMzY5LTRhODktOTI0Yi1iZTI3OWI1N2YxOTAiLCJhY2NvdW50SWQiOiI2OGE1ZjAzMzFkNjQ0MTNlODhlN2FkMmYiLCJwYXRoIjoibWVkaWEvY2xpcHMvNjk5M2YzM2ZkNzJkNzU2MDU3Yjc1OWRiL3JhdmlkLXNod2FydHoteml2cy1zdHVkaW8tY29tcG9zZXItMjAyNi0yLTE3X181LTQ5LTMubXAzIn0=.mp3" length="109249245" type="audio/mpeg"/><itunes:summary>&lt;p&gt;Fred Sala, Assistant Professor at UW-Madison and Chief Scientist at Snorkel AI, joins us to talk about why personalization might be the next frontier for LLMs, why data still matters more than architecture, and how weak supervision refuses to die.&lt;/p&gt;&lt;p&gt;Fred sits at a rare intersection,  building the theory of data-centric AI in academia while shipping it to enterprise clients at Snorkel. We talk about the chaos of OpenClaw (the personal AI assistant that&apos;s getting people hacked the old-fashioned way, via open ports), then focus on one of the most important questions: how do you make a model truly yours?&lt;/p&gt;&lt;p&gt;We dig into why prompting your preferences doesn&apos;t scale, why even LoRA might be too expensive for per-user personalization, and why activation steering methods like REFT could be the sweet spot. We also explore self-distillation for continual learning, the unsolved problem of building realistic personas for evaluation, and Fred&apos;s take on the data vs. architecture debate (spoiler: data is still undervalued). Plus, we discuss why the internet&apos;s &quot;Ouroboros effect&quot; might not doom pre-training as much as people fear, and what happens when models become smarter than the humans who generate their training data.&lt;/p&gt;&lt;hr /&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;Takeaways:&lt;/p&gt;&lt;ul&gt;&lt;li&gt;Personalization requires ultra-efficient methods - even one LoRA per user is probably too expensive. Activation steering is the promising middle ground.&lt;/li&gt;&lt;li&gt;The &quot;pink elephant problem&quot; makes prompt-based personalization fundamentally limited - telling a model what not to do often makes it do it more.&lt;/li&gt;&lt;li&gt;Self-distillation can enable on-policy continual learning without expensive RL reward functions, dramatically reducing catastrophic forgetting.&lt;/li&gt;&lt;li&gt;Data is still undervalued relative to architecture and compute, especially high-quality post-training data, which is actually improving, not getting worse.&lt;/li&gt;&lt;li&gt;Weak supervision principles are alive and well inside modern LLM data pipelines, even if people don&apos;t call it that anymore.&lt;hr /&gt;&lt;p&gt;&lt;/p&gt;&lt;/li&gt;&lt;/ul&gt;&lt;p&gt;Timeline:&lt;/p&gt;&lt;p&gt;(00:13) Introduction and Fred&apos;s Background&lt;/p&gt;&lt;p&gt;(00:39) OpenClaw — The Personal AI Assistant Taking Over Macs&lt;/p&gt;&lt;p&gt;(03:43) Agent Security Risks and the Privacy Problem&lt;/p&gt;&lt;p&gt;(05:13) Cloud Code, Permissions, and Living Dangerously&lt;/p&gt;&lt;p&gt;(07:47) AI Social Media and Agents Talking to Each Other&lt;/p&gt;&lt;p&gt;(08:56) AI Persuasion and Competitive Debate&lt;/p&gt;&lt;p&gt;(09:51) Self-Distillation for Continual Learning&lt;/p&gt;&lt;p&gt;(12:43) What Does Continual Learning Actually Mean?&lt;/p&gt;&lt;p&gt;(14:12) Updating Weights on the Fly — A Grand Challenge&lt;/p&gt;&lt;p&gt;(15:09) The Personalization Problem — Motivation and Use Cases&lt;/p&gt;&lt;p&gt;(17:41) The Pink Elephant Problem with Prompt-Based Personalization&lt;/p&gt;&lt;p&gt;(19:58) Taxonomy of Personalization — Preferences vs. Tone vs. Style&lt;/p&gt;&lt;p&gt;(21:31) Activation Steering, REFT, and Parameter-Efficient Fine-Tuning&lt;/p&gt;&lt;p&gt;(27:00) Evaluating Personalization — Benchmarks and Personas&lt;/p&gt;&lt;p&gt;(31:14) Unlearning and Un-Personalization&lt;/p&gt;&lt;p&gt;(31:51) Cultural Alignment as Group-Level Personalization&lt;/p&gt;&lt;p&gt;(41:00) Can LLM Personas Replace Surveys and Polling?&lt;/p&gt;&lt;p&gt;(44:32) Is Continued Pre-Training Still Relevant?&lt;/p&gt;&lt;p&gt;(46:28) Data vs. Architecture — What Matters More?&lt;/p&gt;&lt;p&gt;(52:25) Multi-Epoch Training — Is It Over?&lt;/p&gt;&lt;p&gt;(54:53) What Makes Good Data? Matching Real-World Usage&lt;/p&gt;&lt;p&gt;(59:23) Decomposing Uncertainty for Better Data Selection&lt;/p&gt;&lt;p&gt;(1:01:52) Mapping Human Difficulty to Model Difficulty&lt;/p&gt;&lt;p&gt;(1:04:49) Scaling Small Ideas — From Academic Proof to Frontier Models&lt;/p&gt;&lt;p&gt;(1:12:01) What Happens When Models Surpass Human Training Data?&lt;/p&gt;&lt;p&gt;(1:15:24) Closing Thoughts&lt;/p&gt;&lt;hr /&gt;&lt;p&gt;Music:&lt;/p&gt;&lt;ul&gt;&lt;li&gt;&quot;Kid Kodi&quot; — Blue Dot Sessions — via Free Music Archive — CC BY-NC 4.0.&lt;/li&gt;&lt;li&gt;&quot;Palms Down&quot; — Blue Dot Sessions — via Free Music Archive — CC BY-NC 4.0.&lt;/li&gt;&lt;li&gt;Changes: trimmed&lt;/li&gt;&lt;/ul&gt;</itunes:summary><itunes:explicit>no</itunes:explicit><itunes:duration>01:15:52</itunes:duration><itunes:image href="https://hosting-media.riverside.com/media/podcasts/eebfac8f-2369-4a89-924b-be279b57f190/episodes/2975aa34-a255-403e-9995-7b5ec668b2ba/images/de4fd5fe-4a0f-4255-aaa5-84b4bcbf04b9.png"/><itunes:title>EP25: Personalization, Data, and the Chaos of Fine-Tuning with Fred Sala (UW-Madison / Snorkel AI)</itunes:title><itunes:episodeType>full</itunes:episodeType></item><item><title><![CDATA[EP24: Can AI Learn to Think About Money? -  with Bayan Bruss (Capital One)]]></title><description><![CDATA[<p>Bayan Bruss, VP of Applied AI at Capital One, joins us to talk about building AI systems that can make autonomous financial decisions, and why money might be the hardest problem in machine learning.</p><p>Bayan leads Capital One's AI Foundations team, where they're working toward a destination most people don't associate with banking: getting AI systems to perceive financial ecosystems, form beliefs about the future, and take actions based on those beliefs. It's a framework that sounds simple until you realize you're asking a model to predict whether someone will pay back a loan over 30 years while the world changes around them.</p><p>We get into why LLMs are a bad fit for ingesting 5,000 credit card transactions, why synthetic data works surprisingly well for time series, and the tension between end-to-end learning and regulatory requirements that demand you know exactly what your model learned. We also discuss reasoning in language vs. in latent space - if you wouldn't trust a self-driving car that translated images to words before deciding to turn, should you trust a financial system that does all its reasoning in token space?</p><hr /><p><b>Takeaways:</b></p><ul><li>Money is a behavioral science problem - AI in finance requires understanding people, not just numbers.</li><li>Foundation models pre-trained on web text don't outperform purpose-built models for financial tasks. You're better off building a standalone encoder for financial data.</li><li>Synthetic data works surprisingly well for time series - possibly because real-world time series lives on a simpler manifold than we assume.</li><li>Explainability in ML is fundamentally unsatisfying because people want causality from non-causal models.</li><li>Financial AI needs world models that can imagine alternative futures, not just fit historical data.<hr /><p></p></li></ul><p><b>Timeline:</b></p><p>(00:24) Introduction and Bayan's Background</p><p>(00:42) Claude Code, Vibe Coding - Hype or AGI?</p><p>(05:59) The Future of Software Engineering and Abstraction</p><p>(11:20) Abstraction Layers and Karpathy's Take</p><p>(13:54) Hamming, Kuhn, and Scientific Revolutions in AI</p><p>(19:24) Stack Overflow's Decline and Proof of Humanity</p><p>(23:07) Why We Still Trust Humans Over LLMs</p><p>(30:45) Deep Dive: AI in Banking and Consumer Finance</p><p>(34:17) Are Markets Efficient? Behavioral Economics vs. Classical Views</p><p>(37:14) The Components of a Financial Decision: Perception, Belief, Action</p><p>(42:15) Protected Variables, Proxy Features, and Fairness in Lending</p><p>(45:05) Explainability: Roller Skating on Marbles</p><p>(47:55) Sparse Autoencoders, Interpretability, and Turtles All the Way Down</p><p>(51:57) Foundation Models for Finance — Web Text vs. Purpose-Built</p><p>(53:09) Time Series, Synthetic Data, and TabPFN</p><p>(59:44) Feeding Tabular Data to VLMs - Graphs Beat Raw Numbers</p><p>(1:03:35) Reasoning in Language vs. Latent Space</p><p>(1:08:24) Is Language the Optimal Representation? Chinese Compression and Information Density</p><p>(1:13:37) Personalization and Predicting Human Behavior</p><p>(1:21:36) World Models, Uncertainty, and Professional Worrying</p><p>(1:24:07) Prediction Markets and Insider Betting</p><p>(1:26:33) Can LLMs Predict Stocks?</p><p>(1:29:11) Multi-Agent Systems for Financial Decisions</p><hr /><p></p><p><b>Music:</b></p><ul><li>"Kid Kodi" — Blue Dot Sessions — via Free Music Archive — CC BY-NC 4.0.</li><li>"Palms Down" — Blue Dot Sessions — via Free Music Archive — CC BY-NC 4.0. Changes: trimmed<hr /><p></p></li></ul><p><b>About:</b> The Information Bottleneck is hosted by Ravid Shwartz-Ziv and Allen Roush, featuring in-depth conversations with leading AI researchers about the ideas shaping the future of machine learning.</p>]]></description><guid isPermaLink="false">f1617f27-13be-4e03-a1ef-1f9b5e9ecde0</guid><dc:creator><![CDATA[Ravid Shwartz-Ziv & Allen Roush]]></dc:creator><pubDate>Sun, 08 Feb 2026 19:36:51 GMT</pubDate><enclosure url="https://api.riverside.com/hosting-analytics/media/a93f6ffcaf5c6553c6b851863289d009e632f3534fc6bd5c89e6ed2115a827f8/eyJlcGlzb2RlSWQiOiJmMTYxN2YyNy0xM2JlLTRlMDMtYTFlZi0xZjliNWU5ZWNkZTAiLCJwb2RjYXN0SWQiOiJlZWJmYWM4Zi0yMzY5LTRhODktOTI0Yi1iZTI3OWI1N2YxOTAiLCJhY2NvdW50SWQiOiI2OGE1ZjAzMzFkNjQ0MTNlODhlN2FkMmYiLCJwYXRoIjoibWVkaWEvY2xpcHMvNjk4OGUxMTVkNGFlYjEyNThhOGRjMWI5L3JhdmlkLXNod2FydHoteml2cy1zdHVkaW8tY29tcG9zZXItMjAyNi0yLThfXzIwLTE2LTM3Lm1wMyJ9.mp3" length="131931890" type="audio/mpeg"/><itunes:summary>&lt;p&gt;Bayan Bruss, VP of Applied AI at Capital One, joins us to talk about building AI systems that can make autonomous financial decisions, and why money might be the hardest problem in machine learning.&lt;/p&gt;&lt;p&gt;Bayan leads Capital One&apos;s AI Foundations team, where they&apos;re working toward a destination most people don&apos;t associate with banking: getting AI systems to perceive financial ecosystems, form beliefs about the future, and take actions based on those beliefs. It&apos;s a framework that sounds simple until you realize you&apos;re asking a model to predict whether someone will pay back a loan over 30 years while the world changes around them.&lt;/p&gt;&lt;p&gt;We get into why LLMs are a bad fit for ingesting 5,000 credit card transactions, why synthetic data works surprisingly well for time series, and the tension between end-to-end learning and regulatory requirements that demand you know exactly what your model learned. We also discuss reasoning in language vs. in latent space - if you wouldn&apos;t trust a self-driving car that translated images to words before deciding to turn, should you trust a financial system that does all its reasoning in token space?&lt;/p&gt;&lt;hr /&gt;&lt;p&gt;&lt;b&gt;Takeaways:&lt;/b&gt;&lt;/p&gt;&lt;ul&gt;&lt;li&gt;Money is a behavioral science problem - AI in finance requires understanding people, not just numbers.&lt;/li&gt;&lt;li&gt;Foundation models pre-trained on web text don&apos;t outperform purpose-built models for financial tasks. You&apos;re better off building a standalone encoder for financial data.&lt;/li&gt;&lt;li&gt;Synthetic data works surprisingly well for time series - possibly because real-world time series lives on a simpler manifold than we assume.&lt;/li&gt;&lt;li&gt;Explainability in ML is fundamentally unsatisfying because people want causality from non-causal models.&lt;/li&gt;&lt;li&gt;Financial AI needs world models that can imagine alternative futures, not just fit historical data.&lt;hr /&gt;&lt;p&gt;&lt;/p&gt;&lt;/li&gt;&lt;/ul&gt;&lt;p&gt;&lt;b&gt;Timeline:&lt;/b&gt;&lt;/p&gt;&lt;p&gt;(00:24) Introduction and Bayan&apos;s Background&lt;/p&gt;&lt;p&gt;(00:42) Claude Code, Vibe Coding - Hype or AGI?&lt;/p&gt;&lt;p&gt;(05:59) The Future of Software Engineering and Abstraction&lt;/p&gt;&lt;p&gt;(11:20) Abstraction Layers and Karpathy&apos;s Take&lt;/p&gt;&lt;p&gt;(13:54) Hamming, Kuhn, and Scientific Revolutions in AI&lt;/p&gt;&lt;p&gt;(19:24) Stack Overflow&apos;s Decline and Proof of Humanity&lt;/p&gt;&lt;p&gt;(23:07) Why We Still Trust Humans Over LLMs&lt;/p&gt;&lt;p&gt;(30:45) Deep Dive: AI in Banking and Consumer Finance&lt;/p&gt;&lt;p&gt;(34:17) Are Markets Efficient? Behavioral Economics vs. Classical Views&lt;/p&gt;&lt;p&gt;(37:14) The Components of a Financial Decision: Perception, Belief, Action&lt;/p&gt;&lt;p&gt;(42:15) Protected Variables, Proxy Features, and Fairness in Lending&lt;/p&gt;&lt;p&gt;(45:05) Explainability: Roller Skating on Marbles&lt;/p&gt;&lt;p&gt;(47:55) Sparse Autoencoders, Interpretability, and Turtles All the Way Down&lt;/p&gt;&lt;p&gt;(51:57) Foundation Models for Finance — Web Text vs. Purpose-Built&lt;/p&gt;&lt;p&gt;(53:09) Time Series, Synthetic Data, and TabPFN&lt;/p&gt;&lt;p&gt;(59:44) Feeding Tabular Data to VLMs - Graphs Beat Raw Numbers&lt;/p&gt;&lt;p&gt;(1:03:35) Reasoning in Language vs. Latent Space&lt;/p&gt;&lt;p&gt;(1:08:24) Is Language the Optimal Representation? Chinese Compression and Information Density&lt;/p&gt;&lt;p&gt;(1:13:37) Personalization and Predicting Human Behavior&lt;/p&gt;&lt;p&gt;(1:21:36) World Models, Uncertainty, and Professional Worrying&lt;/p&gt;&lt;p&gt;(1:24:07) Prediction Markets and Insider Betting&lt;/p&gt;&lt;p&gt;(1:26:33) Can LLMs Predict Stocks?&lt;/p&gt;&lt;p&gt;(1:29:11) Multi-Agent Systems for Financial Decisions&lt;/p&gt;&lt;hr /&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;&lt;b&gt;Music:&lt;/b&gt;&lt;/p&gt;&lt;ul&gt;&lt;li&gt;&quot;Kid Kodi&quot; — Blue Dot Sessions — via Free Music Archive — CC BY-NC 4.0.&lt;/li&gt;&lt;li&gt;&quot;Palms Down&quot; — Blue Dot Sessions — via Free Music Archive — CC BY-NC 4.0. Changes: trimmed&lt;hr /&gt;&lt;p&gt;&lt;/p&gt;&lt;/li&gt;&lt;/ul&gt;&lt;p&gt;&lt;b&gt;About:&lt;/b&gt; The Information Bottleneck is hosted by Ravid Shwartz-Ziv and Allen Roush, featuring in-depth conversations with leading AI researchers about the ideas shaping the future of machine learning.&lt;/p&gt;</itunes:summary><itunes:explicit>no</itunes:explicit><itunes:duration>01:31:37</itunes:duration><itunes:image href="https://hosting-media.riverside.com/media/podcasts/eebfac8f-2369-4a89-924b-be279b57f190/episodes/f1617f27-13be-4e03-a1ef-1f9b5e9ecde0/images/bc3360cb-20bd-464f-8c29-f81a54c9a585.png"/><itunes:season>1</itunes:season><itunes:episode>24</itunes:episode><itunes:title>EP24: Can AI Learn to Think About Money? -  with Bayan Bruss (Capital One)</itunes:title><itunes:episodeType>full</itunes:episodeType></item><item><title><![CDATA[EP23: Building Open Source AI Frameworks: David Mezzetti on TxtAI and Local-First AI]]></title><description><![CDATA[<p><b>David Mezzetti</b>, creator of TxtAI, joins us to talk about building open source AI frameworks as a solo developer - and why local-first AI still matters in the age of API-everything.</p><p></p><p>David's path from running a 50-person IT company through acquisition to building one of the most well-regarded AI orchestration libraries tells you how sometimes constraints breed better design. TextAI started during COVID when he was doing coronavirus literature research and realized semantic search could transform how we find information.</p><p></p><p>We get into the evolution of the AI framework landscape - from the early days of vector embeddings to RAG to LLM orchestration. David was initially stubborn about not supporting OpenAI's API, wanting to keep everything local. He admits that probably cost him some early traction compared to LangChain, but it also shaped TextAI's philosophy: you shouldn't need permission to build with AI.</p><p>We also talk about small models and some genuinely practical insights: a 20-million parameter model running on CPU might be all you need. On the future of coding with AI, David's come around on "vibe coding" and notes that well-documented frameworks with lots of examples are perfectly positioned for this new world.</p><p></p><p>Takeaways:</p><ul><li>Local-first AI gives you control, reproducibility, and often better performance for your domain</li><li>Small models (even 20M parameters) can solve real problems on CPU</li><li>Good documentation and examples make your framework AI-coding friendly</li><li>Open source should mean actually contributing - not just publishing code</li><li>Solo developers can compete by staying focused and being willing to evolve<hr /><p></p></li></ul><p><b>Timeline:</b></p><p><b>(00:14)</b> Introduction and David's Background</p><p><b>(07:44)</b> TextAI History and Evolution</p><p><b>(12:04)</b> Framework Landscape: LangChain, LlamaIndex, Haystack</p><p><b>(15:16)</b> Can AI Re-implement Frameworks?</p><p><b>(24:14)</b> API Specs: OpenAI vs Anthropic</p><p><b>(26:46)</b> Running an Open Source Consulting Business</p><p><b>(32:51)</b> Origin Story: COVID, Kaggle, and Medical Literature</p><p><b>(43:08)</b> Open Source Philosophy and Giving Back</p><p><b>(47:16)</b> Ethics of Local AI and Developer Freedom</p><p><b>(01:06:44)</b> Human in the Loop and AI-Generated Code</p><p><b>(01:09:31)</b> The Future of Work and Automation</p><hr /><p></p><p><b>Music:</b></p><ul><li>"Kid Kodi" — Blue Dot Sessions — via Free Music Archive — CC BY-NC 4.0.</li><li>"Palms Down" — Blue Dot Sessions — via Free Music Archive — CC BY-NC 4.0. Changes: trimmed<hr /><p></p></li></ul><p><b>About:</b></p><p>The Information Bottleneck is hosted by Ravid Shwartz-Ziv and Allen Roush, featuring in-depth conversations with leading AI researchers about the ideas shaping the future of machine learning.</p>]]></description><guid isPermaLink="false">041f5ce2-e212-4ff3-baed-8506b25353b4</guid><dc:creator><![CDATA[Ravid Shwartz-Ziv & Allen Roush]]></dc:creator><pubDate>Sun, 01 Feb 2026 00:23:19 GMT</pubDate><enclosure url="https://api.riverside.com/hosting-analytics/media/b5d6f56a6754a1a939b91e4b9f119aa729103485e5ca2690266a3cffe6f031c6/eyJlcGlzb2RlSWQiOiIwNDFmNWNlMi1lMjEyLTRmZjMtYmFlZC04NTA2YjI1MzUzYjQiLCJwb2RjYXN0SWQiOiJlZWJmYWM4Zi0yMzY5LTRhODktOTI0Yi1iZTI3OWI1N2YxOTAiLCJhY2NvdW50SWQiOiI2OGE1ZjAzMzFkNjQ0MTNlODhlN2FkMmYiLCJwYXRoIjoibWVkaWEvY2xpcHMvNjk3ZTk2YTNhNzIzNTBiZDNmNDRiNzE0L3JhdmlkLXNod2FydHoteml2cy1zdHVkaW8tY29tcG9zZXItMjAyNi0yLTFfXzAtNTYtMTkubXAzIn0=.mp3" length="59475515" type="audio/mpeg"/><itunes:summary>&lt;p&gt;&lt;b&gt;David Mezzetti&lt;/b&gt;, creator of TxtAI, joins us to talk about building open source AI frameworks as a solo developer - and why local-first AI still matters in the age of API-everything.&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;David&apos;s path from running a 50-person IT company through acquisition to building one of the most well-regarded AI orchestration libraries tells you how sometimes constraints breed better design. TextAI started during COVID when he was doing coronavirus literature research and realized semantic search could transform how we find information.&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;We get into the evolution of the AI framework landscape - from the early days of vector embeddings to RAG to LLM orchestration. David was initially stubborn about not supporting OpenAI&apos;s API, wanting to keep everything local. He admits that probably cost him some early traction compared to LangChain, but it also shaped TextAI&apos;s philosophy: you shouldn&apos;t need permission to build with AI.&lt;/p&gt;&lt;p&gt;We also talk about small models and some genuinely practical insights: a 20-million parameter model running on CPU might be all you need. On the future of coding with AI, David&apos;s come around on &quot;vibe coding&quot; and notes that well-documented frameworks with lots of examples are perfectly positioned for this new world.&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;Takeaways:&lt;/p&gt;&lt;ul&gt;&lt;li&gt;Local-first AI gives you control, reproducibility, and often better performance for your domain&lt;/li&gt;&lt;li&gt;Small models (even 20M parameters) can solve real problems on CPU&lt;/li&gt;&lt;li&gt;Good documentation and examples make your framework AI-coding friendly&lt;/li&gt;&lt;li&gt;Open source should mean actually contributing - not just publishing code&lt;/li&gt;&lt;li&gt;Solo developers can compete by staying focused and being willing to evolve&lt;hr /&gt;&lt;p&gt;&lt;/p&gt;&lt;/li&gt;&lt;/ul&gt;&lt;p&gt;&lt;b&gt;Timeline:&lt;/b&gt;&lt;/p&gt;&lt;p&gt;&lt;b&gt;(00:14)&lt;/b&gt; Introduction and David&apos;s Background&lt;/p&gt;&lt;p&gt;&lt;b&gt;(07:44)&lt;/b&gt; TextAI History and Evolution&lt;/p&gt;&lt;p&gt;&lt;b&gt;(12:04)&lt;/b&gt; Framework Landscape: LangChain, LlamaIndex, Haystack&lt;/p&gt;&lt;p&gt;&lt;b&gt;(15:16)&lt;/b&gt; Can AI Re-implement Frameworks?&lt;/p&gt;&lt;p&gt;&lt;b&gt;(24:14)&lt;/b&gt; API Specs: OpenAI vs Anthropic&lt;/p&gt;&lt;p&gt;&lt;b&gt;(26:46)&lt;/b&gt; Running an Open Source Consulting Business&lt;/p&gt;&lt;p&gt;&lt;b&gt;(32:51)&lt;/b&gt; Origin Story: COVID, Kaggle, and Medical Literature&lt;/p&gt;&lt;p&gt;&lt;b&gt;(43:08)&lt;/b&gt; Open Source Philosophy and Giving Back&lt;/p&gt;&lt;p&gt;&lt;b&gt;(47:16)&lt;/b&gt; Ethics of Local AI and Developer Freedom&lt;/p&gt;&lt;p&gt;&lt;b&gt;(01:06:44)&lt;/b&gt; Human in the Loop and AI-Generated Code&lt;/p&gt;&lt;p&gt;&lt;b&gt;(01:09:31)&lt;/b&gt; The Future of Work and Automation&lt;/p&gt;&lt;hr /&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;&lt;b&gt;Music:&lt;/b&gt;&lt;/p&gt;&lt;ul&gt;&lt;li&gt;&quot;Kid Kodi&quot; — Blue Dot Sessions — via Free Music Archive — CC BY-NC 4.0.&lt;/li&gt;&lt;li&gt;&quot;Palms Down&quot; — Blue Dot Sessions — via Free Music Archive — CC BY-NC 4.0. Changes: trimmed&lt;hr /&gt;&lt;p&gt;&lt;/p&gt;&lt;/li&gt;&lt;/ul&gt;&lt;p&gt;&lt;b&gt;About:&lt;/b&gt;&lt;/p&gt;&lt;p&gt;The Information Bottleneck is hosted by Ravid Shwartz-Ziv and Allen Roush, featuring in-depth conversations with leading AI researchers about the ideas shaping the future of machine learning.&lt;/p&gt;</itunes:summary><itunes:explicit>no</itunes:explicit><itunes:duration>01:14:56</itunes:duration><itunes:image href="https://hosting-media.riverside.com/media/podcasts/eebfac8f-2369-4a89-924b-be279b57f190/episodes/041f5ce2-e212-4ff3-baed-8506b25353b4/images/2321c1b6-6912-482b-bbfe-aab1839d1b4a.png"/><itunes:title>EP23: Building Open Source AI Frameworks: David Mezzetti on TxtAI and Local-First AI</itunes:title><itunes:episodeType>full</itunes:episodeType></item><item><title><![CDATA[EP22: Data Curation for LLMs with Cody Blakeney (Datology AI)]]></title><description><![CDATA[<p><b>Cody Blakeney</b> from Datology AI joins us to talk about data curation - the unglamorous but critical work of figuring out what to actually train models on.</p><p>Cody's path from writing CUDA kernels to spending his days staring at weird internet text tells you something important: data quality can account for half or more of a model's final performance. That's on par with major architectural breakthroughs.</p><p>We get into the differences between pre-training, mid-training, and post-training data. Mid-training in particular has become a key technique for squeezing value out of rare, high-quality datasets. Cody's team stumbled onto it while solving a practical problem: how do you figure out if a 5-billion-token dataset is actually useful when you can't afford hundreds of experimental runs?</p><p>We also talk about data filtering and some genuinely surprising findings: the documents that make the best training data are often short and dense with information. Those nicely written blog posts with personal anecdotes? Turns out models don't learn as well from them.</p><p>On synthetic data, Cody thinks pre-training is still in its early days, where most techniques are variations on a few core ideas, but there's huge potential. He's excited about connecting RL failures back to mid-training: when models fail at tasks, use that signal to generate targeted training data.</p><hr /><p></p><p><b>Takeaways:</b></p><ul><li>Data work is high-leverage but underappreciated</li><li>Mid-training helps extract signal from small, valuable datasets</li><li>Good filters favor dense, factual text over polished prose.</li><li>Synthetic data for pre-training works surprisingly well, but remains primitive.</li><li>Optimal data mixtures depend on model scale, where smaller models need more aggressive distribution shifts.<hr /><h2>Timeline</h2><p></p><p><b>(00:12)</b> Introduction to Data Correlation in LLMs</p><p><b>(05:14)</b> The Importance of Data Quality</p><p><b>(10:15)</b> Pre-training vs Post-training Data</p><p><b>(15:22)</b> Strategies for Effective Data Utilization</p><p><b>(20:15)</b> Benchmarking and Model Evaluation</p><p><b>(28:28)</b> Maximizing Perplexity and Coherence</p><p><b>(30:27)</b> Measuring Quality in Data</p><p><b>(32:56)</b> The Role of Filters in Data Selection</p><p><b>(34:19)</b> Understanding High-Quality Data</p><p><b>(39:15)</b> Mid-Training and Its Importance</p><p><b>(46:51)</b> Future of Data Sources</p><p><b>(48:13)</b> Synthetic Data's Role in Pre-Training</p><p><b>(53:10)</b> Creating Effective Synthetic Data</p><p><b>(57:39)</b> The Debate on Pure Synthetic Data</p><p><b>(01:00:25)</b> Navigating AI Training and Legal Challenges</p><p><b>(01:02:34)</b> The Controversy of AI in the Art Community</p><p><b>(01:05:29)</b> Exploring Synthetic Data and Its Efficiency</p><p><b>(01:11:21)</b> The Future of Domain-Specific vs. General Models</p><p><b>(01:22:06)</b> Bias in Pre-trained Models and Data Selection</p><p><b>(01:28:27)</b> The Potential of Synthetic Data Over Human Data</p><hr /><p><b>Music:</b></p></li><li>"Kid Kodi" — Blue Dot Sessions — via Free Music Archive — CC BY-NC 4.0.</li><li>"Palms Down" — Blue Dot Sessions — via Free Music Archive — CC BY-NC 4.0.</li></ul><p>Changes: trimmed</p><hr /><h3><b>About</b></h3><p><b>The Information Bottleneck</b> is hosted by Ravid Shwartz-Ziv and Allen Roush, featuring in-depth conversations with leading AI researchers about the ideas shaping the future of machine learning.</p><p></p>]]></description><guid isPermaLink="false">7848b3bd-8b7d-477f-911a-b3e7349c0752</guid><dc:creator><![CDATA[Ravid Shwartz-Ziv & Allen Roush]]></dc:creator><pubDate>Tue, 20 Jan 2026 18:34:23 GMT</pubDate><enclosure url="https://api.riverside.com/hosting-analytics/media/849a0ea421c17c18db3ea9b1deb3dd68db73d9af13ea4e73097bc38ce03cd205/eyJlcGlzb2RlSWQiOiI3ODQ4YjNiZC04YjdkLTQ3N2YtOTExYS1iM2U3MzQ5YzA3NTIiLCJwb2RjYXN0SWQiOiJlZWJmYWM4Zi0yMzY5LTRhODktOTI0Yi1iZTI3OWI1N2YxOTAiLCJhY2NvdW50SWQiOiI2OGE1ZjAzMzFkNjQ0MTNlODhlN2FkMmYiLCJwYXRoIjoibWVkaWEvY2xpcHMvNjk2ZmM1MjczMWQ4NzAxZDMyZTRhYWMyL3JhdmlkLXNod2FydHoteml2cy1zdHVkaW8tY29tcG9zZXItMjAyNi0xLTIwX18xOS0xMC00Ny5tcDMifQ==.mp3" length="58360062" type="audio/mpeg"/><itunes:summary>&lt;p&gt;&lt;b&gt;Cody Blakeney&lt;/b&gt; from Datology AI joins us to talk about data curation - the unglamorous but critical work of figuring out what to actually train models on.&lt;/p&gt;&lt;p&gt;Cody&apos;s path from writing CUDA kernels to spending his days staring at weird internet text tells you something important: data quality can account for half or more of a model&apos;s final performance. That&apos;s on par with major architectural breakthroughs.&lt;/p&gt;&lt;p&gt;We get into the differences between pre-training, mid-training, and post-training data. Mid-training in particular has become a key technique for squeezing value out of rare, high-quality datasets. Cody&apos;s team stumbled onto it while solving a practical problem: how do you figure out if a 5-billion-token dataset is actually useful when you can&apos;t afford hundreds of experimental runs?&lt;/p&gt;&lt;p&gt;We also talk about data filtering and some genuinely surprising findings: the documents that make the best training data are often short and dense with information. Those nicely written blog posts with personal anecdotes? Turns out models don&apos;t learn as well from them.&lt;/p&gt;&lt;p&gt;On synthetic data, Cody thinks pre-training is still in its early days, where most techniques are variations on a few core ideas, but there&apos;s huge potential. He&apos;s excited about connecting RL failures back to mid-training: when models fail at tasks, use that signal to generate targeted training data.&lt;/p&gt;&lt;hr /&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;&lt;b&gt;Takeaways:&lt;/b&gt;&lt;/p&gt;&lt;ul&gt;&lt;li&gt;Data work is high-leverage but underappreciated&lt;/li&gt;&lt;li&gt;Mid-training helps extract signal from small, valuable datasets&lt;/li&gt;&lt;li&gt;Good filters favor dense, factual text over polished prose.&lt;/li&gt;&lt;li&gt;Synthetic data for pre-training works surprisingly well, but remains primitive.&lt;/li&gt;&lt;li&gt;Optimal data mixtures depend on model scale, where smaller models need more aggressive distribution shifts.&lt;hr /&gt;&lt;h2&gt;Timeline&lt;/h2&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;&lt;b&gt;(00:12)&lt;/b&gt; Introduction to Data Correlation in LLMs&lt;/p&gt;&lt;p&gt;&lt;b&gt;(05:14)&lt;/b&gt; The Importance of Data Quality&lt;/p&gt;&lt;p&gt;&lt;b&gt;(10:15)&lt;/b&gt; Pre-training vs Post-training Data&lt;/p&gt;&lt;p&gt;&lt;b&gt;(15:22)&lt;/b&gt; Strategies for Effective Data Utilization&lt;/p&gt;&lt;p&gt;&lt;b&gt;(20:15)&lt;/b&gt; Benchmarking and Model Evaluation&lt;/p&gt;&lt;p&gt;&lt;b&gt;(28:28)&lt;/b&gt; Maximizing Perplexity and Coherence&lt;/p&gt;&lt;p&gt;&lt;b&gt;(30:27)&lt;/b&gt; Measuring Quality in Data&lt;/p&gt;&lt;p&gt;&lt;b&gt;(32:56)&lt;/b&gt; The Role of Filters in Data Selection&lt;/p&gt;&lt;p&gt;&lt;b&gt;(34:19)&lt;/b&gt; Understanding High-Quality Data&lt;/p&gt;&lt;p&gt;&lt;b&gt;(39:15)&lt;/b&gt; Mid-Training and Its Importance&lt;/p&gt;&lt;p&gt;&lt;b&gt;(46:51)&lt;/b&gt; Future of Data Sources&lt;/p&gt;&lt;p&gt;&lt;b&gt;(48:13)&lt;/b&gt; Synthetic Data&apos;s Role in Pre-Training&lt;/p&gt;&lt;p&gt;&lt;b&gt;(53:10)&lt;/b&gt; Creating Effective Synthetic Data&lt;/p&gt;&lt;p&gt;&lt;b&gt;(57:39)&lt;/b&gt; The Debate on Pure Synthetic Data&lt;/p&gt;&lt;p&gt;&lt;b&gt;(01:00:25)&lt;/b&gt; Navigating AI Training and Legal Challenges&lt;/p&gt;&lt;p&gt;&lt;b&gt;(01:02:34)&lt;/b&gt; The Controversy of AI in the Art Community&lt;/p&gt;&lt;p&gt;&lt;b&gt;(01:05:29)&lt;/b&gt; Exploring Synthetic Data and Its Efficiency&lt;/p&gt;&lt;p&gt;&lt;b&gt;(01:11:21)&lt;/b&gt; The Future of Domain-Specific vs. General Models&lt;/p&gt;&lt;p&gt;&lt;b&gt;(01:22:06)&lt;/b&gt; Bias in Pre-trained Models and Data Selection&lt;/p&gt;&lt;p&gt;&lt;b&gt;(01:28:27)&lt;/b&gt; The Potential of Synthetic Data Over Human Data&lt;/p&gt;&lt;hr /&gt;&lt;p&gt;&lt;b&gt;Music:&lt;/b&gt;&lt;/p&gt;&lt;/li&gt;&lt;li&gt;&quot;Kid Kodi&quot; — Blue Dot Sessions — via Free Music Archive — CC BY-NC 4.0.&lt;/li&gt;&lt;li&gt;&quot;Palms Down&quot; — Blue Dot Sessions — via Free Music Archive — CC BY-NC 4.0.&lt;/li&gt;&lt;/ul&gt;&lt;p&gt;Changes: trimmed&lt;/p&gt;&lt;hr /&gt;&lt;h3&gt;&lt;b&gt;About&lt;/b&gt;&lt;/h3&gt;&lt;p&gt;&lt;b&gt;The Information Bottleneck&lt;/b&gt; is hosted by Ravid Shwartz-Ziv and Allen Roush, featuring in-depth conversations with leading AI researchers about the ideas shaping the future of machine learning.&lt;/p&gt;&lt;p&gt;&lt;/p&gt;</itunes:summary><itunes:explicit>no</itunes:explicit><itunes:duration>01:25:58</itunes:duration><itunes:image href="https://hosting-media.riverside.com/media/podcasts/eebfac8f-2369-4a89-924b-be279b57f190/episodes/7848b3bd-8b7d-477f-911a-b3e7349c0752/images/330cf18d-0cb4-4f59-8812-46fc6c9ee22c.png"/><itunes:season>1</itunes:season><itunes:episode>22</itunes:episode><itunes:title>EP22: Data Curation for LLMs with Cody Blakeney (Datology AI)</itunes:title><itunes:episodeType>full</itunes:episodeType></item><item><title><![CDATA[EP21: Privacy in the Age of Agents with Niloofar Mireshghallah  ]]></title><description><![CDATA[<p><b>Guest: Niloofar Mireshghallah </b>(Incoming Assistant Professor at CMU, Member of Technical Staff at Humans and AI)</p><p></p><p>In this episode, we dive into AI privacy, frontier model capabilities, and why academia still matters.</p><p>We kick off by discussing GPT-5.2 and whether models rely more on parametric knowledge or context. Niloofar shares how reasoning models actually defer to context, even accepting obviously false information to "roll with it."</p><p>On privacy, Niloofar challenges conventional wisdom: memorization isn't the problem anymore. The real threats are aggregation attacks (finding someone's pet name in HTML metadata), inference attacks (models are expert geoguessers), and input-output leakage in agentic workflows.</p><p>We also explore linguistic colonialism in AI, or how models fail for non-English languages, sometimes inventing cultural traditions.</p><p>The episode wraps with a call for researchers to tackle problems industry ignores: AI for science, education tools that preserve the struggle of learning, and privacy-preserving collaboration between small local models and large commercial ones.</p><p></p><hr /><h2>Timeline</h2><p><b>[0:00]</b> Intro</p><p><b>[1:03]</b> GPT-5.2 first impressions and skepticism about the data cutoff claims</p><p><b>[4:17]</b> Parametric vs. context memory—when do models trust training vs. the prompt?</p><p><b>[9:28]</b> The messy problem of memory, weights, and online learning</p><p><b>[16:12]</b> Tool use changes model behavior in unexpected ways</p><p><b>[17:15]</b> OpenAI's "Advances in Sciences" paper and human-AI collaboration</p><p><b>[24:17]</b> Why deep research is getting less useful</p><p><b>[28:17]</b> Pre-training vs. post-training—which matters more?</p><p><b>[30:35]</b> Non-English languages and AI failures</p><p><b>[33:23]</b> Hilarious Farsi bugs: "I'll get back to you in a few days" and invented traditions</p><p><b>[37:56]</b> Linguistic colonialism—ChatGPT changed how we write</p><p><b>[41:20]</b> Why memorization isn't the real privacy threat</p><p><b>[47:14]</b> The three actual privacy problems: inference, aggregation, input-output leakage</p><p><b>[54:33]</b> Deep research stalking experiment—finding a cat's name in HTML</p><p><b>[1:01:13]</b> Privacy solutions for agentic systems</p><p><b>[1:03:23]</b> What Niloofar's excited about: AI for scientists, small models, niche problems</p><p><b>[1:08:31]</b> AI for education without killing the learning process</p><p><b>[1:09:15]</b> Closing: underrated life advice on health and sustainable habits</p><hr /><p></p><p><b>Music:</b></p><p>"Kid Kodi" — Blue Dot Sessions — via Free Music Archive — CC BY-NC 4.0.</p><p>"Palms Down" — Blue Dot Sessions — via Free Music Archive — CC BY-NC 4.0.</p><p>Changes: trimmed</p><hr /><h3><b>About</b></h3><p><b>The Information Bottleneck</b> is hosted by Ravid Shwartz-Ziv and Allen Roush, featuring in-depth conversations with leading AI researchers about the ideas shaping the future of machine learning.</p><p></p>]]></description><guid isPermaLink="false">2d8ec0ea-a8b0-417e-bc4a-e39c52bbd8bf</guid><dc:creator><![CDATA[Ravid Shwartz-Ziv & Allen Roush]]></dc:creator><pubDate>Wed, 07 Jan 2026 03:46:48 GMT</pubDate><enclosure url="https://api.riverside.com/hosting-analytics/media/8b567f8f6496405665934acfe13c0b0304bed7302569862f6bfbd7fd612fcc7d/eyJlcGlzb2RlSWQiOiIyZDhlYzBlYS1hOGIwLTQxN2UtYmM0YS1lMzljNTJiYmQ4YmYiLCJwb2RjYXN0SWQiOiJlZWJmYWM4Zi0yMzY5LTRhODktOTI0Yi1iZTI3OWI1N2YxOTAiLCJhY2NvdW50SWQiOiI2OGE1ZjAzMzFkNjQ0MTNlODhlN2FkMmYiLCJwYXRoIjoibWVkaWEvY2xpcHMvNjk1ZGQyYzNjNjk4ZGY3ZGRjMDRmYmEwL3JhdmlkLXNod2FydHoteml2cy1zdHVkaW8tY29tcG9zZXItMjAyNi0xLTdfXzQtMjgtMy5tcDMifQ==.mp3" length="51294950" type="audio/mpeg"/><itunes:summary>&lt;p&gt;&lt;b&gt;Guest: Niloofar Mireshghallah &lt;/b&gt;(Incoming Assistant Professor at CMU, Member of Technical Staff at Humans and AI)&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;In this episode, we dive into AI privacy, frontier model capabilities, and why academia still matters.&lt;/p&gt;&lt;p&gt;We kick off by discussing GPT-5.2 and whether models rely more on parametric knowledge or context. Niloofar shares how reasoning models actually defer to context, even accepting obviously false information to &quot;roll with it.&quot;&lt;/p&gt;&lt;p&gt;On privacy, Niloofar challenges conventional wisdom: memorization isn&apos;t the problem anymore. The real threats are aggregation attacks (finding someone&apos;s pet name in HTML metadata), inference attacks (models are expert geoguessers), and input-output leakage in agentic workflows.&lt;/p&gt;&lt;p&gt;We also explore linguistic colonialism in AI, or how models fail for non-English languages, sometimes inventing cultural traditions.&lt;/p&gt;&lt;p&gt;The episode wraps with a call for researchers to tackle problems industry ignores: AI for science, education tools that preserve the struggle of learning, and privacy-preserving collaboration between small local models and large commercial ones.&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;hr /&gt;&lt;h2&gt;Timeline&lt;/h2&gt;&lt;p&gt;&lt;b&gt;[0:00]&lt;/b&gt; Intro&lt;/p&gt;&lt;p&gt;&lt;b&gt;[1:03]&lt;/b&gt; GPT-5.2 first impressions and skepticism about the data cutoff claims&lt;/p&gt;&lt;p&gt;&lt;b&gt;[4:17]&lt;/b&gt; Parametric vs. context memory—when do models trust training vs. the prompt?&lt;/p&gt;&lt;p&gt;&lt;b&gt;[9:28]&lt;/b&gt; The messy problem of memory, weights, and online learning&lt;/p&gt;&lt;p&gt;&lt;b&gt;[16:12]&lt;/b&gt; Tool use changes model behavior in unexpected ways&lt;/p&gt;&lt;p&gt;&lt;b&gt;[17:15]&lt;/b&gt; OpenAI&apos;s &quot;Advances in Sciences&quot; paper and human-AI collaboration&lt;/p&gt;&lt;p&gt;&lt;b&gt;[24:17]&lt;/b&gt; Why deep research is getting less useful&lt;/p&gt;&lt;p&gt;&lt;b&gt;[28:17]&lt;/b&gt; Pre-training vs. post-training—which matters more?&lt;/p&gt;&lt;p&gt;&lt;b&gt;[30:35]&lt;/b&gt; Non-English languages and AI failures&lt;/p&gt;&lt;p&gt;&lt;b&gt;[33:23]&lt;/b&gt; Hilarious Farsi bugs: &quot;I&apos;ll get back to you in a few days&quot; and invented traditions&lt;/p&gt;&lt;p&gt;&lt;b&gt;[37:56]&lt;/b&gt; Linguistic colonialism—ChatGPT changed how we write&lt;/p&gt;&lt;p&gt;&lt;b&gt;[41:20]&lt;/b&gt; Why memorization isn&apos;t the real privacy threat&lt;/p&gt;&lt;p&gt;&lt;b&gt;[47:14]&lt;/b&gt; The three actual privacy problems: inference, aggregation, input-output leakage&lt;/p&gt;&lt;p&gt;&lt;b&gt;[54:33]&lt;/b&gt; Deep research stalking experiment—finding a cat&apos;s name in HTML&lt;/p&gt;&lt;p&gt;&lt;b&gt;[1:01:13]&lt;/b&gt; Privacy solutions for agentic systems&lt;/p&gt;&lt;p&gt;&lt;b&gt;[1:03:23]&lt;/b&gt; What Niloofar&apos;s excited about: AI for scientists, small models, niche problems&lt;/p&gt;&lt;p&gt;&lt;b&gt;[1:08:31]&lt;/b&gt; AI for education without killing the learning process&lt;/p&gt;&lt;p&gt;&lt;b&gt;[1:09:15]&lt;/b&gt; Closing: underrated life advice on health and sustainable habits&lt;/p&gt;&lt;hr /&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;&lt;b&gt;Music:&lt;/b&gt;&lt;/p&gt;&lt;p&gt;&quot;Kid Kodi&quot; — Blue Dot Sessions — via Free Music Archive — CC BY-NC 4.0.&lt;/p&gt;&lt;p&gt;&quot;Palms Down&quot; — Blue Dot Sessions — via Free Music Archive — CC BY-NC 4.0.&lt;/p&gt;&lt;p&gt;Changes: trimmed&lt;/p&gt;&lt;hr /&gt;&lt;h3&gt;&lt;b&gt;About&lt;/b&gt;&lt;/h3&gt;&lt;p&gt;&lt;b&gt;The Information Bottleneck&lt;/b&gt; is hosted by Ravid Shwartz-Ziv and Allen Roush, featuring in-depth conversations with leading AI researchers about the ideas shaping the future of machine learning.&lt;/p&gt;&lt;p&gt;&lt;/p&gt;</itunes:summary><itunes:explicit>no</itunes:explicit><itunes:duration>01:11:47</itunes:duration><itunes:image href="https://hosting-media.riverside.com/media/podcasts/eebfac8f-2369-4a89-924b-be279b57f190/episodes/2d8ec0ea-a8b0-417e-bc4a-e39c52bbd8bf/images/8a87222d-42ca-4c2c-8265-6e0686211aa4.png"/><itunes:season>1</itunes:season><itunes:episode>21</itunes:episode><itunes:title>EP21: Privacy in the Age of Agents with Niloofar Mireshghallah  </itunes:title><itunes:episodeType>full</itunes:episodeType></item><item><title><![CDATA[EP20: Yann LeCun ]]></title><description><![CDATA[<h2><b>Yann LeCun – Why LLMs Will Never Get Us to AGI</b></h2><p><i>"The path to superintelligence - just train up the LLMs, train on more synthetic data, hire thousands of people to school your system in post-training, invent new tweaks on RL-I think is complete bullshit. It's just never going to work."</i></p><p></p><p>After 12 years at Meta, Turing Award winner Yann LeCun is betting his legacy on a radically different vision of AI. In this conversation, he explains why Silicon Valley's obsession with scaling language models is a dead end, why the hardest problem in AI is reaching dog-level intelligence (not human-level), and why his new company AMI is building world models that predict in abstract representation space rather than generating pixels.</p><p></p><hr /><p></p><h3><b>Timestamps</b></h3><p>(00:00:14) – Intro and welcome</p><p>(00:01:12) – AMI: Why start a company now?</p><p>(00:04:46) – Will AMI do research in the open?</p><p>(00:06:44) – World models vs LLMs</p><p>(00:09:44) – History of self-supervised learning</p><p>(00:16:55) – Siamese networks and contrastive learning</p><p>(00:25:14) – JEPA and learning in representation space</p><p>(00:30:14) – Abstraction hierarchies in physics and AI</p><p>(00:34:01) – World models as abstract simulators</p><p>(00:38:14) – Object permanence and learning basic physics</p><p>(00:40:35) – Game AI: Why NetHack is still impossible</p><p>(00:44:22) – Moravec's Paradox and chess</p><p>(00:55:14) – AI safety by construction, not fine-tuning</p><p>(01:02:52) – Constrained generation techniques</p><p>(01:04:20) – Meta's reorganization and FAIR's future</p><p>(01:07:31) – SSI, Physical Intelligence, and Wayve</p><p>(01:10:14) – Silicon Valley's "LLM-pilled" monoculture</p><p>(01:15:56) – China vs US: The open source paradox</p><p>(01:18:14) – Why start a company at 65?</p><p>(01:25:14) – The AGI hype cycle has happened 6 times before</p><p>(01:33:18) – Family and personal background</p><p>(01:36:13) – Career advice: Learn things with a long shelf life</p><p>(01:40:14) – Neuroscience and machine learning connections</p><p>(01:48:17) – Continual learning: Is catastrophic forgetting solved?</p><p></p><hr /><p><b>Music:</b></p><p>"Kid Kodi" — Blue Dot Sessions — via Free Music Archive — CC BY-NC 4.0.</p><p>"Palms Down" — Blue Dot Sessions — via Free Music Archive — CC BY-NC 4.0.</p><p>Changes: trimmed</p><hr /><h3><b>About</b></h3><p><b>The Information Bottleneck</b> is hosted by Ravid Shwartz-Ziv and Allen Roush, featuring in-depth conversations with leading AI researchers about the ideas shaping the future of machine learning.</p><p><br /></p>]]></description><guid isPermaLink="false">dffd2103-84ce-4a22-b7a8-33985f3c2f72</guid><dc:creator><![CDATA[Ravid Shwartz-Ziv & Allen Roush]]></dc:creator><pubDate>Mon, 15 Dec 2025 18:54:53 GMT</pubDate><enclosure url="https://api.riverside.com/hosting-analytics/media/a44683b5ab2dda98d68a5a65520926cba6a95fba9591bef9817d05b470c7e2b7/eyJlcGlzb2RlSWQiOiJkZmZkMjEwMy04NGNlLTRhMjItYjdhOC0zMzk4NWYzYzJmNzIiLCJwb2RjYXN0SWQiOiJlZWJmYWM4Zi0yMzY5LTRhODktOTI0Yi1iZTI3OWI1N2YxOTAiLCJhY2NvdW50SWQiOiI2OGE1ZjAzMzFkNjQ0MTNlODhlN2FkMmYiLCJwYXRoIjoibWVkaWEvY2xpcHMvNjk0MDUyYzA3OWE4NDAxYmY2ZDg3OTAyL3JhdmlkLXNod2FydHoteml2cy1zdHVkaW8tY29tcG9zZXItMjAyNS0xMi0xNV9fMTktMjYtOC5tcDMifQ==.mp3" length="78190319" type="audio/mpeg"/><itunes:summary>&lt;h2&gt;&lt;b&gt;Yann LeCun – Why LLMs Will Never Get Us to AGI&lt;/b&gt;&lt;/h2&gt;&lt;p&gt;&lt;i&gt;&quot;The path to superintelligence - just train up the LLMs, train on more synthetic data, hire thousands of people to school your system in post-training, invent new tweaks on RL-I think is complete bullshit. It&apos;s just never going to work.&quot;&lt;/i&gt;&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;After 12 years at Meta, Turing Award winner Yann LeCun is betting his legacy on a radically different vision of AI. In this conversation, he explains why Silicon Valley&apos;s obsession with scaling language models is a dead end, why the hardest problem in AI is reaching dog-level intelligence (not human-level), and why his new company AMI is building world models that predict in abstract representation space rather than generating pixels.&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;hr /&gt;&lt;p&gt;&lt;/p&gt;&lt;h3&gt;&lt;b&gt;Timestamps&lt;/b&gt;&lt;/h3&gt;&lt;p&gt;(00:00:14) – Intro and welcome&lt;/p&gt;&lt;p&gt;(00:01:12) – AMI: Why start a company now?&lt;/p&gt;&lt;p&gt;(00:04:46) – Will AMI do research in the open?&lt;/p&gt;&lt;p&gt;(00:06:44) – World models vs LLMs&lt;/p&gt;&lt;p&gt;(00:09:44) – History of self-supervised learning&lt;/p&gt;&lt;p&gt;(00:16:55) – Siamese networks and contrastive learning&lt;/p&gt;&lt;p&gt;(00:25:14) – JEPA and learning in representation space&lt;/p&gt;&lt;p&gt;(00:30:14) – Abstraction hierarchies in physics and AI&lt;/p&gt;&lt;p&gt;(00:34:01) – World models as abstract simulators&lt;/p&gt;&lt;p&gt;(00:38:14) – Object permanence and learning basic physics&lt;/p&gt;&lt;p&gt;(00:40:35) – Game AI: Why NetHack is still impossible&lt;/p&gt;&lt;p&gt;(00:44:22) – Moravec&apos;s Paradox and chess&lt;/p&gt;&lt;p&gt;(00:55:14) – AI safety by construction, not fine-tuning&lt;/p&gt;&lt;p&gt;(01:02:52) – Constrained generation techniques&lt;/p&gt;&lt;p&gt;(01:04:20) – Meta&apos;s reorganization and FAIR&apos;s future&lt;/p&gt;&lt;p&gt;(01:07:31) – SSI, Physical Intelligence, and Wayve&lt;/p&gt;&lt;p&gt;(01:10:14) – Silicon Valley&apos;s &quot;LLM-pilled&quot; monoculture&lt;/p&gt;&lt;p&gt;(01:15:56) – China vs US: The open source paradox&lt;/p&gt;&lt;p&gt;(01:18:14) – Why start a company at 65?&lt;/p&gt;&lt;p&gt;(01:25:14) – The AGI hype cycle has happened 6 times before&lt;/p&gt;&lt;p&gt;(01:33:18) – Family and personal background&lt;/p&gt;&lt;p&gt;(01:36:13) – Career advice: Learn things with a long shelf life&lt;/p&gt;&lt;p&gt;(01:40:14) – Neuroscience and machine learning connections&lt;/p&gt;&lt;p&gt;(01:48:17) – Continual learning: Is catastrophic forgetting solved?&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;hr /&gt;&lt;p&gt;&lt;b&gt;Music:&lt;/b&gt;&lt;/p&gt;&lt;p&gt;&quot;Kid Kodi&quot; — Blue Dot Sessions — via Free Music Archive — CC BY-NC 4.0.&lt;/p&gt;&lt;p&gt;&quot;Palms Down&quot; — Blue Dot Sessions — via Free Music Archive — CC BY-NC 4.0.&lt;/p&gt;&lt;p&gt;Changes: trimmed&lt;/p&gt;&lt;hr /&gt;&lt;h3&gt;&lt;b&gt;About&lt;/b&gt;&lt;/h3&gt;&lt;p&gt;&lt;b&gt;The Information Bottleneck&lt;/b&gt; is hosted by Ravid Shwartz-Ziv and Allen Roush, featuring in-depth conversations with leading AI researchers about the ideas shaping the future of machine learning.&lt;/p&gt;&lt;p&gt;&lt;br /&gt;&lt;/p&gt;</itunes:summary><itunes:explicit>no</itunes:explicit><itunes:duration>01:50:06</itunes:duration><itunes:image href="https://hosting-media.riverside.com/media/podcasts/eebfac8f-2369-4a89-924b-be279b57f190/episodes/dffd2103-84ce-4a22-b7a8-33985f3c2f72/images/d3eef02b-9e6a-4b9f-9c95-c6b6ffa56b07.png"/><itunes:title>EP20: Yann LeCun </itunes:title><itunes:episodeType>full</itunes:episodeType></item><item><title><![CDATA[EP19: AI in Finance and Symbolic AI with Atlas Wang]]></title><description><![CDATA[<p>Atlas Wang (UT Austin faculty, XTX Research Director) joins us to explore two fascinating frontiers: the foundations of symbolic AI and the practical challenges of building AI systems for quantitative finance.</p><p>On the symbolic AI side, Atlas shares his recent work proving that neural networks can learn symbolic equations through gradient descent, a surprising result given that gradient descent is continuous while symbolic structures are discrete. We talked about why neural nets learn clean, compositional mathematical structures at all, what the mathematical tools involved are, and the broader implications for understanding reasoning in AI systems.</p><p>The conversation then turns to neuro-symbolic approaches in practice: agents that discover rules through continued learning, propose them symbolically, verify them against domain-specific checkers, and refine their understanding.</p><p>On the finance side, Atlas pulls back the curtain on what AI research looks like at a high-frequency trading firm. The core problem sounds simple (predict future prices from past data). Still, the challenge is extreme: markets are dominated by noise, predictions hover near zero correlation, and success means eking out tiny margins across astronomical numbers of trades. He explains why synthetic data techniques that work elsewhere don't translate easily, and why XTX is building time series foundation models rather than adapting language models.</p><p>We also discuss the convergence of hiring between frontier AI labs and quantitative finance, and why this is an exceptional moment for ML researchers to consider the finance industry.</p><p><b>Links</b>:</p><ul><li>Why Neural Network Can Discover Symbolic Structures with Gradient-based Training: An Algebraic and Geometric Foundation for Neurosymbolic Reasoning - <a rel="noopener noreferrer nofollow" href="http://arxiv.org/abs/2506.21797" target="_blank">arxiv.org/abs/2506.21797</a></li><li>Atlas website - <a rel="noopener noreferrer nofollow" href="https://www.vita-group.space/" target="_blank">https://www.vita-group.space/</a></li></ul><p><b>Guest:</b> Atlas Wang (UT Austin / XTX)</p><p><b>Hosts:</b> Ravid Shwartz-Ziv &amp; Allen Roush</p><p><b>Music: </b>“Kid Kodi” — Blue Dot Sessions. Source: Free Music Archive. Licensed CC BY-NC 4.0.</p>]]></description><guid isPermaLink="false">ffa6d122-48f8-4136-93e3-449edd89ea8e</guid><dc:creator><![CDATA[Ravid Shwartz-Ziv & Allen Roush]]></dc:creator><pubDate>Wed, 10 Dec 2025 20:09:28 GMT</pubDate><enclosure url="https://api.riverside.com/hosting-analytics/media/5cde049cc0e742fc0624bfa69cb3b187bad46e3b3bc7169538119c5aef0173e8/eyJlcGlzb2RlSWQiOiJmZmE2ZDEyMi00OGY4LTQxMzYtOTNlMy00NDllZGQ4OWVhOGUiLCJwb2RjYXN0SWQiOiJlZWJmYWM4Zi0yMzY5LTRhODktOTI0Yi1iZTI3OWI1N2YxOTAiLCJhY2NvdW50SWQiOiI2OGE1ZjAzMzFkNjQ0MTNlODhlN2FkMmYiLCJwYXRoIjoibWVkaWEvY2xpcHMvNjkzOWNkMTMxYWI0MWZhZjRlYWI3OTlkL3JhdmlkLXNod2FydHoteml2cy1zdHVkaW8tY29tcG9zZXItMjAyNS0xMi0xMF9fMjAtNDItMTEubXAzIn0=.mp3" length="55434522" type="audio/mpeg"/><itunes:summary>&lt;p&gt;Atlas Wang (UT Austin faculty, XTX Research Director) joins us to explore two fascinating frontiers: the foundations of symbolic AI and the practical challenges of building AI systems for quantitative finance.&lt;/p&gt;&lt;p&gt;On the symbolic AI side, Atlas shares his recent work proving that neural networks can learn symbolic equations through gradient descent, a surprising result given that gradient descent is continuous while symbolic structures are discrete. We talked about why neural nets learn clean, compositional mathematical structures at all, what the mathematical tools involved are, and the broader implications for understanding reasoning in AI systems.&lt;/p&gt;&lt;p&gt;The conversation then turns to neuro-symbolic approaches in practice: agents that discover rules through continued learning, propose them symbolically, verify them against domain-specific checkers, and refine their understanding.&lt;/p&gt;&lt;p&gt;On the finance side, Atlas pulls back the curtain on what AI research looks like at a high-frequency trading firm. The core problem sounds simple (predict future prices from past data). Still, the challenge is extreme: markets are dominated by noise, predictions hover near zero correlation, and success means eking out tiny margins across astronomical numbers of trades. He explains why synthetic data techniques that work elsewhere don&apos;t translate easily, and why XTX is building time series foundation models rather than adapting language models.&lt;/p&gt;&lt;p&gt;We also discuss the convergence of hiring between frontier AI labs and quantitative finance, and why this is an exceptional moment for ML researchers to consider the finance industry.&lt;/p&gt;&lt;p&gt;&lt;b&gt;Links&lt;/b&gt;:&lt;/p&gt;&lt;ul&gt;&lt;li&gt;Why Neural Network Can Discover Symbolic Structures with Gradient-based Training: An Algebraic and Geometric Foundation for Neurosymbolic Reasoning - &lt;a rel=&quot;noopener noreferrer nofollow&quot; href=&quot;http://arxiv.org/abs/2506.21797&quot; target=&quot;_blank&quot;&gt;arxiv.org/abs/2506.21797&lt;/a&gt;&lt;/li&gt;&lt;li&gt;Atlas website - &lt;a rel=&quot;noopener noreferrer nofollow&quot; href=&quot;https://www.vita-group.space/&quot; target=&quot;_blank&quot;&gt;https://www.vita-group.space/&lt;/a&gt;&lt;/li&gt;&lt;/ul&gt;&lt;p&gt;&lt;b&gt;Guest:&lt;/b&gt; Atlas Wang (UT Austin / XTX)&lt;/p&gt;&lt;p&gt;&lt;b&gt;Hosts:&lt;/b&gt; Ravid Shwartz-Ziv &amp;amp; Allen Roush&lt;/p&gt;&lt;p&gt;&lt;b&gt;Music: &lt;/b&gt;“Kid Kodi” — Blue Dot Sessions. Source: Free Music Archive. Licensed CC BY-NC 4.0.&lt;/p&gt;</itunes:summary><itunes:explicit>no</itunes:explicit><itunes:duration>01:10:34</itunes:duration><itunes:image href="https://hosting-media.riverside.com/media/podcasts/eebfac8f-2369-4a89-924b-be279b57f190/episodes/ffa6d122-48f8-4136-93e3-449edd89ea8e/images/9905b1dd-be1b-48c8-85a6-da08f0d62713.png"/><itunes:season>1</itunes:season><itunes:episode>19</itunes:episode><itunes:title>EP19: AI in Finance and Symbolic AI with Atlas Wang</itunes:title><itunes:episodeType>full</itunes:episodeType></item><item><title><![CDATA[EP18: AI Robotics]]></title><description><![CDATA[<p>In this episode, we hosted Judah Goldfeder, a PhD candidate at Columbia University and student researcher at Google, to discuss robotics, reproducibility in ML, and smart buildings.</p><p></p><p><b>Key topics covered:</b></p><p><b>Robotics challenges:</b> We discussed why robotics remains harder than many expected, compared to LLMs. The real world is unpredictable and unforgiving, and mistakes have physical consequences. Sim-to-real transfer remains a major bottleneck because simulators are tedious to configure accurately for each robot and environment. Unlike text, robotics lacks foundation models, partly due to limited clean, annotated datasets and the difficulty of collecting diverse real-world data.</p><p><b>Reproducibility crisis:</b> We discussed how self-reported benchmarks can lead to p-hacking and irreproducible results. Centralized evaluation systems (such as Kaggle or ImageNet challenges), where researchers submit algorithms for testing on hidden test sets, seem to drive faster progress.</p><p></p><p><b>Smart buildings:</b> Judah's work at Google focuses on using ML to optimize HVAC systems, potentially reducing energy costs and carbon emissions significantly. The challenge is that every building is different. It makes the simulation configuration extremely labor-intensive. Generative AI could help by automating the process of converting floor plans or images into accurate building simulations.</p><p></p><p><b>Links:</b></p><ul><li>Judah website<b> - </b><a rel="noopener noreferrer nofollow" href="https://judahgoldfeder.com/" target="_blank"><b>https://judahgoldfeder.com/</b></a></li></ul><p></p><p><b>Music:</b></p><p>"Kid Kodi" — Blue Dot Sessions — via Free Music Archive — CC BY-NC 4.0.</p><p>"Palms Down" — Blue Dot Sessions — via Free Music Archive — CC BY-NC 4.0.</p><p>Changes: trimmed</p>]]></description><guid isPermaLink="false">e8cc4523-586e-44d4-8b86-fb53abf88bb5</guid><dc:creator><![CDATA[Ravid Shwartz-Ziv & Allen Roush]]></dc:creator><pubDate>Mon, 01 Dec 2025 16:20:01 GMT</pubDate><enclosure url="https://api.riverside.com/hosting-analytics/media/a8e870d9c6dd9d4ccad40c04ca6086522dff3315a5b20ca30ffe9baa5ce9b358/eyJlcGlzb2RlSWQiOiJlOGNjNDUyMy01ODZlLTQ0ZDQtOGI4Ni1mYjUzYWJmODhiYjUiLCJwb2RjYXN0SWQiOiJlZWJmYWM4Zi0yMzY5LTRhODktOTI0Yi1iZTI3OWI1N2YxOTAiLCJhY2NvdW50SWQiOiI2OGE1ZjAzMzFkNjQ0MTNlODhlN2FkMmYiLCJwYXRoIjoibWVkaWEvY2xpcHMvNjkyZDI5NWEwZTViNDY1MTRhOWM0ODczL3JhdmlkLXNod2FydHoteml2cy1zdHVkaW8tY29tcG9zZXItMjAyNS0xMi0xX182LTM2LTI2Lm1wMyJ9.mp3" length="83098444" type="audio/mpeg"/><itunes:summary>&lt;p&gt;In this episode, we hosted Judah Goldfeder, a PhD candidate at Columbia University and student researcher at Google, to discuss robotics, reproducibility in ML, and smart buildings.&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;&lt;b&gt;Key topics covered:&lt;/b&gt;&lt;/p&gt;&lt;p&gt;&lt;b&gt;Robotics challenges:&lt;/b&gt; We discussed why robotics remains harder than many expected, compared to LLMs. The real world is unpredictable and unforgiving, and mistakes have physical consequences. Sim-to-real transfer remains a major bottleneck because simulators are tedious to configure accurately for each robot and environment. Unlike text, robotics lacks foundation models, partly due to limited clean, annotated datasets and the difficulty of collecting diverse real-world data.&lt;/p&gt;&lt;p&gt;&lt;b&gt;Reproducibility crisis:&lt;/b&gt; We discussed how self-reported benchmarks can lead to p-hacking and irreproducible results. Centralized evaluation systems (such as Kaggle or ImageNet challenges), where researchers submit algorithms for testing on hidden test sets, seem to drive faster progress.&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;&lt;b&gt;Smart buildings:&lt;/b&gt; Judah&apos;s work at Google focuses on using ML to optimize HVAC systems, potentially reducing energy costs and carbon emissions significantly. The challenge is that every building is different. It makes the simulation configuration extremely labor-intensive. Generative AI could help by automating the process of converting floor plans or images into accurate building simulations.&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;&lt;b&gt;Links:&lt;/b&gt;&lt;/p&gt;&lt;ul&gt;&lt;li&gt;Judah website&lt;b&gt; - &lt;/b&gt;&lt;a rel=&quot;noopener noreferrer nofollow&quot; href=&quot;https://judahgoldfeder.com/&quot; target=&quot;_blank&quot;&gt;&lt;b&gt;https://judahgoldfeder.com/&lt;/b&gt;&lt;/a&gt;&lt;/li&gt;&lt;/ul&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;&lt;b&gt;Music:&lt;/b&gt;&lt;/p&gt;&lt;p&gt;&quot;Kid Kodi&quot; — Blue Dot Sessions — via Free Music Archive — CC BY-NC 4.0.&lt;/p&gt;&lt;p&gt;&quot;Palms Down&quot; — Blue Dot Sessions — via Free Music Archive — CC BY-NC 4.0.&lt;/p&gt;&lt;p&gt;Changes: trimmed&lt;/p&gt;</itunes:summary><itunes:explicit>no</itunes:explicit><itunes:duration>01:45:16</itunes:duration><itunes:image href="https://hosting-media.riverside.com/media/podcasts/eebfac8f-2369-4a89-924b-be279b57f190/episodes/e8cc4523-586e-44d4-8b86-fb53abf88bb5/images/b20981bc-703e-40c3-8e22-fbf2e4932cbc.png"/><itunes:season>1</itunes:season><itunes:episode>18</itunes:episode><itunes:title>EP18: AI Robotics</itunes:title><itunes:episodeType>full</itunes:episodeType></item><item><title><![CDATA[EP17: RL with Will Brown]]></title><description><![CDATA[<p>In this episode, we talk with Will Brown, a research lead at <a rel="noopener noreferrer nofollow" href="https://www.primeintellect.ai/" target="_blank">Prime Intellect</a>, about his journey into reinforcement learning (RL) and multi-agent systems, exploring their theoretical foundations and practical applications. We discuss the importance of RL in the current LLMs pipeline and the challenges it faces. We also discuss applying agentic workflows to real-world applications and the ongoing evolution of AI development.</p><p></p><p><b>Chapters</b></p><p>00:00 Introduction to Reinforcement Learning and Will's Journey</p><p>03:10 Theoretical Foundations of Multi-Agent Systems</p><p>06:09 Transitioning from Theory to Practical Applications</p><p>09:01 The Role of Game Theory in AI</p><p>11:55 Exploring the Complexity of Games and AI</p><p>14:56 Optimization Techniques in Reinforcement Learning</p><p>17:58 The Evolution of RL in LLMs</p><p>21:04 Challenges and Opportunities in RL for LLMs</p><p>23:56 Key Components for Successful RL Implementation</p><p>27:00 Future Directions in Reinforcement Learning</p><p>36:29 Exploring Agentic Reinforcement Learning Paradigms</p><p>38:45 The Role of Intermediate Results in RL</p><p>41:16 Multi-Agent Systems: Challenges and Opportunities</p><p>45:08 Distributed Environments and Decentralized RL</p><p>49:31 Prompt Optimization Techniques in RL</p><p>52:25 Statistical Rigor in Evaluations</p><p>55:49 Future Directions in Reinforcement Learning</p><p>59:50 Task-Specific Models vs. General Models</p><p>01:02:04 Insights on Random Verifiers and Learning Dynamics</p><p>01:04:39 Real-World Applications of RL and Evaluation Challenges</p><p>01:05:58 Prime RL Framework: Goals and Trade-offs</p><p>01:10:38 Open Source vs. Closed Source Models</p><p>01:13:08 Continuous Learning and Knowledge Improvement</p><p></p><p><b>Music:</b></p><p>"Kid Kodi" — Blue Dot Sessions — via Free Music Archive — CC BY-NC 4.0.</p><p>"Palms Down" — Blue Dot Sessions — via Free Music Archive — CC BY-NC 4.0.</p><p>Changes: trimmed</p>]]></description><guid isPermaLink="false">dcd9c81c-70b1-4769-9fb9-ff4e01ff003b</guid><dc:creator><![CDATA[Ravid Shwartz-Ziv & Allen Roush]]></dc:creator><pubDate>Mon, 24 Nov 2025 19:07:23 GMT</pubDate><enclosure url="https://api.riverside.com/hosting-analytics/media/f0e3e6c8024469a55427e7a537886634902d4e0c6be9e208b4839492107d4201/eyJlcGlzb2RlSWQiOiJkY2Q5YzgxYy03MGIxLTQ3NjktOWZiOS1mZjRlMDFmZjAwM2IiLCJwb2RjYXN0SWQiOiJlZWJmYWM4Zi0yMzY5LTRhODktOTI0Yi1iZTI3OWI1N2YxOTAiLCJhY2NvdW50SWQiOiI2OGE1ZjAzMzFkNjQ0MTNlODhlN2FkMmYiLCJwYXRoIjoibWVkaWEvY2xpcHMvNjkyM2U3YjM0MzkwMGMzYmQ1NWRiMTgzL3JhdmlkLXNod2FydHoteml2cy1zdHVkaW8tY29tcG9zZXItMjAyNS0xMS0yNF9fNi01LTU1Lm1wMyJ9.mp3" length="44889130" type="audio/mpeg"/><itunes:summary>&lt;p&gt;In this episode, we talk with Will Brown, a research lead at &lt;a rel=&quot;noopener noreferrer nofollow&quot; href=&quot;https://www.primeintellect.ai/&quot; target=&quot;_blank&quot;&gt;Prime Intellect&lt;/a&gt;, about his journey into reinforcement learning (RL) and multi-agent systems, exploring their theoretical foundations and practical applications. We discuss the importance of RL in the current LLMs pipeline and the challenges it faces. We also discuss applying agentic workflows to real-world applications and the ongoing evolution of AI development.&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;&lt;b&gt;Chapters&lt;/b&gt;&lt;/p&gt;&lt;p&gt;00:00 Introduction to Reinforcement Learning and Will&apos;s Journey&lt;/p&gt;&lt;p&gt;03:10 Theoretical Foundations of Multi-Agent Systems&lt;/p&gt;&lt;p&gt;06:09 Transitioning from Theory to Practical Applications&lt;/p&gt;&lt;p&gt;09:01 The Role of Game Theory in AI&lt;/p&gt;&lt;p&gt;11:55 Exploring the Complexity of Games and AI&lt;/p&gt;&lt;p&gt;14:56 Optimization Techniques in Reinforcement Learning&lt;/p&gt;&lt;p&gt;17:58 The Evolution of RL in LLMs&lt;/p&gt;&lt;p&gt;21:04 Challenges and Opportunities in RL for LLMs&lt;/p&gt;&lt;p&gt;23:56 Key Components for Successful RL Implementation&lt;/p&gt;&lt;p&gt;27:00 Future Directions in Reinforcement Learning&lt;/p&gt;&lt;p&gt;36:29 Exploring Agentic Reinforcement Learning Paradigms&lt;/p&gt;&lt;p&gt;38:45 The Role of Intermediate Results in RL&lt;/p&gt;&lt;p&gt;41:16 Multi-Agent Systems: Challenges and Opportunities&lt;/p&gt;&lt;p&gt;45:08 Distributed Environments and Decentralized RL&lt;/p&gt;&lt;p&gt;49:31 Prompt Optimization Techniques in RL&lt;/p&gt;&lt;p&gt;52:25 Statistical Rigor in Evaluations&lt;/p&gt;&lt;p&gt;55:49 Future Directions in Reinforcement Learning&lt;/p&gt;&lt;p&gt;59:50 Task-Specific Models vs. General Models&lt;/p&gt;&lt;p&gt;01:02:04 Insights on Random Verifiers and Learning Dynamics&lt;/p&gt;&lt;p&gt;01:04:39 Real-World Applications of RL and Evaluation Challenges&lt;/p&gt;&lt;p&gt;01:05:58 Prime RL Framework: Goals and Trade-offs&lt;/p&gt;&lt;p&gt;01:10:38 Open Source vs. Closed Source Models&lt;/p&gt;&lt;p&gt;01:13:08 Continuous Learning and Knowledge Improvement&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;&lt;b&gt;Music:&lt;/b&gt;&lt;/p&gt;&lt;p&gt;&quot;Kid Kodi&quot; — Blue Dot Sessions — via Free Music Archive — CC BY-NC 4.0.&lt;/p&gt;&lt;p&gt;&quot;Palms Down&quot; — Blue Dot Sessions — via Free Music Archive — CC BY-NC 4.0.&lt;/p&gt;&lt;p&gt;Changes: trimmed&lt;/p&gt;</itunes:summary><itunes:explicit>no</itunes:explicit><itunes:duration>01:05:43</itunes:duration><itunes:image href="https://hosting-media.riverside.com/media/podcasts/eebfac8f-2369-4a89-924b-be279b57f190/episodes/dcd9c81c-70b1-4769-9fb9-ff4e01ff003b/images/73305543-9044-4926-ad4c-e1dbd0221e84.png"/><itunes:season>1</itunes:season><itunes:episode>17</itunes:episode><itunes:title>EP17: RL with Will Brown</itunes:title><itunes:episodeType>full</itunes:episodeType></item><item><title><![CDATA[EP16: AI News and Papers]]></title><description><![CDATA[<p>In this episode, we discuss various topics in AI, including the challenges of the conference review process, the capabilities of Kimi K2 thinking, the advancements in TPU technology, the significance of real-world data in robotics, and recent innovations in AI research. We also talk about the cool "Chain of Thought Hijacking" paper, how to use simple ideas to scale RL, and the implications of the Cosmos project, which aims to enable autonomous scientific discovery through AI.</p><p></p><p><b>Papers and links:</b></p><ul><li>Chain-of-Thought Hijacking - <a rel="noopener noreferrer nofollow" href="https://arxiv.org/pdf/2510.26418" target="_blank">https://arxiv.org/pdf/2510.26418</a></li><li>Kosmos: An AI Scientist for Autonomous Discovery - <a rel="noopener noreferrer nofollow" href="https://t.co/9pCr6AUXAe" target="_blank">https://t.co/9pCr6AUXAe</a></li><li>JustRL: Scaling a 1.5B LLM with a Simple RL Recipe - <a rel="noopener noreferrer nofollow" href="https://relieved-cafe-fe1.notion.site/JustRL-Scaling-a-1-5B-LLM-with-a-Simple-RL-Recipe-24f6198b0b6b80e48e74f519bfdaf0a8" target="_blank">https://relieved-cafe-fe1.notion.site/JustRL-Scaling-a-1-5B-LLM-with-a-Simple-RL-Recipe-24f6198b0b6b80e48e74f519bfdaf0a8</a></li></ul><p></p><p></p><p><b>Chapters</b></p><p>00:00 Navigating the Peer Review Process</p><p>04:17 Kimi K2 Thinking: A New Era in AI</p><p>12:27 The Future of Tool Calls in AI</p><p>17:12 Exploring Google's New TPUs</p><p>22:04 The Importance of Real-World Data in Robotics</p><p>28:10 World Models: The Next Frontier in AI</p><p>31:36 Nvidia's Dominance in AI Partnerships</p><p>32:08 Exploring Recent AI Research Papers</p><p>37:46 Chain of Thought Hijacking: A New Threat</p><p>43:05 Simplifying Reinforcement Learning Training</p><p>54:03 Cosmos: AI for Autonomous Scientific Discovery</p><p></p><p><b>Music:</b></p><p>"Kid Kodi" — Blue Dot Sessions — via Free Music Archive — CC BY-NC 4.0.</p><p>"Palms Down" — Blue Dot Sessions — via Free Music Archive — CC BY-NC 4.0.</p><p>Changes: trimmed</p>]]></description><guid isPermaLink="false">4b1ea5f3-0b32-4bbf-aa41-98f65128d21b</guid><dc:creator><![CDATA[Ravid Shwartz-Ziv & Allen Roush]]></dc:creator><pubDate>Mon, 17 Nov 2025 18:08:27 GMT</pubDate><enclosure url="https://api.riverside.com/hosting-analytics/media/a6a3a71ed41129d47b3de9032922b6c28e01bd7b49668fb44c5085db3de26430/eyJlcGlzb2RlSWQiOiI0YjFlYTVmMy0wYjMyLTRiYmYtYWE0MS05OGY2NTEyOGQyMWIiLCJwb2RjYXN0SWQiOiJlZWJmYWM4Zi0yMzY5LTRhODktOTI0Yi1iZTI3OWI1N2YxOTAiLCJhY2NvdW50SWQiOiI2OGE1ZjAzMzFkNjQ0MTNlODhlN2FkMmYiLCJwYXRoIjoibWVkaWEvY2xpcHMvNjkxYjM2ZjEzNmY5M2VhOTJjMWJkYmY1L3JhdmlkLXNod2FydHoteml2cy1zdHVkaW8tY29tcG9zZXItMjAyNS0xMS0xN19fMTUtNTMtMzcubXAzIn0=.mp3" length="45095666" type="audio/mpeg"/><itunes:summary>&lt;p&gt;In this episode, we discuss various topics in AI, including the challenges of the conference review process, the capabilities of Kimi K2 thinking, the advancements in TPU technology, the significance of real-world data in robotics, and recent innovations in AI research. We also talk about the cool &quot;Chain of Thought Hijacking&quot; paper, how to use simple ideas to scale RL, and the implications of the Cosmos project, which aims to enable autonomous scientific discovery through AI.&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;&lt;b&gt;Papers and links:&lt;/b&gt;&lt;/p&gt;&lt;ul&gt;&lt;li&gt;Chain-of-Thought Hijacking - &lt;a rel=&quot;noopener noreferrer nofollow&quot; href=&quot;https://arxiv.org/pdf/2510.26418&quot; target=&quot;_blank&quot;&gt;https://arxiv.org/pdf/2510.26418&lt;/a&gt;&lt;/li&gt;&lt;li&gt;Kosmos: An AI Scientist for Autonomous Discovery - &lt;a rel=&quot;noopener noreferrer nofollow&quot; href=&quot;https://t.co/9pCr6AUXAe&quot; target=&quot;_blank&quot;&gt;https://t.co/9pCr6AUXAe&lt;/a&gt;&lt;/li&gt;&lt;li&gt;JustRL: Scaling a 1.5B LLM with a Simple RL Recipe - &lt;a rel=&quot;noopener noreferrer nofollow&quot; href=&quot;https://relieved-cafe-fe1.notion.site/JustRL-Scaling-a-1-5B-LLM-with-a-Simple-RL-Recipe-24f6198b0b6b80e48e74f519bfdaf0a8&quot; target=&quot;_blank&quot;&gt;https://relieved-cafe-fe1.notion.site/JustRL-Scaling-a-1-5B-LLM-with-a-Simple-RL-Recipe-24f6198b0b6b80e48e74f519bfdaf0a8&lt;/a&gt;&lt;/li&gt;&lt;/ul&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;&lt;b&gt;Chapters&lt;/b&gt;&lt;/p&gt;&lt;p&gt;00:00 Navigating the Peer Review Process&lt;/p&gt;&lt;p&gt;04:17 Kimi K2 Thinking: A New Era in AI&lt;/p&gt;&lt;p&gt;12:27 The Future of Tool Calls in AI&lt;/p&gt;&lt;p&gt;17:12 Exploring Google&apos;s New TPUs&lt;/p&gt;&lt;p&gt;22:04 The Importance of Real-World Data in Robotics&lt;/p&gt;&lt;p&gt;28:10 World Models: The Next Frontier in AI&lt;/p&gt;&lt;p&gt;31:36 Nvidia&apos;s Dominance in AI Partnerships&lt;/p&gt;&lt;p&gt;32:08 Exploring Recent AI Research Papers&lt;/p&gt;&lt;p&gt;37:46 Chain of Thought Hijacking: A New Threat&lt;/p&gt;&lt;p&gt;43:05 Simplifying Reinforcement Learning Training&lt;/p&gt;&lt;p&gt;54:03 Cosmos: AI for Autonomous Scientific Discovery&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;&lt;b&gt;Music:&lt;/b&gt;&lt;/p&gt;&lt;p&gt;&quot;Kid Kodi&quot; — Blue Dot Sessions — via Free Music Archive — CC BY-NC 4.0.&lt;/p&gt;&lt;p&gt;&quot;Palms Down&quot; — Blue Dot Sessions — via Free Music Archive — CC BY-NC 4.0.&lt;/p&gt;&lt;p&gt;Changes: trimmed&lt;/p&gt;</itunes:summary><itunes:explicit>no</itunes:explicit><itunes:duration>00:59:20</itunes:duration><itunes:image href="https://hosting-media.riverside.com/media/podcasts/eebfac8f-2369-4a89-924b-be279b57f190/episodes/4b1ea5f3-0b32-4bbf-aa41-98f65128d21b/images/a025824c-c2f6-45fb-b5e2-dcc7d1498049.png"/><itunes:season>1</itunes:season><itunes:episode>16</itunes:episode><itunes:title>EP16: AI News and Papers</itunes:title><itunes:episodeType>full</itunes:episodeType></item><item><title><![CDATA[EP15: The Information Bottleneck and Scaling Laws with Alex Alemi]]></title><description><![CDATA[<p>In this episode, we sit down with Alex Alemi, an AI researcher at Anthropic (previously at Google Brain and Disney), to explore the powerful framework of the information bottleneck and its profound implications for modern machine learning.</p><p>We break down what the information bottleneck really means, a principled approach to retaining only the most informative parts of data while compressing away the irrelevant. We discuss why compression is still important in our era of big data, how it prevents overfitting, and why it's essential for building models that generalize well.</p><p>We also dive into scaling laws: why they matter, what we can learn from them, and what they tell us about the future of AI research.</p><p></p><p>Papers and links:</p><ul><li>Alex's website - <a rel="noopener noreferrer nofollow" href="https://www.alexalemi.com/" target="_blank">https://www.alexalemi.com/</a></li><li>Scaling exponents across parameterizations and optimizers - <a rel="noopener noreferrer nofollow" href="https://arxiv.org/abs/2407.05872" target="_blank">https://arxiv.org/abs/2407.05872</a></li><li>Deep Variational Information Bottleneck - <a rel="noopener noreferrer nofollow" href="https://arxiv.org/abs/1612.00410" target="_blank"><b>https://arxiv.org/abs/1612.00410</b></a></li><li>Layer by Layer: Uncovering Hidden Representations in Language Models - <a rel="noopener noreferrer nofollow" href="https://arxiv.org/abs/2502.02013" target="_blank">https://arxiv.org/abs/2502.02013</a></li><li>Information in Infinite Ensembles of Infinitely-Wide Neural Networks - <a rel="noopener noreferrer nofollow" href="https://proceedings.mlr.press/v118/shwartz-ziv20a.html" target="_blank">https://proceedings.mlr.press/v118/shwartz-ziv20a.html</a></li></ul><p></p><p>Music:</p><p>“Kid Kodi” — Blue Dot Sessions — via Free Music Archive — CC BY-NC 4.0.</p><p>“Palms Down” — Blue Dot Sessions — via Free Music Archive — CC BY-NC 4.0.</p><p>Changes: trimmed</p><ul><li></li></ul>]]></description><guid isPermaLink="false">cdf2b0a0-66ca-4577-b70a-141b9d7e80f1</guid><dc:creator><![CDATA[Ravid Shwartz-Ziv & Allen Roush]]></dc:creator><pubDate>Thu, 13 Nov 2025 18:17:15 GMT</pubDate><enclosure url="https://api.riverside.com/hosting-analytics/media/78ec5dab84118aef1cf508a43fbba45b0d32a5f4eb01705fd6dcaf77492fdd5d/eyJlcGlzb2RlSWQiOiJjZGYyYjBhMC02NmNhLTQ1NzctYjcwYS0xNDFiOWQ3ZTgwZjEiLCJwb2RjYXN0SWQiOiJlZWJmYWM4Zi0yMzY5LTRhODktOTI0Yi1iZTI3OWI1N2YxOTAiLCJhY2NvdW50SWQiOiI2OGE1ZjAzMzFkNjQ0MTNlODhlN2FkMmYiLCJwYXRoIjoibWVkaWEvY2xpcHMvNjkxNjBlNzk4YzQzNTU1MjlmZWY3YTkyL3JhdmlkLXNod2FydHoteml2cy1zdHVkaW8tY29tcG9zZXItMjAyNS0xMS0xM19fMTctNTktMzcubXAzIn0=.mp3" length="67766798" type="audio/mpeg"/><itunes:summary>&lt;p&gt;In this episode, we sit down with Alex Alemi, an AI researcher at Anthropic (previously at Google Brain and Disney), to explore the powerful framework of the information bottleneck and its profound implications for modern machine learning.&lt;/p&gt;&lt;p&gt;We break down what the information bottleneck really means, a principled approach to retaining only the most informative parts of data while compressing away the irrelevant. We discuss why compression is still important in our era of big data, how it prevents overfitting, and why it&apos;s essential for building models that generalize well.&lt;/p&gt;&lt;p&gt;We also dive into scaling laws: why they matter, what we can learn from them, and what they tell us about the future of AI research.&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;Papers and links:&lt;/p&gt;&lt;ul&gt;&lt;li&gt;Alex&apos;s website - &lt;a rel=&quot;noopener noreferrer nofollow&quot; href=&quot;https://www.alexalemi.com/&quot; target=&quot;_blank&quot;&gt;https://www.alexalemi.com/&lt;/a&gt;&lt;/li&gt;&lt;li&gt;Scaling exponents across parameterizations and optimizers - &lt;a rel=&quot;noopener noreferrer nofollow&quot; href=&quot;https://arxiv.org/abs/2407.05872&quot; target=&quot;_blank&quot;&gt;https://arxiv.org/abs/2407.05872&lt;/a&gt;&lt;/li&gt;&lt;li&gt;Deep Variational Information Bottleneck - &lt;a rel=&quot;noopener noreferrer nofollow&quot; href=&quot;https://arxiv.org/abs/1612.00410&quot; target=&quot;_blank&quot;&gt;&lt;b&gt;https://arxiv.org/abs/1612.00410&lt;/b&gt;&lt;/a&gt;&lt;/li&gt;&lt;li&gt;Layer by Layer: Uncovering Hidden Representations in Language Models - &lt;a rel=&quot;noopener noreferrer nofollow&quot; href=&quot;https://arxiv.org/abs/2502.02013&quot; target=&quot;_blank&quot;&gt;https://arxiv.org/abs/2502.02013&lt;/a&gt;&lt;/li&gt;&lt;li&gt;Information in Infinite Ensembles of Infinitely-Wide Neural Networks - &lt;a rel=&quot;noopener noreferrer nofollow&quot; href=&quot;https://proceedings.mlr.press/v118/shwartz-ziv20a.html&quot; target=&quot;_blank&quot;&gt;https://proceedings.mlr.press/v118/shwartz-ziv20a.html&lt;/a&gt;&lt;/li&gt;&lt;/ul&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;Music:&lt;/p&gt;&lt;p&gt;“Kid Kodi” — Blue Dot Sessions — via Free Music Archive — CC BY-NC 4.0.&lt;/p&gt;&lt;p&gt;“Palms Down” — Blue Dot Sessions — via Free Music Archive — CC BY-NC 4.0.&lt;/p&gt;&lt;p&gt;Changes: trimmed&lt;/p&gt;&lt;ul&gt;&lt;li&gt;&lt;/li&gt;&lt;/ul&gt;</itunes:summary><itunes:explicit>no</itunes:explicit><itunes:duration>01:22:50</itunes:duration><itunes:image href="https://hosting-media.riverside.com/media/podcasts/eebfac8f-2369-4a89-924b-be279b57f190/episodes/cdf2b0a0-66ca-4577-b70a-141b9d7e80f1/images/3fc772a8-410f-4aa1-82f6-e1f4d943a84e.png"/><itunes:season>1</itunes:season><itunes:episode>15</itunes:episode><itunes:title>EP15: The Information Bottleneck and Scaling Laws with Alex Alemi</itunes:title><itunes:episodeType>full</itunes:episodeType></item><item><title><![CDATA[EP14: AI News and Papers ]]></title><description><![CDATA[<p>In this episode, we talked about AI news and recent papers. We explored the complexities of using AI models in healthcare (the Nature Medicine paper on GPT-5's fragile intelligence in medical contexts). We discussed the delicate balance between leveraging LLMs as powerful research tools and the risks of over-reliance, touching on issues such as hallucinations, medical disagreements among practitioners, and the need for better education on responsible AI use in healthcare.</p><p>We also talked about Stanford's "Cartridges" paper, which presents an innovative approach to long-context language models. The paper tackles the expensive computational costs of billion-token context windows by compressing KV caches through a clever "self-study" method using synthetic question-answer pairs and context distillation. We discussed the implications for personalization, composability, and making long-context models more practical.</p><p>Additionally, we explored the "Continuous Autoregressive Language Models" paper and touched on insights from the Smol Training Playbook.</p><p><b>Papers discussed:</b></p><ul><li>The fragile intelligence of GPT-5 in medicine: <a rel="noopener noreferrer nofollow" href="https://www.nature.com/articles/s41591-025-04008-8" target="_blank">https://www.nature.com/articles/s41591-025-04008-8</a></li><li>Cartridges: Lightweight and general-purpose long context representations via self-study: <a rel="noopener noreferrer nofollow" href="https://arxiv.org/abs/2506.06266" target="_blank">https://arxiv.org/abs/2506.06266</a></li><li>Continuous Autoregressive Language Models: <a rel="noopener noreferrer nofollow" href="https://arxiv.org/abs/2510.27688" target="_blank">https://arxiv.org/abs/2510.27688</a></li><li>The Smol Training Playbook: <a rel="noopener noreferrer nofollow" href="https://huggingface.co/spaces/HuggingFaceTB/smol-training-playbook" target="_blank">https://huggingface.co/spaces/HuggingFaceTB/smol-training-playbook</a></li></ul><p></p><p>Music:</p><p>“Kid Kodi” — Blue Dot Sessions — via Free Music Archive — CC BY-NC 4.0.</p><p>“Palms Down” — Blue Dot Sessions — via Free Music Archive — CC BY-NC 4.0.</p><p>Changes: trimmed</p><p></p><p>This is an experimental format for us, just news and papers without a guest interview. Let us know what you think!<br /><br /></p>]]></description><guid isPermaLink="false">bb45146c-15d6-4f88-ab22-91ca31759664</guid><dc:creator><![CDATA[Ravid Shwartz-Ziv & Allen Roush]]></dc:creator><pubDate>Mon, 10 Nov 2025 15:42:55 GMT</pubDate><enclosure url="https://api.riverside.com/hosting-analytics/media/abb11b378cc7954779dba3b1706d97164345062ac62a86a4e966c5210886d085/eyJlcGlzb2RlSWQiOiJiYjQ1MTQ2Yy0xNWQ2LTRmODgtYWIyMi05MWNhMzE3NTk2NjQiLCJwb2RjYXN0SWQiOiJlZWJmYWM4Zi0yMzY5LTRhODktOTI0Yi1iZTI3OWI1N2YxOTAiLCJhY2NvdW50SWQiOiI2OGE1ZjAzMzFkNjQ0MTNlODhlN2FkMmYiLCJwYXRoIjoibWVkaWEvY2xpcHMvNjkxMWZkMDk4N2VjYzQ3ZDczYjA3NmRkL3JhdmlkLXNod2FydHoteml2cy1zdHVkaW8tY29tcG9zZXItMjAyNS0xMS0xMF9fMTUtNTYtOS5tcDMifQ==.mp3" length="40556720" type="audio/mpeg"/><itunes:summary>&lt;p&gt;In this episode, we talked about AI news and recent papers. We explored the complexities of using AI models in healthcare (the Nature Medicine paper on GPT-5&apos;s fragile intelligence in medical contexts). We discussed the delicate balance between leveraging LLMs as powerful research tools and the risks of over-reliance, touching on issues such as hallucinations, medical disagreements among practitioners, and the need for better education on responsible AI use in healthcare.&lt;/p&gt;&lt;p&gt;We also talked about Stanford&apos;s &quot;Cartridges&quot; paper, which presents an innovative approach to long-context language models. The paper tackles the expensive computational costs of billion-token context windows by compressing KV caches through a clever &quot;self-study&quot; method using synthetic question-answer pairs and context distillation. We discussed the implications for personalization, composability, and making long-context models more practical.&lt;/p&gt;&lt;p&gt;Additionally, we explored the &quot;Continuous Autoregressive Language Models&quot; paper and touched on insights from the Smol Training Playbook.&lt;/p&gt;&lt;p&gt;&lt;b&gt;Papers discussed:&lt;/b&gt;&lt;/p&gt;&lt;ul&gt;&lt;li&gt;The fragile intelligence of GPT-5 in medicine: &lt;a rel=&quot;noopener noreferrer nofollow&quot; href=&quot;https://www.nature.com/articles/s41591-025-04008-8&quot; target=&quot;_blank&quot;&gt;https://www.nature.com/articles/s41591-025-04008-8&lt;/a&gt;&lt;/li&gt;&lt;li&gt;Cartridges: Lightweight and general-purpose long context representations via self-study: &lt;a rel=&quot;noopener noreferrer nofollow&quot; href=&quot;https://arxiv.org/abs/2506.06266&quot; target=&quot;_blank&quot;&gt;https://arxiv.org/abs/2506.06266&lt;/a&gt;&lt;/li&gt;&lt;li&gt;Continuous Autoregressive Language Models: &lt;a rel=&quot;noopener noreferrer nofollow&quot; href=&quot;https://arxiv.org/abs/2510.27688&quot; target=&quot;_blank&quot;&gt;https://arxiv.org/abs/2510.27688&lt;/a&gt;&lt;/li&gt;&lt;li&gt;The Smol Training Playbook: &lt;a rel=&quot;noopener noreferrer nofollow&quot; href=&quot;https://huggingface.co/spaces/HuggingFaceTB/smol-training-playbook&quot; target=&quot;_blank&quot;&gt;https://huggingface.co/spaces/HuggingFaceTB/smol-training-playbook&lt;/a&gt;&lt;/li&gt;&lt;/ul&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;Music:&lt;/p&gt;&lt;p&gt;“Kid Kodi” — Blue Dot Sessions — via Free Music Archive — CC BY-NC 4.0.&lt;/p&gt;&lt;p&gt;“Palms Down” — Blue Dot Sessions — via Free Music Archive — CC BY-NC 4.0.&lt;/p&gt;&lt;p&gt;Changes: trimmed&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;This is an experimental format for us, just news and papers without a guest interview. Let us know what you think!&lt;br /&gt;&lt;br /&gt;&lt;/p&gt;</itunes:summary><itunes:explicit>no</itunes:explicit><itunes:duration>00:57:20</itunes:duration><itunes:image href="https://hosting-media.riverside.com/media/podcasts/eebfac8f-2369-4a89-924b-be279b57f190/episodes/bb45146c-15d6-4f88-ab22-91ca31759664/images/406bf8f8-5e80-4c2a-a7d5-72c357f7aefd.png"/><itunes:season>1</itunes:season><itunes:episode>14</itunes:episode><itunes:title>EP14: AI News and Papers </itunes:title><itunes:episodeType>full</itunes:episodeType></item><item><title><![CDATA[EP13: Recurrent-Depth Models and Latent Reasoning with Jonas Geiping ]]></title><description><![CDATA[<p>In this episode, we host Jonas Geiping from ELLIS Institute &amp; Max-Planck Institute for Intelligent Systems, Tübingen AI Center, Germany. We talked about his broad research on Recurrent-Depth Models and latent reasoning in large language models (LLMs). We talked about what these models can and can't do, what are the challenges and next breakthroughs in the field, world models, and the future of developing better models. We also talked about safety and interpretability, and the role of scaling laws in AI development.</p><p></p><p><b>Chapters</b></p><p><b>00:00 </b>Introduction and Guest Introduction</p><p><b>01:03 </b>Peer Review in Preprint Servers</p><p><b>06:57 </b>New Developments in Coding Models</p><p><b>09:34 </b>Open Source Models in Europe</p><p><b>11:00 </b>Dynamic Layers in LLMs</p><p><b>26:05 </b>Training Playbook Insights</p><p><b>30:05 </b>Recurrent Depth Models and Reasoning Tasks</p><p><b>43:59 </b>Exploring Recursive Reasoning Models</p><p><b>46:46 </b>The Role of World Models in AI</p><p><b>48:41 </b>Innovations in AI Training and Simulation</p><p><b>50:39 </b>The Promise of Recurrent Depth Models</p><p><b>52:34 </b>Navigating the Future of AI Algorithms</p><p><b>54:44 </b>The Bitter Lesson of AI Development</p><p><b>59:11 </b>Advising the Next Generation of Researchers</p><p><b>01:06:42 </b>Safety and Interpretability in AI Models</p><p><b>01:10:46 </b>Scaling Laws and Their Implications</p><p><b>01:16:19 </b>The Role of PhDs in AI Research</p><p></p><p>Links and paper:</p><ul><li>Jonas' website - <a rel="noopener noreferrer nofollow" href="https://jonasgeiping.github.io/" target="_blank">https://jonasgeiping.github.io/</a></li><li>Scaling up test-time compute with latent reasoning: A recurrent depth approach - <a rel="noopener noreferrer nofollow" href="https://arxiv.org/abs/2502.05171" target="_blank">https://arxiv.org/abs/2502.05171</a></li><li>The Smol Training Playbook: The Secrets to Building World-Class LLMs - <a rel="noopener noreferrer nofollow" href="https://huggingface.co/spaces/HuggingFaceTB/smol-training-playbook" target="_blank">https://huggingface.co/spaces/HuggingFaceTB/smol-training-playbook</a></li><li>VaultGemma: A Differentially Private Gemma Model - <a rel="noopener noreferrer nofollow" href="https://arxiv.org/abs/2510.15001" target="_blank">https://arxiv.org/abs/2510.15001</a></li></ul><p></p><p>Music:</p><p>“Kid Kodi” — Blue Dot Sessions — via Free Music Archive — CC BY-NC 4.0.</p><p>“Palms Down” — Blue Dot Sessions — via Free Music Archive — CC BY-NC 4.0.</p><p>Changes: trimmed</p>]]></description><guid isPermaLink="false">f69710ab-265a-42d5-b682-730a23e26baa</guid><dc:creator><![CDATA[Ravid Shwartz-Ziv & Allen Roush]]></dc:creator><pubDate>Fri, 07 Nov 2025 14:19:46 GMT</pubDate><enclosure url="https://api.riverside.com/hosting-analytics/media/7adefce1f8ff6416f896b0038afa7416a7e0e1ef2a09c9456d6998d239bb4aae/eyJlcGlzb2RlSWQiOiJmNjk3MTBhYi0yNjVhLTQyZDUtYjY4Mi03MzBhMjNlMjZiYWEiLCJwb2RjYXN0SWQiOiJlZWJmYWM4Zi0yMzY5LTRhODktOTI0Yi1iZTI3OWI1N2YxOTAiLCJhY2NvdW50SWQiOiI2OGE1ZjAzMzFkNjQ0MTNlODhlN2FkMmYiLCJwYXRoIjoibWVkaWEvY2xpcHMvNjkwZGYyNDUxZmI3Y2RjNTYwMGZhZTA5L3JhdmlkLXNod2FydHoteml2cy1zdHVkaW8tY29tcG9zZXItMjAyNS0xMS03X18xNC0yMS05Lm1wMyJ9.mp3" length="59366473" type="audio/mpeg"/><itunes:summary>&lt;p&gt;In this episode, we host Jonas Geiping from ELLIS Institute &amp;amp; Max-Planck Institute for Intelligent Systems, Tübingen AI Center, Germany. We talked about his broad research on Recurrent-Depth Models and latent reasoning in large language models (LLMs). We talked about what these models can and can&apos;t do, what are the challenges and next breakthroughs in the field, world models, and the future of developing better models. We also talked about safety and interpretability, and the role of scaling laws in AI development.&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;&lt;b&gt;Chapters&lt;/b&gt;&lt;/p&gt;&lt;p&gt;&lt;b&gt;00:00 &lt;/b&gt;Introduction and Guest Introduction&lt;/p&gt;&lt;p&gt;&lt;b&gt;01:03 &lt;/b&gt;Peer Review in Preprint Servers&lt;/p&gt;&lt;p&gt;&lt;b&gt;06:57 &lt;/b&gt;New Developments in Coding Models&lt;/p&gt;&lt;p&gt;&lt;b&gt;09:34 &lt;/b&gt;Open Source Models in Europe&lt;/p&gt;&lt;p&gt;&lt;b&gt;11:00 &lt;/b&gt;Dynamic Layers in LLMs&lt;/p&gt;&lt;p&gt;&lt;b&gt;26:05 &lt;/b&gt;Training Playbook Insights&lt;/p&gt;&lt;p&gt;&lt;b&gt;30:05 &lt;/b&gt;Recurrent Depth Models and Reasoning Tasks&lt;/p&gt;&lt;p&gt;&lt;b&gt;43:59 &lt;/b&gt;Exploring Recursive Reasoning Models&lt;/p&gt;&lt;p&gt;&lt;b&gt;46:46 &lt;/b&gt;The Role of World Models in AI&lt;/p&gt;&lt;p&gt;&lt;b&gt;48:41 &lt;/b&gt;Innovations in AI Training and Simulation&lt;/p&gt;&lt;p&gt;&lt;b&gt;50:39 &lt;/b&gt;The Promise of Recurrent Depth Models&lt;/p&gt;&lt;p&gt;&lt;b&gt;52:34 &lt;/b&gt;Navigating the Future of AI Algorithms&lt;/p&gt;&lt;p&gt;&lt;b&gt;54:44 &lt;/b&gt;The Bitter Lesson of AI Development&lt;/p&gt;&lt;p&gt;&lt;b&gt;59:11 &lt;/b&gt;Advising the Next Generation of Researchers&lt;/p&gt;&lt;p&gt;&lt;b&gt;01:06:42 &lt;/b&gt;Safety and Interpretability in AI Models&lt;/p&gt;&lt;p&gt;&lt;b&gt;01:10:46 &lt;/b&gt;Scaling Laws and Their Implications&lt;/p&gt;&lt;p&gt;&lt;b&gt;01:16:19 &lt;/b&gt;The Role of PhDs in AI Research&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;Links and paper:&lt;/p&gt;&lt;ul&gt;&lt;li&gt;Jonas&apos; website - &lt;a rel=&quot;noopener noreferrer nofollow&quot; href=&quot;https://jonasgeiping.github.io/&quot; target=&quot;_blank&quot;&gt;https://jonasgeiping.github.io/&lt;/a&gt;&lt;/li&gt;&lt;li&gt;Scaling up test-time compute with latent reasoning: A recurrent depth approach - &lt;a rel=&quot;noopener noreferrer nofollow&quot; href=&quot;https://arxiv.org/abs/2502.05171&quot; target=&quot;_blank&quot;&gt;https://arxiv.org/abs/2502.05171&lt;/a&gt;&lt;/li&gt;&lt;li&gt;The Smol Training Playbook: The Secrets to Building World-Class LLMs - &lt;a rel=&quot;noopener noreferrer nofollow&quot; href=&quot;https://huggingface.co/spaces/HuggingFaceTB/smol-training-playbook&quot; target=&quot;_blank&quot;&gt;https://huggingface.co/spaces/HuggingFaceTB/smol-training-playbook&lt;/a&gt;&lt;/li&gt;&lt;li&gt;VaultGemma: A Differentially Private Gemma Model - &lt;a rel=&quot;noopener noreferrer nofollow&quot; href=&quot;https://arxiv.org/abs/2510.15001&quot; target=&quot;_blank&quot;&gt;https://arxiv.org/abs/2510.15001&lt;/a&gt;&lt;/li&gt;&lt;/ul&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;Music:&lt;/p&gt;&lt;p&gt;“Kid Kodi” — Blue Dot Sessions — via Free Music Archive — CC BY-NC 4.0.&lt;/p&gt;&lt;p&gt;“Palms Down” — Blue Dot Sessions — via Free Music Archive — CC BY-NC 4.0.&lt;/p&gt;&lt;p&gt;Changes: trimmed&lt;/p&gt;</itunes:summary><itunes:explicit>no</itunes:explicit><itunes:duration>01:21:15</itunes:duration><itunes:image href="https://hosting-media.riverside.com/media/podcasts/eebfac8f-2369-4a89-924b-be279b57f190/episodes/f69710ab-265a-42d5-b682-730a23e26baa/images/c522a8ca-f4da-46fd-975d-d019e825392e.png"/><itunes:season>1</itunes:season><itunes:episode>13</itunes:episode><itunes:title>EP13: Recurrent-Depth Models and Latent Reasoning with Jonas Geiping </itunes:title><itunes:episodeType>full</itunes:episodeType></item><item><title><![CDATA[EP12:  Adversarial attacks and compression with Jack Morris
]]></title><description><![CDATA[<p>In this episode of the Information Bottleneck Podcast, we host Jack Morris, a PhD student at Cornell, to discuss adversarial examples (Jack created <a rel="noopener noreferrer nofollow" href="https://arxiv.org/abs/2005.05909" target="_blank">TextAttack</a>, the first software package for LLM jailbreaking), the Platonic representation hypothesis, the implications of inversion techniques, and the role of compression in language models.</p><p></p><p><b>Links:</b></p><p>Jack's Website - <a rel="noopener noreferrer nofollow" href="https://jxmo.io/" target="_blank">https://jxmo.io/</a></p><p>TextAttack - <a rel="noopener noreferrer nofollow" href="https://arxiv.org/abs/2005.05909" target="_blank">https://arxiv.org/abs/2005.05909</a></p><p>How much do language models memorize? <a rel="noopener noreferrer nofollow" href="https://arxiv.org/abs/2505.24832" target="_blank">https://arxiv.org/abs/2505.24832</a></p><p>DeepSeek OCR - <a rel="noopener noreferrer nofollow" href="https://www.arxiv.org/abs/2510.18234" target="_blank">https://www.arxiv.org/abs/2510.18234</a></p><p></p><p></p><p><b>Chapters:</b></p><p>00:00 Introduction and AI News Highlights</p><p>04:53 The Importance of Fine-Tuning Models</p><p>10:01 Challenges in Open Source AI Models</p><p>14:34 The Future of Model Scaling and Sparsity</p><p>19:39 Exploring Model Routing and User Experience</p><p>24:34 Jack's Research: Text Attack and Adversarial Examples</p><p>29:33 The Platonic Representation Hypothesis</p><p>34:23 Implications of Inversion and Security in AI</p><p>39:20 The Role of Compression in Language Models</p><p>44:10 Future Directions in AI Research and Personalization</p>]]></description><guid isPermaLink="false">7136436c-215e-4033-ae62-9833aeb47028</guid><dc:creator><![CDATA[Ravid Shwartz-Ziv & Allen Roush]]></dc:creator><pubDate>Mon, 03 Nov 2025 02:40:41 GMT</pubDate><enclosure url="https://api.riverside.com/hosting-analytics/media/91ceeb15faa3a87b4b16fe93c0520de97383b9635053d803f08144b12147c398/eyJlcGlzb2RlSWQiOiI3MTM2NDM2Yy0yMTVlLTQwMzMtYWU2Mi05ODMzYWViNDcwMjgiLCJwb2RjYXN0SWQiOiJlZWJmYWM4Zi0yMzY5LTRhODktOTI0Yi1iZTI3OWI1N2YxOTAiLCJhY2NvdW50SWQiOiI2OGE1ZjAzMzFkNjQ0MTNlODhlN2FkMmYiLCJwYXRoIjoibWVkaWEvY2xpcHMvNjkwODA5ODVlOWNhMTVkY2EyYjhjNTUyL3JhdmlkLXNod2FydHoteml2cy1zdHVkaW8tY29tcG9zZXItMjAyNS0xMS0zX18yLTQ2LTQ1Lm1wMyJ9.mp3" length="44125613" type="audio/mpeg"/><itunes:summary>&lt;p&gt;In this episode of the Information Bottleneck Podcast, we host Jack Morris, a PhD student at Cornell, to discuss adversarial examples (Jack created &lt;a rel=&quot;noopener noreferrer nofollow&quot; href=&quot;https://arxiv.org/abs/2005.05909&quot; target=&quot;_blank&quot;&gt;TextAttack&lt;/a&gt;, the first software package for LLM jailbreaking), the Platonic representation hypothesis, the implications of inversion techniques, and the role of compression in language models.&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;&lt;b&gt;Links:&lt;/b&gt;&lt;/p&gt;&lt;p&gt;Jack&apos;s Website - &lt;a rel=&quot;noopener noreferrer nofollow&quot; href=&quot;https://jxmo.io/&quot; target=&quot;_blank&quot;&gt;https://jxmo.io/&lt;/a&gt;&lt;/p&gt;&lt;p&gt;TextAttack - &lt;a rel=&quot;noopener noreferrer nofollow&quot; href=&quot;https://arxiv.org/abs/2005.05909&quot; target=&quot;_blank&quot;&gt;https://arxiv.org/abs/2005.05909&lt;/a&gt;&lt;/p&gt;&lt;p&gt;How much do language models memorize? &lt;a rel=&quot;noopener noreferrer nofollow&quot; href=&quot;https://arxiv.org/abs/2505.24832&quot; target=&quot;_blank&quot;&gt;https://arxiv.org/abs/2505.24832&lt;/a&gt;&lt;/p&gt;&lt;p&gt;DeepSeek OCR - &lt;a rel=&quot;noopener noreferrer nofollow&quot; href=&quot;https://www.arxiv.org/abs/2510.18234&quot; target=&quot;_blank&quot;&gt;https://www.arxiv.org/abs/2510.18234&lt;/a&gt;&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;&lt;b&gt;Chapters:&lt;/b&gt;&lt;/p&gt;&lt;p&gt;00:00 Introduction and AI News Highlights&lt;/p&gt;&lt;p&gt;04:53 The Importance of Fine-Tuning Models&lt;/p&gt;&lt;p&gt;10:01 Challenges in Open Source AI Models&lt;/p&gt;&lt;p&gt;14:34 The Future of Model Scaling and Sparsity&lt;/p&gt;&lt;p&gt;19:39 Exploring Model Routing and User Experience&lt;/p&gt;&lt;p&gt;24:34 Jack&apos;s Research: Text Attack and Adversarial Examples&lt;/p&gt;&lt;p&gt;29:33 The Platonic Representation Hypothesis&lt;/p&gt;&lt;p&gt;34:23 Implications of Inversion and Security in AI&lt;/p&gt;&lt;p&gt;39:20 The Role of Compression in Language Models&lt;/p&gt;&lt;p&gt;44:10 Future Directions in AI Research and Personalization&lt;/p&gt;</itunes:summary><itunes:explicit>no</itunes:explicit><itunes:duration>00:58:07</itunes:duration><itunes:image href="https://hosting-media.riverside.com/media/podcasts/eebfac8f-2369-4a89-924b-be279b57f190/episodes/7136436c-215e-4033-ae62-9833aeb47028/images/afb2bbf2-448e-4e04-a218-de7e5c5676a2.png"/><itunes:season>1</itunes:season><itunes:episode>12</itunes:episode><itunes:title>EP12:  Adversarial attacks and compression with Jack Morris
</itunes:title><itunes:episodeType>full</itunes:episodeType></item><item><title><![CDATA[EP11: JEPA with Randall Balestriero]]></title><description><![CDATA[<p>In this episode we talk with Randall Balestriero, an assistant professor at Brown University. We discuss the potential and challenges of Joint Embedding Predictive Architectures (JEPA). We explore the concept of JEPA, which aims to learn good data representations without reconstruction-based learning. We talk about the importance of understanding and compressing irrelevant details, the role of prediction tasks, and the challenges of preventing collapse.</p>]]></description><guid isPermaLink="false">f614b7f0-8635-48e1-bc5d-60315ac55b33</guid><dc:creator><![CDATA[Ravid Shwartz-Ziv & Allen Roush]]></dc:creator><pubDate>Tue, 28 Oct 2025 01:35:12 GMT</pubDate><enclosure url="https://api.riverside.com/hosting-analytics/media/767b3d85d3cb9cb465d4dc66928a54ee7aa263ce4d017d7fcb3f994d2fe5eebe/eyJlcGlzb2RlSWQiOiJmNjE0YjdmMC04NjM1LTQ4ZTEtYmM1ZC02MDMxNWFjNTViMzMiLCJwb2RjYXN0SWQiOiJlZWJmYWM4Zi0yMzY5LTRhODktOTI0Yi1iZTI3OWI1N2YxOTAiLCJhY2NvdW50SWQiOiI2OGE1ZjAzMzFkNjQ0MTNlODhlN2FkMmYiLCJwYXRoIjoibWVkaWEvY2xpcHMvNjhmZjdiNWNmNmI3NzdkNDgxOTFhM2MxL3JhdmlkLXNod2FydHoteml2cy1zdHVkaW8tY29tcG9zZXItMjAyNS0xMC0yN19fMTUtMi00Lm1wMyJ9.mp3" length="59945662" type="audio/mpeg"/><itunes:summary>&lt;p&gt;In this episode we talk with Randall Balestriero, an assistant professor at Brown University. We discuss the potential and challenges of Joint Embedding Predictive Architectures (JEPA). We explore the concept of JEPA, which aims to learn good data representations without reconstruction-based learning. We talk about the importance of understanding and compressing irrelevant details, the role of prediction tasks, and the challenges of preventing collapse.&lt;/p&gt;</itunes:summary><itunes:explicit>no</itunes:explicit><itunes:duration>01:18:04</itunes:duration><itunes:image href="https://hosting-media.riverside.com/media/podcasts/eebfac8f-2369-4a89-924b-be279b57f190/episodes/f614b7f0-8635-48e1-bc5d-60315ac55b33/images/ec090f57-2b2e-492c-b3a5-eb5f84e39e9f.png"/><itunes:season>1</itunes:season><itunes:episode>11</itunes:episode><itunes:title>EP11: JEPA with Randall Balestriero</itunes:title><itunes:episodeType>full</itunes:episodeType></item><item><title><![CDATA[EP10: Geometric Deep Learning with Michael Bronstein]]></title><description><![CDATA[<p>In this episode, we talked with Michael Bronstein, a professor of AI at the University of Oxford and a scientific director at AITHYRA, about the fascinating world of geometric deep learning. We explored how understanding the geometric structures in data can enhance the efficiency and accuracy of AI models. Michael shared insights on the limitations of small neural networks and the ongoing debate about the role of scaling in AI. We also talked about the future in scientific discovery, and the potential impact on fields like drug design and mathematics</p>]]></description><guid isPermaLink="false">c7035d99-53f8-4172-9b84-1cfdaf950185</guid><dc:creator><![CDATA[Ravid Shwartz-Ziv & Allen Roush]]></dc:creator><pubDate>Mon, 20 Oct 2025 15:52:52 GMT</pubDate><enclosure url="https://api.riverside.com/hosting-analytics/media/79d3a40f826fc6a39be77aae8edeae7714094ce00eb91eaa78860615bbbddae9/eyJlcGlzb2RlSWQiOiJjNzAzNWQ5OS01M2Y4LTQxNzItOWI4NC0xY2ZkYWY5NTAxODUiLCJwb2RjYXN0SWQiOiJlZWJmYWM4Zi0yMzY5LTRhODktOTI0Yi1iZTI3OWI1N2YxOTAiLCJhY2NvdW50SWQiOiI2OGE1ZjAzMzFkNjQ0MTNlODhlN2FkMmYiLCJwYXRoIjoibWVkaWEvY2xpcHMvNjhmNTk2YmE5MGJkMzgxOTM1MzA3MjkxL3JhdmlkLXNod2FydHoteml2cy1zdHVkaW8tY29tcG9zZXItMjAyNS0xMC0yMF9fMy01Ni0xMC5tcDMifQ==.mp3" length="62869271" type="audio/mpeg"/><itunes:summary>&lt;p&gt;In this episode, we talked with Michael Bronstein, a professor of AI at the University of Oxford and a scientific director at AITHYRA, about the fascinating world of geometric deep learning. We explored how understanding the geometric structures in data can enhance the efficiency and accuracy of AI models. Michael shared insights on the limitations of small neural networks and the ongoing debate about the role of scaling in AI. We also talked about the future in scientific discovery, and the potential impact on fields like drug design and mathematics&lt;/p&gt;</itunes:summary><itunes:explicit>no</itunes:explicit><itunes:duration>01:17:49</itunes:duration><itunes:image href="https://hosting-media.riverside.com/media/podcasts/eebfac8f-2369-4a89-924b-be279b57f190/episodes/c7035d99-53f8-4172-9b84-1cfdaf950185/images/36fe61b0-757a-428e-ad88-235e6e43b637.png"/><itunes:season>1</itunes:season><itunes:episode>10</itunes:episode><itunes:title>EP10: Geometric Deep Learning with Michael Bronstein</itunes:title><itunes:episodeType>full</itunes:episodeType></item><item><title><![CDATA[EP9: AI in Natural Sciences with Tal Kachman]]></title><description><![CDATA[<p>In this episode we host Tal Kachman, an assistant professor at Radboud University, to explore the fascinating intersection of artificial intelligence and natural sciences. Prof. Kachman's research focuses on multiagent interaction, complex systems, and reinforcement learning. We dive deep into how AI is revolutionizing materials discovery, chemical dynamics modeling, and experimental design through self-driving laboratories. Prof. Kachman shares insights on the challenges of integrating physics and chemistry with AI systems, the critical role of high-throughput experimentation in accelerating scientific discovery, and the transformative potential of generative models to unlock new materials and functionalities.</p>]]></description><guid isPermaLink="false">c7401d57-69e1-48b4-bd26-0a05bc778468</guid><dc:creator><![CDATA[Ravid Shwartz-Ziv & Allen Roush]]></dc:creator><pubDate>Mon, 13 Oct 2025 04:20:02 GMT</pubDate><enclosure url="https://api.riverside.com/hosting-analytics/media/20e470f1cd94876d6691f702d543ccbde1f622f8bbd4ebdbe06c162cc619f4cf/eyJlcGlzb2RlSWQiOiJjNzQwMWQ1Ny02OWUxLTQ4YjQtYmQyNi0wYTA1YmM3Nzg0NjgiLCJwb2RjYXN0SWQiOiJlZWJmYWM4Zi0yMzY5LTRhODktOTI0Yi1iZTI3OWI1N2YxOTAiLCJhY2NvdW50SWQiOiI2OGE1ZjAzMzFkNjQ0MTNlODhlN2FkMmYiLCJwYXRoIjoibWVkaWEvY2xpcHMvNjhlYzdhNGYyZDIwNGQ5OWEyZjE3MDBkL3JhdmlkLXNod2FydHoteml2cy1zdHVkaW8tY29tcG9zZXItMjAyNS0xMC0xM19fNi00LTMxLm1wMyJ9.mp3" length="45258973" type="audio/mpeg"/><itunes:summary>&lt;p&gt;In this episode we host Tal Kachman, an assistant professor at Radboud University, to explore the fascinating intersection of artificial intelligence and natural sciences. Prof. Kachman&apos;s research focuses on multiagent interaction, complex systems, and reinforcement learning. We dive deep into how AI is revolutionizing materials discovery, chemical dynamics modeling, and experimental design through self-driving laboratories. Prof. Kachman shares insights on the challenges of integrating physics and chemistry with AI systems, the critical role of high-throughput experimentation in accelerating scientific discovery, and the transformative potential of generative models to unlock new materials and functionalities.&lt;/p&gt;</itunes:summary><itunes:explicit>no</itunes:explicit><itunes:duration>01:07:42</itunes:duration><itunes:image href="https://hosting-media.riverside.com/media/podcasts/eebfac8f-2369-4a89-924b-be279b57f190/episodes/c7401d57-69e1-48b4-bd26-0a05bc778468/images/063d6b3a-b5dd-4b56-b42f-5eb3a2614250.png"/><itunes:season>1</itunes:season><itunes:episode>9</itunes:episode><itunes:title>EP9: AI in Natural Sciences with Tal Kachman</itunes:title><itunes:episodeType>full</itunes:episodeType></item><item><title><![CDATA[EP8: RL with Ahmad Beirami]]></title><description><![CDATA[<p>In this episode, we talked with Ahmad Beirami, an ex-researcher at Google, to discuss various topics. We explored the complexities of reinforcement learning, its applications in LLMs, and the evaluation challenges in AI research. We also discussed the dynamics of academic conferences and the broken review system. Finally, we discussed how to integrate theory and practice in AI research and why the community should prioritize a deeper understanding over surface-level improvements.</p>]]></description><guid isPermaLink="false">1eb60600-ae79-4c59-82f6-ac402da6f3f4</guid><dc:creator><![CDATA[Ravid Shwartz-Ziv & Allen Roush]]></dc:creator><pubDate>Tue, 07 Oct 2025 02:38:14 GMT</pubDate><enclosure url="https://api.riverside.com/hosting-analytics/media/d93d9a0fa9665721af813dcdade0894e7fc7544511035f3ae9dd4ff4c1c7b1f1/eyJlcGlzb2RlSWQiOiIxZWI2MDYwMC1hZTc5LTRjNTktODJmNi1hYzQwMmRhNmYzZjQiLCJwb2RjYXN0SWQiOiJlZWJmYWM4Zi0yMzY5LTRhODktOTI0Yi1iZTI3OWI1N2YxOTAiLCJhY2NvdW50SWQiOiI2OGE1ZjAzMzFkNjQ0MTNlODhlN2FkMmYiLCJwYXRoIjoibWVkaWEvY2xpcHMvNjhlNDBiYjEzNTdjNDU1ZmQyYjJmZDhiL3JhdmlkLXNod2FydHoteml2cy1zdHVkaW8tY29tcG9zZXItMjAyNS0xMC02X18yMC0zNC0yNS5tcDMifQ==.mp3" length="51115198" type="audio/mpeg"/><itunes:summary>&lt;p&gt;In this episode, we talked with Ahmad Beirami, an ex-researcher at Google, to discuss various topics. We explored the complexities of reinforcement learning, its applications in LLMs, and the evaluation challenges in AI research. We also discussed the dynamics of academic conferences and the broken review system. Finally, we discussed how to integrate theory and practice in AI research and why the community should prioritize a deeper understanding over surface-level improvements.&lt;/p&gt;</itunes:summary><itunes:explicit>no</itunes:explicit><itunes:duration>01:07:09</itunes:duration><itunes:image href="https://hosting-media.riverside.com/media/podcasts/eebfac8f-2369-4a89-924b-be279b57f190/episodes/1eb60600-ae79-4c59-82f6-ac402da6f3f4/images/92f8b6f6-0387-4aef-9ed9-6f29ef4bbf3a.png"/><itunes:season>1</itunes:season><itunes:episode>8</itunes:episode><itunes:title>EP8: RL with Ahmad Beirami</itunes:title><itunes:episodeType>full</itunes:episodeType></item><item><title><![CDATA[EP7: AI and Neuroscience with Aran Nayebi ]]></title><description><![CDATA[<p>In this episode of the "Information Bottleneck" podcast, we hosted Aran Nayeb, an assistant professor at Carnegie Mellon University, to discuss the intersection of computational neuroscience and machine learning. We talked about the challenges and opportunities in understanding intelligence through the lens of both biological and artificial systems. We talked about topics such as the evolution of neural networks, the role of intrinsic motivation in AI, and the future of brain-machine interfaces. </p>]]></description><guid isPermaLink="false">f8276a52-3a0a-48a6-8475-0b6aa88a1032</guid><dc:creator><![CDATA[Ravid Shwartz-Ziv & Allen Roush]]></dc:creator><pubDate>Mon, 29 Sep 2025 13:55:24 GMT</pubDate><enclosure url="https://api.riverside.com/hosting-analytics/media/340fbf27025076ca3130fd92d9f38d7947f55ab946b15beb7e948a962ca69f75/eyJlcGlzb2RlSWQiOiJmODI3NmE1Mi0zYTBhLTQ4YTYtODQ3NS0wYjZhYTg4YTEwMzIiLCJwb2RjYXN0SWQiOiJlZWJmYWM4Zi0yMzY5LTRhODktOTI0Yi1iZTI3OWI1N2YxOTAiLCJhY2NvdW50SWQiOiI2OGE1ZjAzMzFkNjQ0MTNlODhlN2FkMmYiLCJwYXRoIjoibWVkaWEvY2xpcHMvNjhkOTViZjAyODJhYmUzODU1MmViMDNhL3JhdmlkLXNod2FydHoteml2cy1zdHVkaW8tY29tcG9zZXItMjAyNS05LTI4X18xOC0xLTUyLm1wMyJ9.mp3" length="52757329" type="audio/mpeg"/><itunes:summary>&lt;p&gt;In this episode of the &quot;Information Bottleneck&quot; podcast, we hosted Aran Nayeb, an assistant professor at Carnegie Mellon University, to discuss the intersection of computational neuroscience and machine learning. We talked about the challenges and opportunities in understanding intelligence through the lens of both biological and artificial systems. We talked about topics such as the evolution of neural networks, the role of intrinsic motivation in AI, and the future of brain-machine interfaces. &lt;/p&gt;</itunes:summary><itunes:explicit>no</itunes:explicit><itunes:duration>01:09:12</itunes:duration><itunes:image href="https://hosting-media.riverside.com/media/podcasts/eebfac8f-2369-4a89-924b-be279b57f190/episodes/f8276a52-3a0a-48a6-8475-0b6aa88a1032/images/6e7b5ed4-e28f-4ca0-8be4-f70731c3a838.png"/><itunes:season>1</itunes:season><itunes:episode>7</itunes:episode><itunes:title>EP7: AI and Neuroscience with Aran Nayebi </itunes:title><itunes:episodeType>full</itunes:episodeType></item><item><title><![CDATA[EP6:  Urban Design Meets AI: With Ariel Noyman]]></title><description><![CDATA[<p>We talked with Ariel Noyman, an urban scientist, working in the intersection of cities and technology. Ariel is a research scientist at the MIT Media Lab, exploring novel methods of urban modeling and simulation using AI. We discussed the potential of virtual environments to enhance urban design processes, the challenges associated with them, and the future of utilizing AI. </p><p>Links:</p><ul><li>TravelAgent: Generative agents in the built environment - <a rel="noopener noreferrer nofollow" href="https://journals.sagepub.com/doi/10.1177/23998083251360458" target="_blank">https://journals.sagepub.com/doi/10.1177/23998083251360458</a></li><li>Ariel Neumann's websites -<ul><li><a rel="noopener noreferrer nofollow" href="https://www.arielnoyman.com/" target="_blank">https://www.arielnoyman.com/</a></li><li><a rel="noopener noreferrer nofollow" href="https://www.media.mit.edu/people/noyman/overview/" target="_blank">https://www.media.mit.edu/people/noyman/overview/</a></li></ul></li></ul>]]></description><guid isPermaLink="false">281e3269-66e0-4895-a5d0-9174a8719983</guid><dc:creator><![CDATA[Ravid Shwartz-Ziv & Allen Roush]]></dc:creator><pubDate>Sun, 21 Sep 2025 01:58:29 GMT</pubDate><enclosure url="https://api.riverside.com/hosting-analytics/media/19831c8c81da9b9e8a2ebd850e68c9d4ea80572a51b36c08af49f686994a4374/eyJlcGlzb2RlSWQiOiIyODFlMzI2OS02NmUwLTQ4OTUtYTVkMC05MTc0YTg3MTk5ODMiLCJwb2RjYXN0SWQiOiJlZWJmYWM4Zi0yMzY5LTRhODktOTI0Yi1iZTI3OWI1N2YxOTAiLCJhY2NvdW50SWQiOiI2OGE1ZjAzMzFkNjQ0MTNlODhlN2FkMmYiLCJwYXRoIjoibWVkaWEvY2xpcHMvNjhjZTFjNWM2YmE4OWJlNzBmMThjYTEyL3JhdmlkLXNod2FydHoteml2cy1zdHVkaW8tY29tcG9zZXItMjAyNS05LTIwX181LTE1LTQwLm1wMyJ9.mp3" length="41753268" type="audio/mpeg"/><itunes:summary>&lt;p&gt;We talked with Ariel Noyman, an urban scientist, working in the intersection of cities and technology. Ariel is a research scientist at the MIT Media Lab, exploring novel methods of urban modeling and simulation using AI. We discussed the potential of virtual environments to enhance urban design processes, the challenges associated with them, and the future of utilizing AI. &lt;/p&gt;&lt;p&gt;Links:&lt;/p&gt;&lt;ul&gt;&lt;li&gt;TravelAgent: Generative agents in the built environment - &lt;a rel=&quot;noopener noreferrer nofollow&quot; href=&quot;https://journals.sagepub.com/doi/10.1177/23998083251360458&quot; target=&quot;_blank&quot;&gt;https://journals.sagepub.com/doi/10.1177/23998083251360458&lt;/a&gt;&lt;/li&gt;&lt;li&gt;Ariel Neumann&apos;s websites -&lt;ul&gt;&lt;li&gt;&lt;a rel=&quot;noopener noreferrer nofollow&quot; href=&quot;https://www.arielnoyman.com/&quot; target=&quot;_blank&quot;&gt;https://www.arielnoyman.com/&lt;/a&gt;&lt;/li&gt;&lt;li&gt;&lt;a rel=&quot;noopener noreferrer nofollow&quot; href=&quot;https://www.media.mit.edu/people/noyman/overview/&quot; target=&quot;_blank&quot;&gt;https://www.media.mit.edu/people/noyman/overview/&lt;/a&gt;&lt;/li&gt;&lt;/ul&gt;&lt;/li&gt;&lt;/ul&gt;</itunes:summary><itunes:explicit>no</itunes:explicit><itunes:duration>01:07:05</itunes:duration><itunes:image href="https://hosting-media.riverside.com/media/podcasts/eebfac8f-2369-4a89-924b-be279b57f190/episodes/281e3269-66e0-4895-a5d0-9174a8719983/images/ef496407-7a3e-437e-a28d-62b783a09157.png"/><itunes:season>1</itunes:season><itunes:episode>6</itunes:episode><itunes:title>EP6:  Urban Design Meets AI: With Ariel Noyman</itunes:title><itunes:episodeType>full</itunes:episodeType></item><item><title><![CDATA[EP5: Speculative Decoding with Nadav Timor]]></title><description><![CDATA[<p>We discussed the inference optimization technique known as Speculative Decoding with a world class researcher, expert, and ex-coworker of the podcast hosts: Nadav Timor.</p><p></p><p>Papers and links:</p><ul><li>Accelerating LLM Inference with Lossless Speculative Decoding Algorithms for Heterogeneous Vocabularies, Timor et al, ICML 2025, <a rel="noopener noreferrer nofollow" href="https://arxiv.org/abs/2502.05202" target="_blank">https://arxiv.org/abs/2502.05202</a></li><li>Distributed Speculative Inference (DSI): Speculation Parallelism for Provably Faster Lossless Language Model Inference, Timor et al, ICLR, 2025, <a rel="noopener noreferrer nofollow" href="https://arxiv.org/abs/2405.14105" target="_blank">https://arxiv.org/abs/2405.14105</a></li><li>Fast Inference from Transformers via Speculative Decoding, Leviathan et al, 2022, <a rel="noopener noreferrer nofollow" href="https://arxiv.org/abs/2502.05202" target="_blank">https://arxiv.org/abs/2502.05202</a></li></ul><ul><li>FindPDFs - <a rel="noopener noreferrer nofollow" href="https://huggingface.co/datasets/HuggingFaceFW/finepdfs" target="_blank">https://huggingface.co/datasets/HuggingFaceFW/finepdfs</a></li></ul><p></p><p></p>]]></description><guid isPermaLink="false">8dd62ab2-116b-43bb-8ab0-09f062d8b89f</guid><dc:creator><![CDATA[Ravid Shwartz-Ziv & Allen Roush]]></dc:creator><pubDate>Tue, 16 Sep 2025 21:00:40 GMT</pubDate><enclosure url="https://api.riverside.com/hosting-analytics/media/54299f8ddf78c848add890bedebc1c1c64cb22fac9dc422a8d14910e5ed3c663/eyJlcGlzb2RlSWQiOiI4ZGQ2MmFiMi0xMTZiLTQzYmItOGFiMC0wOWYwNjJkOGI4OWYiLCJwb2RjYXN0SWQiOiJlZWJmYWM4Zi0yMzY5LTRhODktOTI0Yi1iZTI3OWI1N2YxOTAiLCJhY2NvdW50SWQiOiI2OGE1ZjAzMzFkNjQ0MTNlODhlN2FkMmYiLCJwYXRoIjoibWVkaWEvY2xpcHMvNjhjYjM5ZTQ3NzhiMmJiODI1NjE0YzYzL3JhdmlkLXNod2FydHoteml2cy1zdHVkaW8tY29tcG9zZXItMjAyNS05LTE4X18wLTQ0LTUyLm1wMyJ9.mp3" length="48558822" type="audio/mpeg"/><itunes:summary>&lt;p&gt;We discussed the inference optimization technique known as Speculative Decoding with a world class researcher, expert, and ex-coworker of the podcast hosts: Nadav Timor.&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;Papers and links:&lt;/p&gt;&lt;ul&gt;&lt;li&gt;Accelerating LLM Inference with Lossless Speculative Decoding Algorithms for Heterogeneous Vocabularies, Timor et al, ICML 2025, &lt;a rel=&quot;noopener noreferrer nofollow&quot; href=&quot;https://arxiv.org/abs/2502.05202&quot; target=&quot;_blank&quot;&gt;https://arxiv.org/abs/2502.05202&lt;/a&gt;&lt;/li&gt;&lt;li&gt;Distributed Speculative Inference (DSI): Speculation Parallelism for Provably Faster Lossless Language Model Inference, Timor et al, ICLR, 2025, &lt;a rel=&quot;noopener noreferrer nofollow&quot; href=&quot;https://arxiv.org/abs/2405.14105&quot; target=&quot;_blank&quot;&gt;https://arxiv.org/abs/2405.14105&lt;/a&gt;&lt;/li&gt;&lt;li&gt;Fast Inference from Transformers via Speculative Decoding, Leviathan et al, 2022, &lt;a rel=&quot;noopener noreferrer nofollow&quot; href=&quot;https://arxiv.org/abs/2502.05202&quot; target=&quot;_blank&quot;&gt;https://arxiv.org/abs/2502.05202&lt;/a&gt;&lt;/li&gt;&lt;/ul&gt;&lt;ul&gt;&lt;li&gt;FindPDFs - &lt;a rel=&quot;noopener noreferrer nofollow&quot; href=&quot;https://huggingface.co/datasets/HuggingFaceFW/finepdfs&quot; target=&quot;_blank&quot;&gt;https://huggingface.co/datasets/HuggingFaceFW/finepdfs&lt;/a&gt;&lt;/li&gt;&lt;/ul&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;&lt;/p&gt;</itunes:summary><itunes:explicit>no</itunes:explicit><itunes:duration>01:02:22</itunes:duration><itunes:image href="https://hosting-media.riverside.com/media/podcasts/eebfac8f-2369-4a89-924b-be279b57f190/episodes/8dd62ab2-116b-43bb-8ab0-09f062d8b89f/images/a829b2bd-819c-4fdd-9932-ee1d61ddcea0.png"/><itunes:season>1</itunes:season><itunes:episode>5</itunes:episode><itunes:title>EP5: Speculative Decoding with Nadav Timor</itunes:title><itunes:episodeType>full</itunes:episodeType></item><item><title><![CDATA[EP4: AI Coding]]></title><description><![CDATA[<p>In this episode, Ravid and Allen discuss the evolving landscape of AI coding. They explore the rise of AI-assisted development tools, the challenges faced in software engineering, and the potential future of AI in creative fields. The conversation highlights both the benefits and limitations of AI in coding, emphasizing the need for careful consideration of its impact on the industry and society.</p><p></p><p><b>Chapters</b></p><p><b>00:00</b>Introduction to AI Coding and Recent Developments</p><p><b>03:10</b>OpenAI's Paper on Hallucinations in LLMs</p><p><b>06:03</b>Critique of OpenAI's Research Approach</p><p><b>08:50</b>Copyright Issues in AI Training Data</p><p><b>12:00</b>The Value of Data in AI Training</p><p><b>14:50</b>Watermarking AI Generated Content</p><p><b>17:54</b>The Future of AI Investment and Market Dynamics</p><p><b>20:49</b>AI Coding and Its Impact on Software Development</p><p><b>31:36</b>The Evolution of AI in Software Development</p><p><b>33:54</b>Vibe Coding: The Future or a Fad?</p><p><b>38:24</b>Navigating AI Tools: Personal Experiences and Challenges</p><p><b>41:53</b>The Limitations of AI in Complex Coding Tasks</p><p><b>46:52</b>Security Vulnerabilities in AI-Generated Code</p><p><b>50:28</b>The Role of Human Intuition in AI-Assisted Coding</p><p><b>53:28</b>The Impact of AI on Developer Productivity</p><p><b>56:53</b>The Future of AI in Creative Fields</p>]]></description><guid isPermaLink="false">a47380de-7da1-4e02-b726-c2044269f8e6</guid><dc:creator><![CDATA[Ravid Shwartz-Ziv & Allen Roush]]></dc:creator><pubDate>Mon, 08 Sep 2025 01:52:12 GMT</pubDate><enclosure url="https://api.riverside.com/hosting-analytics/media/4237e29e46366f39e112273d2403773f7fa4a2cb509ac1cdfdd9f750308a8970/eyJlcGlzb2RlSWQiOiJhNDczODBkZS03ZGExLTRlMDItYjcyNi1jMjA0NDI2OWY4ZTYiLCJwb2RjYXN0SWQiOiJlZWJmYWM4Zi0yMzY5LTRhODktOTI0Yi1iZTI3OWI1N2YxOTAiLCJhY2NvdW50SWQiOiI2OGE1ZjAzMzFkNjQ0MTNlODhlN2FkMmYiLCJwYXRoIjoibWVkaWEvY2xpcHMvNjhiZTMzZjcwOGExMzE1NjU5NDcyZWQ1L3JhdmlkLXNod2FydHoteml2cy1zdHVkaW8tY29tcG9zZXItMjAyNS05LThfXzMtNDAtNy5tcDMifQ==.mp3" length="49113081" type="audio/mpeg"/><itunes:summary>&lt;p&gt;In this episode, Ravid and Allen discuss the evolving landscape of AI coding. They explore the rise of AI-assisted development tools, the challenges faced in software engineering, and the potential future of AI in creative fields. The conversation highlights both the benefits and limitations of AI in coding, emphasizing the need for careful consideration of its impact on the industry and society.&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;&lt;b&gt;Chapters&lt;/b&gt;&lt;/p&gt;&lt;p&gt;&lt;b&gt;00:00&lt;/b&gt;Introduction to AI Coding and Recent Developments&lt;/p&gt;&lt;p&gt;&lt;b&gt;03:10&lt;/b&gt;OpenAI&apos;s Paper on Hallucinations in LLMs&lt;/p&gt;&lt;p&gt;&lt;b&gt;06:03&lt;/b&gt;Critique of OpenAI&apos;s Research Approach&lt;/p&gt;&lt;p&gt;&lt;b&gt;08:50&lt;/b&gt;Copyright Issues in AI Training Data&lt;/p&gt;&lt;p&gt;&lt;b&gt;12:00&lt;/b&gt;The Value of Data in AI Training&lt;/p&gt;&lt;p&gt;&lt;b&gt;14:50&lt;/b&gt;Watermarking AI Generated Content&lt;/p&gt;&lt;p&gt;&lt;b&gt;17:54&lt;/b&gt;The Future of AI Investment and Market Dynamics&lt;/p&gt;&lt;p&gt;&lt;b&gt;20:49&lt;/b&gt;AI Coding and Its Impact on Software Development&lt;/p&gt;&lt;p&gt;&lt;b&gt;31:36&lt;/b&gt;The Evolution of AI in Software Development&lt;/p&gt;&lt;p&gt;&lt;b&gt;33:54&lt;/b&gt;Vibe Coding: The Future or a Fad?&lt;/p&gt;&lt;p&gt;&lt;b&gt;38:24&lt;/b&gt;Navigating AI Tools: Personal Experiences and Challenges&lt;/p&gt;&lt;p&gt;&lt;b&gt;41:53&lt;/b&gt;The Limitations of AI in Complex Coding Tasks&lt;/p&gt;&lt;p&gt;&lt;b&gt;46:52&lt;/b&gt;Security Vulnerabilities in AI-Generated Code&lt;/p&gt;&lt;p&gt;&lt;b&gt;50:28&lt;/b&gt;The Role of Human Intuition in AI-Assisted Coding&lt;/p&gt;&lt;p&gt;&lt;b&gt;53:28&lt;/b&gt;The Impact of AI on Developer Productivity&lt;/p&gt;&lt;p&gt;&lt;b&gt;56:53&lt;/b&gt;The Future of AI in Creative Fields&lt;/p&gt;</itunes:summary><itunes:explicit>no</itunes:explicit><itunes:duration>01:03:01</itunes:duration><itunes:image href="https://hosting-media.riverside.com/media/podcasts/eebfac8f-2369-4a89-924b-be279b57f190/episodes/a47380de-7da1-4e02-b726-c2044269f8e6/images/a8af3dd8-f1d3-4702-b066-b4581766a722.png"/><itunes:season>1</itunes:season><itunes:episode>4</itunes:episode><itunes:title>EP4: AI Coding</itunes:title><itunes:episodeType>full</itunes:episodeType></item><item><title><![CDATA[EP3: GPU Cloud]]></title><description><![CDATA[<p>Allen and Ravid discuss the dynamics associated with the extreme need for GPUs that AI researchers utilize. They also discuss the latest advancements in AI, including Google's Nano Banana and DeepSeek V3.1, exploring the implications of synthetic data, perplexity, and the influence of AI on human communication. They also delve into the challenges faced by AI researchers in the job market, the importance of GPU infrastructure, and a recent papers examining knowledge and reasoning in LLMs.</p>]]></description><guid isPermaLink="false">321f7e00-b325-44f2-9fd7-06e70e0a2690</guid><dc:creator><![CDATA[Ravid Shwartz-Ziv & Allen Roush]]></dc:creator><pubDate>Tue, 02 Sep 2025 22:48:00 GMT</pubDate><enclosure url="https://api.riverside.com/hosting-analytics/media/321f027b505258e12465e005f6a232d72cba2e342e870a6cd9666c334b107a9a/eyJlcGlzb2RlSWQiOiIzMjFmN2UwMC1iMzI1LTQ0ZjItOWZkNy0wNmU3MGUwYTI2OTAiLCJwb2RjYXN0SWQiOiJlZWJmYWM4Zi0yMzY5LTRhODktOTI0Yi1iZTI3OWI1N2YxOTAiLCJhY2NvdW50SWQiOiI2OGE1ZjAzMzFkNjQ0MTNlODhlN2FkMmYiLCJwYXRoIjoibWVkaWEvY2xpcHMvNjhiNzZmYzczNjBhYmI0ZjJhMzc2ZjE0L3JhdmlkLXNod2FydHoteml2cy1zdHVkaW8tY29tcG9zZXItMjAyNS05LTNfXzAtMjktMjcubXAzIn0=.mp3" length="51502900" type="audio/mpeg"/><itunes:summary>&lt;p&gt;Allen and Ravid discuss the dynamics associated with the extreme need for GPUs that AI researchers utilize. They also discuss the latest advancements in AI, including Google&apos;s Nano Banana and DeepSeek V3.1, exploring the implications of synthetic data, perplexity, and the influence of AI on human communication. They also delve into the challenges faced by AI researchers in the job market, the importance of GPU infrastructure, and a recent papers examining knowledge and reasoning in LLMs.&lt;/p&gt;</itunes:summary><itunes:explicit>no</itunes:explicit><itunes:duration>01:06:43</itunes:duration><itunes:image href="https://hosting-media.riverside.com/media/podcasts/eebfac8f-2369-4a89-924b-be279b57f190/episodes/321f7e00-b325-44f2-9fd7-06e70e0a2690/images/f5a1875b-5ab6-4dbc-9127-5928b297a259.png"/><itunes:season>1</itunes:season><itunes:episode>3</itunes:episode><itunes:title>EP3: GPU Cloud</itunes:title><itunes:episodeType>full</itunes:episodeType></item><item><title><![CDATA[EP2: PeFT]]></title><description><![CDATA[<p>Allen and Ravid sit down and talk about Parameter Efficient Fine Tuning (PeFT) along with the latest updated in AI/ML news. </p>]]></description><guid isPermaLink="false">d02de2eb-64dd-4f5a-a8b7-51d09d13090d</guid><dc:creator><![CDATA[Ravid Shwartz-Ziv & Allen Roush]]></dc:creator><pubDate>Wed, 27 Aug 2025 01:17:54 GMT</pubDate><enclosure url="https://api.riverside.com/hosting-analytics/media/b3ac2a5b0d433089d22b3510bf4b924f5eee9f09363ec26a16d0338d5a72838e/eyJlcGlzb2RlSWQiOiJkMDJkZTJlYi02NGRkLTRmNWEtYThiNy01MWQwOWQxMzA5MGQiLCJwb2RjYXN0SWQiOiJlZWJmYWM4Zi0yMzY5LTRhODktOTI0Yi1iZTI3OWI1N2YxOTAiLCJhY2NvdW50SWQiOiI2OGE1ZjAzMzFkNjQ0MTNlODhlN2FkMmYiLCJwYXRoIjoibWVkaWEvY2xpcHMvNjhhZTU2OTY5M2MwZjczMDMwMmRlNTBhL3JhdmlkLXNod2FydHoteml2cy1zdHVkaW8tY29tcG9zZXItMjAyNS04LTI3X18yLTUxLTM0Lm1wMyJ9.mp3" length="52813942" type="audio/mpeg"/><itunes:summary>&lt;p&gt;Allen and Ravid sit down and talk about Parameter Efficient Fine Tuning (PeFT) along with the latest updated in AI/ML news. &lt;/p&gt;</itunes:summary><itunes:explicit>no</itunes:explicit><itunes:duration>01:12:37</itunes:duration><itunes:image href="https://hosting-media.riverside.com/media/podcasts/eebfac8f-2369-4a89-924b-be279b57f190/logos/71320ada-c62f-4607-9641-81e40c066e46.png"/><itunes:season>1</itunes:season><itunes:episode>2</itunes:episode><itunes:title>EP2: PeFT</itunes:title><itunes:episodeType>full</itunes:episodeType></item><item><title><![CDATA[EP1: Sampling]]></title><description><![CDATA[<p>Allen and Ravid discuss a topic near and dear to their hearts, LLM Sampling!</p><p></p><p></p><p>In this episode of the Information Bottleneck Podcast, Ravid Shwartz-Ziv and Alan Rausch discuss the latest developments in AI, focusing on the controversial release of GPT-5 and its implications for users. They explore the future of large language models and the importance of sampling techniques in AI. </p><p></p><p>Chapters</p><p>00:00 Introduction to the Information Bottleneck Podcast</p><p>01:42 The GPT-5 Debacle: Expectations vs. Reality</p><p>05:48 Shifting Paradigms in AI Research</p><p>09:46 The Future of Large Language Models</p><p>12:56 OpenAI's New Model: A Mixed Bag</p><p>17:55 Corporate Dynamics in AI: Mergers and Acquisitions</p><p>21:39 The GPU Monopoly: Challenges and Opportunities</p><p>25:31 Deep Dive into Samplers in AI</p><p>35:38 Innovations in Sampling Techniques</p><p>42:31 Dynamic Sampling Methods and Their Implications</p><p>51:50 Learning Samplers: A New Frontier</p><p>59:51 Recent Papers and Their Impact on AI Research</p><p></p>]]></description><guid isPermaLink="false">787ad8b3-dba3-4f51-b838-d29f978efefc</guid><dc:creator><![CDATA[Ravid Shwartz-Ziv & Allen Roush]]></dc:creator><pubDate>Thu, 21 Aug 2025 04:05:16 GMT</pubDate><enclosure url="https://api.riverside.com/hosting-analytics/media/b95acb7c7371d63f5c684f2844aa247aaf91bca66e22c1785ad8afb50739478e/eyJlcGlzb2RlSWQiOiI3ODdhZDhiMy1kYmEzLTRmNTEtYjgzOC1kMjlmOTc4ZWZlZmMiLCJwb2RjYXN0SWQiOiJlZWJmYWM4Zi0yMzY5LTRhODktOTI0Yi1iZTI3OWI1N2YxOTAiLCJhY2NvdW50SWQiOiI2OGE1ZjAzMzFkNjQ0MTNlODhlN2FkMmYiLCJwYXRoIjoibWVkaWEvY2xpcHMvNjhhNjk4YmEwMzI5YjY0Y2QxY2M4YjIzL3JhdmlkLXNod2FydHoteml2cy1zdHVkaW8tY29tcG9zZXItMjAyNS04LTIxX181LTU1LTM4Lm1wMyJ9.mp3" length="50328396" type="audio/mpeg"/><itunes:summary>&lt;p&gt;Allen and Ravid discuss a topic near and dear to their hearts, LLM Sampling!&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;In this episode of the Information Bottleneck Podcast, Ravid Shwartz-Ziv and Alan Rausch discuss the latest developments in AI, focusing on the controversial release of GPT-5 and its implications for users. They explore the future of large language models and the importance of sampling techniques in AI. &lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;Chapters&lt;/p&gt;&lt;p&gt;00:00 Introduction to the Information Bottleneck Podcast&lt;/p&gt;&lt;p&gt;01:42 The GPT-5 Debacle: Expectations vs. Reality&lt;/p&gt;&lt;p&gt;05:48 Shifting Paradigms in AI Research&lt;/p&gt;&lt;p&gt;09:46 The Future of Large Language Models&lt;/p&gt;&lt;p&gt;12:56 OpenAI&apos;s New Model: A Mixed Bag&lt;/p&gt;&lt;p&gt;17:55 Corporate Dynamics in AI: Mergers and Acquisitions&lt;/p&gt;&lt;p&gt;21:39 The GPU Monopoly: Challenges and Opportunities&lt;/p&gt;&lt;p&gt;25:31 Deep Dive into Samplers in AI&lt;/p&gt;&lt;p&gt;35:38 Innovations in Sampling Techniques&lt;/p&gt;&lt;p&gt;42:31 Dynamic Sampling Methods and Their Implications&lt;/p&gt;&lt;p&gt;51:50 Learning Samplers: A New Frontier&lt;/p&gt;&lt;p&gt;59:51 Recent Papers and Their Impact on AI Research&lt;/p&gt;&lt;p&gt;&lt;/p&gt;</itunes:summary><itunes:explicit>no</itunes:explicit><itunes:duration>01:10:26</itunes:duration><itunes:image href="https://hosting-media.riverside.com/media/podcasts/eebfac8f-2369-4a89-924b-be279b57f190/episodes/787ad8b3-dba3-4f51-b838-d29f978efefc/images/8a7c00b2-4706-411f-b214-c213b1daa674.png"/><itunes:season>1</itunes:season><itunes:episode>1</itunes:episode><itunes:title>EP1: Sampling</itunes:title><itunes:episodeType>full</itunes:episodeType></item></channel></rss>