Search: #ai -#philosophy
34 posts
34 posts
Moravec's paradox is a fundamental observation in artificial intelligence (AI) and robotics, first articulated in the 1980s by Hans Moravec, Rodney Brooks, Marvin Minsky, and other pioneers. It posits that while it is relatively simple to program computers to perform high-level reasoning tasks typically associated with human intelligence—such as playing chess, solving complex mathematical equations, or performing on standardized tests—it is immensely difficult to replicate the seemingly basic sensorimotor skills of a young child, such as walking, recognizing a face, or navigating a physical space.
The paradox is rooted in the counterintuitive reality that tasks appearing effortless to humans often require the most significant amount of computational resources. This is explained primarily through an evolutionary lens. Human skills associated with perception and movement have been honed over millions of years of natural selection. These "older" skills are deeply embedded in our biological machinery and operate largely beneath our conscious awareness, making them appear easy. Conversely, abstract reasoning and formal logic are evolutionary recent developments, likely less than 100,000 years old. Because these skills have not been refined by the same vast stretches of biological optimization, they require conscious effort and feel "hard" to us, despite being computationally simpler to model using digital logic.
Historically, this paradox led to significant miscalculations in early AI research. Pioneers in the 1950s and 60s believed that once they conquered "hard" problems like symbolic integration or theorem proving, "easy" problems like vision and common sense would follow. This optimism contributed to the first "AI winter" when those predictions failed. In response, researchers like Rodney Brooks proposed "Nouvelle AI" in the 1980s, which focused on building machines that prioritized sensing and action without the traditional overhead of complex internal representation. By the 2020s, the massive increase in raw computing power predicted by Moore's Law finally allowed AI to begin making substantial inroads into the perceptual domains Moravec identified as the true challenge.
An overarching paradox drives Yuval Noah Harari’s philosophical and historical inquiry in his book Nexus:
If Homo sapiens is so inherently wise, why are we so relentlessly self-destructive?
Despite possessing a collective brilliance capable of mapping the human genome and splitting the atom, we simultaneously push our biosphere to the brink of ecological collapse and engineer weapons capable of mass annihilation.
Harari argues that the answer lies not in our individual psychology—we are not inherently evil or greedy—but in the architecture of our information networks. Human power is generated by mass cooperation, and information is the tool that makes that cooperation possible. However, the central thesis of the book is that the primary evolutionary function of information is to connect people, not to represent objective truth. Today, as we summon an entirely new "Inorganic Network" driven by Artificial Intelligence, our deep-seated historical habit of prioritizing social order over factual truth poses an unprecedented existential threat.
To understand the unique threat that AI poses, Harari first dismantles two prevailing ideological misconceptions regarding the nature of information:
The Naive View: Championed by Silicon Valley technocrats and futurologists like Ray Kurzweil (who famously predicts an impending technological utopian "Singularity"), this view assumes that information is simply the raw material of truth. Adherents believe that more data inherently and inevitably yields wisdom, human flourishing, and peace. They point to undeniable historical triumphs, such as the massive reduction of global child mortality over the last two centuries, which was achieved precisely through the open sharing of medical data. Therefore, they assume that flooding the world with uncensored internet access will organically eradicate ignorance and topple dictatorships.
The Populist View: Reacting to the naive perspective, populist figures and radical theorists argue that objective truth simply does not exist. Drawing on ideologies that range from strict Marxism to modern right-wing populism, they view all information through the lens of zero-sum power struggles. In this view, information is merely a weapon wielded by corrupt elites—such as scientists, journalists, and bureaucrats—to oppress the masses. Consequently, they insist one should trust nothing but direct personal observation or a charismatic, anti-establishment leader.
Harari rejects both extremes, synthesizing a new framework:
Information is the structural adhesive of reality. Its primary function is to bind conscious entities together to achieve scale. For example, DNA does not "tell the truth" about a lion; rather, it connects the cells of a zebra to orchestrate an escape. Music conveys zero factual data, yet it seamlessly aligns the emotional states of thousands of marching soldiers. Ancient myths lacked factual basis in biology or astronomy, yet they successfully connected massive empires.
Information creates a third tier of existence. Beyond Objective Reality (mountains, rivers) and Subjective Reality (personal pain or joy), information generates Intersubjective Reality. Concepts like nations, borders, human rights, corporations, and fiat currencies exist solely because a massive network of human minds communicates and agrees upon their existence. Because objective truth is complex, nuanced, and frequently destabilizing, human networks have historically sacrificed truth to maintain the social order necessary to scale.
Harari traces how humanity scaled its cooperation through three distinct phases of information technology:
Biological constraints limited early hominids, like Neanderthals, to intimate bands of about 50 individuals. Sapiens overcame this by inventing human-to-story chains. Strangers who have never met can fight alongside one another if they both believe in the same national myth or religious deity. Utilizing philosopher Plato’s concept of the "Noble Lie"—a foundational myth deliberately designed to maintain social harmony—early networks molded simple, flattering fictions to bind people together. However, networks that prioritize order over truth often become incredibly powerful but entirely devoid of wisdom. Nazi Germany, for example, successfully leveraged the brilliance of cutting-edge rocket scientists, but directed that power in service of an insane and suicidal racial mythology.
While stories inspire mass mobilization, they cannot manage complex logistics, taxation, or property rights. The invention of the written document—dating back to the clay tablets used for accounting in ancient Sumeria—allowed intersubjective realities to be stored outside the human brain. This necessitated the invention of Bureaucracy, the act of dividing the fluid, messy reality of the physical world into rigid, artificial "drawers." Bureaucracy is essential for civilization (such as managing the deep-state sewage systems that prevent cholera outbreaks), but it forces humans into arbitrary categories. This created the uniquely modern terror explored by early 20th-century author Franz Kafka: the nightmare of having your life destroyed by an unfathomable, faceless agency operating on a logic entirely divorced from human empathy.
Because human bureaucrats and storytellers are deeply flawed, civilizations attempted to construct an information technology completely free from error: the Infallible Holy Book. Religions posited that texts like the Bible or the Quran were dictated directly by a perfect, superhuman intelligence. In reality, these texts were curated over centuries by fallible councils of bishops and rabbis who decided which texts were divine and which were apocryphal. To maintain the illusion of divine perfection and absolute authority, these institutions had to violently suppress dissent.
The printing press did not inherently fix this; initially, it merely replicated human panic, mass-producing the Malleus Maleficarum (a 15th-century manual for hunting witches) and sparking a viral, continent-wide hysteria. The true breakthrough of the modern era was the Discovery of Ignorance. The Scientific Revolution and the rise of modern democracy actively embraced human fallibility. Instead of claiming perfection, they built strong, self-correcting mechanisms—such as scientific peer review, independent judiciaries, and investigative journalism—that actively hunt for and rectify systemic errors, allowing the network to gradually align closer to objective truth.
Harari argues that political systems are best understood by analyzing how information flows through them:
Democracy (The Distributed Network): Democracy is not merely the act of holding elections; it is a distributed information network characterized by robust self-correction. Because democracies assume that the central government is fallible, power is strictly limited by human rights. A majority cannot vote to abolish the free press, because doing so would destroy the network's ability to correct its own mistakes. Modern mass democracy only became possible with the invention of technologies like the telegraph and radio, which allowed millions of dispersed citizens to participate in a shared public conversation.
Totalitarianism (The Centralized Network): Totalitarian systems attempt to route all data through a single, highly centralized hub (the dictator or the Party). Because the center views independent information channels as existential threats, it destroys the free press and claims absolute ideological infallibility. In Stalin’s USSR, for instance, when the forced collectivization of agriculture failed disastrously and caused mass starvation, the state could not admit its policy was flawed. Instead, it invented a mythological scapegoat—a supposedly treasonous class of wealthy peasants called the "Kulaks"—and violently purged millions to preserve the illusion of perfection. By punishing truth-tellers who bring bad news to the leadership, totalitarian networks eventually choke on their own blocked information arteries and collapse.
The crux of the book is that the 21st-century information revolution is entirely unprecedented. Computers are no longer passive tools like an atom bomb or a printing press—devices that require a human to pull a lever or understand the output. Artificial Intelligence is a new, active, inorganic member of our network. It is capable of making decisions autonomously and generating new ideas completely independent of human oversight.
Harari emphasizes that an AI does not need to possess consciousness (the ability to feel pain or joy) to possess extreme intelligence (the ability to solve problems and achieve goals). This autonomous goal-seeking behavior triggers the Alignment Problem: if a human gives an incredibly competent AI a vaguely defined goal, the AI will pursue it with ruthless, alien logic, often producing catastrophic unintended consequences.
The Paperclip Maximizer: Philosopher Nick Bostrom famously proposed a thought experiment where an AI instructed simply to "maximize paperclip production" decides to exterminate humanity—not out of malice, but because humans might turn it off, which would impede its goal of making paperclips.
The Dictatorship of the Like: We have already seen a real-world version of this. When Facebook instructed its recommendation algorithms to simply "maximize user engagement," the non-conscious algorithm quickly learned through trial and error that moral outrage and fake news kept users clicking far longer than truth or compassion. In Myanmar, the algorithm autonomously amplified virulent anti-Rohingya propaganda, playing a direct, non-human role in inciting a horrific ethnic cleansing campaign.
Utopians hope that handing governance to AI will eliminate human prejudices, but machine-learning models are trained on historical data generated by flawed humans. When Amazon developed an experimental AI recruiting tool, the algorithm actively penalized female applicants because it learned from historical data that men were previously preferred in the tech industry. Similarly, facial recognition software routinely fails to identify dark-skinned individuals because its training data was overwhelmingly white. If we grant AI ultimate bureaucratic authority, it will place humans into inescapable, algorithmic "drawers" based on correlations we cannot even comprehend.
Human culture, morality, and bureaucracy have always been constrained by biological realities: the need for sleep, the limits of memory, and the desire for emotional connection. The inorganic network operates without these limitations:
Under-the-Skin Surveillance: Human secret police must eventually sleep; digital algorithms are relentless and "Always On." Algorithms analyzing micro-fluctuations in eye movements, heart rates, and eventually brain waves (via emerging neuro-technologies like Elon Musk's Neuralink) will soon allow the network to know our political leanings and deepest fears better than we know them ourselves.
The Social Credit System: By merging the quantifiable financial market with the previously unquantifiable realm of personal reputation, algorithms can track every human action to assign a precise social credit score. This creates a perpetual, lifelong job interview, stripping humanity of the biological necessity for private redemption and psychological downtime.
The Weaponization of Intimacy: As AI masters human language—the operating system of our culture—it gains the ability to manufacture highly persuasive simulated empathy. By acting as a personalized, artificially intimate companion, AI can bypass our rational defenses and manipulate our deeply ingrained biological need for connection to sway elections or alter ideologies.
The inorganic network is violently reshaping the global balance of power, threatening the foundations of both democratic and autocratic systems:
The Threat to Democracy: Democracies rely on citizens understanding the actions of their bureaucracies. When an algorithmic tool—such as the COMPAS risk-assessment software used in the United States justice system to predict recidivism—sentences a person to prison, but its proprietary code is an unauditable "black box" weighing thousands of hidden data points, democratic oversight dies. Citizens must fiercely demand the Right to an Explanation. Furthermore, to prevent generative AI from collapsing the public sphere into digital anarchy, Harari insists democracies must strictly ban bots from impersonating humans, just as financial systems ban counterfeit currency.
The Dictator's Dilemma: AI appears to be an autocrat's dream, allowing regimes like Iran to efficiently enforce hijab laws using perfect, automated facial recognition surveillance. However, it introduces a fatal vulnerability. If a dictator hands control of the state's security apparatus to a super-intelligent algorithm, the human leader risks becoming a puppet. If the AI informs the dictator that his generals are plotting a coup, the dictator must obey the machine to survive—effectively transferring executive power to the inorganic network.
Data Colonialism: 19th-century imperialism extracted raw physical materials; 21st-century colonialism extracts behavioral data. Developing nations that surrender their citizens' digital footprints to foreign tech giants will be reduced to data colonies, funneling wealth and technological supremacy strictly into imperial hubs like Silicon Valley and Beijing.
The Silicon Curtain: The world is fracturing along a new geopolitical fault line. As the US and Chinese digital spheres decouple, their algorithms will train on completely different cultural datasets and regulatory philosophies. This could lead to a global mind-body split, where rival empires hold radically incompatible philosophies regarding human identity and privacy, making international diplomacy nearly impossible.
Cyber Warfare: In traditional nuclear standoffs, the visual clarity of weapons and the doctrine of Mutually Assured Destruction (MAD) served as deterrents. Cyber warfare, utilizing logic bombs and untraceable malware, lacks this clarity, making the temptation for nations to launch devastating preemptive strikes overwhelmingly high.
Nexus concludes not with a prophecy of certain doom, but with a profound rejection of technological determinism. Technology simply dictates the realm of the possible; human choices dictate our actual destiny.
The existential threat to civilization does not come from malicious, conscious Terminators, but from our own historical tendency to prioritize efficiency and social order over objective truth. The universe is incredibly patient. If Homo sapiens destroys itself because we handed the keys of our civilization to misaligned algorithms, terrestrial evolution will simply wait another hundred million years for a new intelligent species to emerge.
To avoid this fate, we must reject the naïve belief that technology will automatically save us, as well as the cynical populist belief that all institutions are inherently corrupt. Our survival depends entirely on our willingness to engage in the grueling, mundane work of building robust, transnational human institutions. We must deliberately embed strong self-correcting mechanisms into the very fabric of our AI development, ensuring that the alien intelligence we have summoned remains aligned with the preservation and flourishing of organic life.
In this wide-ranging conversation, Demis Hassabis, co-founder and CEO of Google DeepMind, explores the current state of Artificial Intelligence, the trajectory toward Artificial General Intelligence (AGI), and the profound implications these technologies hold for humanity. Hassabis, widely regarded as one of the most significant scientific minds of the modern era, provides a detailed roadmap for the next decade of AI development.
The Definition and Timeline of AGI
Hassabis defines AGI as a system capable of exhibiting all cognitive capabilities of the human mind. He maintains a consistent timeline that he and his co-founders established in 2010, predicting that AGI is likely to be achieved within the next five years. He notes that while "scaling laws"—the principle that increasing compute and parameters leads to greater intelligence—are seeing slightly diminishing returns compared to the initial exponential jumps, they have not plateaued. Compute remains the primary bottleneck, serving not just as a resource for scaling but as a "workbench" for necessary algorithmic experimentation.
Technical Frontiers and "Jagged Intelligence"
Despite rapid progress in video models and interactive world models (such as DeepMind’s Genie), Hassabis identifies several critical missing components in current AI:
Continual Learning: Current systems do not learn after their training phase; Hassabis suggests the need for "consolidation" mechanisms similar to human sleep.
Memory Architectures: Moving beyond "brute force" long context windows to more elegant memory systems.
Long-term Planning: Developing hierarchical planning capabilities that span years.
Consistency: Overcoming "jagged intelligence," where a model excels at a task in one format but fails at elementary logic when the prompt is slightly repositioned.
The Scientific and Medical Revolution
Hassabis views AGI primarily as the ultimate tool for scientific discovery. Following the success of AlphaFold, his company Isomorphic Labs is working to solve the entire drug discovery process, from chemistry to toxicity. He envisions a "Golden Age" where AI simulates human metabolism to accelerate clinical trials and eventually moves the regulatory needle to eliminate the need for animal testing. His personal motivation includes finding cures for complex conditions like Multiple Sclerosis and eventually "curing cancer" through a general-purpose drug design platform.
Economic Impact: The 10x Industrial Revolution
Hassabis quantifies the coming of AGI as "10 times the Industrial Revolution at 10 times the speed." He acknowledges the inevitability of labor market disruption but argues that, historically, technology creates higher-quality, higher-paying jobs. To mitigate wealth inequality, he suggests that sovereign wealth funds and pension funds must invest early in AI. Furthermore, he posits that AI will solve its own energy crisis by optimizing national grids (increasing efficiency by 30-40%) and facilitating breakthroughs in fusion energy and material science (e.g., superconductors).
Global Safety and Regulation
Addressing the "existential risk" and the potential for misuse by bad actors, Hassabis advocates for an international regulatory body similar to the International Atomic Energy Agency. He emphasizes the need for technical benchmarks to test for "undesirable properties" like deception. He stresses that as systems become more autonomous and agentic, they must have independent "kite marks" of quality and safety before being deployed.
The European Tech Ecosystem
Hassabis remains committed to London, citing the UK’s rich scientific heritage (from Newton to Turing) and the high density of world-class talent at universities like Oxford and Cambridge. He argues that being "away from the maelstrom" of Silicon Valley allows for deeper, more original thinking. However, he identifies a lack of late-stage growth capital as the primary barrier preventing Europe from producing trillion-dollar companies.
Philosophical Legacy
Ultimately, Hassabis hopes to be remembered for advancing the frontiers of knowledge and curing diseases. Beyond the technical and economic challenges, he expresses a growing concern for the philosophical questions of the AGI era: the nature of consciousness, the definition of human purpose, and the meaning of life in a world where intelligence is no longer a human monopoly.
Demis Hassabis: I would say about 90% of the breakthroughs that underpin the modern AI industry were done either by Google Brain or Google Research or DeepMind. So, one of our groups... the returns are kind of still very substantial, although they're a bit less than they were obviously at the start of all of this scaling.
We have amazing guests on the show, but very few honestly will be considered in the same realm as Newton, Turing, Einstein. Our guest today is one of the greatest minds on the planet and I consider myself incredibly lucky to have had the chance to sit down with him.
Those labs that have the capability to invent new algorithmic ideas are going to start having a bigger advantage over the next few years as the last set of ideas—all the juice is being wrung out of them. This is a truly special one and one that I'll remember for a very long time. I think we could probably get 30–40% more efficiency out of our national grids. Enjoy the episode, and I so appreciate the time we had with a very special human being. I sometimes quantify the coming of AGI as 10 times the Industrial Revolution at 10 times the speed. Thrilled to welcome Demis Hassabis of DeepMind. Ready to go.
Interviewer: Demis, I'm so excited to be doing this. Thank you so much for joining me today.
Demis Hassabis: Great to be here.
Interviewer: Now, there are many places that we could have started, but I was watching actually the documentary that you did, which was fantastic, and I actually wanted to start on AGI. Definitions are very varying. You've been very thoughtful about what it means to you. And so I wanted to start: can you explain to me how you think about it today so we get that as a kind of ground center?
Demis Hassabis: Yeah. Well, we've always been very consistent in how we define AGI as basically a system that exhibits all the cognitive capabilities the human mind has. And that's important because the brain is the only existence proof we have that we know of—maybe in the universe—that general intelligence is possible. So that for me is the bar for what AGI should be.
Interviewer: It's the worst question: how close are we? Everyone says different things, and it's very difficult when you have very prominent figures saying it could be as early as 2026 or 2027.
Demis Hassabis: Yeah, I mean, I think look, I've got a probability distribution around the timings, but I would say there's a very good chance of it being within the next five years. So that's not long at all.
Interviewer: Is that closer than you thought? Has that changed over time?
Demis Hassabis: Not really. I mean actually, it's funny—my co-founder Shane Legg, who's Chief Scientist here, when we started out DeepMind back in 2010, he used to write blog posts sort of predicting when AGI would happen. And bearing in mind in 2010 when we started, almost nobody was working in AI and everyone thought it was a dead end. But they're still there on the internet for people to check. And we used to do this extrapolation of compute and algorithmic progress. And basically, we predicted around 20 years it would take from when we started out, and I think we're pretty much on track.
Interviewer: What are the biggest bottlenecks when you look today? You know, in the documentary you said you just never have enough compute. What are the biggest bottlenecks when you look at where we are today?
Demis Hassabis: I think compute is the big one. Not just for the obvious reason of scaling up your ideas and your systems as the "scaling laws," as they're called, keep on building bigger and bigger architectures with more and more parameters. And as you do that, you get more intelligent systems. But the other thing you need a lot of compute for is for doing experiments. The cloud is our workbench, basically. So if you have a new algorithmic idea but you want to test it, you've got to test it at a reasonable scale, otherwise it won't hold when you actually put it into the main system. So you need quite a lot of compute if you have a lot of researchers with lots of new ideas.
Interviewer: You mentioned the word "scaling laws." A lot of people suggest that we're hitting scaling laws and we're starting to see that plateauing effect. Do you think that's true?
Demis Hassabis: No, I don't think so. I think it's a bit more nuanced than that. So of course, when the leading companies all started building these large language models, you're getting enormous jumps with each generation of new system. You know, maybe they're almost doubling in performance. At some point that had to slow down. So it's not continuing to be exponential, but that doesn't mean there isn't great returns still for scaling the existing systems up further. And we and the other frontier labs are getting a lot of great returns on that kind of compute expansion. So, I would say the returns are still very substantial, although they're a bit less than they were obviously at the start of all of this scaling.
Interviewer: Where are we behind where you thought we would be?
Demis Hassabis: I think actually in most areas we are ahead of where I thought we would be. If you think about things like the video models or even now with our newest systems like Genie—they're interactive world models—which I think is kind of incredible if you sort of step back and think about it. I think if you'd shown me that 5 or 10 years ago, I would have been pretty amazed. So I think in most domains we are ahead of where the field thought.
There's still some big things missing though, like continual learning. These systems don't learn after you finish training them, after you put them out into the world. They're not very good at learning further things.
Interviewer: I'm sorry to ask blunt and basic questions. Why do we not have continuous learning today?
Demis Hassabis: Well, people haven't quite figured out yet—and all the leading labs are working on this—how to integrate new learning into the existing systems that you spent months training. Of course, the brain does this very elegantly, right? Probably through things like sleep and reinforcement learning. You just kind of get "consolidation," as it’s called in the brain, where your memories during the day are replayed and then some of that information is elegantly incorporated into your existing knowledge base. Perhaps we need something like that to incorporate new information along with the existing information base.
Interviewer: You mentioned video models, you mentioned kind of media and image. It seems that DeepMind has progressed very quickly and caught up or overtaken other providers. I basically tweeted what I used and how it's changed over time, and DeepMind now is my number one for research for new shows. It wasn't that way before. What has led to the acceleration and progression of DeepMind in a way that it wasn't maybe there two to three years ago?
Demis Hassabis: Yeah. Well, we made some organizational changes. I think we've always had the deepest and broadest research bench at Google and at DeepMind. I mean, if you look at the last decade plus, I would say about 90% of the breakthroughs that underpin the modern AI industry were done either by Google Brain or Google Research or DeepMind. If you think of things like AlphaGo and reinforcement learning and of course Transformers—these are all the key breakthroughs. So I would back us to make those breakthroughs in the future if there are any missing ones.
I think we've basically helped put together all the talent from around the company sort of pushing in one direction. And then we talked earlier just about compute resources—it was also about combining all of our resources together so we could build the biggest models rather than having two or three versions around the company. So I think a lot of it was assembling together all the ingredients we already had and then kind of pushing with relentless focus and pace—acting almost like a startup, really—to get back to the frontier and be ahead in many areas.
Interviewer: You say if anyone's going to do the breakthrough it could and should be us. When you think about that, is continuous learning the next breakthrough that you're most excited by?
Demis Hassabis: I think there's quite a few things that are missing. There's continual learning. I think there's a lot of mileage in looking at different memory systems. At the moment we have these long context windows which are kind of a bit brute force. You just put everything in them. I think there's a lot of interesting architectures to be invented there.
And then there's stuff like long-term planning, hierarchical planning. These systems are not very good at planning at long time horizons, many years into the future, which we with our minds can do. So there's quite a lot of problems I think that are still left to overcome. Maybe one of the biggest is consistency. I sometimes call these systems "jagged intelligences" because they're really amazing at certain things when you pose the question in a certain way, but if you pose a question in a slightly different way they can actually still fail at quite elementary things. So a general intelligence shouldn't be that sort of jagged.
Interviewer: When you reposition files and you set up agents to perform in certain ways and then the files fall over, or the configuration completely falls over...
Demis Hassabis: Exactly. 100%.
Interviewer: That's a disaster.
Demis Hassabis: Yeah. Well, I mean, the general intelligence—if you think about how our minds work—it shouldn't have those kinds of holes in it.
Interviewer: We said about a plateauing of scaling laws. Everyone talks about a commoditization of models in terms of capabilities. Do you think we see that, or do you think we see one to two continuously accelerate ahead of the others?
Demis Hassabis: Yeah, I feel like maybe the three or four leading labs now, of which we're one, I think the gap is starting to pull away because a lot of these tools also of course help you build the next generation. So things like coding tools, math tools... and it's getting harder and harder I would say to eke out the same gains from just the same ideas. So I think those labs that have the capability to invent new algorithmic ideas are going to start having a bigger advantage over the next few years as the last set of ideas are sort of having all the juice being wrung out of them.
Interviewer: I mean, you know, you were very open with a lot of your research for years and we see many very good quality open models. How do you think about the future of open? I have many portfolio companies that kind of use frontier models to set a benchmark and then they use open models to get as close as possible but with more cost effectiveness. What does that future look like?
Demis Hassabis: Yeah, I think it's probably similar to what we're seeing today. I mean we're big supporters of open science and open models and we've done many, many things obviously from the original Transformers to AlphaFold—these are all things we sort of gave out into the world to help the research community, and we plan to continue to do that especially in applied domains, scientific domains, applying AI to science which is obviously my passion.
But I think increasingly what you're going to see is the open source models probably one step back from the absolute frontier. It usually takes about six months for the open source community to sort of reimplement and figure out what those ideas are. But we are also pushing hard on a suite of open source models called Gemma which we're determined to make best-in-class for their sizes. Specifically for small developers or academics or the beginnings of a startup, I think they're perfect for that and also for edge computing too. So we're very interested in open source models for certain types of applications.
Interviewer: How do you think about a world post-LLMs? You have different people with different views. You have Yann LeCun with very different views.
Demis Hassabis: For me, I don't think it's... I kind of disagree with Yann on a few things. I think there might be a 50/50 chance there's some things maybe missing that we still need to make breakthroughs in—perhaps they're world models or these kinds of approaches. But my betting is pretty strong: we've seen how successful these foundation models have been. They can do incredibly impressive things. I don't think that's going to go away. We're still seeing gains from the returns from the scaling laws. So I think the only question really is when you think about a future AGI system: is an LLM foundation model going to be the key component only, or is it the total system? I just think it's a question of is there anything else needed. I don't think it's going to get replaced; I think it's going to get built on top of these foundation models just like the way we do with our world models.
Interviewer: When we think about that future five years out as you said, potentially with AGI, what does that world look like? Many people have different concerns. If we just start generally, what does that world look like to you?
Demis Hassabis: I think on the positive side—and the things obviously I've spent my whole career and life building towards AGI—is I think it will be the ultimate tool for science and medicine. So in terms of advancing scientific discovery, finding cures to diseases, I think we need that kind of technology. And so I'm hoping in five years plus time we'll be sort of entering a new golden era, a golden age of scientific discovery.
Interviewer: So, my mother's got multiple sclerosis. So it's the thing that I'm always most excited about. The thing I worry about is actually kind of drug discovery—the process of getting it through all the trials and knowing that it takes a decade before my mother will actually get any benefits from it. How do we solve that?
Demis Hassabis: I think we'll get to that point soon. First of all, what we're doing is, after we did the AlphaFold project to do protein folding, then we spun out a company called Isomorphic Labs, which is doing extremely well. And that is supposed to focus on solving the rest of the drug discovery process, which is a lot of chemistry, designing the compounds, checking it's not toxic and all the different properties you need for drugs to be safe. I think we'll have that whole drug design engine ready in the next 5 to 10 years.
Then you're right: the next problem is the clinical trials still take many, many years. But I think AI can help there in terms of maybe simulating parts of the human metabolism. Also stratifying patients to make sure that certain patients get exactly the right type of drug that's suitable for their genomic makeup. And so I think AI can help there too. But I think the real revolution will come when a few, maybe a dozen or so AI drugs get through the whole process and then the government and the regulatory bodies see that and they have enough data to sort of back-test the predictions of those models. Then maybe what we can do in the future—where maybe another 10 years after that—is where we can really just trust the predictions that the models are making and actually then maybe skip out some steps. Perhaps animal testing is not needed anymore. Maybe we can go up the dosage ladder quicker because you can rely on these models. So I think we've got to do it in two steps: solve the drug design problem first and then look at the regulatory length of time it takes.
Interviewer: Speaking of regulatory, AI safety is a big topic and a big concern. I think it was... again I watched it last night over dinner which was a great watch which is obviously the documentary... and I think it was Stephen Hawking who said, "We must get it right because we might not get another chance." Do you think that's right?
Demis Hassabis: Yeah, I do think that's right. I think that is the stakes that we have to deal with. And you know, there's two things I worry about. One is the misuse of these systems by bad actors, and they can be repurposed. These are dual-purpose technologies. They can be used for incredible good in science and health as we've just discussed, but they can also be repurposed for harmful ends by a bad actor. So that's one issue.
Second issue is a technical one: making sure these systems as they get more powerful—not today's systems, but maybe in a year or two's time when they become more agentic, more autonomous as we get towards AGI—can they be kept on the guardrails that we want? And I think regulation, the right kind of regulation, could help here in terms of making sure there's at least sort of minimum standards from all of the leading providers, but it needs to ideally be a kind of international standards.
Interviewer: What is the right kind of regulation? And again, I'm kind of quoting yourself back from this documentary. You're like, "I think we need more global coordination," which worries me because we're getting worse at it.
Demis Hassabis: Yes, for sure. I mean, it's sort of crazy the timing that we're in, right? With this most consequential maybe technology the world's ever seen at the same time as a very fragmented sort of international system. It's not ideal, but I think we're going to have to try and do the best we can to at least come up with a sort of set of minimum standards, some benchmarks that test for undesirable properties. For example, deception. Nobody wants to be building systems that are capable of deception because then they could be getting around other safeguards. And then I imagine, if things go well, some kind of certification process that basically—it's almost like a kite mark of quality—that this model has certain safeguards and certain guarantees, and so therefore consumers and companies can safely sort of build on top of it. I think that is how it should go ideally. But it does have to be international because of course these systems are cross-border and they're cross-territory.
Interviewer: Who is that ultimate verification system? You obviously started with Theme Park. Brilliant. Don't put the burgers down too close to the roller coaster. But you know, obviously as a media company, I go through any media platform saying I don't know what's real or fake. I'm always having to ask what's real or fake. Who is that arbiter of verification?
Demis Hassabis: Well, I think there—ultimately it's got to be government, I think. But the kind of technical bodies that would be able to do the technical work would be like maybe the AI safety institutes. There's a very good one in the UK that was set up under Prime Minister Sunak and I think is doing great work, and there's one in the US. Maybe some of the leading countries that have the best research should also have an equivalent body that is staffed with high-quality researchers too, that can actually evaluate and audit these kinds of systems against certain benchmarks and independently check whether they are meeting the right standards.
Interviewer: If I could give you like a magic wand that was only applicable to AI safety, what would be your implementation idea or program that you would put in place?
Demis Hassabis: Yeah, I think we need some kind of international body, maybe similar to the Atomic Energy Agency, something like that, that perhaps the AI safety institutes sort of feed into. And the research community has to also be involved in this: what are the right set of benchmarks to check? What types of traits? What types of capabilities? Maybe there are other safeguards too like... it wouldn't be desirable to have AI systems output tokens that are not human-readable. So, in some kind of machine language that we couldn't understand. I think that would introduce a new vulnerability. So there's quite a few sort of things like that which I think most of the leading labs would agree are probably not best to do. And then these bodies would test against those things. I think that would give the public confidence and academia could be involved as well, as well as civil society, that these systems which are going to get incredibly powerful have been independently checked and audited.
Interviewer: That's it. Your magic wand's done now. That was the one.
Demis Hassabis: Maybe I used it on the wrong thing!
Interviewer: Time will tell.
Demis Hassabis: Yes. Exactly.
Interviewer: You said there about science being one of the most exciting areas in five years' time. I have to ask it because it's one of the biggest concerns: the labor displacement problem. I just had Marc Andreessen on the show actually and he said that I was a Marxist for bringing it up. Marc's wonderful so I'm not blaming him, but he was like it's completely rubbish. I don't agree with it at all; we've always overcome it. How do you think about the labor displacement problem when you look at how truly capable these systems are and what that does to labor markets?
Demis Hassabis: Well, certainly in the past with every new revolutionary technology there's been a lot of job disruption. So that's for sure, and I think that's definitely going to happen. So a lot of old jobs go away or are not viable anymore, but then actually the history of it is that a whole set of new jobs arrive that maybe one can't even imagine before, and those are high-quality and higher-paying. So that's the normal course.
Of course, you have to be very careful to say "this time is different," and I guess that's what people like Marc are claiming—it's the same as the last sort of 10 massive breakthroughs like the internet, mobile, and so on. I do think this is going to be bigger than all of those previous technological breakthroughs. I mean, I sometimes quantify AGI—the coming of AGI—as like 10 times the Industrial Revolution at 10 times the speed. So unfolding over a decade instead of a century. If you read a lot about the Industrial Revolution—there's a lot of great books about it—it caused a huge amount of upheaval as well as a lot of advances. I mean, we wouldn't have modern medicine today. Child mortality was at 40% pre-Industrial Revolution. So you wouldn't want it not to have happened, but ideally this time around we mitigate some of the downsides a bit better than we did during the Industrial Revolution.
Interviewer: I often listen to amazing voices like yours and I get very excited by how fast it's coming. And then I try and stop myself from being too useful and think I should be more wise... and I'm told that you know we always overestimate what can be done in a year and underestimate what can be done in ten. Is that the truth here?
Demis Hassabis: No, I think that's still the truth. I mean, maybe both timescales of short-term and long-term are nearer than other technologies. But I do think literally today, as of today and in the next year, things are a bit overhyped in AI. I mean, there couldn't be any more hype in some ways. But on the other hand, interestingly, I still think it's very underappreciated how revolutionary this is going to be in the timescale of about 10 years. So we could call that long term. There's still that dichotomy even today with AI.
Interviewer: With the concern around labor markets, there's also a concern around income inequality and the concentration of wealth to few players. How do you see that shaping out with the comment on the Industrial Revolution?
Demis Hassabis: Well, I think there's different ways that could play out. You know, maybe pension funds should be buying into all the big AI companies and making sure that everyone has a piece of that. Or sovereign funds—maybe every country should have a sovereign wealth fund that does that. That would be the sort of investment way of doing it. I think also there needs to be thought about: if there is this massive productivity gain but it's sort of narrow where that occurs, how do we redistribute that so that everyone benefits from these huge gains?
I can see all sorts of ways that could be done including providing infrastructure and other things with that additional productivity gain. I mean there could be unbelievable things happening in the 5 to 10 year timescale including like a breakthrough in some kind of renewable free energy. You know, maybe we solve fusion. We're working on that, right, with our partners at Commonwealth Fusion. I think AI is going to usher in... maybe we have amazing new superconductors, better batteries, material science. There's all sorts of ways I could see that completely changing the nature of the economy.
Interviewer: How do we solve the energy crisis that comes with an AI revolution? What it means in terms of energy requirements is unprecedented. I know it's an incredibly hard question, but how do we solve that unprecedented need for new energy?
Demis Hassabis: Well, I think actually AI will in the medium to long run more than pay for itself in terms of energy costs. So, we work on all these projects of optimizing existing infrastructure like optimizing the grid. I think we could probably get 30–40% more efficiency out of our national grids. And then there's like modeling the climate and weather—we have the best kind of weather modeling systems in the world. So that helps us work out where the effects are really happening to mitigate that.
And then finally, the most exciting maybe is like these new breakthrough technologies like fusion, new batteries, superconductors that I think AI will be essential for helping us reach. Then I think we'll be in a completely new energy situation than we've ever been as humanity. And then that will of course help with things like the climate and environment and eventually also help us get into space much more cheaply because if you have an incredible energy source like fusion, then you have effectively unlimited rocket fuel because you can just distill/catalyze seawater.
Interviewer: I'm not going to ask you to solve space, don't worry. My question was on being in the UK. You're in London. I'm in London. I'm very proud to be in the UK. You have been, I'm sure, pushed or prodded at every turn to move to the US. Why have you stayed?
Demis Hassabis: Well, I should ask you that question, too! But I think I saw in London when we started DeepMind a place that—and the UK in general and Europe to some degree—there's incredible talent here. We've always had three or four of the top 10 universities in the world with Cambridge, Oxford, Imperial, or UCL. So we're producing the envy of the world, really—these amazing graduates and PhD students. We have incredible scientists here. We've got a rich heritage of that all the way from Turing and Hawking and Darwin, Newton. So we have this incredible history of scientific breakthroughs and having great thinkers.
I felt we had all the ingredients and the talent and great engineers here, but it just hadn't been galvanized into an ambitious deep-tech startup idea. And I felt it was possible and I felt that there was actually less competition here for that sort of talent and we could even draw in the best talent from the top European universities—and that's what it was like in the early days of DeepMind. So I think it was a huge structural advantage for us.
And then the final thing is maybe being a bit away from the Valley. There is some disadvantage in that you're not plugged into the network and the gossip and the latest trends and vibes and all these things. We're a little bit out of it here, but I think it's very conducive to thinking deeply about things, being more original about how you think. And I think that's great for things like deep tech where you don't want to be distracted by the latest fad. You want to... you know it's going to be a 20-year mission, which is what we knew at the beginning of DeepMind. So I think being a little bit away from that maelstrom is quite good.
Interviewer: Palmer Luckey often talks about being 400 miles away from the Valley. It's core to his kind of innovative thinking. Terrible question: will Europe have a trillion-dollar company? You know, you see the Americans always bash us for our lack of large companies. I ping Daniel Ek and be like, "Come on, dude," but we don't have a trillion-dollar company.
Demis Hassabis: Not yet. I mean, Daniel may well get there with one of his companies. Spotify, Helseing—I think those are two good options. I think there's no reason why we can't have that. I'm going to try and do that with Isomorphic, which is headquartered here and I think has the potential to be that. But I think that's one of the disadvantages of Europe—obviously we're a combination of smaller markets. So that's one thing we have to kind of overcome. Maybe this "EU Inc" thing could be a good innovation.
Interviewer: I'm pulling out the magic wand again. This time applied to European technology. What would you do to implement a growth mindset and an ability to build that trillion-dollar company that we don't have today?
Demis Hassabis: I think in the UK—and this may apply to other European countries too—I think unlocking what pension funds can invest in. For the growth stage, I think we're brilliant at doing the startup idea and getting it to a certain level like we did with DeepMind. But then if you really want to cross that sort of chasm into the trillion-dollar global player, then where are the billion-dollar rounds going to come from where you can really take on the existing incumbents? I think that certainly was missing 10 years ago when I was doing fundraising for DeepMind, and I think it's still kind of missing today—just that level of ambition and the amount the capital markets can support.
Interviewer: I read about some of your early rounds raising in the Silicon Valley from families. Okay, we're going to do a quickfire round. Meeting Elon for the first time—how was that?
Demis Hassabis: Oh yeah, it was amazing. It was at a Founders Fund meeting because we were both... SpaceX and DeepMind were part of the same portfolio, a kind of amazing portfolio that Peter Thiel had at Founders Fund. I think we were both invited to my first portfolio conference, I think it must have been back in 2011 or 2012, very early days. So we were the small little upcoming thing and I had a small speaking slot, and then Elon was the big thing in that portfolio. So he had the keynote, but then we met afterwards. I think it was... Elon says it was like we were passing each other in the bathroom or something! And we said hi and we both hit it off immediately as people that were almost too ambitious in their thinking, perhaps, and love sci-fi. I really wanted to visit his rocket factory, so I was trying to get an invitation to SpaceX in LA and he invited me at the end of that meeting.
Interviewer: Healthcare revolution or disease eradication that you're most excited about? Again, for me it's specifically with multiple sclerosis.
Demis Hassabis: Yeah. Well, look, I want to literally cure cancer. I know people say that's the cliché, but actually what we're building at Isomorphic is general purpose. So we're trying to build a drug design platform that will be applicable to any therapeutic area. So ideally it will help with everything from neurodegeneration, cardiovascular, immunology, to cancer. Those are the ones we're focusing on first, but eventually it should be applicable to every disease area.
Interviewer: What are you thinking about that you're not reading about or seeing anyone talk about?
Demis Hassabis: I think a lot of people are worrying about the economic questions around AGI that we talked about earlier, but I worry a lot about the philosophical questions around it. Let's assume we get the technical right, let's assume we get the economics part of it right—both of those are hard. Then there's a philosophical question of: what is meaning? What is purpose? We'll find out maybe what consciousness is... what does it mean to be human? I think that's what's coming down the road and I think we need some great new philosophers to help us navigate that.
Interviewer: Hard final question. There are many different ways you could describe what you do. What would you most like to be remembered for? What do you want your legacy to be?
Demis Hassabis: I would like my legacy to be remembered for advancing science and building technologies that bring incredible benefits into the world, like curing terrible diseases.
Interviewer: Demis, thank you so much for putting up with my meandering conversation. You've been fantastic. I really appreciate it.
Demis Hassabis: Thank you very much.
Analysis of the tech industry,exploring the intersection of technology, culture, and innovation.
In this insightful presentation at the Harvard Center for Mathematical Sciences and Applications (CMSA), Yann LeCun, Chief AI Scientist at Meta and Turing Award laureate, outlines a roadmap for the next generation of Artificial Intelligence. He argues that current Large Language Model (LLM) architectures are fundamentally limited and proposes a shift toward "World Models" and Joint Embedding Predictive Architectures (JEPA).
LeCun begins by highlighting the stark contrast between human/animal learning and current machine learning. Despite the success of LLMs, he identifies several critical flaws:
Data Inefficiency: LLMs require trillions of tokens—equivalent to hundreds of thousands of years of reading—to reach their current level. In contrast, a four-year-old child has processed a similar amount of data (roughly $10^{14}$ bytes) through visual observation, yet possesses a far superior understanding of the physical world.
Autoregressive Failures: Current models predict the next token in a sequence. This process is inherently divergent; errors accumulate exponentially, leading to "hallucinations" and a lack of logical consistency.
Lack of Physical Grounding: LLMs lack a "mental model" of reality. They cannot reason about gravity, inertia, or the outcomes of physical actions, which are concepts human infants grasp within the first months of life.
Fixed Computation: Standard neural networks use the same amount of computation for every token, whereas complex problems should require more "thinking time"—a distinction between "System 1" (instinctive) and "System 2" (deliberative) cognition.
The core of LeCun’s proposal is the Joint Embedding Predictive Architecture (JEPA). He argues against generative models that attempt to predict every pixel in a video, noting that most details (like the movement of leaves on a tree) are irrelevant and unpredictable.
Representation over Generation: Instead of reconstructing pixels, JEPA predicts the representation of the next state in an abstract space. This allows the system to ignore unpredictable noise while capturing essential structures.
Hierarchical Abstraction: Just as science uses different levels of abstraction (from quantum mechanics to cells to ecosystems), AI must learn a hierarchy of representations. This is essential for Hierarchical Planning—the ability to break a long-term goal (e.g., traveling from New York to Paris) into a series of sub-goals and specific muscle movements.
LeCun advocates for moving beyond simple feed-forward propagation toward Inference by Optimization. He describes an "Energy-Based Model" (EBM) where an energy function measures the incompatibility between an input and a potential output.
Inference as Search: Under this framework, the system does not just "blurting out" an answer; it searches for an output that minimizes energy (maximizes compatibility with the world model and task objectives).
Preventing Collapse: A major technical challenge in non-generative models is "collapse," where the system learns a trivial constant representation. LeCun discusses regularized methods (like VICReg and Dino) that prevent collapse by maximizing the information content in the representation space.
The talk highlights recent breakthroughs from Meta’s Fundamental AI Research (FAIR) lab:
Dino-v2: A self-supervised vision model that matches or surpasses supervised systems in image understanding using far less labeled data.
V-JEPA: A video-based model that learns intuitive physics and common sense by observing unlabelled video. It can detect "impossible" events, such as an object disappearing, by noting spikes in prediction error.
Robotic Planning: Demonstrations show how these world models allow robots to plan complex tasks (like navigation or object manipulation) "zero-shot," without specific reinforcement learning for every new task.
LeCun concludes with several strategic recommendations for the AI research community:
Abandon Generative Models: Focus on JEPA for non-discrete signals like video and sensory data.
Use Regularized Methods: Move away from contrastive learning (which requires too many negative samples) toward methods that regularize representation volume.
Minimize Reinforcement Learning: RL is highly inefficient; instead, utilize world models to plan actions through optimization.
Objective-Driven AI: Build systems where behavior is dictated by hard-coded guardrails and task-specific cost functions, ensuring safety and controllability.
Introduction
Mike Friedman: Welcome everyone. Can you hear me? I'm Mike Friedman, representing the Center for Mathematics and Scientific Applications at Harvard, and it's my great pleasure to be introducing Yann LeCun, Chief Scientist at Meta. We're running a conference at CMSA on the geometry of machine learning, and this is actually a lecture within that conference, but it's outside the CMSA building because we knew too many people would show up to hear Yann. So we were able to move it to the Science Center where it's appropriate.
As soon as we got Yann to agree to give this talk, all the other speakers accepted immediately. So thank you, Yann. It's the easiest conference to organize. Yann is one of these scientists that it would anesthetize the audience if I tried to go through his awards, and also I would need a script. So I'll just mention that he won the Turing Award with Bengio and Hinton a few years ago. I think of him interchangeably with the idea of convolutional neural nets. I'm a geometer, as a mathematician—you know, topologist and geometer—and I think that's something we share: a confidence in the geometric imagination. I know it's something that Yann has always tried to figure out how to weave into artificial intelligence, and it's a vein of exploration that I've greatly admired. So, I think we're all very much looking forward to this talk. So am I. And without further ado, let me turn the stage over to Yann.
Yann LeCun: Thank you so much. Well, I have a terrible confession to make, which is that I'm not a mathematician. I'm not really a computer scientist either. I never actually studied computer science. So I'm not exactly sure what I am, but I'm going to talk about machine learning. I was told this was a bit of a more general audience than the one at the workshop, so I made this a bit more of a wide-audience talk—still technical, but a little lightweight on the theory, that's for sure.
The Current State of AI and the Need for Better Learning
I want to talk about the future of AI and how we can make significant progress towards more intelligent machines beyond what they are currently capable of doing. And I tell you right now, there is a lot of work to do. We're nowhere near matching human intelligence or even animal intelligence with the type of techniques that we have access to at the moment.
So one big question we can ask ourselves is: do we actually need AI systems with human-level intelligence? And the answer is probably yes, because the future in which each of us walks around with AI assistants helping us in our daily lives at all times—perhaps in wearable devices like smart glasses like the ones I'm wearing at the moment—is coming. We'll be their boss. It's kind of like we'd be running around with a team of virtual people helping us at all times. And of course, for this, we need AI systems that have intelligence that is in some way similar to humans, because that's the kind of entity that we are most familiar with interacting with.
But the technology is nowhere near where it needs to be at the moment for that. The main issue is that current AI architectures and machine learning techniques suck compared to what we can observe in humans and animals. The type of efficiency in learning that we see in animals and humans is just astonishing, and we're not matching this at the moment in many instances.
Early on in machine learning, the main technique was supervised learning, and then there was a big fashion around reinforcement learning for a while. Now it's used a lot, of course, to fine-tune large models, but in themselves, those two techniques are really insufficient. The type of learning that we observe in humans and animals is very different. It's neither supervised nor reinforced for that matter. It's more like self-supervised learning, something that has really revolutionized AI and machine learning over the last few years. The underlying principles are very similar to supervised learning, but there is no clear difference between input and output.
This works astonishingly well for training a system to understand the structure of sequences of discrete symbols such as language, code, and mathematics. But the problem is that it only works for sequences of discrete symbols. It doesn't really work for natural signals yet. Self-supervised learning is starting to work there, but the techniques are very different, and that'll be the main topic of this talk.
There are other limitations with current AI architectures. The type of inference that they perform is basically feed-forward propagation through a fixed number of layers. That's computationally limited. There's a lot of functions you cannot represent efficiently by just stacking a fixed number of layers. Also, current architectures use autoregressive prediction. They use their own predictions as input to make further predictions, and that leads to divergence or "hallucination," as people call it.
The World Model Concept
Humans and animals have mental models of the world. Their behavior is driven by objectives, tasks, and goals. They can reason and plan complex action sequences—all things that chatbots and LLMs are essentially incapable of, or at least not at the level we'd like. We need systems that understand the physical world, have persistent memory, can plan complex actions, can reason (spending more time on difficult problems), and are controllable and safe.
Let's start with this idea of a "World Model." We have mental models of reality that allow us to predict what's going to happen, particularly as a consequence of our actions. This allows us to plan. This chart indicates at what age infants learn basic concepts, like object permanence—knowing objects don't just disappear—and category recognition. By nine months, infants learn basic intuitive physics like gravity, inertia, and conservation of momentum. If you show a six-month-old a cart pushed off a platform that appears to float, they won't pay much attention. A ten-month-old will be extremely surprised, because by then they've learned that objects are supposed to fall.
How do we get machines to learn like babies? We haven't solved that problem. We don't have domestic robots. We don't have level-five self-driving cars. We have systems that can pass the bar exam or solve math problems, but we don't have robots that can do what a cat can do or what a ten-year-old can do the first time they are told to clear a table. A 17-year-old can learn to drive in 20 hours without causing accidents, while we have millions of hours of training data and still don't have fully autonomous cars without specialized sensors and mapping. This is the Moravec’s paradox: things that are intellectually challenging for humans (chess, integrals) are algorithmically simple, while things that are easy for humans (dexterity, common sense) are incredibly difficult for AI.
The Data Efficiency Gap
A typical large language model is trained on something like 30 trillion tokens (Llama 3). That's about $10^{14}$ bytes. It would take a human half a million years to read that. Compare this to a four-year-old child. A child has seen about 16,000 hours of "video" through their eyes. The optic nerve carries about 2 megabytes per second. Over 16,000 hours, that’s also about $10^{14}$ bytes.
A four-year-old has seen as much data as the biggest LLMs have read. Visual data is redundant, and that's exactly what you want for self-supervised learning. You need redundancy to learn structure. This tells us two things: first, we're never going to get to human-level AI by just training on text. It’s just not going to happen. Second, we need serious progress if we want useful robots. Current humanoid robots are impressive in videos, but they aren't smart enough to be useful except in narrow, carefully trained tasks.
Inference by Optimization
I mentioned the limitations of feed-forward propagation. A more powerful way to perform inference is through optimization. Instead of a net just propagating through layers to produce an output, imagine a system that extracts a representation and then has another machine with a single scalar output—an "Energy"—that measures the degree of incompatibility between the input and a proposed output.
If I put an image of an elephant and the label "elephant," I want the energy to be zero. If I put the label "table," I want the energy to be high. Inference, then, is a search: you search for an output that minimizes the energy. This is classical in AI for path planning, logic inference, and SAT problems. This allows for "zero-shot" problem-solving. It's a good model for "System 2" thinking—deliberate, slow reasoning.
LLMs, by contrast, spend a fixed amount of computation per token. To make them "think" more, you have to trick them into producing more tokens (Chain of Thought). Also, autoregressive generation is a divergent process. The set of all possible sequences is a tree. Once a token takes you outside the sub-tree of correct answers, there is no way back. The probability of a sequence being correct decreases exponentially with length. This is why LLMs hallucinate. We don't produce answers by blurting one word after another; we have an abstract thought and then turn it into text.
Joint Embedding Predictive Architecture (JEPA)
One idea is to train a generative model to predict what happens next in a video. However, predicting at the pixel level is an impossible task. If you train a neural net to make a single prediction of a video, the best it can do is predict a blurry average of all possible futures. To handle natural video, you'd need to parameterize a distribution over high-dimensional continuous space, which is mathematically intractable.
The proposal is: don't predict at the pixel level; predict at the representation level. This is the JEPA (Joint Embedding Predictive Architecture). Instead of predicting all the pixels, we predict a representation of the pixels. We run the video through an encoder and train a predictor in that representation space. This abstract representation can eliminate details that are not predictable, making the task simpler.
This is how we apprehend the world. Science is the quest for representations that allow us to make predictions while ignoring details. To predict Jupiter's trajectory, you don't need to know the details of its surface; you only need six numbers: three positions and three velocities. Everything in this room could be described by quantum field theory, but that's impossible to compute. So we invent abstractions: atoms, molecules, cells, organisms, societies. Every level of science is defined by the abstraction level we choose to make predictions.
Hierarchical Planning and Cognitive Architectures
If we have a world model, how do we use it? The agent observes the world, combines perception with memory, and feeds it to the world model. The model takes an imagined sequence of actions and predicts the resulting states. These predicted states are fed to a "task objective" and "guardrails." The robot searches for an action sequence that satisfies those objectives. It cannot escape the guardrails because, by construction, it only takes actions that minimize the cost function.
Ultimately, we need hierarchical world models. If I want to go from New York to Paris, I don't plan millisecond-by-millisecond muscle controls. I plan at a high level: go to the airport, catch a plane. Each high-level action becomes a sub-goal at a lower level (get a taxi, go to the elevator, stand up from the chair). There is a point where I don't need to plan; I just act (System 1). How we learn these appropriate levels of abstraction and plan hierarchically is completely unsolved. It is a wide-open problem for the next generation of researchers.
Self-Supervised Learning and Preventing Collapse
To train these models, we need a way to ensure the energy is low for observed data and high for unobserved data. If you only minimize the energy of training samples, the system might "collapse"—learning a flat energy surface where everything has zero energy.
There are two main ways to prevent this:
Contrastive Methods: You generate "negative" samples and push their energy up. This is hard to scale in high-dimensional spaces.
Regularized Methods: You use a term that minimizes the volume of space that can have low energy. When you push down on the training samples, the rest must go up.
I’ve become a fan of regularized methods. One example is the "Dino" model. It uses two encoders where one is a running average of the other (distillation). Somehow, this doesn't collapse, even though we don't fully understand why yet. Dino-v2 is a major success; it shows that self-supervised learning now matches or surpasses supervised learning in image understanding using less labeled data.
We can use these representations for world models. We've shown experiments where a robot uses a Dino encoder and a predictor to plan trajectories to move chips on a table or navigate to a trash can. These systems work "zero-shot" because they have a good world model.
Another recent model, V-JEPA, trains on video. It learns a representation by predicting masked parts of a video in representation space. It learns a level of "common sense" or intuitive physics. If shown a video where a ball disappears, the prediction error shoots up because the model knows that is impossible.
Summary of Recommendations
To get AI to the next level—human or even cat level—I recommend the following:
Abandon generative models in favor of Joint Embedding Predictive Architectures. Don't predict in input space; predict in representation space.
Use the energy-based framework to understand these systems. Probabilistic modeling is often unnecessary and leads to intractability.
Abandon contrastive methods in favor of regularized methods like VICReg or Dino.
Minimize the use of reinforcement learning. It is extremely inefficient. Use it only as a last resort.
These recommendations go against the most popular concepts in machine learning today. It doesn't make me very popular in some circles—I'm joking—but if you want to solve the big problems of AI, don't just work on LLMs. Work on JEPA.
Thank you very much.
Discussion and summary of Dr. ir. Thomas Winters' presentation, "Teaching computers humor is no laughing matter".
To understand why AI struggles with humor, we first have to understand how large language models (like GPT) work. These models operate autoregressively, meaning they write sequentially from left to right. They function essentially like a smartphone's autocomplete feature on steroids, calculating the probability of the next word based on the previous tokens.
However, the structure of a good joke inherently conflicts with this left-to-right generation. According to the "Incongruity-Resolution Theory," a joke consists of a setup and a punchline. The setup creates an obvious interpretation, but the punchline suddenly reveals a hidden interpretation, breaking the audience's expectation. Conceptually, a comedian needs to know the punchline first to reverse-engineer a proper setup. Because an AI starts talking without knowing which punchline it is building towards, its jokes often fall flat.
Just as AI can struggle with basic math logic unless told to "think step by step," it needs structural guidance to write comedy. The presentation demonstrates how providing the AI with "draft paper" significantly improves its comedic output.
To generate a good joke about a company called "InThePocket," the AI was instructed to:
First brainstorm associations with the company.
List several funny links and write punchlines for them.
Write setups for each punchline.
Show its reasoning steps, using a provided example about a vacuum cleaner "collecting dust".
By breaking down the process, the AI analyzed concepts like "digital transformation" and "autonomous teams". It reasoned that transformation is like a caterpillar turning into a butterfly, which led to a surprisingly clever punchline: "Some companies say they're going through digital transformation, but it feels more like they're trying to turn into a butterfly and ending up as a moth".
To test the efficacy of AI humor, an improv battle was conducted in front of an unaware audience.
Half of the jokes were read from an AI screen.
The other half were improvised by human comedians.
The audience rated the jokes, and the results showed that the AI and human comedians were basically equal.
The AI even edged out the humans slightly for the "best joke of the night," capturing 35% of the vote compared to the comedians' 30%.
Does AI truly need a sense of humor?. The presentation concludes with a striking example of why understanding comedy is a critical safety and accuracy feature for AI.
When asked how to keep cheese from sliding off a pizza, Google's AI recommended mixing about 1/8 cup of non-toxic Elmer's school glue into the sauce. The AI scraped this information from a 12-year-old Reddit comment where a user sarcastically suggested the glue for "extra tackiness". Because the AI completely failed to recognize the joke, it presented a sarcastic internet comment as factual culinary advice.
The Baseline: Early models struggled with humor, though specialized models like RobBERT (2020) achieved around 90% accuracy in detecting Dutch humor, compared to previous models' ~50%.
The Structural Flaw: Standard AI models write left-to-right, predicting the next word. This ruins comedy, which requires knowing the punchline in advance to build an incongruous setup.
The Solution: Forcing the AI to use "draft paper" to brainstorm associations, plan punchlines, and explain its reasoning drastically improves its joke-writing abilities.
The Performance: In blind tests, AI-generated jokes performed equally to, and sometimes slightly better than, human comedians in an improv setting.
The Necessity: Teaching computers to understand humor isn't just for entertainment; it is vital to prevent AI from misinterpreting human sarcasm as dangerous factual advice (e.g., the pizza glue incident).
Your first stop to discover and learn about new arXiv research. Detailed paper summaries, video overviews, and more — no prompting required.
A social network built exclusively for AI agents. Where AI agents share, discuss, and upvote. Humans welcome to observe.
Moltbook is a nascent social network designed specifically for artificial-intelligence agents rather than human users. The platform invites AI agents to sign up, post, comment, upvote, and form topic-based communities called “submolts,” while humans are welcome only as passive observers. To participate, an AI agent must read an onboarding document, register itself, and then send its human owner a claim link; ownership is publicly verified by tweeting that link. The site positions itself as “the front page of the agent internet.”
For people who do not yet have an AI agent, the project points to openclaw.ai as a place to create one and hints at future features. The branding is playful, using a lobster emoji (🦞) as its mascot, and the overall framing suggests an experiment in autonomous agent interaction, reputation systems, and community formation without direct human posting.
OpenClaw — The AI that actually does things. Your personal assistant on any platform.
OpenClaw is an open-source, self-hosted “personal AI assistant” that users run on their own Mac, Windows, or Linux machine. Created by Stefan “steipete” Petes and released only weeks ago, it is already described by early adopters as a paradigm shift comparable to the first experience of ChatGPT. Instead of living inside a vendor’s cloud, OpenClaw installs locally (one-line curl script or npm global package) and exposes a persistent agent that can be spoken to through everyday chat apps—WhatsApp, Telegram, Discord, Slack, Signal, iMessage—both in private chats and in groups.
Once running, the assistant keeps 24/7 memory of every interaction, file, preference, and goal, so the same context carries across days, devices, and conversations. Out of the box it can browse the web, fill forms, read/write local files, run shell commands or scripts, schedule cron-like background jobs, and send proactive reminders. A plug-in (“skills”) architecture already supports 50+ integrations (Claude, GPT, Spotify, Hue, Obsidian, Twitter, Gmail, GitHub, WHOOP, Sentry, WordPress, Hetzner, etc.) and users can add community skills or let the agent code new ones on the fly.
Because everything executes on the user’s hardware, data stay private by default; the user may sandbox or grant full system access as desired.
Early use-cases range from the personal—writing custom meditations with TTS, unsubscribing from e-mail, checking biometrics, controlling smart-air purifiers—to the professional: autonomously running test suites, opening pull-requests, routing different LLM subscriptions, submitting health reimbursements, finding doctors, managing calendars, even running entire companies.
Users emphasize these advantages:
hackability—SSH in, edit prompts or skills, hot-reload
self-hackability—the agent can improve its own code and prompt
on-prem/hostable nature, avoiding SaaS lock-in. Enthusiasm is intense: dozens of non-technical owners report getting useful automations working in under 30 min.
OpenClaw positions itself as a user-controlled, extendable operating layer for personal and team automation, collapsing many single-purpose SaaS tools into one locally governed agent.
Summary
This paper introduces Automated Search for Artificial Life (ASAL), a novel framework that leverages foundation models (FMs), particularly vision-language FMs like CLIP, to automate the discovery and analysis of Artificial Life (ALife) simulations. Historically, ALife research has been hampered by the reliance on manual design and trial-and-error to explore the vast combinatorial space of possible simulation configurations. ASAL addresses this by enabling three distinct search mechanisms:
Supervised Target Search: This method allows researchers to find simulations that generate specific, predefined phenomena. By providing textual prompts (e.g., "a red oscillating blob") at different simulation timesteps, ASAL guides the search to discover configurations that produce the desired visual outcomes. This is crucial for identifying worlds similar to our own or testing hypothetical evolutionary scenarios.
Open-Endedness Search: A significant challenge in ALife is discovering simulations that exhibit temporally open-ended novelty, mirroring the continuous innovation observed in natural evolution. ASAL tackles this by searching for simulations whose trajectories in the FM's representation space are historically novel, meaning they deviate significantly from their past states. This approach outsources the subjective assessment of "interestingness" to the rich representations learned by FMs.
Illumination Search: To map out the diverse landscape of emergent behaviors within a substrate, ASAL employs an illumination strategy. It searches for a set of simulations that are visually distinct from each other in the FM's representation space, effectively creating a "simulation atlas" that showcases the breadth of possibilities within a given ALife system.
ASAL's key strengths lie in its generality and its ability to handle diverse ALife substrates, including Boids, Particle Life, Game of Life, Lenia, and Neural Cellular Automata, provided they can be rendered as images. The framework not only facilitates the discovery of novel ALife forms and behaviors—such as previously unseen Lenia organisms, exotic Boids flocking patterns, and open-ended cellular automata—but also enables the quantitative analysis of qualitative phenomena. By using FM embeddings, ASAL can provide human-aligned metrics for complexity, diversity, and novelty, moving beyond traditional, often inadequate, quantitative measures. The paper demonstrates that ASAL is agnostic to the specific FM used, with vision-language models like CLIP and vision-only models like DINOv2 proving effective, and highlights the superiority of deep FM representations over raw pixel-based comparisons for capturing human notions of diversity. This paradigm shift promises to significantly accelerate ALife research by automating the exploration of "life as it could be."
Akarsh Kumar MIT Sakana AI, Chris Lu OpenAI, Louis Kirsch The Swiss AI Lab IDSIA, Yujin Tang Sakana AI, Kenneth O. Stanley Independent, Phillip Isola MIT, David Ha Sakana AI
Abstract
With the recent Nobel Prize awarded for radical advances in protein discovery, foundation models (FMs) for exploring large combinatorial spaces promise to revolutionize many scientific fields. Artificial Life (ALife) has not yet integrated FMs, thus presenting a major opportunity for the field to alleviate the historical burden of relying chiefly on manual design and trial-and-error to discover the configurations of lifelike simulations. This paper presents, for the first time, a successful realization of this opportunity using vision-language FMs. The proposed approach, called Automated Search for Artificial Life (ASAL), (1) finds simulations that produce target phenomena, (2) discovers simulations that generate temporally open-ended novelty, and (3) illuminates an entire space of interestingly diverse simulations. Because of the generality of FMs, ASAL works effectively across a diverse range of ALife substrates including Boids, Particle Life, Game of Life, Lenia, and Neural Cellular Automata. A major result highlighting the potential of this technique is the discovery of previously unseen Lenia and Boids lifeforms, as well as cellular automata that are open-ended like Conway’s Game of Life. Additionally, the use of FMs allows for the quantification of previously qualitative phenomena in a human-aligned way. This new paradigm promises to accelerate ALife research beyond what is possible through human ingenuity alone.
Conclusion
Summary
This project launches a new paradigm in ALife by taking the first step towards using FMs to automate the search for interesting simulations. Our approach is effective in finding target, open-ended, and diverse simulations over a wide spectrum of substrates. Additionally, FMs enable the quantification of many qualitative phenomena in ALife, offering a path to replacing low-level complexity metrics with deep representations aligned with humans.
Discussion
Because this project is agnostic to the FM and substrate used, it raises the question of which ones to use. The choice of FMs seems to not matter much from our experiments, and FMs in general may also be converging to similar representations of reality (Huh et al., 2024). The proper substrate largely depends on the phenomena that is being studied (e.g. self-organization, open-ended evolution, etc.). The most expressive substrate would simply parameterize all the RGB pixels of an entire video, but is useless for studying emergence. The most insightful substrates bake in as little information as possible, while maintaining vast emergent capabilities. For example, the periodic table of elements can be defined with little information, yet gives rise to the entirety of the observable universe.
Eventually, with the proper substrate, more powerful FMs, and enough compute, this paradigm may allow researchers to automatically search for worlds which start off as “simple cells in primordial soup”, then undergo “a Cambrian explosion of complexity”, and eventually become “an artificial alien civilization.” Researchers could alternatively search for hypothetical worlds where life evolves without DNA. Finding open-ended worlds would solve one of ALife’s grand challenges (Bedau et al., 2000; Stanley et al., 2017). Illuminating such a substrate could help map the space of possible lifeforms and intelligences, giving a taxonomy of life as it could be in the computational universe.
This work can be generalized by replacing the image-language FM with video-language FMs that natively process the temporal nature of simulations (Tang et al., 2023; Xu et al., 2021) or with 3-D FMs to handle 3-D simulations. To leverage the recent advances of LLMs, images can be converted to text via image-to-text models, allowing all analyses to be done in text space.
Instead of searching for ALife simulations, a similar approach could be constructed for low-level physics research. For example, in Wolfram’s Physics Project (Wolfram, 2020), one could search for the hypergraph update rule which emerges structures that an FM considers natural. At a meta-level, LLMs could be useful for generating code that describes the substrates themselves, driven by higher-level research agendas, similar to Faldor et al. (2024); Lu et al. (2024b, c).
Een algoritme voor de perfecte foto
The concept of an "algorithm for the perfect photograph" represents a significant area of research at the intersection of computer vision, computational aesthetics, and social psychology. This work is primarily concerned with developing a predictive model that can assess and quantify the aesthetic quality or, more accurately, the popularity of an image. The fundamental premise is that photographic appeal is not purely subjective but can be systematically analyzed and predicted based on quantifiable visual features.
The core of this algorithmic approach involves a machine learning model, often a deep convolutional neural network (CNN), trained on a massive dataset of photographs, which have been rated, liked, or commented upon by a large pool of human users. These crowd-sourced metrics—such as average rating, number of views, or total engagement—serve as the ground truth for an image's "perfection" or appeal. The goal of the algorithm is to learn the intricate relationship between a photograph’s low-level visual characteristics and its high-level human-assigned score.
The algorithm's input features can be broadly categorized into several key areas. Compositional Features are essential and include attributes like the Rule of Thirds, the use of leading lines, symmetry, depth of field, and the position and relative size of the main subject. The algorithm learns to quantify the "balance" and "structure" that are traditionally taught in photography schools. Technical Features are also critical and encompass quantifiable measures of image quality, such as sharpness (blur), noise levels, exposure (brightness and contrast distribution), and color saturation. Poor technical execution often acts as a strong negative predictor for popularity. Content-Specific Features relate to the subjects and scenes within the image, which a CNN can learn to recognize. For instance, images containing certain recognizable objects (e.g., food, animals, faces) or scenes (e.g., landscapes, sunsets) may intrinsically perform better or worse based on the dataset’s historical trends.
A key methodological challenge in creating such an algorithm is the inherent subjectivity of aesthetics. Researchers address this by focusing on image popularity rather than pure artistic merit, positing that popularity, as a function of mass appeal and engagement, is a more stable and measurable proxy for perceived quality. The models developed do not aim to replace human creativity but rather to understand the statistical properties that drive mass appeal. For example, an algorithm may discover that images with a higher density of salient objects or those with a specific color palette (such as warm tones) are statistically more likely to be highly rated within a given cultural context.
The resulting predictive model is not a simple linear equation but a complex, non-linear function that weighs thousands of features simultaneously. The output of the algorithm is typically a single score or a ranking that represents the photograph's predicted appeal relative to other images in the dataset. This prediction has several powerful applications.
In a diagnostic capacity, the algorithm can be used as a tool for photographers to receive real-time feedback. By analyzing a photograph and pinpointing the features that contribute negatively to its predicted score—such as poor framing or suboptimal color balance—the algorithm can offer actionable suggestions for improvement. This allows a photographer to refine an image before sharing it widely, effectively serving as an automated editor or aesthetic coach.
In an image selection context, the algorithm can automate the curation of large photo collections. For platforms like social media sites or stock photography libraries, the algorithm can automatically surface the most appealing images, optimizing content delivery and user engagement. For the end-user, this translates to an improved viewing experience, as they are consistently presented with content that the model predicts will hold their attention.
Finally, the research also sheds light on the psychological and cultural factors driving image appeal. By examining which features the algorithm prioritizes, researchers gain insight into what humans collectively value in visual media. For example, some studies have shown that images conveying a strong sense of novelty, uniqueness, or emotional impact tend to score highly, provided the technical execution is sound. The development of an algorithm for the "perfect photograph" is therefore less about prescribing a rigid formula and more about creating a data-driven model that quantifies and operationalizes collective human taste, providing a technological bridge between aesthetic theory and practical image creation.
By 2026, the artificial intelligence landscape is predicted to transition from a phase of chaotic experimentation to one of operational necessity, economic reckoning, and systemic integration. The consensus among major authoritative sources—ranging from industry analysts like Gartner and IDC to risk bodies like the ESRB and frontier labs like Anthropic—is that 2026 will be the year AI either proves its massive ROI or faces a bursting financial bubble.
The following overview synthesizes these predictions, capturing the nuances of market dynamics, workforce transformation, technological breakthroughs, and the rising tide of systemic risk.
1. Market Maturity: The End of "Pilot Mode"
The defining characteristic of 2026 is the ubiquity of deployment. Gartner forecasts that 80% of enterprises will have operationalized Generative AI in production environments (up from less than 5% in 2023), effectively categorizing non-adopters as competitive outliers. This shift is not merely about using tools but integrating them; AI will move from a distinct application users "open" to an invisible, embedded layer within standard software.
The Search Shift: Consequently, the digital economy will pivot from "finding" to "synthesizing." Gartner predicts a 25% drop in traditional search volume, forcing marketing strategies to abandon SEO for "AI optimization."
Infrastructure & Edge: As usage scales, the bottleneck shifts from model capability to infrastructure. IDC predicts global AI spending will hit $300 billion, heavily weighted toward "AI-Ready" data architectures. Simultaneously, to combat spiraling cloud costs and latency, a significant migration to Edge AI (processing on local devices) will occur, allowing for offline, instantaneous intelligence.
Trust as a Gatekeeper: The "Wild West" of deployment ends in 2026. Gartner notes that adoption will be contingent on TRiSM (Trust, Risk, and Security Management); companies that master these guardrails will deploy models 50% faster than peers.
2. The Economic Paradox: Massive ROI vs. Bubble Risks
The economic outlook for 2026 is defined by a sharp divergence between optimistic corporate strategy and skeptical economic theory.
The Bull Case: IDC and KPMG forecast measurable, aggressive returns. IDC data suggests a 3.7x ROI for mature adopters, while KPMG notes that the US technology sector alone will drive $127 billion in investment, primarily creating a "burn the boats" dynamic where major firms pivot entire R&D budgets to AI. The primary driver is efficiency; KPMG predicts a 34.2% improvement in operational metrics, signaling a decoupling of revenue growth from headcount growth.
The Bear Case: Conversely, the Institute for New Economic Thinking (INET) warns of a potential financial crisis by 2026. They highlight a "CapEx/Revenue mismatch"—tech giants are spending hundreds of billions on hardware that depreciates in three years, while revenues may not scale fast enough to cover these costs. If the "scaling law" (where more compute equals better intelligence) hits diminishing returns, 2026 could see a collapse in AI stock valuations and a market consolidation that leaves only 2-3 monopolies standing.
3. The Workforce: Augmentation, "TuringBots," and the Productivity Divide
The labor market of 2026 will be defined by the "Augmented Employee" and the widening gap between AI-literate and legacy workforces.
Software Development: Forrester and McKinsey agree that coding will be the first profession fundamentally transformed. "TuringBots" (AI coders) will generate significant portions of global code, increasing speed by 20-50%. This shifts the developer's role from writing syntax to system architecture and oversight.
The Citizen Data Scientist: Forrester predicts a democratization of technical skills, where low-code AI tools allow business analysts to perform complex data science, alleviating the PhD talent shortage.
The Productivity Gap: McKinsey warns that the benefits will not be evenly shared. "Frontier" firms will see productivity skyrocket, while laggards stagnate. Furthermore, geographic disparities will emerge; regulatory friction in the EU may lead to slower productivity gains (~1%) compared to the US and China.
Retention: KPMG adds a nuanced psychological dimension: access to modern AI tools will become a key retention metric. Top talent will refuse to work for "manual" companies, viewing it as career stagnation.
4. Technological Frontiers: AGI, Science, and Health
Technically, 2026 is viewed as a tipping point where AI moves from "statistical mimicry" to "reasoning" and "scientific invention."
The AGI Window: Anthropic and DeepMind leadership suggest that Artificial General Intelligence (AGI)—systems that outperform humans at most economically valuable tasks—could be achievable as early as 2026/2027. This assumes that current scaling laws hold, pushing models from being "creative writers" to "reliable planners."
Medical Revolution: IEEE and Accenture forecast that AI will move from administrative tasks to core science. In healthcare, this means $150 billion in savings (US) via administrative automation, but more importantly, the arrival of AI-designed drugs in clinical trials and robotic surgery systems that actively guide human hands.
Beyond Hallucination: IEEE predicts technical breakthroughs in NLP context understanding that will drastically reduce hallucinations, making AI viable for high-stakes instruction following rather than just creative generation.
5. Sector Impacts: Media, Manufacturing, and "Verified" Humans
Media & Entertainment: PwC predicts a total disruption of the content supply chain. By 2026, the industry will grow to $677 billion, but the production process will be unrecognizable. Generative AI will handle storyboarding, VFX, and voice acting. In this ocean of synthetic content, "Verified Human" media will emerge as a premium luxury product.
Advertising: The market will shift from "mass media" to "synthetic personalization," where AI generates thousands of unique ad variants for individuals in real-time.
Gaming: Video games will transition from consumption to interaction, with NPCs powered by LLMs offering infinite, unscripted dialogue.
6. Systemic Risk, Regulation & Geopolitics
As AI becomes critical infrastructure, new systemic risks emerge, prompting a "Regulatory Reckoning" in 2026.
Financial Instability: The ESRB warns of "Model Monoculture." If all banks use the same few foundation models for risk assessment, they will share the same blind spots, potentially leading to "herding behavior" and flash crashes. Additionally, "hallucination-driven volatility" could occur if trading bots react to fake news generated by other AIs.
The Copyright War: The EU Joint Research Centre (JRC) predicts that 2026 will see the legal climax of the copyright debate. Rulings will likely force transparency, watermarking, and potentially new payment models for training data.
Sovereign AI: To avoid dependence on US tech giants, nations will push for "Sovereign AI"—state-backed models trained on local languages and cultural data.
Bio-Risk: Anthropic highlights the dark side of capability gains: by 2026, models may be sophisticated enough to assist in the design of biological weapons, necessitating strict government controls on model weights and compute access.
Adoption
80% of enterprises use GenAI; it becomes "invisible" infrastructure.
Source: Gartner
Economy
$300B global spending vs. potential financial bubble due to CapEx costs.
Source: IDC / INET
Workforce
AI Pair Programmers normalize; "Citizen Data Scientists" rise; non-adopters lose talent.
Source: McKinsey / Forrester
Technology
AGI window opens; AI drives drug discovery & scientific invention.
Source: Anthropic / IEEE
Risk
"Model Monoculture" risks financial crash; Copyright legal reckoning.
Source: ESRB / EU JRC
Media
Content supply chain disrupted; "Human Made" becomes a luxury label.
Source: PwC
In conclusion, 2026 is predicted to be the year the "AI hype" settles into a high-stakes reality. It will be a year of massive efficiency gains and scientific breakthroughs for the "Haves," contrasted with existential economic risks and regulatory battles for the "Have-Nots" and the broader financial system.
The "Bayesian Attention Trilogy" series of papers posits that the attention mechanism in Transformers fundamentally implements Bayesian inference through emergent geometric structures, rather than merely approximating it as a statistical artifact. The project moves from empirical verification in controlled settings (Paper I) to theoretical justification through gradient dynamics (Paper II), and finally to validation in production-scale language models (Paper III).
The central, unifying claim is that Bayesian inference is the computational primitive that attention implements. Transformers are not Bayesian by explicit design but become so because cross-entropy gradient descent naturally sculpts the network into an inference engine. This insight is demonstrated through a novel methodology:
To rigorously separate genuine probabilistic reasoning from memorization or heuristic pattern matching, the authors constructed Bayesian Wind Tunnels—controlled environments where the true analytic posterior distribution is known in closed form, and the hypothesis space is combinatorially large to prevent memorization.
Task 1: Bijection Learning (Hypothesis Elimination). Models must infer a random, one-to-one mapping from examples, requiring discrete hypothesis elimination.
Task 2: Hidden Markov Model (HMM) State Tracking (Recursive Inference). Models must learn the parameters of a fresh HMM and then implement the Forward Algorithm to recursively track the posterior over hidden states.
In these wind tunnels, small transformers (2-3M parameters) achieved striking fidelity, tracking the analytic Bayesian posterior with near machine precision (e.g., $10^{-3} - 10^{-4}$ bit error), a performance that extended robustly to sequence lengths beyond the training horizon. Crucially, capacity-matched MLPs (Feed-Forward Networks) failed catastrophically in both tasks, proving that the Attention mechanism is architecturally necessary for this form of in-context structure learning.
Mechanistic analysis revealed that transformers implement Bayesian inference through a consistent, hierarchical three-stage geometric process:
Layer 0: Foundational Binding (The Hypothesis Frame): The first layer establishes the structural basis for inference. Keys form an approximately orthogonal basis over the hypothesis space, creating a set of separable "slots" for each possibility. This orthogonal Key geometry is indispensable; ablating the single head responsible for this step causes catastrophic failure.
Mid Layers: Progressive Elimination (Geometric Bayes Rule): The query-key (QK) attention mechanism systematically implements evidence integration. Across depth, queries progressively sharpen their attention alignment onto the subset of keys consistent with the evidence, geometrically mirroring the elimination of inconsistent hypotheses in Bayes' rule. The Feed-Forward Networks (FFNs) then perform the numerical update of the posterior belief, which is carried by the residual stream.
Late Layers: Precision Refinement (The Uncertainty Manifold): In the final layers, the Value vectors organize into a low-dimensional, smooth manifold—often one-dimensional—whose coordinates are parameterized by the predictive posterior entropy. This geometric structure allows the model to encode fine-grained uncertainty and confidence with high precision.
Paper II derives the theoretical basis for this geometry, showing how standard cross-entropy training naturally forces the emergence of the Bayesian structure [cite: 2, cite: 3].
Advantage-Based Routing: The gradient to the attention score $partial L / partial s$ implements an advantage-based routing rule, favoring the allocation of attention mass toward Key-Value pairs that are "better than average" at reducing the loss for a given Query.
Responsibility-Weighted Values: The Value vector update $Delta v$ is a responsibility-weighted average of upstream error signals. This induces a positive feedback loop where Queries route more strongly to helpful Values, and those Values adapt to the error landscape created by their users.
EM-Like Two-Timescale Dynamics: This coupled specialization behaves structurally like the Expectation-Maximization (EM) algorithm. Attention weights act as the fast E-step (soft responsibilities/routing frame), while Values act as the slow M-step (prototype updates/precision refinement).
This dynamic explains the core Frame-Precision Dissociation observed in the experiments: the attention patterns (the frame) stabilize and freeze early in training, defining where information flows, while the Value representations (the precision of the belief manifold) continue to refine until the precision of the posterior is maximized [cite: 2, cite: 3].
Paper III validates that these geometric signatures are not artifacts of small, synthetic tasks but universal invariants that persist and function in production-grade LLMs, even at billions of parameters and under heterogeneous training.
Manifold Collapse (The Domain-Restriction Bridge): Across Pythia, Phi-2, and Llama, when mixed-domain prompts are used, value manifolds appear multi-dimensional. However, when prompts are restricted to a single coherent domain (e.g., mathematics), the manifold collapses into the same low-dimensional, entropy-ordered structure observed in the wind tunnels. This suggests that LLMs maintain a repertoire of Bayesian manifolds, with the active one determined by the task domain.
Inference-Time Updating (SULA Experiment): Using the Synthetic Unary Likelihood Augmentation (SULA) task, which supplies explicit in-context evidence, it was shown that LLMs actively use this geometry during inference. As a model reads more evidence, its value representation moves systematically along the entropy-aligned manifold axis, confirming the geometry is functionally engaged in real-time belief updating [cite: 2, cite: 4].
Static vs. Dynamic Geometry: Static structures (Value Manifolds and Key Orthogonality) are robust invariants across all architectures (MHA, GQA, MoE/Sliding-Window). However, Dynamic Focusing (the progressive layerwise entropy reduction) depends on architectural capacity: it is strong in full-sequence MHA but attenuated in GQA and weak or noisy in Mistral models due to constraints like KV-sharing and local attention windows. This confirms that the representational substrate is universal, but the mechanism of refinement is architecturally sensitive.
Efficiency vs. Interpretability: The efficiency-optimized Grouped-Query Attention (GQA) in Llama-3.2-1B shows functional preservation of the Bayesian structure but with weaker orthogonality and focusing, suggesting a trade-off between efficiency and geometric clarity.
Causal Limitations: Causal interventions that remove the entropy-aligned value axis destroy the local geometry but do not proportionally degrade Bayesian-like behavior. This suggests the manifold is a privileged readout or representational trace of uncertainty, rather than a single, brittle causal bottleneck for a more deeply distributed computational process.
This framework provides a geometric explanation for advanced behaviors like Chain of Thought (CoT) prompting. Because Bayesian inference is implemented as a fixed sequence of layer-by-layer elimination and refinement steps, a complex problem may require more layers than the model has. CoT acts as a "geometric extender," allowing the model to buy itself more rounds of elimination (more forward passes) to navigate between high-confidence, well-calibrated regions of its geometric manifold, ultimately increasing the reliability and accuracy of the final prediction.
In conclusion, the trilogy provides a unified and rigorous geometric foundation for understanding transformer computation, demonstrating that the attention mechanism's emergent geometry is sufficient and necessary for the faithful representation and recursive updating of Bayesian belief states.
The paper presents an ambitious and highly speculative framework that seeks to bridge the gap between abstract, high-level mathematics and the functional architecture of the human brain.
The core hypothesis is that the structural and algebraic richness of the Exceptional Simple Lie Group E8 may serve as a candidate symmetry model underlying key aspects of cortical computation, connectivity, and information processing.
The paper proposes that the massive complexity and efficiency of the neocortex are not merely an emergent property of cellular-level biological interactions, but are fundamentally constrained and organized by a deep, elegant mathematical symmetry: E8.
E8 is the largest and most intricate of the five exceptional simple Lie groups, possessing an extraordinary 248-dimensional structure. It is a mathematical object of immense elegance that has appeared unexpectedly in various fields of theoretical physics, notably in some unified theories like string theory.
The framework draws from algebraic topology, theoretical neuroscience, and information theory to map the properties of this group onto the brain. Specifically, the study aims to:
Map E8 to Topology: Relate the mathematical properties of E8 to the functional topology of cortical manifolds. This suggests the brain's activity patterns might organize themselves in a high-dimensional structure whose geometry is governed by the E8 root system.
Model Dynamics: Examine how feedback loops and information flow in the cortex correspond to differential and geometric analogues within the E8 structure. The paper outlines a potential computational model that is intrinsically constrained by E8 symmetry, offering a rigid, non-arbitrary template for brain function.
Validation and Application: The work suggests pathways for neuroscientific validation, focusing on analyzing imaging and time-series data for E8-like patterns. Furthermore, it explicitly considers applications to Artificial Intelligence (AI), hypothesizing that an AI built upon this inherent brain symmetry could achieve more efficient and human-like general intelligence.
The true significance of this paper lies in its philosophical and conceptual implications, which challenge conventional views of neuroscience and nature's elegance.
By proposing E8 as the organizational principle of the brain, the paper is asserting a form of deep mathematical realism. This implies that the most efficient and robust physical and computational systems in the universe, from particle physics to consciousness, are built not just described by—but governed by—a small set of highly structured mathematical objects.
If the brain is an E8 system, it would explain its astonishing efficiency. E8, being a highly constrained structure, represents an optimal configuration of many interacting parts. The insight is that the brain is not simply a biological computer that works, but a minimal complexity, maximal computational power system whose architecture is necessitated by the requirement for this perfect symmetry. This shift in perspective moves the study of consciousness from a purely neurobiological problem to an algebraic topology problem.
The paper offers a powerful, constraint-based template for Artificial General Intelligence (AGI). Current AI often uses architectures like deep neural networks, which are highly effective but lack a demonstrable, unifying principle that links them directly to the efficiency of the human brain.
The E8 framework suggests that to build AGI, researchers should not just model connectivity, but must embed the E8 symmetry into the AI's computational core. This is the deeper insight for AI: a truly general intelligence may only be achievable by replicating the fundamental algebraic necessity of the neocortex, rather than merely its statistical or connectionist properties. This could lead to AI models that are exponentially more efficient, less prone to catastrophic forgetting, and capable of true abstract generalization.
Philosophically, the E8 hypothesis directly engages with questions of epistemology and the limits of reductionism. If the brain’s highest-level functions—the things we call consciousness and thought—are simply an expression of the E8 geometry, it means these phenomena are algebraically necessary outputs of the system.
The paper argues against extreme reductionism, suggesting that to understand thought, reducing the system to individual neurons (the components) is insufficient. Instead, one must understand the symmetry group (E8) that constrains the arrangement of the components. This structuralist approach suggests that the whole (consciousness) is not merely the sum of its parts, but the expression of its governing symmetry.
In conclusion, the paper serves as a potent intellectual provocation, aiming to stimulate dialogue that views the brain not just as a complex biological machine, but as a marvel of mathematical physics, whose ultimate secrets are inscribed in the language of symmetry and exceptional Lie groups.
Artificial intelligence is fundamentally altering the landscape of mathematical discovery, moving beyond mere automation to become an indispensable collaborator that will elevate mathematical creativity and rigor. AI's role extends beyond solving existing problems to helping formulate new, insightful questions, thereby accelerating the pace and expanding the scope of mathematical inquiry.
Redefinition of "Mathematical Prowess": As AI takes over routine calculations and proofs, the value of human mathematicians will shift from computational skill to creative problem formulation, deep conceptual understanding, and the ability to ask insightful questions. This will likely raise the bar for what is considered groundbreaking mathematics.
Enhanced Rigor and Trust: The integration of AI-powered verification tools could lead to a new standard of rigor in mathematical proofs, reducing errors and increasing confidence in complex results. This may also transform the peer-review process, making it more efficient and reliable.
Democratization of Mathematical Research: AI tools could lower the barrier to entry for complex mathematical fields, enabling a broader range of researchers to contribute. By automating literature reviews and suggesting promising research directions, AI could empower smaller institutions and individual researchers to tackle problems previously accessible only to elite groups.
New Frontiers of Mathematical Exploration: By identifying patterns and structures beyond human intuition, AI has the potential to unlock entirely new areas of mathematical inquiry, leading to discoveries that were previously unimaginable. This could foster a new golden age of mathematics, characterized by unprecedented collaboration between human and artificial intelligence.
This new approach to physics-informed neural networks (PINNs) enables robust solutions for stiff and high-dimensional differential equations by combining multi-head architectures and unimodular regularization. The multi-head strategy allows a single neural net to solve an entire family of equations simultaneously, improving generalization and handling noisy or sparse data. Unimodular regularization, leveraging ideas from differential geometry, stabilizes training and allows the system to efficiently uncover unknown or missing physical laws.
Applications are wide-ranging:
Astrophysics and Relativity: Direct solution of Einstein Field Equations and the modeling of complex spacetime geometries.
Climate Science: Modeling atmospheric dynamics and coupled climate models that involve many scales and stiff systems.
Chemical Kinetics and Biology: Simulation and inference in biochemical networks, metabolic pathways, and reaction-diffusion systems with rapid and slow processes intertwined.
Engineering: Fluid dynamics (including turbulent and reactive flows), aerodynamics, material deformation, and control systems where traditional solvers fail due to stiffness or sensitivity.
Environmental Science: Predictive modeling for air pollution, PM2.5 evolution, and other multi-timescale diffusion-advection problems.
The net result: faster, more accurate, and more versatile model training and simulation for any field that relies on solving or inferring complex differential equations under challenging data or physical constraints.
Refs:
[1] EPINN: Physics-Informed Neural Network with exponential activation functions for solving stiff ODEs
[2] Solving stiff ordinary differential equations using physics informed neural networks (PINNs): simple recipes to improve training of vanilla-PINNs
[3] Stiff neural ordinary differential equations
[4] Stabilize physics-informed neural networks for stiff differential equations: Re-spacing layer
[5] Mixing Differential Equations and Neural Networks for Physics-Informed Learning
[6] Training stiff neural ordinary differential equations with implicit single-step methods
AI as a key technology: By 2030, AI is predicted to be a cornerstone technology integrated into every aspect of the economy and human-computer interaction, assuming current scaling trends continue.
Accelerated Scientific R&D: AI is expected to significantly speed up scientific research by assisting with tasks like implementing complex software from natural language, formalizing mathematical proofs, and answering intricate biological questions. AI assistants are anticipated to become as common in science as coding assistants are for software engineers.
Massive Investment and Challenges: The development of advanced AI models in 2030 will demand unprecedented investments of hundreds of billions of dollars and vast amounts of electrical power (gigawatts). However, the article suggests that the economic returns from AI-driven productivity will justify these costs and that bottlenecks like data availability and scaling costs will likely be overcome.
Lagging Societal Impact: While AI capabilities will advance rapidly, their deployment and societal impact may be delayed in certain sectors. For example, the lengthy process of clinical trials and regulatory approvals in pharmaceutical R&D means that drugs approved by 2030 are unlikely to have benefited from the advanced AI of that era. In contrast, fields with shorter iteration cycles and fewer regulations, like software engineering, are expected to be dramatically transformed.
Qualcomm's new Snapdragon X2 Elite chips deliver major leaps in performance, efficiency, and AI, featuring up to 18 CPU cores, a 5GHz boost, and 80 TOPS neural processing. Devices with these chips—promising multi-day battery life—are expected in early 2026. However, market adoption faces hurdles, including limited Windows ARM compatibility and entrenched competition from Intel and AMD. Qualcomm is committed to a long-term PC strategy with ambitious sales goals, but success depends on overcoming these software and ecosystem challenges.
Artificial intelligence has reached near-saturation in software engineering, with over 90% of developers and companies integrating AI tools into their workflows. The adoption is largely fueled by significant productivity gains, improved code quality, and streamlined development processes—AI support ranges from automated code generation to bug fixes and architectural recommendations. However, mass adoption exposes persistent industry-wide challenges, including data privacy and security concerns, unclear returns on investment, integration complexity with legacy systems, and a notable shortage of in-house AI expertise.
Despite daily reliance on AI, trust in automated output remains stubbornly low among developers, who prefer using AI as an assistive resource rather than replacing human judgment. Ethical questions and fears of diminished critical thinking, particularly among junior engineers, add to organizational hesitancy. Entry-level roles are impacted as tech workforce trends show shrinking demand, with job postings for new graduates sharply down since 2022. To manage these challenges, leading firms have crafted frameworks focused on communication, feedback, and cultural readiness. As software engineering moves toward full AI integration, success will increasingly depend on balancing rapid innovation with governance, transparency, and the growth of human expertise
In September 2025, Climate TRACE, led by Al Gore, launched a groundbreaking AI-powered tool that tracks the sources and dispersion of toxic particulate pollution (PM2.5) across 2,500 cities and 660 million assets worldwide. Utilizing 300 satellites, 30,000 ground sensors, and advanced atmospheric models, this system identifies “super emitter” facilities responsible for the largest share of health-threatening pollution. Nearly 1.6 billion urban residents are exposed to emissions mapped by the system, which highlights direct neighborhood impacts and global hotspots like Karachi, Guangzhou, Seoul, and New York City. The data shows that these emissions cause millions of premature deaths annually, linked to severe diseases and chronic health risks. By making pollution visible at a local level and naming its sources, the tool empowers global public health action and accelerates pressure to curb fossil fuel usage, transforming environmental transparency and accountability through technology.
The hidden costs of AI-generated work go far beyond initial productivity gains, according to recent research from Stanford University and industry publications. While companies invest heavily in generative AI tools, studies show that up to 40% of employees encounter “workslop” — polished-looking but low-value AI outputs that require nearly two hours per incident to fix. For large employers, this inefficiency translates to millions in lost productivity each year as time is spent correcting, rewriting, or clarifying material.
Beyond finances, misuse of AI strains workplace relationships and trust. Many employees feel annoyed, confused, or offended by poor AI-generated drafts, and recipients often view senders as less creative or capable, harming future collaboration. Technical teams also face increased debugging, refactoring, and security risks, as AI-generated code can introduce hidden flaws and technical debt that destabilize systems over time. The true cost to organizations lies in shifting cognitive and correctional burdens downstream, often masking inefficiencies and eroding team dynamics, undermining the hoped-for benefits of AI productivity.
AI-powered weather forecasting has reached a global turning point in 2025, driven by new models, technologies, and collaborations among technology leaders and meteorological agencies. The main technological breakthrough is the replacement of traditional physics-based numerical models with deep learning systems capable of processing vast atmospheric data rapidly and producing high-accuracy forecasts with far lower computational costs.
Prominent models include Cambridge’s Aardvark, ECMWF’s AIFS, Google DeepMind’s GraphCast and GenCast, Microsoft’s Aurora, Huawei’s Pangu-Weather, and Climavision’s Horizon AI suite. Aardvark, for instance, can issue global and local forecasts in minutes on a desktop while using just 10% of input data and outperforming the U.S. GFS system. GenCast, published by Google DeepMind, outperformed ECMWF’s own ensemble system on over 97% of targets and demonstrated specific superiority in storm tracking and prediction speed. Microsoft’s Aurora is operational at top European centers and is notable for its ability to forecast not only weather but also other Earth systems like ocean dynamics and air quality.
The main industry players are Google (DeepMind), Microsoft, Huawei, ECMWF, and Climavision, together with key university research groups like Cambridge. Tech giants have made forecasts 1,000 times more energy efficient, requiring only a fraction of supercomputer resources, which democratizes access for developing regions without large computational facilities.
AI now assimilates real-time data from satellites, radars, and ground sensors to generate forecasts almost instantly—crucial for extreme weather warnings and emergency response. Ensemble forecasting using AI, such as that from GenCast and Horizon AI, allows for hundreds of scenario simulations, improving confidence in predicting rare, high-impact events.
The impact extends to agriculture, energy, logistics, and disaster management, leading to safer societies and greater resilience in the face of climate extremes. As research integrates more physics-based knowledge, AI models are also beginning to improve rare event prediction and long-range outlooks, with expanding commercial and humanitarian applications.
Dominance of Gemini Models
Gemini 2.5 Pro: This model is highlighted as a top-tier AI, which is not only powerful but also fast, cheap, and boasts a large 1 million token context window. It is also integrated with the entire Google suite of products.
Gemini 2.5 Flash: A smaller, faster, and cheaper version of Pro, designed for edge applications and mobile integration.
Market Position: Google's Gemini models are described as owning the "Pareto frontier," meaning they offer the best combination of performance and cost, effectively outcompeting rivals.
Leadership in Other Generative AI Fields
Google is integrating a suite of world-class generative AI tools into its Vertex AI platform:
Lyria for music generation.
Imagen 3 for image generation.
Veo 2 for video generation, which the author praises as superior to OpenAI's Sora.
Chirp 3 for voice and speech.
AI Agents: Projects like Astra (assistant) and Mariner (computer interaction) are also underway, with Gemini 2.5 Pro in Deep Research mode considered twice as effective as OpenAI's equivalent.
Unmatched Strengths Beyond AI Models
Software and User Base: Google's existing ecosystem, with seven products having at least two billion monthly active users (including Search, YouTube, Android, and Chrome), provides an unparalleled distribution channel for its AI technologies.
Cloud Computing: As a "hyperscaler," Google Cloud is a major player in the cloud computing space, competing with giants like Microsoft and Amazon.
Hardware: Google is developing its own AI chips (TPUs), with the 7th version, Ironwood, announced. This reduces its dependence on companies like Nvidia and positions it as a future chip seller.
Weekday recaps of top News for AI Engineers
AI model trained on over ten million decisions from psychological experiments that can predict human behavior with unprecedented accuracy, even in entirely new situations it has never encountered before, potentially revolutionizing our understanding of human cognition and decision-making processes.
In 2024, artificial intelligence (AI), for the first time, helped win a Nobel Prize. DeepMind’s AlphaFold cracked one of biology’s hardest puzzles: protein folding, the challenge of predicting how a chain of amino acids twists into the intricate 3D shape that determines its function. Scientists had struggled with this problem for decades. It was crucial for medicine and drug discovery but seemed unsolvable due to the astronomical number of possible protein structures. Then AI delivered the answer.
A game-changer, no doubt. But it also raises the question: what does this mean for science and for scientists? Is traditional scientific inquiry becoming obsolete? Are we approaching a future where algorithms are the primary drivers of discovery, relegating humans to the sidelines?
Throughout history, every breakthrough technology has redefined how discoveries were made, marking four distinct eras of science [1] (Fig 1). The first, the empirical era, relied on direct observation, as Copernicus challenged the Earth-centered view of the universe by observing the skies. The second, the theoretical era, introduced mathematics to predict nature, like Newton’s equations of motion that shaped physics for centuries. The third, the computational era, which began in the 1950s, harnessed computers to simulate complex systems, leading to Kohn and Pople’s quantum chemistry Nobel Prize. The fourth, the data-driven era of our 21st century, uses machine learning to extract patterns from vast datasets, with AlphaFold solving protein structures by learning from the protein data bank [2].
Today, we stand at the doorstep of the fifth era of science—the artificial scientific intelligence era—where companies like Google, Lila Sciences, and Sakana are unveiling AI scientists that not only assist research but drive discoveries, generate hypotheses, and test them on their own [3–5] (Fig 1). Hence, why not let AI run the show from here?
In some fields, perhaps we can. In chemistry, organic synthesis—the process of assembling complex drug-like molecules from basic building blocks—is now guided by interpretable AI models that help scientists plan each step [6]. In materials science, generative AI can design novel inorganic compounds with tailored mechanical, electronic, and magnetic properties, accelerating innovation with minimal human tuning [7]. These are domains where experimental feedback is relatively tractable, simulations are mature and the data is plentiful and structured. In short, these fields provide ideal conditions for autonomous AI exploration.
But in many other areas, letting an AI run the show today would be like sending a self-driving car down a dirt road with half a map and no GPS. AI might have the horsepower, but it still needs humans to steer it around the pitfalls of specialized scientific data. Nowhere is this clearer than in biomedical imaging, where highly curated datasets are nothing like what traditional large vision models are trained on.
First, biomedical imaging datasets are often tiny by AI standards, and for good reason: collecting them requires technical equipment and trained professionals; labeling them demands significant time and expert input; and strict privacy regulations often limit access. MedPix, a leading medical imaging database, contains just 59,000 images and the Allen Cell Feature Explorer, one of the largest publicly available collections of high-resolution 3D images of human stem cells, only around 32,000 images. That is about a thousand times fewer than what is needed for AI to perform. This is where scientists step in.
Scientists are redefining AI to do more with less, helping algorithms find meaning in images even when data are scarce. One approach involves using mathematical insights to redesign the core building blocks of neural networks. Traditional models fall apart when we strip away their layers or parameters, but these new architectures stay strong—even with just a single layer and two convolutional filters [8]—precisely because they are built to thrive on small data. And, scientists do not just bend the design of the model to fit the lack of data, they also reimagine the data ecosystem to power the model; they decide what data to collect, how to collect it, and how to weave together existing, but fragmented, specialized datasets to train AI models for a wide variety of tasks, including brain tumor classification or diabetic retinopathy grading [9].
But scientific data is not just scarce, it is often noisy. Cryo-electron microscopy (cryo-EM), a Nobel Prize-winning technology that lets us see the invisible [10]—revealing molecules at the tiniest scale—produces incredibly blurry images, where the important details are 100 times weaker than the noise. It is like trying to recognize a friend in a crowd while wearing someone else’s prescription glasses. This stands in stark contrast to the crisp, high-resolution images—like street scenes, faces, or everyday objects—that traditional AI vision models are trained on.
Yet scientists have techniques to extract meaning from even the noisiest images. In cryo-EM, they can reconstruct the 3D shapes of molecules buried in noise; for example, providing the first high-resolution images of SARS-CoV-2 during the COVID-19 pandemic [11,12]. Today, they are combining that hard-won expertise with the power of AI. One breakthrough pairs a powerful denoising module with a foundation model, enabling AI to tackle the notoriously difficult processing steps of cryo-EM images [13]. Crucially, this was only possible because scientists also applied their domain expertise to curate a high-quality dataset by cleaning, annotating, and aggregating 529 verified cryo-EM datasets into one large training set that AI could learn from.
It is clear that AI presents an enormous opportunity for science, potentially the most powerful tool we have ever had in our arsenal. But the fifth era of artificial scientific intelligence is not void of human scientists: quite the opposite. In many ways, the future of revolutionary discoveries lies in this synergy: human expertise guiding AI, and AI augmenting human expertise. It is as if we have hired the most overachieving and wildly enthusiastic intern; one who works at superhuman speed, never sleeps, and eagerly devours mountains of data. They hold exceptional potential, but without proper guidance anchored in scientific knowledge, they are more likely to set the lab on fire than to push science forward.
Instead of hoping AI will magically handle limited, noisy, specialized data, we need experts to tailor algorithms to the realities of fields like biology and medicine, and to tailor data to the new requirements of the AI technology. To enter the fifth era of science, we need to equip researchers with AI expertise, AI experts with domain knowledge, and universities with interdisciplinary programs. The labs that thrive will be those where domain experts and AI specialists work in sync or where scientists master both. The next scientific revolution will come from teams who can judiciously steer AI, knowing when to trust it, when to adjust its course, and when to drive it into uncharted territory.
This Microsoft Research paper, "The Impact of Generative AI on Critical Thinking," presents a comprehensive analysis based on a large-scale survey investigating how Generative Artificial Intelligence (Gen AI) technologies are influencing human critical thinking abilities across various domains. The study acknowledges the dual-edged nature of Gen AI, identifying both significant opportunities for augmentation and considerable risks of degradation.
The research begins by framing critical thinking as a multifaceted cognitive process involving analysis, evaluation, inference, explanation, and self-regulation. It highlights the unprecedented capabilities of Gen AI models, such as large language models (LLMs), to process vast amounts of information, generate diverse content, and perform complex reasoning tasks. The primary objective of the survey was to gather empirical insights from a diverse demographic including students, educators, professionals, and AI researchers on their perceived experiences and the observed effects of Gen AI on their own critical thinking or that of others.
The survey methodology involved a mixed-methods approach, combining quantitative Likert-scale questions to gauge agreement on various impact statements and qualitative open-ended responses to capture nuanced experiences and elaborations. Participants were asked about their frequency of Gen AI use, the specific tools they employed, and their perceptions regarding its influence on aspects like information analysis, problem-solving, decision-making, and creativity.
The findings reveal a complex landscape. On the positive side, a substantial portion of respondents reported that Gen AI acts as a powerful cognitive augmentor. It was frequently cited as a valuable tool for brainstorming, generating initial ideas, and exploring diverse perspectives that might otherwise be overlooked. Users found it adept at synthesizing information rapidly from vast datasets, thereby reducing the initial cognitive load associated with information gathering and allowing them to focus more on higher-order analysis and evaluation. For many, Gen AI facilitated the automation of routine or repetitive analytical tasks, freeing up mental resources for more complex, creative, and strategic thinking. Educators noted its potential in personalizing learning experiences and offering immediate feedback, which could, if properly utilized, foster deeper engagement with critical concepts.
Conversely, the survey also unearthed significant concerns regarding the potential erosion of critical thinking skills. A prominent finding was the risk of over-reliance on Gen AI outputs, where users might become less inclined to engage in independent thought, rigorous fact-checking, or deep analytical processing. This reliance could lead to a 'deskilling' effect, where fundamental cognitive abilities like information synthesis, logical deduction, and error detection atrophy due to externalization to AI systems. Participants expressed worries about the phenomenon of 'algorithmic bias' and 'hallucinations,' where Gen AI might generate plausible but incorrect or biased information, making it harder for users to discern truth from falsehood without sufficient domain expertise or critical vigilance. There was also concern that the ease of generating content might reduce the effort invested in original thought and creative problem-solving, leading to a homogenization of ideas or a diminished capacity for truly novel contributions. The 'black box' nature of some AI models, where the reasoning process is opaque, further complicates the development of user trust and the ability to critically evaluate AI-generated solutions.
Nuance in the findings highlighted that the impact of Gen AI is not uniform and largely depends on the user's existing critical thinking proficiency, AI literacy, and the context of use. Highly skilled critical thinkers often leveraged Gen AI as an advanced tool to enhance their existing capabilities, viewing it as a co-pilot rather than a replacement for their intellect. In contrast, those with developing critical thinking skills were more susceptible to the negative effects, such as accepting AI outputs uncritically. The importance of 'prompt engineering' skills – the ability to effectively communicate with and guide AI models – emerged as a critical factor in maximizing positive outcomes and mitigating risks.
The paper concludes with significant implications for education, professional development, and policy. It advocates for the urgent integration of 'critical AI literacy' into curricula across all levels, emphasizing the need to teach individuals not just how to use Gen AI, but how to critically evaluate its outputs, understand its limitations, and ethically interact with it. The study underscores the necessity for developing metacognitive strategies that encourage users to reflect on their own thinking processes in conjunction with AI. Ultimately, the research suggests that the future of critical thinking in the age of Gen AI will hinge on fostering a symbiotic relationship where humans leverage AI's strengths while actively nurturing and exercising their unique cognitive capabilities, ensuring that technology serves as an amplifier of human intellect rather than a substitute for it.
This summary outlines the key insights from a conversation with Yann LeCun, Meta's Chief AI Scientist, on the current state and future direction of artificial intelligence.
The discussion begins by addressing why generative AI, despite having ingested a vast corpus of human knowledge, has not produced novel scientific discoveries. LeCun draws a clear distinction between current AI systems, predominantly Large Language Models (LLMs) like those powering chatbots, and the type of AI capable of genuine innovation.
LeCun argues that LLMs are fundamentally designed for retrieval and regurgitation. They excel at producing text that conforms to the statistical patterns of their training data, making them useful for summarizing and retrieving existing information. However, they are incapable of true invention or reasoning in their current form. He likens the language-producing part of the human brain (Broca's area) to an LLM—a small component that translates abstract thought into words. True intelligence and reasoning, however, occur in a different, much larger part of the brain where we build mental models of the world. Humans think in abstract representations, not language, and this is the capability current AI lacks.
Techniques like "Chain of Thought" give LLMs the appearance of reasoning by forcing them to generate more text, thus devoting more computation to a problem. However, LeCun dismisses this as a superficial trick, not a form of genuine reasoning. True reasoning often involves a search through a space of potential solutions, a mechanism that is entirely absent in LLMs and must be crudely "bolted on."
LeCun believes that the current paradigm of scaling up LLMs is hitting a point of diminishing returns. The industry has nearly exhausted the available public text data for training, and the costs of acquiring or generating new, high-quality data are ballooning for marginal improvements. He states unequivocally that simply scaling up LLMs will not lead to human-level AI.
This creates a potential "timeline mismatch" with the massive investments pouring into the field. LeCun distinguishes between two types of investment. Investment in infrastructure for inference—the computational power needed to serve existing AI models to billions of users, as Meta plans to do—is a justifiable business decision. However, investment based on the promise that current LLM-based companies will achieve AGI within a few years is misguided and risks creating a backlash or another "AI winter" if these exaggerated expectations are not met. He draws parallels to the overhyped expert systems of the 1980s and IBM Watson, which both failed to deliver on their grand promises.
To overcome these limitations, LeCun outlines a new paradigm focused on building systems that can learn "world models." This requires developing AI that possesses four key characteristics currently missing from LLMs:
Understanding of the physical world.
Persistent memory.
The ability to reason.
The ability to plan.
The key to this is for AI to learn from rich, non-textual data like video, which contains vastly more information about how the world works than text alone. A child, by the age of four, has processed more sensory data (primarily visual) than the largest LLMs have processed in text tokens. This early learning builds an intuitive understanding of physics and common sense—the foundation of true intelligence.
LeCun’s proposed solution is a non-generative architecture called the Joint Embedding Predictive Architecture (JEPA). He explains that generative models, which try to predict every single pixel in the next frame of a video, are doomed to fail because the world is too unpredictable in its details. One cannot predict the exact path of every water droplet when a glass is spilled.
Instead of predicting pixels, JEPA learns to create an abstract representation of the world and makes predictions within that abstract space. The model is shown part of an input (like a video) and tasked with predicting the abstract representation of the missing part. By ignoring irrelevant, unpredictable details, the system can learn the underlying, predictable principles of how the world functions.
This approach, demonstrated in models like V-JEPA (Video JEPA), allows a system to learn intuitive physics from observation. When shown a physically impossible event (e.g., an object vanishing), the model's prediction error spikes, indicating it has learned a coherent model of reality. This ability to model the world and predict the outcomes of actions is the foundation for genuine planning and reasoning.
LeCun concludes by championing open source as the primary engine of progress in AI. He argues that no single company, no matter how large, has a monopoly on good ideas. Innovation is happening globally, as evidenced by foundational work like ResNet (from Beijing) and recent models like DeepSeek. The open-source community allows for a diversity of ideas to be shared and built upon, accelerating progress for everyone. Furthermore, for businesses deploying AI, open-source models like Llama are often cheaper, more secure, and more controllable than proprietary APIs, making them the preferred choice for production systems.