Day 2025-03-20

7 bookmarks for 2025-03-20

This summary outlines the key insights from a conversation with Yann LeCun, Meta's Chief AI Scientist, on the current state and future direction of artificial intelligence.

The Limitations of Large Language Models (LLMs)

The discussion begins by addressing why generative AI, despite having ingested a vast corpus of human knowledge, has not produced novel scientific discoveries. LeCun draws a clear distinction between current AI systems, predominantly Large Language Models (LLMs) like those powering chatbots, and the type of AI capable of genuine innovation.

LeCun argues that LLMs are fundamentally designed for retrieval and regurgitation. They excel at producing text that conforms to the statistical patterns of their training data, making them useful for summarizing and retrieving existing information. However, they are incapable of true invention or reasoning in their current form. He likens the language-producing part of the human brain (Broca's area) to an LLM—a small component that translates abstract thought into words. True intelligence and reasoning, however, occur in a different, much larger part of the brain where we build mental models of the world. Humans think in abstract representations, not language, and this is the capability current AI lacks.

Techniques like "Chain of Thought" give LLMs the appearance of reasoning by forcing them to generate more text, thus devoting more computation to a problem. However, LeCun dismisses this as a superficial trick, not a form of genuine reasoning. True reasoning often involves a search through a space of potential solutions, a mechanism that is entirely absent in LLMs and must be crudely "bolted on."

Diminishing Returns and Investment Risks

LeCun believes that the current paradigm of scaling up LLMs is hitting a point of diminishing returns. The industry has nearly exhausted the available public text data for training, and the costs of acquiring or generating new, high-quality data are ballooning for marginal improvements. He states unequivocally that simply scaling up LLMs will not lead to human-level AI.

This creates a potential "timeline mismatch" with the massive investments pouring into the field. LeCun distinguishes between two types of investment. Investment in infrastructure for inference—the computational power needed to serve existing AI models to billions of users, as Meta plans to do—is a justifiable business decision. However, investment based on the promise that current LLM-based companies will achieve AGI within a few years is misguided and risks creating a backlash or another "AI winter" if these exaggerated expectations are not met. He draws parallels to the overhyped expert systems of the 1980s and IBM Watson, which both failed to deliver on their grand promises.

The Path Forward: World Models and a New Paradigm

To overcome these limitations, LeCun outlines a new paradigm focused on building systems that can learn "world models." This requires developing AI that possesses four key characteristics currently missing from LLMs:

Understanding of the physical world.
Persistent memory.
The ability to reason.
The ability to plan.

The key to this is for AI to learn from rich, non-textual data like video, which contains vastly more information about how the world works than text alone. A child, by the age of four, has processed more sensory data (primarily visual) than the largest LLMs have processed in text tokens. This early learning builds an intuitive understanding of physics and common sense—the foundation of true intelligence.

JEPA: A Non-Generative Architecture for Learning

LeCun’s proposed solution is a non-generative architecture called the Joint Embedding Predictive Architecture (JEPA). He explains that generative models, which try to predict every single pixel in the next frame of a video, are doomed to fail because the world is too unpredictable in its details. One cannot predict the exact path of every water droplet when a glass is spilled.

Instead of predicting pixels, JEPA learns to create an abstract representation of the world and makes predictions within that abstract space. The model is shown part of an input (like a video) and tasked with predicting the abstract representation of the missing part. By ignoring irrelevant, unpredictable details, the system can learn the underlying, predictable principles of how the world functions.

This approach, demonstrated in models like V-JEPA (Video JEPA), allows a system to learn intuitive physics from observation. When shown a physically impossible event (e.g., an object vanishing), the model's prediction error spikes, indicating it has learned a coherent model of reality. This ability to model the world and predict the outcomes of actions is the foundation for genuine planning and reasoning.

The Crucial Role of Open Source

LeCun concludes by championing open source as the primary engine of progress in AI. He argues that no single company, no matter how large, has a monopoly on good ideas. Innovation is happening globally, as evidenced by foundational work like ResNet (from Beijing) and recent models like DeepSeek. The open-source community allows for a diversity of ideas to be shared and built upon, accelerating progress for everyone. Furthermore, for businesses deploying AI, open-source models like Llama are often cheaper, more secure, and more controllable than proprietary APIs, making them the preferred choice for production systems.

🌐 youtu.be, ai, lecun, video

2001.

Cats are (almost) liquid!—Cats selectively rely on body size awareness when negotiating short openings

www.cell.com/iscience/fulltext/S2589-0042(24)02024-8

Various animal species can make a priori decisions about the passability of openings, based on their own size knowledge. So far no one has tested the ability for self-representation in cats. We hypothesized that cats may rely on their size awareness when they have to negotiate small openings. Companion cats (N = 30) were tested with incrementally decreasing sized openings, which were either the same height, or the same width. Cats approached and entered even the narrowest openings, but they slowed down before reaching, and while passing through the shortest ones. Because of their specific anatomical features and cautious locomotory strategy, cats readily opt for the trial-and-error method to negotiate narrow apertures, but they seemingly rely on their body-size representing capacity in the case of uncomfortably short openings. Ecologically valid methodologies can provide answers in the future as to whether cats would rely on their body awareness in other challenging spatial tasks.

🌐 cell.com, cats, fun

1998.

Bluesky: An Open Social Web - Bluesky

bsky.social/about/blog/02-22-2024-open-social-web

Your data, such as your posts, likes, and follows, needs to be stored somewhere. With traditional social media, your data is stored by the social media company whose services you've signed up for. If you ever want to stop using that company's services, you can do that—but you would have to leave that social network and lose your existing connections.
It doesn't have to be this way! An alternative model is how the internet itself works. Anyone can put up a website on the internet. You can choose from one of many companies to host your site (or even host it yourself), and you can always change your mind about this later. If you move to another hosting provider, your visitors won't even notice. No matter where your site's data is managed and stored, your visitors can find your site simply by typing the name of the website or by clicking a link.
We think social media should work the same way. When you register on Bluesky, by default we'll suggest that Bluesky will store your data.
But if you'd like to let another company store it, or even store it yourself, you can do that. You'll also be able to change your mind at any point, moving your data to another provider without losing any of your existing posts, likes, or follows. From your followers' perspective, your profile is always available at your handle—no matter where your information is actually stored, or how many times it has been moved.

🌐 bsky.social, fediverse, indieweb, social

1996.

Marginalia Search Engine - Marginalia Search

marginalia-search.com

The need for discovery

Nothing you do to try to make the web a better place matters if nobody can find what you did. There are a lot of precious websites out there that deserve an audience, but instead are languishing in obscurity.

This makes alternative discovery mechanisms an urgent priority of the free and independent web, both document search as well as blog and RSS-feed discovery.

🌐 marginalia-search.com, indieweb, search, web

1995.

Opinion | It May Not Be Brainwashing, but It’s Not Democracy, Either - The New York Times

www.nytimes.com/2025/03/18/opinion/andreessen-musk-trump-silicon-valley.html

Google commands 90 percent of the search market. Seven in 10 of all Americans use Facebook. Amazon, Microsoft and Google control two-thirds of the internet’s cloud architecture — if any of it goes down, so does the web. Amazon owns 40 percent of the American e-commerce market.

What’s happening now, in one sense, is that the tech titans who have secured such large swaths of power over the digital world are increasingly comfortable wielding that power, openly, in the “real” world too; the tech oligarchs are becoming the American oligarchs, period, often using leverage from their digital platforms in tandem with their war chests of old-fashioned cash.

🌐 nytimes.com, dystopia, politics, technocracy

1993.

The Fediverse Isn’t the Future. It’s the Present We’ve Been Denied.

www.joanwestenberg.com/the-fediverse-isnt-the-future-its-the-present-weve-been-denied

For years, the internet has been shrinking. Not in size, not in data, but in ownership. A vast, decentralized network of personal blogs, forums, and independent communities has been corralled into a handful of paved prison yards controlled by a few massive corporations. Every post, every “friend,” every creative work—locked behind closed doors, and you don’t have the keys.

The fediverse is a jailbreak. It’s not a product, not a single platform, it’s not something you can buy stock in or use to enrich yourself at the cost of our shared humanity. It’s a network of independent, interconnected social platforms, all running on open protocols like ActivityPub. It’s an ecosystem where you - not some incellionaire obsessed with eugenics - own your digital identity. Where your social graph belongs to you, not an algorithm’s shifting fucking whims. Where moving from one service to another doesn’t mean losing everything you’ve built and everything you’ve ever said.

🌐 joanwestenberg.com, fediverse

1990.

Uiua programming language

www.uiua.org

Uiua is a general purpose array-oriented programming language with a focus on simplicity, beauty, and tacit code.
Uiua has the terseness and expressivity afforded by Unicode glyphs without the need for special keyboard or editor support. Instead, the language comes with a formatter that converts the names of built-in functions into glyphs.

🌐 uiua.org, programming