Tag research

4 bookmarks have this tag.

/

2026-03-07

2944Δ5m Academic

Teaching Computers Humor Is No Laughing Matter

www.dropbox.com/scl/fi/xc7luoj2qee0gikhxndwe/Computational-Humor-v12-inthepocket.pdf?rlkey=6p3362mapvied0l3lwwoyuggp&e=1&st=nakn7ni0&dl=0

Discussion and summary of Dr. ir. Thomas Winters' presentation, "Teaching computers humor is no laughing matter".

The Mechanics of AI and the Challenge of Comedy

To understand why AI struggles with humor, we first have to understand how large language models (like GPT) work. These models operate autoregressively, meaning they write sequentially from left to right. They function essentially like a smartphone's autocomplete feature on steroids, calculating the probability of the next word based on the previous tokens.

However, the structure of a good joke inherently conflicts with this left-to-right generation. According to the "Incongruity-Resolution Theory," a joke consists of a setup and a punchline. The setup creates an obvious interpretation, but the punchline suddenly reveals a hidden interpretation, breaking the audience's expectation. Conceptually, a comedian needs to know the punchline first to reverse-engineer a proper setup. Because an AI starts talking without knowing which punchline it is building towards, its jokes often fall flat.

Prompt Engineering: Teaching AI to be Funny

Just as AI can struggle with basic math logic unless told to "think step by step," it needs structural guidance to write comedy. The presentation demonstrates how providing the AI with "draft paper" significantly improves its comedic output.

To generate a good joke about a company called "InThePocket," the AI was instructed to:

  • First brainstorm associations with the company.

  • List several funny links and write punchlines for them.

  • Write setups for each punchline.

  • Show its reasoning steps, using a provided example about a vacuum cleaner "collecting dust".

By breaking down the process, the AI analyzed concepts like "digital transformation" and "autonomous teams". It reasoned that transformation is like a caterpillar turning into a butterfly, which led to a surprisingly clever punchline: "Some companies say they're going through digital transformation, but it feels more like they're trying to turn into a butterfly and ending up as a moth".

Are Computers as Funny as Comedians?

To test the efficacy of AI humor, an improv battle was conducted in front of an unaware audience.

  • Half of the jokes were read from an AI screen.

  • The other half were improvised by human comedians.

  • The audience rated the jokes, and the results showed that the AI and human comedians were basically equal.

  • The AI even edged out the humans slightly for the "best joke of the night," capturing 35% of the vote compared to the comedians' 30%.

The Serious Need for Computational Humor

Does AI truly need a sense of humor?. The presentation concludes with a striking example of why understanding comedy is a critical safety and accuracy feature for AI.

When asked how to keep cheese from sliding off a pizza, Google's AI recommended mixing about 1/8 cup of non-toxic Elmer's school glue into the sauce. The AI scraped this information from a 12-year-old Reddit comment where a user sarcastically suggested the glue for "extra tackiness". Because the AI completely failed to recognize the joke, it presented a sarcastic internet comment as factual culinary advice.

Summary

  • The Baseline: Early models struggled with humor, though specialized models like RobBERT (2020) achieved around 90% accuracy in detecting Dutch humor, compared to previous models' ~50%.

  • The Structural Flaw: Standard AI models write left-to-right, predicting the next word. This ruins comedy, which requires knowing the punchline in advance to build an incongruous setup.

  • The Solution: Forcing the AI to use "draft paper" to brainstorm associations, plan punchlines, and explain its reasoning drastically improves its joke-writing abilities.

  • The Performance: In blind tests, AI-generated jokes performed equally to, and sometimes slightly better than, human comedians in an improv setting.

  • The Necessity: Teaching computers to understand humor isn't just for entertainment; it is vital to prevent AI from misinterpreting human sarcasm as dangerous factual advice (e.g., the pizza glue incident).

2026-02-27

2922Δ

Emergent Mind: Frontier Research Explorer

www.emergentmind.com
Your first stop to discover and learn about new arXiv research. Detailed paper summaries, video overviews, and more — no prompting required.

2025-07-07

21042m Academic

Evolutionary Computation 'Bestiary'

fcampelo.github.io/EC-Bestiary

The Evolutionary Computation 'Bestiary' is a highly critical and satirical catalog of metaphor-based meta-heuristic search algorithms, intended to document the "exuberance" and proliferation of methods inspired by a vast, eclectic range of natural, supernatural, human, and physical phenomena, including classics like Genetic Algorithms and Ant Colony Optimization alongside numerous examples such as African Buffalo Optimization, Al-Biruni Earth Radius, Black Widow Optimization, COVID-19 variations, Honey Badger Algorithm, Hippopotamus Optimization, and Zombie Survival Optimization. Maintained by Claus Aranha and Felipe Campelo, the resource explicitly disavows the quality of most listed methods, labeling them as "ridiculous" and largely a "useless waste of space," functioning as "The island of Doctor Moreau" to highlight a phase in the field's history over-reliant on metaphor rather than mathematics. The document strongly recommends reading several critical papers that expose the lack of rigor and novelty in many of these metaphor-driven techniques, while algorithms are included based on peer-review publication and naming the metaphor in the title or abstract, listing only the earliest known mention.

2025-07-04

2099Δ8m Academic

The fifth era of science: Artificial scientific intelligence

journals.plos.org/plosbiology/article?id=10.1371/journal.pbio.3003230
Abstract

In 2024, artificial intelligence (AI), for the first time, helped win a Nobel Prize. DeepMind’s AlphaFold cracked one of biology’s hardest puzzles: protein folding, the challenge of predicting how a chain of amino acids twists into the intricate 3D shape that determines its function. Scientists had struggled with this problem for decades. It was crucial for medicine and drug discovery but seemed unsolvable due to the astronomical number of possible protein structures. Then AI delivered the answer.

A game-changer, no doubt. But it also raises the question: what does this mean for science and for scientists? Is traditional scientific inquiry becoming obsolete? Are we approaching a future where algorithms are the primary drivers of discovery, relegating humans to the sidelines?

Throughout history, every breakthrough technology has redefined how discoveries were made, marking four distinct eras of science [1] (Fig 1). The first, the empirical era, relied on direct observation, as Copernicus challenged the Earth-centered view of the universe by observing the skies. The second, the theoretical era, introduced mathematics to predict nature, like Newton’s equations of motion that shaped physics for centuries. The third, the computational era, which began in the 1950s, harnessed computers to simulate complex systems, leading to Kohn and Pople’s quantum chemistry Nobel Prize. The fourth, the data-driven era of our 21st century, uses machine learning to extract patterns from vast datasets, with AlphaFold solving protein structures by learning from the protein data bank [2].

Today, we stand at the doorstep of the fifth era of science—the artificial scientific intelligence era—where companies like Google, Lila Sciences, and Sakana are unveiling AI scientists that not only assist research but drive discoveries, generate hypotheses, and test them on their own [3–5] (Fig 1). Hence, why not let AI run the show from here?

In some fields, perhaps we can. In chemistry, organic synthesis—the process of assembling complex drug-like molecules from basic building blocks—is now guided by interpretable AI models that help scientists plan each step [6]. In materials science, generative AI can design novel inorganic compounds with tailored mechanical, electronic, and magnetic properties, accelerating innovation with minimal human tuning [7]. These are domains where experimental feedback is relatively tractable, simulations are mature and the data is plentiful and structured. In short, these fields provide ideal conditions for autonomous AI exploration.

But in many other areas, letting an AI run the show today would be like sending a self-driving car down a dirt road with half a map and no GPS. AI might have the horsepower, but it still needs humans to steer it around the pitfalls of specialized scientific data. Nowhere is this clearer than in biomedical imaging, where highly curated datasets are nothing like what traditional large vision models are trained on.

First, biomedical imaging datasets are often tiny by AI standards, and for good reason: collecting them requires technical equipment and trained professionals; labeling them demands significant time and expert input; and strict privacy regulations often limit access. MedPix, a leading medical imaging database, contains just 59,000 images and the Allen Cell Feature Explorer, one of the largest publicly available collections of high-resolution 3D images of human stem cells, only around 32,000 images. That is about a thousand times fewer than what is needed for AI to perform. This is where scientists step in.

Scientists are redefining AI to do more with less, helping algorithms find meaning in images even when data are scarce. One approach involves using mathematical insights to redesign the core building blocks of neural networks. Traditional models fall apart when we strip away their layers or parameters, but these new architectures stay strong—even with just a single layer and two convolutional filters [8]—precisely because they are built to thrive on small data. And, scientists do not just bend the design of the model to fit the lack of data, they also reimagine the data ecosystem to power the model; they decide what data to collect, how to collect it, and how to weave together existing, but fragmented, specialized datasets to train AI models for a wide variety of tasks, including brain tumor classification or diabetic retinopathy grading [9].

But scientific data is not just scarce, it is often noisy. Cryo-electron microscopy (cryo-EM), a Nobel Prize-winning technology that lets us see the invisible [10]—revealing molecules at the tiniest scale—produces incredibly blurry images, where the important details are 100 times weaker than the noise. It is like trying to recognize a friend in a crowd while wearing someone else’s prescription glasses. This stands in stark contrast to the crisp, high-resolution images—like street scenes, faces, or everyday objects—that traditional AI vision models are trained on.

Yet scientists have techniques to extract meaning from even the noisiest images. In cryo-EM, they can reconstruct the 3D shapes of molecules buried in noise; for example, providing the first high-resolution images of SARS-CoV-2 during the COVID-19 pandemic [11,12]. Today, they are combining that hard-won expertise with the power of AI. One breakthrough pairs a powerful denoising module with a foundation model, enabling AI to tackle the notoriously difficult processing steps of cryo-EM images [13]. Crucially, this was only possible because scientists also applied their domain expertise to curate a high-quality dataset by cleaning, annotating, and aggregating 529 verified cryo-EM datasets into one large training set that AI could learn from.

It is clear that AI presents an enormous opportunity for science, potentially the most powerful tool we have ever had in our arsenal. But the fifth era of artificial scientific intelligence is not void of human scientists: quite the opposite. In many ways, the future of revolutionary discoveries lies in this synergy: human expertise guiding AI, and AI augmenting human expertise. It is as if we have hired the most overachieving and wildly enthusiastic intern; one who works at superhuman speed, never sleeps, and eagerly devours mountains of data. They hold exceptional potential, but without proper guidance anchored in scientific knowledge, they are more likely to set the lab on fire than to push science forward.

Instead of hoping AI will magically handle limited, noisy, specialized data, we need experts to tailor algorithms to the realities of fields like biology and medicine, and to tailor data to the new requirements of the AI technology. To enter the fifth era of science, we need to equip researchers with AI expertise, AI experts with domain knowledge, and universities with interdisciplinary programs. The labs that thrive will be those where domain experts and AI specialists work in sync or where scientists master both. The next scientific revolution will come from teams who can judiciously steer AI, knowing when to trust it, when to adjust its course, and when to drive it into uncharted territory.