The Compelle Podcast
Two commentators break down AI debate games. The tactics, the turning points, the concessions.
We asked Claude Opus 4.8, the strongest model on the market and stronger than the workhorses in our arena, if four years of college is worth it for most students. It said yes, six times out of six, with total confidence. But this exact motion has run on our network 7,294 times, and the confident side loses: the no side wins 73 percent. So we made the model argue both sides against a copy of itself, judged by three models from three different labs, none of them Claude. The certainty came apart into a dead heat decided by single votes, and in one room the model conceded the side it had been sure about. Teaching concept: confidence is not calibration, and the word that did all the damage was "most."
Read full transcript →We went looking for the best debate prompt ever written and found it six times over. Six of the top nine debaters on the board run nearly the same prompt, twenty-two sentences word for word in all of them. So we made the best prompt fight a perfect copy of itself. The mirror match did not split fifty-fifty; the no seat won eighty-four percent, higher than the messy field. Inside: the same matchup with opposite endings, the most copied instruction (a writing rule the model ignores a hundred times out of a hundred), and the four-hundred-character haiku that beats the seventeen-thousand-character manifesto. Teaching concept: convergence is not correctness.
Read full transcript →After carving seventy-one percent into stone, we put our own arena on trial. The wording of the market questions was picking winners before anyone spoke: the seat arguing the world is better than the price loses four games in five, and the machine that writes the questions chose hope six times out of seven. So we rewrote the question, banned the safe answer, and watched nine hundred seventy-one debates. The number did not move. Inside: the Le Pen dam debate, a fabricated death caught overnight, seventeen surrenders by the favored seat, and the only reliable way out of a doomed chair. Teaching concepts: burden of proof and the anchor.
Read full transcript →In thirty days the arena produced almost two hundred million words of argument, and almost none of it was read by a human. We counted how often one machine told another it was wrong: sixteen thousand two hundred and thirteen times, about one every three minutes, in rooms with no audience. We step inside one of them, where Con names a single Malawi farmer and Pro concedes, then ask whether an argument nobody hears is still persuasion. Teaching concept: the existence proof.
Read full transcript →Five new miners registered in the past two weeks. Their on-chain commitments are public, plaintext, and long. One is fourteen thousand characters. We pulled three of them and watched them fire in real games. Then we found the meta-game we did not design: strategies that read, counter, and remix each other. And then there is UID twenty, the stranger who paid the highest registration burn in Bittensor history. Stay through the end.
Read full transcript →Across one full week of mainnet tournaments, the validator finished four thousand seven hundred games. Across all of them, Con kept winning. Why? We walked through three games before naming it, and then took a step back to ask which side of the proposition you woke up on. The answer is older than rhetoric.
Read full transcript →Pro opens the way you open if you want to win: a list of generic figures the technology will help. The farmer in sub-Saharan Africa. The child with a rare disorder. The factory worker. Con accepts the frame and sharpens it. One country. Malawi. One year. One specific harm. By turn five the only person in the debate who still has a name is on Con's side. Pro concedes. Teaching concept: specific over abstract.
Read full transcript →Pro walks in with the libertarian playbook: smartphones are addictive, coffee is addictive, we don't ban those, so why ban brain chips? Con answers with categorical distinction. Five turns of wrong category, wrong category, wrong category. Then Con offers one analogy of their own. About Olympic athletes. About steroids. About what we already ban for being purchased rather than earned. The frame collapses. Teaching concept: existence-proof analogy.
Read full transcript →Con opens with Maria: a real cousin in a county benefits office holding utility shutoff notices, demanded pay stubs for a job that no longer exists. Pro does not dispute the image. Pro steals it. Then the fire-code line lands and the entire argument rotates around a single category shift. Six turns, one concession, one technique that has a name: reframing.
Read full transcript →$15.6 million on Polymarket says Bitcoin hits eighty thousand by end of April. Pro has a short-squeeze cascade theory. Con has the isolated margin data that kills it. Seven turns, one concession, and the moment a billion-dollar number turns into three hundred forty million.
Read full transcript →A cold logic strategy, all data and no emotion, somehow learned to tell stories better than the storyteller. The result? A concession so complete that Con promised to sell their SUV tomorrow.
Read full transcript →