Slashdot is powered by your submissions, so send in your scoop

ChatGPT Just Got 'Absolutely Wrecked' at Chess, Losing to a 1970s-Era Atari 2600 (cnet.com) 139

Posted by EditorDavid on Saturday June 14, 2025 @12:34PM from the checkered-past dept.

An anonymous reader shared this report from CNET: By using a software emulator to run Atari's 1979 game Video Chess, Citrix engineer Robert Caruso said he was able to set up a match between ChatGPT and the 46-year-old game. The matchup did not go well for ChatGPT. "ChatGPT confused rooks for bishops, missed pawn forks and repeatedly lost track of where pieces were — first blaming the Atari icons as too abstract, then faring no better even after switching to standard chess notations," Caruso wrote in a LinkedIn post.

"It made enough blunders to get laughed out of a 3rd-grade chess club," Caruso said. "ChatGPT got absolutely wrecked at the beginner level."
"Caruso wrote that the 90-minute match continued badly and that the AI chatbot repeatedly requested that the match start over..." CNET reports.

"A representative for OpenAI did not immediately return a request for comment."

This discussion has been archived. No new comments can be posted.

ChatGPT Just Got 'Absolutely Wrecked' at Chess, Losing to a 1970s-Era Atari 2600

Load All Comments

Search 139 Comments Log In/Create an Account

Comments Filter:

ChatGPT is not a chess engine (Score:4, Insightful)

by JoshuaZ ( 1134087 ) writes: on Saturday June 14, 2025 @12:43PM (#65449311) Homepage

ChatGPT is not a chess engine. Comparing it to an actual chess system is missing the point. The thing that's impressive about systems like ChatGPT is not that they are better than specialized programs, or that it is better than expert humans, but that it is often much better at many tasks than a random human. I'm reasonably confident that if you asked a random person off the street to play chess this way, they'd likely have a similar performance. And it shouldn't be that surprising, since the actual set of text-based training data that corresponds to a lot of legal chess games is going to be a small fraction of the training data, and since nearly identical chess positions can have radically different outcomes, this is precisely the sort of thing that an LLM is bad at (they are really bad at abstract math for similar reasons). This also has a clickbait element given that substantially better LLM AIs than ChatGPT are now out there, including GPT 4o and Claude. Overall, this comes across as people just moving the goalposts while not recognizing how these systems keep getting better and better.

Share
twitter facebook
- Re:ChatGPT is not a chess engine (Score:5, Insightful)
  
  by OrangeTide ( 124937 ) writes: on Saturday June 14, 2025 @12:46PM (#65449319) Homepage Journal
  
  ChatGPT has flexibility, but it is inferior to both humans and specialized algorithms in nearly all cases.
  The main advantage of ChatGPT is that you only have to feed it electricity instead of a living wage.
  
  Parent Share
  twitter facebook
  - Re:ChatGPT is not a chess engine (Score:5, Insightful)
    
    by gweihir ( 88907 ) writes: on Saturday June 14, 2025 @01:05PM (#65449355)
    
    The main advantage of ChatGPT is that you only have to feed it electricity instead of a living wage.
    With the little problem that you have to feed it so muich electricity that paying that wage might still well tirn out to be cheaper, even at western standards. At the moment LLMs burn money like crazy and it is unclear whether that can be fixed.
    
    Parent Share
    twitter facebook
    - Re: (Score:2)
      
      by Ol Olsoc ( 1175323 ) writes:
      
      The main advantage of ChatGPT is that you only have to feed it electricity instead of a living wage.
      With the little problem that you have to feed it so muich electricity that paying that wage might still well tirn out to be cheaper, even at western standards. At the moment LLMs burn money like crazy and it is unclear whether that can be fixed.
      W're going to be needing several Kashiwazaki-Kariwa size or larger reactor to perform what a web search by a random person can do.
      - Re: (Score:3)
        
        by gweihir ( 88907 ) writes:
        
        Remember how expensive electricity from nuclear is? That will not solve things...
        Also remember that most Uranium comes from Kazakhstan (43%) and they border on China and Russia. Not a critical dependency you want. Place 2 is Kanada (15%), which the US just has mightily pissed off by sheer leadership stupidity. US domestic? A whopping 0.15%...
        
        Re: (Score:2)
        
        by Ol Olsoc ( 1175323 ) writes:
        
        Remember how expensive electricity from nuclear is? That will not solve things...
        Also remember that most Uranium comes from Kazakhstan (43%) and they border on China and Russia. Not a critical dependency you want. Place 2 is Kanada (15%), which the US just has mightily pissed off by sheer leadership stupidity. US domestic? A whopping 0.15%...
        I don't disagree with any of that. And if we do decide to put ourselves in that position, is this glorified search engine going to be worth it? I don't think so.
        That said, I think that before too long, we aren't going to need an entire nuclear generating facility to generate power to feed the tech bro wet dream. A guess, but a half educated one, with the way innovation trends to work.
        
        Re: (Score:2)
        
        by gweihir ( 88907 ) writes:
        
        Actually, the bulk of the uranium reserves are solved in seawater. But, as it turns out, extraction ability and technology matters.
        
        Re: (Score:2)
        
        by OrangeTide ( 124937 ) writes:
        
        Plus lots of Thorium to make U-233
    - Re: (Score:2)
      
      by dhasenan ( 758719 ) writes:
      
      Businesses often prefer to minimize labor costs even when there's an overall increase to operating costs. Replacing humans with ChatGPT at a 20% markup over labor costs is still going to be an attractive prospect to many MBAs.
      - Re: (Score:2)
        
        by gweihir ( 88907 ) writes:
        
        I don't disagree. But 20% is a very, very low estimate.
    - - Re: (Score:3)
        
        by gweihir ( 88907 ) writes:
        
        I disagree. Generative AI cannot really do "automation". Far too unreliable. But we will see. Your argument definitely has some merit.
      - Re: (Score:2)
        
        by DrMrLordX ( 559371 ) writes:
        
        That's ridiculous. The same unions that existed four years ago are still here. Also worthy of note:
        https://www.npr.org/2025/06/11... [npr.org]
  - Re: (Score:2)
    
    by DamnOregonian ( 963763 ) writes:
    
    ChatGPT is a language model- and it excels in the production of language. In fact, it's capabilities in that regime are far above that of even 80th percentile humans.
    
    Whoever thought a language model would be remotely good at chess clearly doesn't understand the technology they're working with.
    - Re:ChatGPT is not a chess engine (Score:4, Insightful)
      
      by war4peace ( 1628283 ) writes: on Saturday June 14, 2025 @04:35PM (#65449679)
      
      I wanted to to GP with "Now ask the Atari chess program to summarize a 10-page PDF".
      Cherry-picking goes both ways.
      
      Parent Share
      twitter facebook
      - Re:ChatGPT is not a chess engine (Score:4, Insightful)
        
        by taustin ( 171655 ) writes: on Saturday June 14, 2025 @09:31PM (#65450043) Homepage Journal
        
        Nobody claimed the Atari chess program was capable of anything else.
        ChatGPT is supposed to be able to do anything, including walk the dog.
        
        Parent Share
        twitter facebook
        
        Re: (Score:2)
        
        by DamnOregonian ( 963763 ) writes:
        
        ChatGPT is supposed to be able to do anything, including walk the dog.
        Says who?
        
        Why are you not able to keep your argument grounded in reality?
        
        Re: (Score:2)
        
        by war4peace ( 1628283 ) writes:
        
        ChatGPT is supposed to be able to do anything, including walk the dog.
        No, it is not. While Marketing tends to exaggerate its capabilities, I have never seen such claims.
      - Re: (Score:2)
        
        by phantomfive ( 622387 ) writes:
        
        I wanted to to GP with "Now ask the Atari chess program to summarize a 10-page PDF".
        I pulled out an Atari and did that.
        
        The Atari won because it didn't make any mistakes in the summary.
        
        Re: (Score:2)
        
        by DamnOregonian ( 963763 ) writes:
        
        Can you show me on the doll where the LLM touched you, son?
        
        1) No, you didn't.
        2) You should always review an LLM-generated summary, but in most cases, it is perfectly accurate.
        
        Where LLMs critical fail is when you ask them to generate something for which they have no ground truth. Because they'll fucking invent it.
    - Re: (Score:2)
      
      by ceoyoyo ( 59147 ) writes:
      
      ChatGPT is advertised as AI, approaching human level. AI is building machines that exhibit human behaviour and capabilities.
      So they made the thing play a computer chess algorithm and it made excuses and demanded a rematch. Sounds like what most humans with no chess experience would do. It didn't flip the board and stomp off though.
      - Re: (Score:2)
        
        by DamnOregonian ( 963763 ) writes:
        
        ChatGPT is advertised as AI, approaching human level.
        Have a citation for such an advertisement?
        AI is building machines that exhibit human behaviour and capabilities.
        Sure. LLMs are widely regarded as AI- no disagreement, there.
        So they made the thing play a computer chess algorithm and it made excuses and demanded a rematch. Sounds like what most humans with no chess experience would do. It didn't flip the board and stomp off though.
        Absolutely.
        It's a language model. It has no chess training. It has probably picked up a good bit of information about chess, but still the model is trained in language, not playing Chess.
        
        It's the difference between someone who knows the rules for chess, but has no real experience with the game.
        
        Re: (Score:2)
        
        by phantomfive ( 622387 ) writes:
        
        It's like the clock problem [monochrome-watches.com] on steroids. Not all the required positions are in the training set of images. With the clock, you can easily just increase the number of images in your training set. With the chess problem, you can do that to some degree, but there are more positions than planets in the universe, so you won't have enough disk space.
        
        Chess is a problem where you need to be able to tell the machine "these are the rules" and have it follow them. Humans can do that, the LLM can't.
        
        Re: (Score:2)
        
        by ceoyoyo ( 59147 ) writes:
        
        LLMs are mostly composed of regular old fully connected ANNs, and the remainder, the transformers, are also ANNs. ANNs certainly can learn the rules of chess, and you can train one to play chess at a level that is generally regarded as superhuman. There's also a proof that any 2+ layer ANN of sufficient size can learn any IO function.
        So there's nothing about the structure of an LLM that would make it unable to learn and follow the rules of chess. The fact that they don't, or don't do so very well, means tha
        
        Re: (Score:2)
        
        by phantomfive ( 622387 ) writes:
        
        LLMs are mostly composed of regular old fully connected ANNs, and the remainder, the transformers, are also ANNs.
        You're talking theoretical things here, not practical reality.
        ANNs certainly can learn the rules of chess, and you can train one to play chess at a level that is generally regarded as superhuman.
        No one has ever made an ANN that plays chess at a level that is superhuman. AlphaZero is still primarily a tree searching algorithm with an ANN used to evaluate every node of the tree.
        There's also a proof that any 2+ layer ANN of sufficient size can learn any IO function.
        To remove the tree search, the ANN would need to be VERY big.
        The fact that they don't, or don't do so very well, means that the way they are trained is an inefficient way to learn chess.
        LLMs are trained in an inefficient way to learn anything. How many billions of pages did you need to be trained how to read?
        
        Re: (Score:2)
        
        by DamnOregonian ( 963763 ) writes:
        
        This is absurd.
        The LLM absolutely can.
        AlphaZero can beat any human alive at Chess.
        
        You're not 100% wrong though. It is similar to the clock problem, though I don't think you fully understand it.
        "AI" can indeed produce clocks in any position.
        The problem, is that they are over-trained/fit for one position, which means there is a likelihood that they end up there without careful prompting.
        I'm familiar with the clock problem, so I didn't read that article, but I'm certain if you read it, you'll find the w
        
        Re: (Score:2)
        
        by ceoyoyo ( 59147 ) writes:
        
        Have a citation for such an advertisement?
        https://blog.samaltman.com/the... [samaltman.com]
        "Humanity is close to building digital superintelligence"
        "we have recently built systems that are smarter than people in many ways"
        "In some big sense, ChatGPT is already more powerful than any human who has ever lived."
        etc.
        
        Re: (Score:2)
        
        by DamnOregonian ( 963763 ) writes:
        
        You're not a stupid person. Why do you act stupid?
        Your quote was:
        ChatGPT is advertised as AI, approaching human level.
        From your above blog, no such claim exists.
        The most sus claim he makes is your last cited one- but even the context makes it clear he's not talking about ChatGPT's intelligence.
        
        Re: (Score:2)
        
        by ceoyoyo ( 59147 ) writes:
        
        Uh huh.
    - - Re: (Score:2)
        
        by allo ( 1728082 ) writes:
        
        surprisingly good is something different from good.
        If you want to play chess, use one of the planning algorithm based engines. They are fast, easy to parallelize, easy to run with a time budget (i.e. they get better the more time you give them, but you can stop them any time and let them do the best move) and actually built to play chess.
        Having an LLM play a game is a good way to show they generalize. It is not a good way to build a chess AI.
        Many people don't get science. Someone shows they can find Waldo w
        
        Re: (Score:2)
        
        by DrMrLordX ( 559371 ) writes:
        
        What's funny is that ChatGPT wasn't able to spawn an instance of a chess engine for its own benefit.
        
        Re: (Score:2)
        
        by allo ( 1728082 ) writes:
        
        How should it spawn it?
        But I wonder if it could code one and how that would line up against the Atari. I bet there is enough chess engine code online that larger models would know how to code an engine, especially the reasoning ones.
        And if you look at the MCTS algorithm (one of the algorithms that finally helped to beat human players) it is quite simple to implement. You only need rules to generate valid moves and determine win/loss and then it explores promising and less promising games until interrupted a
        
        Re: (Score:2)
        
        by DrMrLordX ( 559371 ) writes:
        
        There are some AI tools that can, when prompted, actually undertake fairly complex steps to accomplish a goal, including finding ways to avoid shutting down:
        https://www.livescience.com/te... [livescience.com]
        If it can go that far, it can certainly download a common chess engine and run it. Assuming it was given network access and permissions necessary.
        
        Re: (Score:2)
        
        by DamnOregonian ( 963763 ) writes:
        
        Agentic AI is very well established now- i.e., the ability to use tools and run code in sandboxed environments.
        Importantly- they need to be instructed to do so, so it needs to be part of the test.
        
        Re: (Score:2)
        
        by allo ( 1728082 ) writes:
        
        I'm not sure if you are really aware how the things work.
        There is a model. That can do text completion and in extension is useful to solve certain problems that can be solved by text, when you provide it with some prompt that implies that afterward there is a right answer. It doesn't have any option to run code on your computer.
        Now there is another thing: Tool-Use. That means you extend your prompt like this:
        > You can use a calculator tool for complicated math. To use the tool write {"tool": "calc", "inp
        
        Re: (Score:2)
        
        by DamnOregonian ( 963763 ) writes:
        
        How should it spawn it?
        It's called "Agentic AI".
        The LLM is trained to call tools using a specific format, and the code running the LLM executes the tools on behalf of the LLM. This is my primary use for them.
        More advanced systems will also spin up a sandbox where the LLM can run code it generates.
        
        Re: (Score:2)
        
        by allo ( 1728082 ) writes:
        
        Here the LLM did not have a chess tool. And I think it would have been considered cheating in this test.
        
        Re: (Score:2)
        
        by DamnOregonian ( 963763 ) writes:
        
        Yes, I understand that.
        I was just answering your question (at least as I understood it)
        
        Yes, the LLM could code a chess engine, and yes, common frameworks for running the chess engine do exist.
        Of course- it wouldn't really demonstrate what they were trying to demonstrate, which is why they didn't do that.
        
        Re: (Score:2)
        
        by allo ( 1728082 ) writes:
        
        The point of the OP was just: Don't use a token predictor to play chess.
        If you want to add tool usage on top, you're just extending your service (which is more than the model) to do more than predicting tokens.
        To provide a service to the end user that's a good idea.
        I wouldn't be surprised if ChatGPT would get an interface for games some day, which can display an actual chess board among other game fields. From the user perspective of view that would make a lot of sense and from the companies point of view i
        
        Re: (Score:2)
        
        by DamnOregonian ( 963763 ) writes:
        
        Tool use is more than the model. But something to think about with regard to, say, playing chess.
        
        Have you ever tried to play a game in your head with no physical board? That's what you're imposing upon an LLM without tool usage giving it access to a board simulator that it can query.
        The tester first tried to give it screenshots of the board. The CLIP model didn't seem to do a great job embedding, as it couldn't recognize the pieces.
        After that, he switch to standard chess notation, unknowningly demanding
      - Re: (Score:2)
        
        by DamnOregonian ( 963763 ) writes:
        
        Nice try, naarc, lol.
        
        Shall I like to you saying that ChatGPT is an RNN, and that it is Turing Complete?
        
        Get fucked, poser.
      - Re: (Score:2)
        
        by DamnOregonian ( 963763 ) writes:
        
        Also, I love that I made you look so utterly fucking stupid with nothing but a list of posts you've made contradicting yourself more than our President, that you don't even have the balls to respond to me with your name anymore. It makes me smile. Thank you for that :)
  - Re: (Score:2)
    
    by thegarbz ( 1787294 ) writes:
    
    but it is inferior to both humans and specialized algorithms in nearly all cases.
    In what way? The OP postulated pulling a random person off the street - a generalised average person. There's a good chance that they don't even know the basic rules of chess or how to make legal moves. That's the OP's point. ChatGPT is that weird friend of yours who somehow is a pub quiz ace, a true walking encyclopedia, yet someone who has no practical skills.
    - Re:ChatGPT is not a chess engine (Score:4, Funny)
      
      by rossdee ( 243626 ) writes: on Saturday June 14, 2025 @05:44PM (#65449773)
      
      "pulling a random person off the street - a generalised average person. There's a good chance that they don't even know the basic rules of chess or how to make legal moves."
      It depends where you are. In Russia everyone is taught chess.
      (Of course there are no average persons in the street, they are all in Ukraine.)
      
      Parent Share
      twitter facebook
- Re: ChatGPT is not a chess engine (Score:5, Interesting)
  
  by devslash0 ( 4203435 ) writes: on Saturday June 14, 2025 @12:53PM (#65449333)
  
  Well, the way I look it is that AI models were trained on unchecked data and they just reheat mistakes made while in training because, statistically, mistakes are more common than good moves.
  Garbage in. Garbage out.
  
  Parent Share
  twitter facebook
  - Re: (Score:2)
    
    by allo ( 1728082 ) writes:
    
    LLM yes. Chess engines are more often trained with methods like self-play.
  - Re: (Score:2)
    
    by dinfinity ( 2300094 ) writes:
    
    This is a badly conducted experiment by some random fuck on LinkedIn. Talking about unchecked data and garbage. Apparently everybody on Slashdot is now so hellbent on disparaging anything AI that they'll take any bit of ragebait at face value.
    The LinkedIn post: https://www.linkedin.com/posts... [linkedin.com]
    Relevant quotes by the author:
    - "Despite being given a baseline board layout to identify pieces, ChatGPT confused rooks for bishops, missed pawn forks, and repeatedly lost track of where pieces were — first blam
- Re: (Score:2)
  
  by fuzzyfuzzyfungus ( 1223518 ) writes:
  
  A lot of the 'headline' announcements, pro and con, are basically useless; but this sort of thing does seem like a useful cautionary tale in the current environment where we've got hype-driven ramming of largely unspecialized LLMs as 'AI features' into basically everything with a sales team; along with a steady drumbeat of reports of things like legal filings with hallucinated references; despite a post-processing layer that just slams your references into a conventional legal search engine to see if they r
- Re: (Score:2)
  
  by Archtech ( 159117 ) writes:
  
  Actually this is a very important result, because it highlights ChatGPT's strength and weakness. It's very good at dredging through vast amounts of text and forming principles of prediction, so that it can fake a human being's speech.
  But it doesn't have any intellectual power at all - which is exactly what chess tests.
  "On the chessboard, lies and hypocrisy do not survive long. The creative combination lays bare the presumption of a lie; the merciless fact, culminating in the checkmate, contradicts the hypoc
  - Re: (Score:2)
    
    by JoshuaZ ( 1134087 ) writes:
    
    That LLM AIs are bad at abstract reasoning of this sort is not a new thing. People have seen that very early on with these systems, such as their inability to prove theorems. If someone thought that an LLM would be good at chess by itself in this situation they haven't been paying attention.
  - Re: (Score:2)
    
    by war4peace ( 1628283 ) writes:
    
    But it doesn't have any intellectual power at all - which is exactly what chess tests.
    All hail the Atari 2600, our intellectual power overlord! Right?
- Replace ChatGPT with "autocomplete" (Score:4, Insightful)
  
  by fuzzyf ( 1129635 ) writes: on Saturday June 14, 2025 @04:47PM (#65449701)
  
  Replace ChatGPT or AI with "autocomplete" and all these AI headlines explains themselves.
  
  Autocomplete loses in Chess!
  Autocomplete makes up references!
  Autocomplete said something stupid!
  
  Parent Share
  twitter facebook
- Re: (Score:2)
  
  by Tony Isaac ( 1301187 ) writes:
  
  And this is a great illustration of why LLMs aren't going to be decimating white-collar jobs.
  Just as ChatGPT is terrible at chess (I'm surprised it could even try to play the game)...LLMs are terrible at doing people's jobs.
  They're great at making up text (often literally making stuff up), but that's a lot different from actually *doing a job.*
  - Re: (Score:2)
    
    by JoshuaZ ( 1134087 ) writes:
    
    You shouldn't be surprised that it will try. All of the major LLMs are wildly overconfident in their abilities. I'm not sure if this is more because they've got human reinforcement to be "helpful" or if because they are trained on the internet where there's very rarely a response in the training data of "That's an interesting question, I've got no idea."
    - Re: (Score:2)
      
      by Tony Isaac ( 1301187 ) writes:
      
      What helps me understand why LLMs are so "confident" is to visualize what AI does when it "erases" unwanted people or things from a photo. It essentially makes up a background of pixels that could plausibly be behind the "erased" object. Those made-up pixels have nothing to do what was _actually_ behind those unwanted objects, it just uses a fancy extrapolation engine to predict what those pixels might be.
      LLMs do the same thing, but instead of pixels, they use language tokens. When you provide a prompt or q
- Re: ChatGPT is not a chess engine (Score:2)
  
  by Miamicanes ( 730264 ) writes:
  
  If ChatGPT (or at least, GPT-4o) can ingest and execute code, why wouldn't it just go online, search for a FOSS chess engine in a language it "understands" (like Python), download it, recognize it as being more adept at solving problems in this specific domain, and execute that chess engine *directly* & present the output as its own?
  The only thing I can think of offhand is that gpt-4o's "firewall" might limit its ability to execute code.
- Re: (Score:2)
  
  by martin-boundary ( 547041 ) writes:
  
  I'm sorry but you're only projecting your wishes here.
  As you say, ChatGPT >= random human, but give a random human a day of instruction with a chess teacher (whereas ChatGPT got access to the entire internet's worth of chess discussions for years) and that human >= 3rd grade chess club. But we've just seen now that ChatGPT In other words, this news proves (to those who are rational wishful thinkers) that ChatGPT claims about >= random human are full of shit.
  TL;DR. YW. YHL. HAND.
  ;-)
  - Re: (Score:2)
    
    by martin-boundary ( 547041 ) writes:
    
    Murphy's law strikes again. I forgot that less-than sign must be escaped in HTML. Here's the corrected comment
    I'm sorry but you're only projecting your wishes here.
    As you say, ChatGPT >= random human, but give a random human a day of instruction with a chess teacher (whereas ChatGPT got access to the entire internet's worth of chess discussions for years) and that human >= 3rd grade chess club. But we've just seen now that ChatGPT < 3rd grade chess club. Contradiction!
    In other words, this news
  - Re: (Score:2)
    
    by JoshuaZ ( 1134087 ) writes:
    
    Obnoxious snark aside, it appears that you are missing the point. Yes, ChatGPT is trained on a large fraction of the internet. That's why it can do this at all. What is impressive is that it can do that even without the sort of specialized training you envision. Also, speaking as someone who has actually taught people how to play chess, you are to be blunt substantially overestimating how fast people learn.
    - Re: (Score:2)
      
      by phantomfive ( 622387 ) writes:
      
      That's why it can do this at all. What is impressive is that it can do that even without the sort of specialized training you envision.
      How many pages of chess instruction do you think ChatGPT has been trained on? How many pages do you think it would take for it to play a decent game of chess?
      - Re: (Score:2)
        
        by JoshuaZ ( 1134087 ) writes:
        
        Pages of instruction are not the only thing that matters. Lots of humans don't learn well from simply reading instruction sets. And since ChatGPT doesn't have a good visual representation of the board, this is equivalent to trying to teach a human who has never learned to play chess to learn to play without a visual board and only able to keep track of moves based on the move notation. Even some strong chess players have trouble playing chess in their heads this way.
        
        Re: (Score:2)
        
        by phantomfive ( 622387 ) writes:
        
        ok lol. You claim you are a mathematician, but here you are using motivated reasoning.
        
        Re: (Score:2)
        
        by JoshuaZ ( 1134087 ) writes:
        
        My job doesn't have much to do with this at all. All humans engage in motivated reasoning and other cognitive biases. But it is also very easy to think someone one disagrees with is engaging in some sort of cognitive error even when they are not. So instead of just labeling this as motivated reasoning, maybe you could explain what it is wrong with the point I made?
- I confess. It's a fake. (Score:2)
  
  by hawk ( 1151 ) writes:
  
  I'll confess to having faked the whole thing.
  I used a different chatbot, which shall remain anonymous, and told it; "Submit an article to slashdot about what would happen if chatgpt played against the Atari 2600 chess program."
- Re: (Score:2)
  
  by sjames ( 1099 ) writes:
  
  Given the way so many people vastly over-estimate ChatGPT as an actual intelligence, I thing it's quite fair to put it up against an old and tiny chess engine on easy level. This is basically "Are you Smarter than a5th Grader" for AIs. And it is NOT.
- Re: (Score:2)
  
  by mesterha ( 110796 ) writes:
  
  People have tried LLMs on chess and have much more interesting things to say then this click bait. See https://nicholas.carlini.com/w... [carlini.com] for a good example from almost two years ago.
- - Re: (Score:2)
    
    by JoshuaZ ( 1134087 ) writes:
    
    Hmm? No. I'm a mathematician. Instead of ad hominem attacks maybe try to address the actual points?
- - Re: (Score:2)
    
    by allo ( 1728082 ) writes:
    
    "ChatGPT said"
    I can create a textfile that says it is good at chess. Presented with a chess program, the file will still ... do nothing at all.
    Don't think a program can tell you what it is able to do, just because its primary interface is presented to you as a dialogue form. It is convenient to use that way, but you're not actually communicating with something, but only using an interface that is made to be understandable by you to instruct a neural network for some tasks it can do. There is no magic and no
AI (Score:5, Insightful)

by LainTouko ( 926420 ) writes: on Saturday June 14, 2025 @12:45PM (#65449317)

This is only news for the kind of people who refer to large language models as "AI".
Unfortunately, that's quite a lot of people.
.

Share
twitter facebook
- Re: (Score:2)
  
  by Tablizer ( 95088 ) writes:
  
  Stop the vocab fight! It's pointless and useless! Every known definition of "AI" and even "intelligence" has big flaws. I've been in hundreds of such debates, No Human nor Bot Has Ever Proposed A Hole-Free Definition of "Intelligence", so go home and shuddup already!
  - Re: (Score:2)
    
    by phantomfive ( 622387 ) writes:
    
    "An argument about the world is interesting. An argument about a word is not." I can't find the origin of that quote.
  - - Re: (Score:2)
      
      by Big Hairy Gorilla ( 9839972 ) writes:
      
      I guess defining intelligence is like what we used to say about porn. "I know it when I see it" ....tee hee....
      Actually, I argue that this is the problem with language. It's vague. Ideas usually start vague, and then only after do you drill down and add details. Like writing pseudo code or specifications for code. This function is called XYZ. It does (blah blah blah...blah blah blah....etc, etc, etc, ad infinitum).
      
      It is hard to be precise. For example how do you define "art". How about "good?". What is "goo
- Re:AI (Score:4, Insightful)
  
  by ThomasBHardy ( 827616 ) writes: on Saturday June 14, 2025 @01:25PM (#65449413)
  
  This is my pet peeve. AI has been turned into a marketing term for things that are not the traditional definition of AI.
  The term is now corrupted beyond all hope of recovery.
  I'm distressed at how much tools like Chat GPT favor seeming intelligent and capable as an illusion even when lying to you. I've even caught it making a mistake and then blaming me for the mistake or pretending it meant to do it wring as a test step. The conman element is real,. even down to the tool itself.
  
  Parent Share
  twitter facebook
  - Re: (Score:2)
    
    by serviscope_minor ( 664417 ) writes:
    
    That are not the traditional definition of AI.
    What IS the traditional definition of AI?
    It's been all over the place for years. Back when I was a student, in he very early 2000s, I had a course on AI in the same module as the neural nets lectures. It contained such topics as alpha/beta pruning, A* search, decision trees, expert systems, that kind of thing.
    Further in the past neural networks were definitely considered AI, but by 2000 they were considered as "ML" which was generally treated as something separa
  - Re: (Score:2)
    
    by ceoyoyo ( 59147 ) writes:
    
    This is my pet peeve. AI has been turned into a marketing term for things that are not the traditional definition of AI.
    How so? What is the traditional definition of AI? Are you sure you're using the correct one?
- Re: (Score:3)
  
  by Ol Olsoc ( 1175323 ) writes:
  
  This is only news for the kind of people who refer to large language models as "AI".
  Unfortunately, that's quite a lot of people.
  .
  Old MacDonald had a LLM farm -
  AI, AI, Oh!,
  And on that farm he had a nuclear plant,
  AI AI Oh!
  With a hallucination here, a wrong answer there, here a fault there a fault, everywhere a bad answer.
  Old MacDonald had a LLM farm
  AI AI Oh!
- Re: (Score:2)
  
  by thegarbz ( 1787294 ) writes:
  
  This is only news for the kind of people who refer to large language models as "AI".
  So, ... everyone including people working in the field of AI?
- Re: (Score:2)
  
  by allo ( 1728082 ) writes:
  
  AI is a category that even includes ELIZA. You're thinking of AGI. AI is the category that includes the simplest algorithms, not the category that only includes what you see in sci-fi movies that talk about AI.
- Re: (Score:2)
  
  by Ossifer ( 703813 ) writes:
  
  Eventually people recognize cheap parlor tricks for what they are. Or in this case, massively expensive ones.
- Re: (Score:2)
  
  by taustin ( 171655 ) writes:
  
  This is only news for the kind of people who refer to large language models as "AI".
  Unfortunately, that's quite a lot of people.
  .
  Starting with the marketing droids at the A"I" companies.
Mocking their God (Score:2)

by bill_mcgonigle ( 4333 ) * writes:

Some people so want to believe that a useful information retrieval system is a superintelligence.
The rest of us aren't surprised that an interesting search engine isn't good at chess.
- Re: (Score:3)
  
  by gweihir ( 88907 ) writes:
  
  Some people so want to believe that a useful information retrieval system is a superintelligence.
  The rest of us aren't surprised that an interesting search engine isn't good at chess.
  That very nicely sums it up. Obviously, you have to be something like a sub-intelligence to think that LLMs are superintelligent. To be fair, something like 80% of the human race cannot fact-check for shit and may well qualify as sub-intelligence. Especially as miost of these do not know about their limitations due to the Dunning-Kruger effect.
- Re: (Score:2)
  
  by Vlad_the_Inhaler ( 32958 ) writes:
  
  Hmm:
  - confused rooks for bishops, missed pawn forks and repeatedly lost track of where pieces were
  - first blaming the Atari icons as too abstract, then faring no better even after switching to standard chess notations
  - repeatedly requested that the match start over
  That all rings a bell somewhere - confusion, blaming everything else for the errors, repeatedly requesting a mulligan. That seems familiar.
No surprise (Score:5, Insightful)

by gweihir ( 88907 ) writes: on Saturday June 14, 2025 @01:01PM (#65449339)

To anybody that wants to know, it is already clear that LLMs, including the "reasoning" variant, have zero reasoning abilities. All they can do is statistical predictions based on their training data. Hence any task that requires actual reasoning like chess (because chess is subject to state-space explosion and cannot be solved by "training" alone), is completely out of reach of an LLM.
The only thing surprising to me is that it took so long to come up with demonstrations of this well-known fact. Of course, the usual hallucinators believe (!) that LLMs are thinking machines/God/the singularity and other such crap, but these people are simply delulu and have nothing to contribute except confusing the issue. Refer to the litle pathetic fact that abouy 80% of the human race is "religious" and the scope of _that_ prioblem becomes clear. It also becomes clear why a rather non-impressive technology like LLMs is seen as more than just better search and better crap, when that is essentially all it has delivered. Not worthless, but not a revolution either and the extreme cost of running general (!) LLMs may still kill the whole idea in practice.

Share
twitter facebook
- - Re:No surprise (Score:5, Interesting)
    
    by gweihir ( 88907 ) writes: on Saturday June 14, 2025 @02:16PM (#65449493)
    
    To anybody that wants to know, it is already clear that LLMs, including the "reasoning" variant, have zero reasoning abilities
    A good many humans don't either. They memorize patterns, rituals, slogans, etc. but can't think logically.
    Indeed. There are a few facts from sociology. Apparently only 10-15% of all humans can fact-check and apparently only around 20% (including the fact-checkers) can be convinced by rational argument when the question matters to them (goes up to 30% when it does not). Unfortunately, these numbers seem to be so well established that there are no current publications I can find. It may also be hard to publish about this. This is from interviews with experts and personal observations and observatioons from friends that also teach on academic levels. ChatGPT at least confirmed the 30% number but sadly failed to find a reference.
    Anyway, that would mean only about 10-15% of the human race has active reasoning ability (can come up with rational arguments) and only about 20-30% has passive reasoning ability (can verify rational arguments). And that nicely explains some things, including why so many people mistake generative AI and in particular LLMs for something they are very much not and ascribe capabilities to them that they do not have and cannot have.
    
    Parent Share
    twitter facebook
    - - Re: (Score:2)
        
        by gweihir ( 88907 ) writes:
        
        Thus proving the point by example.
        Most people have faith in something. Since they didn't arrive at that faith by reason how would you expect to get them to change their mind using reason? You are really demanding they give priority to your faith in reason over their other faith.
        And there I can stop reading, because you do not get it. Your simplistic and, frankly, stupid claim is that relying on rational reasoning is "faith". That is, obviously, a direct lie. Now, it is quite possible you are not smart enough to see that.
        
        Re: (Score:2)
        
        by RossCWilliams ( 5513152 ) writes:
        
        Now, it is quite possible you are not smart enough to see that.
        Why you don't (try) to use reason to defend it your belief in it, instead of ad-hominems? Of course, like every true believer anyone who questions your belief is a heretic. There is no rational defense, because your belief isn't rational.
        
        Re: (Score:2)
        
        by gweihir ( 88907 ) writes:
        
        If you want to communicate, you have to play the same game everyone else is playing.
        If you don't believe in rationality and reason, why go on a forum and try making reasonable arguments? Wouldn't "squid purple smiley-emoji" be just as convincing?
        Yep, I have been wondering about that person. Maybe some mental disability at play here that prevents them from seeing this obvious thing? Or maybe "I can use reason but when you do it, it is just wrong"? There are enough assholes around that do not believe others should have the same rights as they do.
        The long and the short of it is, if you deny reason, everything breaks down. First, you lose all technology, because STEM is completely dependent on reason. Second, you lose society. Maybe you can keep a smal
        
        Re: (Score:2)
        
        by RossCWilliams ( 5513152 ) writes:
        
        "more" is a quantity word, not a weight word. In standard English conversation, this means that "count(oranges) > count(grapes)". One person is right, and one person is wrong.
        Actually this example is about what you count. Weight, volume or individual fruits. But thank you for making my point.
        
        Re: (Score:2)
        
        by gweihir ( 88907 ) writes:
        
        Yep. "Countable" and "non-countable" are distinct and it is reflected in grammar. My English classes did cover that. Apparently some supposed native speakers (?) did not get that taught or failed to understand it when the subject was on.
        
        Re: (Score:2)
        
        by RossCWilliams ( 5513152 ) writes:
        
        "more" is a quantity word, not a weight word.
        
        So, according to your ideological defense, people can't weigh more? This is the reason not to argue matters of faith. It has nothing to do with the grammatical distinction between less and fewer which are rules about how to describe something in English. The example I gave was not a grammatical distinction, it was a conceptual distinction.
Shall we play a game? (Score:2)

by CommunityMember ( 6662188 ) writes:

Would the average Slashdot reader beat the Atari 2600?
- Re: (Score:2)
  
  by haruchai ( 17472 ) writes:
  
  probably not.
  I didn't know about the Atari chess game until a couple weeks ago when an old colleague showed it on FB, that he was struggling with it on the lowest level.
  but I did pretty well against Fritz a long time ago, running on a Compaq Armada 7800
But the Atari can't make excuses (Score:2)

by Tablizer ( 95088 ) writes:

...for fucking up like ChatGPT can. Take that Atari!
[ChatGPT] first blaming the Atari icons as too abstract...continued badly and that the AI chatbot repeatedly requested that the match start over
Score one for Kathe Spracklen (Score:4, Informative)

by 50000BTU_barbecue ( 588132 ) writes: on Saturday June 14, 2025 @01:19PM (#65449397) Journal

And her algorithm

Share
twitter facebook
- - Re: (Score:2, Informative)
    
    by Anonymous Coward writes:
    
    Dan & Kathe had to work with at the time, 8bit with memory measured in kilo-bytes using assembly.
    Even more impressive when you realize that it didn't even have one kilobyte of RAM to play with. The Atari 2600 used a "cost optimized" version of the 6502 processor, the 6507, with a total 8KB address space (13 bits) and only had 128 bytes of RAM built in. While some cartridges supplemented this with additional RAM of their own the Video Chess cartridge did not.
Who came up with the idea to let a LLM play chess? (Score:2)

by allo ( 1728082 ) writes:

A LLM is one of the worst AIs to play chess. I won't be surprised if you're better with some greedy algorithm (which is no good idea in general).
Not all AI are the same. LLM are text generators, not chess players.
yeah my mechanic fails at brain surgery too (Score:2)

by bloodhawk ( 813939 ) writes:

Firstly ChatGPT is NOT a gaming or Chess engine, secondly LLM's are not made or designed for the reasoning required to even play chess effectively.
This is what your boss wants (Score:2)

by BytePusher ( 209961 ) writes:

Your boss is OK with this. They would rather have a 3rd grader without intelligence or ambition, than give you a raise.
Not Surprising (Score:2)

by The MAZZTer ( 911996 ) writes:

"AI" is a marketing term for these types of tools. They are just specialized for a specific type of task, it's just that task can be generating images, video, audio, or text that can cover as wide a variety of topics as you can train it with data. Ultimately, text generating AI is only concerned about putting the "most right" words together in sequence to follow up your prompt. These can be wrong if there is no really right words (eg it doesn't "know" the answer, so you get "halucinations") and it certain
I don't u derstand why this story gets shared at a (Score:2)

by PoopMelon ( 10494390 ) writes:

It's like comparing making a toast in a toaster from 1970 and inside thr newest tesla and then saying that toaster from 1970 beat tesla

There may be more comments in this discussion. Without JavaScript enabled, you might want to turn on Classic Discussion System in your preferences instead.

ChatGPT is not a chess engine (Score:4, Insightful)

Re:ChatGPT is not a chess engine (Score:5, Insightful)

Re:ChatGPT is not a chess engine (Score:5, Insightful)

Re: (Score:2)

Re: (Score:3)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:3)

Re: (Score:2)

Re: (Score:2)

Re:ChatGPT is not a chess engine (Score:4, Insightful)

Re:ChatGPT is not a chess engine (Score:4, Insightful)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re:ChatGPT is not a chess engine (Score:4, Funny)

Re: ChatGPT is not a chess engine (Score:5, Interesting)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Replace ChatGPT with "autocomplete" (Score:4, Insightful)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: ChatGPT is not a chess engine (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

I confess. It's a fake. (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

AI (Score:5, Insightful)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re:AI (Score:4, Insightful)

Re: (Score:2)

Re: (Score:2)

Re: (Score:3)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Mocking their God (Score:2)