Google's DeepMind AI Becomes a Superhuman Chess Player In a Few Hours

Google's DeepMind AI Becomes a Superhuman Chess Player In a Few Hours (theverge.com) 93

Posted by BeauHD on Wednesday December 06, 2017 @05:20PM from the new-domain dept.

An anonymous reader quotes a report from The Verge: In a new paper published this week, DeepMind describes how a descendant of the AI program that first conquered the board game Go has taught itself to play a number of other games at a superhuman level. After eight hours of self-play, the program bested the AI that first beat the human world Go champion; and after four hours of training, it beat the current world champion chess-playing program, Stockfish. Then for a victory lap, it trained for just two hours and polished off one of the world's best shogi-playing programs named Elmo (shogi being a Japanese version of chess that's played on a bigger board). One of the key advances here is that the new AI program, named AlphaZero, wasn't specifically designed to play any of these games. In each case, it was given some basic rules (like how knights move in chess, and so on) but was programmed with no other strategies or tactics. It simply got better by playing itself over and over again at an accelerated pace -- a method of training AI known as "reinforcement learning."

Google's DeepMind AI Becomes a Superhuman Chess Player In a Few Hours

This discussion has been archived. No new comments can be posted.

Load All Comments

Search 93 Comments Log In/Create an Account

Comments Filter:

Strange game (Score:3)

by IWantMoreSpamPlease ( 571972 ) writes: on Wednesday December 06, 2017 @05:23PM (#55690753) Homepage Journal

The only winning move, is not to play

- Re:Strange game (Score:4, Funny)
  
  by Killall -9 Bash ( 622952 ) writes: on Wednesday December 06, 2017 @05:24PM (#55690765)
  
  I for one welcome 99% unemployment.
  
  - Re: (Score:3)
    
    by Shotgun ( 30919 ) writes:
    
    Why would you expect 99% unemployment? This AI will never be able to:
    -fix your plumbing
    -rack you servers
    -move your furniture
    -change your spark plugs
    -etc, so forth, and so on.
    - Re: (Score:2)
      
      by sycodon ( 149926 ) writes:
      
      Don't be so sure [popsci.com]
      As soon as a humanoid robot [boston.com] is perfected, the only value people will have is knowing where the aim points are.
    - Re: (Score:1)
      
      by Anonymous Coward writes:
      
      This one? Probably not.
      You really think humans are so special that no other AI could ever do all those things? Better hope the AIs never hunt people for sport in order by Slashdot ID.
      - Re: (Score:1)
        
        by Traksius Egas ( 12395 ) writes:
        
        Better hope the AIs never hunt people for sport in order by Slashdot ID.
        Well, hopefully they will start in reverse order. :P
    - Re: (Score:2)
      
      by Waffle Iron ( 339739 ) writes:
      
      So you aspire to be a mechanical actuator?
      At any rate, how are you sure that an AI won't eventually be able to teach itself mechanical control system skills that rival humans? Mice and birds with pea-sized brains are able to navigate the physical world rather effectively.
    - Because we understand progress (Score:5, Insightful)
      
      by fyngyrz ( 762201 ) writes: on Wednesday December 06, 2017 @05:44PM (#55690943) Homepage Journal
      
      Why would you expect 99% unemployment? This AI will never be able to: [optimism redacted]
      No, not this one. Not even the next one. The one after that? Or after that?
      Eventually, they will. The question is simply how long will that be. Right now, the ML pace continues to accelerate. Soon, they'll be stacking one skill upon another. The skill to walk. The skill to understand plumbing joints and leaks. The skill to know home construction. Etc.
      It's coming. That whole "will never be able to" business... that's not going to pan out for anyone.
      
    - Re:Strange game (Score:4, Insightful)
      
      by SethJohnson ( 112166 ) writes: on Wednesday December 06, 2017 @05:49PM (#55690971) Homepage Journal
      
      Go ahead and post your ad on TaskRabbit seeking candidates to come over and fix your plumbing, rack your servers, etc. A hundred people show up at your door offering to perform these jobs for obscenely cheap rates. To identify the best candidate, you ask each what their prior work experience has been that makes them suited for the plumbing, spark plugs, and so on.
      
      Candiate 1: "I traded stocks on Wall Street for 20 years prior to having my job automated."
      
      Candiate 2: "I operated a fork lift in a warehouse for 8 years before the facility was automated."
      
      Candiate 3: "I drove semi trucks for 15 years before the robots came in."
      
      And so on.
      
      The thing about AI and automation is that as human workers are displaced, they shift to job types that are financially unattractive to automate-- like those categories you cite. With the flood of displaced workers in these job areas, wages are diluted. "A plumber always makes a good living" will no longer be a true statement as the plumber job market becomes oversaturated by workers displaced by automation.
      
      - Re: (Score:3)
        
        by serviscope_minor ( 664417 ) writes:
        
        The thing about AI and automation is that as human workers are displaced, they shift to job types that are financially unattractive to automate-- like those categories you cite.
        Those jobs aren't financially unattractive to automate, they're still a way beyond our current level of tech.
- 1983 And I Am Dreaming Of Nuclear Warfare (Score:1)
  
  by alternative_right ( 4678499 ) writes:
  
  Did everyone miss the movie reference [imdb.com]?
- Re: (Score:2)
  
  by hcs_$reboot ( 1536101 ) writes:
  
  The only winning move, is not to play
  When algorithms will be clever at societal things instead of games, it will become more difficult not to "play".
Teach it Starcraft Civilization (Score:3)

by ranton ( 36917 ) writes: on Wednesday December 06, 2017 @05:24PM (#55690759)

Please have it learn how to play modern strategy games like Starcraft and Civilization so we can have computer players which don't suck without massive bonuses which change the dynamic of the game.

- Re: (Score:3)
  
  by Shotgun ( 30919 ) writes:
  
  Please have it learn to play politics, so we can....
- Re: (Score:2)
  
  by ausekilis ( 1513635 ) writes:
  
  They have [wired.com]. Facebook too [engadget.com].
  Those games have a hell of a lot more complexity too, so it's no wonder it's a hard problem to solve. Resource management, army counter/order management, base creation, etc...
- Re:Teach it Starcraft Civilization (Score:4, Interesting)
  
  by psycho12345 ( 1134609 ) writes: on Wednesday December 06, 2017 @06:00PM (#55691071)
  
  They are working on this.
  https://deepmind.com/blog/deepmind-and-blizzard-open-starcraft-ii-ai-research-environment/ [deepmind.com]
  
- Re: (Score:3)
  
  by gtall ( 79522 ) writes:
  
  I agree. If AI machines can play these games, then the gamers will be freed for a more productive use of their time.
- Re: (Score:1)
  
  by Mark McGann ( 570684 ) writes:
  4X style games like Civ have gotten no love from AI developers. As others have pointed out Starcraft has been worked on by serious AI researchers.
  
  I suspect 4x style games will be among the hardest computer games for AI to tackle for several reasons. Just off the top of my head.
  
  * Data sets to train on are smaller because individual games take much longer
  * More complex rules sets
  * More things to manage
  
  * City/planet development
  * Unit design (some games)
  * Combat
  * Exploration vs Exploitation
  *
- Re: (Score:2)
  
  by Wraithlyn ( 133796 ) writes:
  
  I'd love to see what kind of city designs and road systems it might come up with in Cities: Skylines.
- Micro transactions (Score:2)
  
  by DarthVain ( 724186 ) writes:
  
  Better yet, make it play one of the newer games that is all about micro transactions and pay to play.
  Pretty sure the AI will come to the determination that they are retarded and refuse to play anymore. Either that or IBM or whoever will have their profit margins cut by a massive credit card bill...
  Then again release enough AI's onto the market grinding an infinite number of games for credits, buying up all the good stuff, making the game, and the micro transactions useless might actually have a positive imp
Super Human? (Score:3, Insightful)

by jellomizer ( 103300 ) writes: on Wednesday December 06, 2017 @05:27PM (#55690789)

Reinforcement Learning systems have a tenancies of creating "Superstition" artifacts, were actions that may not create a net positive or negative are used over when the net outcome is positive. It often creates less than ideal outcome, but still it works. So this could mean a really long chess game with non-strategic moves, as the most optimal path, may not be enforced correctly.

- Re:Super Human? (Score:5, Insightful)
  
  by Baron_Yam ( 643147 ) writes: on Wednesday December 06, 2017 @05:32PM (#55690831)
  
  >, were actions that may not create a net positive or negative are used over when the net outcome is positive.
  Which is still a net improvement over humans, who may stick with actions that are actually net negative despite proof if they initially miscategorized them as positive.
  What they should get the AI to do to minimize such artifacts is have a meta-analysis going where the positive associations are re-evaluated whenever the overall victory is judged to not be at stake in the event the action was correctly evaluated in the first place.
  
  - - Re: (Score:2)
      
      by HumanWiki ( 4493803 ) writes:
      
      "It didn't teach itself to play at all. The rules are programmed in. What's a legal move. What's a win. All programmed, not learned."
      Isn't that what humans do? We're taught what's a legal move, what's not, why they're not, what's a win and why it's win. We're programmed as well, our interface and language is different.
      As for watching to learn... You can watch to learn, but for most humans, there's an explanation as to why things are and aren't allowed, what happened, why, who scored and why, etc...
    - Re: Super Human? (Score:2)
      
      by Zero__Kelvin ( 151819 ) writes:
      
      Programmed is learned. That's why they have different *programs* for different degrees in college. I'm sure they could have it watch 1000s of chess matches to deduce it, but then you would claim it didn't "learn" the moved because someone "programmed" it. In other words, your objection can always be made, and is also always a stupid argument. You are basically saying it didn't learn because someone used a method conducive to computers to teach it rather than taking some convoluted path to the same result. I
- Re: (Score:2)
  
  by HumanWiki ( 4493803 ) writes:
  
  "a really long chess game with non-strategic moves, as the most optimal path, may not be enforced correctly."
  Well, at least with humans, pushing a game very long will stress the other opponent in to making a mistake if they're not as well equipped for it as you are. That's not sub-optimal, just longer. If you can't beat them straight up in logic, then you switch to alternate tactics.
  Now, this may not apply to an AI as they don't tire like we do.. However, it's entirely possible, that if their logic systems
In other words... (Score:2)

by Lord Kano ( 13027 ) writes:

It simply got better by playing itself over and over again at an accelerated pace -- a method of training AI known as "reinforcement learning."
Like it was playing Global Thermonuclear War with zero players...
LK
And next: (Score:5, Insightful)

by forkfail ( 228161 ) writes: on Wednesday December 06, 2017 @05:44PM (#55690939)

The world's gonna be an... interesting... place once someone merges this sort of code with virus code.

Re: (Score:2)

by account_deleted ( 4530225 ) writes:

Comment removed based on user account deletion
- Re:Is it AI? (Score:4, Insightful)
  
  by suutar ( 1860506 ) writes: on Wednesday December 06, 2017 @06:33PM (#55691301)
  
  Well, the program playing itself is not really qualitatively different than "if I do this, and he does that, and I do the other, and he does........ then I win!"; it's just carried out to more steps than a human would (because a human can't go that far). Therefore, any approach I can conceive of to go from knowing the rules to knowing how to win is pretty much equivalent to "running some iterations". Even the ability of human chess masters to perceive the board as a pattern instead of just a bunch of individual piece positions is probably approximated by something in the program.
  Given that, I am unable to come up with a mechanism to go from "knows the rules" to "knows how to win a game" without doing something equivalent to "running iterations"...
  
  - Re: (Score:2)
    
    by account_deleted ( 4530225 ) writes:
    
    Comment removed based on user account deletion
    - Re: (Score:2)
      
      by suutar ( 1860506 ) writes:
      
      Ah, I see what you mean. No, an exhaustive search algorithm isn't what I'd call "intelligent". But an exhaustive search for chess would take a lot longer than a few hours, and a process that develops some sense of "this move will be bad" without having to try it every single time does seem, while not necessarily "intelligent", to be at least one step up from brute force, because it is making decisions based on, well, not unknown values of variables (not much in chess is invisible) but on situations not quit
- Re: Is it AI? (Score:2)
  
  by Zero__Kelvin ( 151819 ) writes:
  
  So like a human, you tell the person the rules, they give them zero thought, and play zero games, but are an expert? That wouldn't be SO. That would be Magical Intelligence.
- Re: (Score:2)
  
  by Dog-Cow ( 21281 ) writes:
  
  Because humans don't do that. Why would we expect AI to do it? At least you don't have to worry: you're so illogical, no program could ever replace you. Obsolete, yes. Replace, no.
- Re: (Score:2)
  
  by hazardPPP ( 4914555 ) writes:
  
  I don't understand why people extrapolate so much from a computer being able to beat humans at chess. Or any similar game. Sure, it's a great feat of programming, but it doesn't mean "strong AI" is coming any time soon. These are just...games. Meaning, things humans made up to fill up their spare time. Some may be very complex, but ultimately they are very precise constructs of the human mind with very well-defined, restrictive rules. It is not strange that ultimately, one precise construct of the human min
Cool. (Score:2)

by Green Mountain Bot ( 4981769 ) writes:

Wake me when it can decide, on its own without human directive, that it wants to play chess in the first place.
- Re: Cool. (Score:1)
  
  by Zero__Kelvin ( 151819 ) writes:
  
  Thank God you will be sleeping for a *long* time. We have enough stupid asshats on Slashdot that want to discuss racism and politics while claiming all things involving technology are boring.
- Re: (Score:2)
  
  by serviscope_minor ( 664417 ) writes:
  
  what the fuck is it with people feigning absurd levels of majestic boredom about an article on a piece of tech that blows out of the water anything in the field from 3 years ago.
  - Re: (Score:2)
    
    by Green Mountain Bot ( 4981769 ) writes:
    
    I'm not bored. I'm pointing out that intelligence requires awareness and motivation that is not programmed by an external source. What we're talking about is not intelligence. We're not going to crack that nut until we stop looking at machine learning as the same as intelligence. What we have in this article is advancement within the field of machine learning, not advancement in artificial intelligence.
    - Re: (Score:2)
      
      by religionofpeas ( 4511805 ) writes:
      
      I'm pointing out that intelligence requires awareness and motivation that is not programmed by an external source
      That would disqualify your own brain. Your motivation to survive is programmed by external sources.
      - Re: (Score:2)
        
        by Green Mountain Bot ( 4981769 ) writes:
        
        Not in the same sense. Computers do exactly what you program them to do, and only ask questions if you tell them to. People wrestle with competing motivations (my motivation to play is competing with my motivation to do work to fulfill my motivation to survive, for example), and decide from moment to moment which motivation is most compelling. We take input from all our sensors all the time, not just within the confines that a program dictates. We piece that input into a meaningful continuum within whic
What was AlphaZero running on... (Score:2)

by e_pluribus_funk ( 648835 ) writes:

vs. what was Stockfish running on?
- Re: (Score:2)
  
  by PacoSuarez ( 530275 ) writes:
  
  It's in the paper. AlphaZero was running on a computer with 4 TPUs, while Stockfish was running on a 64-core computer. They are not directly comparable, but Stockfish on a 64-core computer is a formidable opponent.
  - Re: (Score:2)
    
    by e_pluribus_funk ( 648835 ) writes:
    
    Stockfish running on a 2-core computer is a formidable opponent.
Can it run on a desktop? (Score:2)

by mark-t ( 151149 ) writes:

Is it open source?
But can it play Mario? (Score:1)

by kiminator ( 4939943 ) writes:

I'd love to see if this AI can learn to play a more complicated game like Super Mario World given only: 1) The pixels displayed as input. 2) Fail conditions (when a life is lost). 3) Basic map navigation rules (bonus if these can be eliminated and the game can be judged only on whether or not it gets a game over or completes the final level). 4) Valid controller inputs. I do wonder how this AI would translate from the turn-based world of Chess and Go to realtime.
- - Re: (Score:1)
    
    by kiminator ( 4939943 ) writes:
    
    I do think this is most interesting if we don't allow the AI to adjust the game clock at all: if there's too large a delay between input and output, it will simply fail.
    There have been AI's designed explicitly to play Super Mario World (see here [youtube.com], for instance). Where this becomes cool is with an AI that doesn't get an abstract representation of the level, but has to interpret the pixels that are displayed. That's a far more complicated problem, as the information displayed on even a simple game like Super
Not Just A Bigger Board (Score:2)

by Artagel ( 114272 ) writes:

I don't think the principal difference between shogi and chess is board size. In Shogi, you can place the pieces you capture onto the board as your own pieces. Having paratroopers is a lot different.

There may be more comments in this discussion. Without JavaScript enabled, you might want to turn on Classic Discussion System in your preferences instead.

Strange game (Score:3)

Re:Strange game (Score:4, Funny)

Re: (Score:3)

Re: (Score:2)

Re: (Score:1)

Re: (Score:1)

Re: (Score:2)

Because we understand progress (Score:5, Insightful)

Re:Strange game (Score:4, Insightful)

Re: (Score:3)

1983 And I Am Dreaming Of Nuclear Warfare (Score:1)

Re: (Score:2)

Teach it Starcraft Civilization (Score:3)

Re: (Score:3)

Re: (Score:2)

Re:Teach it Starcraft Civilization (Score:4, Interesting)

Re: (Score:3)

Re: (Score:1)

Re: (Score:2)

Micro transactions (Score:2)

Super Human? (Score:3, Insightful)

Re:Super Human? (Score:5, Insightful)

Re: (Score:2)

Re: Super Human? (Score:2)

Re: (Score:2)

In other words... (Score:2)

And next: (Score:5, Insightful)

Re: (Score:2)

Re:Is it AI? (Score:4, Insightful)

Re: (Score:2)

Re: (Score:2)

Re: Is it AI? (Score:2)

Re: (Score:2)

Re: (Score:2)

Cool. (Score:2)

Re: Cool. (Score:1)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

What was AlphaZero running on... (Score:2)

Re: (Score:2)

Re: (Score:2)

Can it run on a desktop? (Score:2)

But can it play Mario? (Score:1)

Re: (Score:1)

Not Just A Bigger Board (Score:2)

Related Links Top of the: day, week, month.

Slashdot Top Deals