Become a fan of Slashdot on Facebook

 



Forgot your password?
typodupeerror
×
AI Education Google Software Games Entertainment Technology

DeepMind's Go-Playing AI Doesn't Need Human Help To Beat Us Anymore (theverge.com) 133

An anonymous reader quotes a report from The Verge: Google's AI subsidiary DeepMind has unveiled the latest version of its Go-playing software, AlphaGo Zero. The new program is a significantly better player than the version that beat the game's world champion earlier this year, but, more importantly, it's also entirely self-taught. DeepMind says this means the company is one step closer to creating general purpose algorithms that can intelligently tackle some of the hardest problems in science, from designing new drugs to more accurately modeling the effects of climate change. The original AlphaGo demonstrated superhuman Go-playing ability, but needed the expertise of human players to get there. Namely, it used a dataset of more than 100,000 Go games as a starting point for its own knowledge. AlphaGo Zero, by comparison, has only been programmed with the basic rules of Go. Everything else it learned from scratch. As described in a paper published in Nature today, Zero developed its Go skills by competing against itself. It started with random moves on the board, but every time it won, Zero updated its own system, and played itself again. And again. Millions of times over. After three days of self-play, Zero was strong enough to defeat the version of itself that beat 18-time world champion Lee Se-dol, winning handily -- 100 games to nil. After 40 days, it had a 90 percent win rate against the most advanced version of the original AlphaGo software. DeepMind says this makes it arguably the strongest Go player in history.
This discussion has been archived. No new comments can be posted.

DeepMind's Go-Playing AI Doesn't Need Human Help To Beat Us Anymore

Comments Filter:
  • by sconeu ( 64226 ) on Wednesday October 18, 2017 @03:03PM (#55392083) Homepage Journal

    I, for one, welcome our new Go-playing robotic overlords.

  • >> After three days of self-play, Zero was strong enough to defeat >> the version of itself that beat 18-time world champion Lee Se-dol, winning handily -- 100 games to nil. >> After 40 days, it had a 90 percent win rate against the most advanced version of the original AlphaGo software. So after 3 days, it had 100% win rate, after 40 days it had only 90% win rate.
    • by zlives ( 2009072 )

      i think they are saying that perhaps the "most advanced version" of alpha go is better than the version that beat Lee...

      what i want to know is what happens when it plays with itself... also how does it feel when it wins.

    • No. It happened to have an initial win-streak of 100 games to zero. That doesn't guarantee a win is going to happen every single time.

      There's no information given on how long it took to play the first 100 games. It might have been hours, or a few days, or a week. Continuing to play against the original software for 40 days, it eventually trended towards an overall winning percentage of 90%. We don't know how many games were played across those 40 days, it could have been thousands. Also perhaps the o

      • by Anonymous Coward

        No. It happened to have an initial win-streak of 100 games to zero. That doesn't guarantee a win is going to happen every single time. There's no information given on how long it took to play the first 100 games. It might have been hours, or a few days, or a week. Continuing to play against the original software for 40 days, it eventually trended towards an overall winning percentage of 90%. We don't know how many games were played across those 40 days, it could have been thousands. Also perhaps the original software was improving its own play at the same time?

        Why speculate? The paper says there is AlphaGo Lee, which AlphaGo Zero defeated after three days of training and AlphaGo Master, which took forty days of training to beat 90% of the time.

      • by mlyle ( 148697 )

        What happened was, Google made a version of AlphaGo that beat Lee Se-dol. Call this AlphaGo One. It won, but it was at least close.

        Then Google updated it and had it play lots and lots of top players, and it trounced them all. Call this AlphaGo Two.

        Then they did this new version, AlphaGo Zero. Zero, early in training, beat AlphaGo One 40-0. Late in training it defeated AlphaGo Two 90% of the time.

  • Not impressed, doesn't prove anything, and why should anyone even care?
    • by Anonymous Coward on Wednesday October 18, 2017 @03:27PM (#55392225)

      That's just what an AI would say, Mr. Schumann

    • by MrDozR ( 1476411 ) on Wednesday October 18, 2017 @03:44PM (#55392323)

      Not impressed, doesn't prove anything, and why should anyone even care?

      Maybe because they're not trying to prove anything? Maybe their actual goal is to improve general purpose algorithms by an iterative approach? Like it says in the article. Which you read of course.

    • Well the TotallyFake non-AI are coming for your jobs regardless of how impressed you are.

  • Sorry, I doubt there is any chance that a neural network can be used in a meaningfull way in drug design or climate modeling.

    • Sorry, I disagree.

    • Re: (Score:2, Insightful)

      by Lisandro ( 799651 )

      We're all neural networks designing drugs and climate models.

      • Re: (Score:2, Insightful)

        by Anonymous Coward

        Though it's true that we're neural networks, we aren't the same neural networks as these are. Remember folks, "neural network" in the sense of AI is a marketing term, it does not in any way imply that it functions in a manner similar to how our brains work. Fact is, we have no idea how our brains work. We know what certain parts are responsible for, but no idea how they do it. If anybody claims to know, then please ask them to describe in detail how memory is encoded in our brains, and have them demonst

        • "Remember folks, "neural network" in the sense of AI is a marketing term, it does not in any way imply that it functions in a manner similar to how our brains work."

          Neural network in the sense of AI is in fact at it's core an implementation of a mathematical replication of at least part of how our brains work.

          "If anybody claims to know, then please ask them to describe in detail how memory is encoded in our brains, and have them demonstrate by altering a memory in a predetermined way."

          We can't even do that
        • Though it's true that we're neural networks, we aren't the same neural networks as these are.

          Indeed. The machine's version is better, as demonstrated by its superior Go playing abilities.

          • Better at playing Go, but a neural network probably never will play Chess.
            And: AlphaGo only can do Go ... nothing else.

    • I seriously doubt that a world class english major with a doctorate in english literature and accolades in their field can be used in a meaningful way in drug design or climate modeling.

      But... a very smart person trained for 10 years in those fields might be able to contribute.

      However, a.i. probably trained is already contributing in a meaningful way to drug design (even coming up with new drugs humans hadn't considered) for ... well.. several years now.

      And while I don't personally know about climate modeli

      • AI != neural network

        A neural network is a very specific kind of datastructure and algorithms.

        It can do amazing things, but bottom line that will always be a kind of 'pattern matching'.

        I see no way how something like this can be used to model chemical reactions or enzyme interactions aka modeling drugs.

        And modeling climate, sorry: no way at all.

    • I doubt there is any chance that a neural network can be used in a meaningfull way in ... climate modeling.

      You mean a system that is pretty much defined as a series of interlocking simple equations? One of the few areas where the outputs of humans look like neural networks already?

      • A climate model takes an input state and computes an output state.
        That output state can then be used as input for the the next step.

        While that looks similar on the first glance like a NN, the computations behind are completely different.
        Algorithms in a NN are 'just running the NN' in other words: they don't really change depending on the topic where you want to use the NN in.

        A climate model is run by its equations, aka the algorithms change all the time when we know more or add another 'factor'.

        • I really don't see any difference, other than one is equations derived by humans, the other by a guided random walk.

          And feeding the output of one state into the next is a very common NN technique.

          • I really don't see any difference, other than one is equations derived by humans, the other by a guided random walk.

            Then you perhaps should read up what an NN/ANN is and how it works: https://en.wikipedia.org/wiki/... [wikipedia.org]

            Sorry, perhaps I simply lack imagination. Hoever I see no way how that could be used for climate models.

            An ANN is not modeling anything, it is recognizung things. Two things as far apart on the spectrum (of algorithms) as one can imagine.

            • I'm talking about the endpoint. The actual climate model isn't "modeling", its a bunch of math. A bunch of math that happens to be designed to model climate. And the NN is a bunch of math designed to pattern match climate. I don't follow the distinction you're positing.

              And, when I said "I don't see any difference", I was making an implicit statement that I don't think you defined a difference. I'd be interested in seeing it, but I don't follow whatever you're thinking.

              • Sorry, you have to read up what an ANN is and how it works.
                I lack englipsh skills to properly explain it in a forum (or would need two or three days of work, that time I don't have right now)

                Obviously everything a computer does is a bunsh of equations, and applying them, solving them ... that does not make everything equal.

                • I think our miscommunication is not on ANNs, but on climate models [wikipedia.org]. In fact, like SETI@Home, you can donate cycles to trying to machine develop models [climateprediction.net].

                  I understand the methods are different, but I'm not sure the results are distinguishable.

  • by tommeke100 ( 755660 ) on Wednesday October 18, 2017 @03:27PM (#55392227)
    They could have mined tons of bitcoins instead with that computing power.
    • How many bitcoins are in a ton?
    • No. For neural networks you need simple floating point calculations. For bitcoin you need to do SHA-256 hashing.

      These are totally different tasks, each running on their own optimized hardware.

  • by JMZero ( 449047 ) on Wednesday October 18, 2017 @04:35PM (#55392683) Homepage

    I've decided that this accomplishment -- a dizzying milestone in artificial intelligence that not long ago was though impossible or at least decades away -- is actually meaningless and doesn't prove anything and they should clearly have been working on some other problem. I have no idea how their system works, but I'm confident that their approach is just "brute force" (or something, I clearly have no idea what even that means) and won't generalize to any "real" problem solving (with my definition of "real problem" subject to change without notice).

    I will only admit that any progress has been made towards artificial intelligence when computers perform exactly equivalent to humans in all tasks with no human intervention. I mean, I won't really, because I have weird quasi-spiritual hangups about believing computers can be intelligent, but that's where I'm putting the goal posts for now. Digital computers can't think, but I can because reasons. Free will or quantum mechanics or something else that I haven't thought about at all, probably.

    Also, cotton gins and blacksmiths, therefore computers will never take our jobs. Amen.

    • > Also, cotton gins and blacksmiths, therefore computers will never take our jobs.

      They don't need to take all our jobs. Humans needs don't increase beyond shelter, security, food and water. The 'need' of entertainment is a bottomless pit in which you can throw money and out comes stories, sports, children and art, to name a few. Humans are definitely bright, but we are also routine and we like it when things at work don't change that much, so we can keep up.

      When the majority of us like average, easy to

      • by Maxo-Texas ( 864189 ) on Wednesday October 18, 2017 @07:24PM (#55393479)

        Never forget that the many of the people who did lose their jobs to industrialization died homeless of exposure and starvation after being put down by the military. They were provided neither training nor jobs on the new machines (which was their true issue).

        Similarly, we could have a very rough 20 years where jobs are destroyed faster than they can be created and where workers over 50 (40?) can't afford train for the new jobs and there are more unemployed than society is willing to pay for (even tho our productivity is 100x what it was a hundred years ago so one worker should be able to completely support 99 unemployed with 200 square feet of living space and basic food).

        It's coming. It could be better but it's probably going to be bad. Possibly even "great depression" or "financial panic of 18xx" bad. (they had a lot of financial panics in the 1800s that were pretty terrible.).

      • by jezwel ( 2451108 )

        This leads to a quick saturation of highly educated individuals still without positions. So they go into the only jobs on offer in the service sector, creating massive under-unemployment.

        They (we?) will join the service industry, selling our time for $$$, in whatever way we can to pay for food. It's already underway with the massive number of vlogging, prostitution etc. The price of unskilled human labour will drop like a stone, welfare costs will balloon, and the ruling elite will eventually legalise things that currently make do as fiction - The Purge, Running Man etc.
        Time to update that zombie-proof castle to human-proof?

    • "Digital computers can't think, but I can because reasons. Free will or quantum mechanics or something else that I haven't thought about at all, probably."

      Digital computers CAN'T think. Digital computers are nothing like how the brain works. You seem to be talking about yourself. You don't think at all. If computers playing games is impressive to you, then you must be easily impressed. Computers excel at games because they have strict rules. Computers LOVE rules.
      • by JMZero ( 449047 )

        You seem to be talking about yourself.

        Uh... dude... I was making fun of you. Well, you and Gweihir (sic?), who I assume is taking a day off to scream at pigeons.

  • Can you make the "game" be the game of learning? I can imagine the dataset would be the rules of many different games and the solutions would be networks that learn those games with solution quality based on some balance of leanness and efficacy of the networks. You'd then let it loose teaching itself how to best teach networks. Hmmm.
  • The essence of intelligence is that it enables one to predict the outcome of a unique situation based upon an understanding of its essential elements.

    Starting with only the rules of Go, Zero explored a variety of combinations, learning that some were more likely to give a satisfactory result. It developed a sense of what types of moves are best. Thus, without playing or studying an infinite number of games it could know the type of move that should be best in each unique situation.

    Theoretically, a vast inte

    • by slew ( 2918 )

      The essence of intelligence is that it enables one to predict the outcome of a unique situation based upon an understanding of its essential elements.

      Starting with only the rules of Go, Zero explored a variety of combinations, learning that some were more likely to give a satisfactory result. It developed a sense of what types of moves are best. Thus, without playing or studying an infinite number of games it could know the type of move that should be best in each unique situation.

      Theoretically, a vast intelligence, given only the facts of the Big Bang, could anticipate most of the resulting evolution of our universe. Zero has taken the first small step.

      You are making quite a few assumptions. One, that somehow a "game" that has a goal (e.g., a "winner") is the same as predicted
      an open ended problem. Two: that somehow AlphaGoZero developed a "sense" of what types of moves are best.

      First, because of the limited rules and state space of the game Go, and the fact that there is a "winner", the Go universe is certainly closed and quite bounded.

      In contrast, the real universe has a much larger state space and the rules are unknown (although some approximate rules

      • by swell ( 195815 )

        great reply slew;
        "Secondly, it is unclear if a "sense" of what types of moves are best is being learned"

        I understand that it can be hard for a programmer to accept a program that doesn't lend it's 'thinking' to analysis. But I believe that until computers can sense things, they are not intelligent.

        There is a corollary in the classroom. The children have a math problem. Most of them repeat the steps they have been taught and come up with the correct answer. But one or two look at the problem with a fuzzy log

  • Boards may be 21x21 but may be other sizes as well.

  • by thisisauniqueid ( 825395 ) on Wednesday October 18, 2017 @09:15PM (#55394033)
    This doesn't show we are winning at creating AI. It simply shows that the game of Go is more tractable than we previously thought. Claims about the number of positions in Go being vastly greater than the number of atoms in the universe (something like the number of atoms squared) completely miss the point: this is a straw man argument for why algorithms weren't good at Go until recently, since (obviously) humans are not searching the entire space of all possible board positions either. It stands to reason that once a sufficiently flexible fuzzy hierarchical pattern matching algorithm were produced, it would be able to play Go much better than a human.
    • by SnowZero ( 92219 )

      That argument is meaningless because it always works:
      1903: Heavier-than-air powered flight is more tractable than we previously thought.
      1957: Putting objects in orbit is more tractable than we previously thought.
      1969: Landing on the moon is more tractable than we previously thought.
      2016: The game of Go is more tractable than we previously thought.
      20xx: Human-level cognition is more tractable than we previously thought.

      Yes, a lot of people are over-optimistic about AI at any given time, but it is moving for

  • And that son is how Skynet was born. Damn humans never learn anything.
  • I have programmed this type of learning algorithm in the past. About 30 years ago when computers were about 30000 times slower than now.

    Anyway, you can have the program play itself for a while and it becomes quite good. But you won't know how it will perform against a human unless you try. It might be very good against those moves that the computer player will come up with, but very bad against moves thought up by a human.

    30 years ago I tackled a simpler game than go.

Ocean: A body of water occupying about two-thirds of a world made for man -- who has no gills. -- Ambrose Bierce

Working...