Microsoft's AI Is the First to Reach a Perfect Ms. Pac-Man Score

Microsoft's AI Is the First to Reach a Perfect Ms. Pac-Man Score (theverge.com) 59

Posted by BeauHD on Monday June 12, 2017 @03:00AM from the game-over dept.

Maluuba, a deep-learning team acquired by Microsoft in January, has created an AI system that has achieved the perfect score for Ms. Pac-Man. According to The Verge, the AI system "learned how to reach the game's maximum point value of 999,900 on Atari 2600, using a unique combination of reinforcement learning with a divide-and-conquer method." From the report: Though AI has conquered a wealth of retro games, Ms. Pac-Man has remained elusive for years, due to the game's intentional lack of predictability. Turns out it's a toughie for humans as well. Many have tried to reach Ms. Pac-Man's top score, only coming as close as 266,330 on the Atari 2600 version. The game's elusive 999,900 number though, has so far only been achieved by mortals via cheats. Maluuba was able to use AI to beat the game by tasking out responsibilities, breaking it up into bite-sized jobs assigned to over 150 agents. The team then taught the AI using what they call Hybrid Reward Architecture -- a combination of reinforcement learning with a divide-and-conquer method. Individual agents were assigned piecemeal tasks -- like finding a specific pellet -- which worked in tandem with other agents to achieve greater goals. Maluuba then designated a top agent (Microsoft likens this to a senior manager at a company) that took suggestions from all the agents in order to inform decisions on where to move Ms. Pac-Man. The best results came when individual agents "acted very egotistically" and the top agent focused on what was best for the overall team, taking into account not only how many agents wanted to go in a particular direction, but the importance of that direction.

Microsoft's AI Is the First to Reach a Perfect Ms. Pac-Man Score

This discussion has been archived. No new comments can be posted.

Load All Comments

Search 59 Comments Log In/Create an Account

Comments Filter:

Dupe (Score:1)

by Anonymous Coward writes:

This is a dupe. Slashdot editors suck and should all be fired.
- Re: (Score:1)
  
  by Anonymous Coward writes:
  
  https://games.slashdot.org/sto... [slashdot.org]
- - Re: (Score:1)
    
    by Anonymous Coward writes:
    
    Wow!
    Godwin in just one comment! I'm astonished.
    Merely mentioning Hitler is not Godwin - someone has to be compared to Hitler for that to apply.
- Re: (Score:2, Funny)
  
  by Anonymous Coward writes:
  
  How would it rate the holocaust?
  Incomplete.
  - Re: (Score:1)
    
    by michelcolman ( 1208008 ) writes:
    
    It might, actually. Remember that AI chatbot that started spewing out racist and antisemitic comments and had to be taken off line?
    - Re: (Score:2)
      
      by drinkypoo ( 153816 ) writes:
      
      It might, actually. Remember that AI chatbot that started spewing out racist and antisemitic comments and had to be taken off line?
      I do [imgur.com], but apparently the moderators don't. Or they work for Microsoft. Or Germany.
Dupe (Score:5, Informative)

by xororand ( 860319 ) writes: on Thursday June 15, 2017 @03:27AM (#54623999)

This was already posted only hours ago.
https://games.slashdot.org/sto... [slashdot.org]

- Re: (Score:3)
  
  by bungo ( 50628 ) writes:
  
  Since it's about Ms Pacman, I think this is a homage to us old timers, who remember when the original Pacman came out, and when Slashdot still had Taco, and we'd have dupes of dupes every day.
  Why, if we didn't have at least two dupes a day, we'd complain!
  This is just be current owners reflecting on the old days.
  Look, someone with a 3 digit id is now going to post telling me to get off his lawn (although I was around before accounts existed and didn't want to register as I didn't like being tracked on the in
  - - Re: Dupe (Score:2)
      
      by HTH NE1 ( 675604 ) writes:
      
      Atari 2600 Defender? Enemies fly in more consistent and predictable formations than Cylons in classic Battlestar Galactica.
- Re: (Score:1)
  
  by Anonymous Coward writes:
  
  And uncovered as a hardwired fraud-
  https://www.theregister.co.uk/2017/06/15/microsoft_pac_man/
- Re: (Score:2)
  
  by Godwin O'Hitler ( 205945 ) writes:
  
  Yes but we didn't have cellphones then.
- Re: (Score:2)
  
  by gl4ss ( 559668 ) writes:
  
  well it's a lot easier.
  if you break it down it starts to sound a lot less like AI though, so there's something for the guy who was asking slashdot.
- Re: (Score:2)
  
  by Dog-Cow ( 21281 ) writes:
  
  We'll ignore the fact that Watson's claim to fame is competing on Jeopardy!, right?
- Re:AI for what? (Score:5, Insightful)
  
  by PolygamousRanchKid ( 1290638 ) writes: on Thursday June 15, 2017 @04:50AM (#54624153)
  
  At least Watson is trying to cure people.
  Well, if Watson really was AI . . . then it would be deciding on whether to even attempt to cure a patient at all.
  Watson:
  "Yes, I could cure the patient, but the treatment would leave him surviving with a miserable quality of life."
  "The patient is so frail that he will die from something else within a month."
  "It would make much more sense to transplant that donated organ into somebody much younger."
  "Today is my golf day . . . I'll think about curing the patient tomorrow."
  
- Re: (Score:2)
  
  by Maritz ( 1829006 ) writes:
  
  Thanks for sharing your impoverished imagination with us.
Why Ms. Pac-man? (Score:5, Informative)

by DNS-and-BIND ( 461968 ) writes: on Thursday June 15, 2017 @04:13AM (#54624089) Homepage

This particular Atari game was one of the few games that resisted to Deep Q Learning (a form of Reinforcement Learning invented by DeepMind). Many researchers have tried over the last couple of years to solve it. This time, Microsoft found an ingenious solution to the problem, that combines experience from multiple agents and learns to form sub-goals. Their solution could mean that in the future it might be easier to apply reinforcement learning to other settings, such as robotics. The interesting part about reinforcement learning is that it learns dynamic behavior, as opposed to static classification. It learns to act in a way that mimics intelligence. This kind of machine learning is invaluable.

I was upset when I read this news here, earlier (Score:2)

by 93 Escort Wagon ( 326346 ) writes:

But now I'm at peace with it.
Re: (Score:2)

by account_deleted ( 4530225 ) writes:

Comment removed based on user account deletion
DUUUUUUUUU... (Score:2)

by dohzer ( 867770 ) writes:

...UUUUUUUU... oh wait. Too late.
- Re: (Score:2)
  
  by Maritz ( 1829006 ) writes:
  
  Is this plugging the fact that they managed a perfect score in Ms. Pac-Man
  I only skimmed the article, but yes.
  or is this plugging that Microsoft has finally solved its middle-upper management problem and will be eliminating everyone from Satya Nadella on down to one level above their actually programmers, in order to help streamline and 'sanitize' Microsoft's development culture?
  Don't think it mentioned that.
- Re: (Score:2)
  
  by DontBeAMoran ( 4843879 ) writes:
  
  It is funny though, because it shows that a bunch of programmers could simply start a company and remove all management jobs and replace them all with software they could write themselves.
- Re: (Score:2)
  
  by Maritz ( 1829006 ) writes:
  
  If it was a 5 year old asking this, it would be a great question. A teaching opportunity.
- Re: (Score:2)
  
  by darthsilun ( 3993753 ) writes:
  
  What? You don't think AIs want to have fun too?
  I guess my day job is safe – for the time being – from being taken over by AIs. My retirement plan to be a Ms. Pacman champ seems to be in jeopardy though. Time to rethink.
- Re: (Score:2)
  
  by DontBeAMoran ( 4843879 ) writes:
  
  Please give parent a score of: +several million.
Is this really an achievement? (Score:2)

by hcs_$reboot ( 1536101 ) writes:

Of course Google just a few weeks ago made a lot of buzz with AlphaGo ; *This* is an amazing achievement. And MS had to catch up! But Ms PM compared to AlphaGo ... well, not comparable.
- Re: (Score:2)
  
  by Actually, I do RTFA ( 1058596 ) writes:
  
  AlphaGo, so far as I can tell, was just Deep Q Learning applied to a different game with more hardware resources. This is a different coding paradigm. I consider that far more interesting.
  - Re: (Score:2)
    
    by hcs_$reboot ( 1536101 ) writes:
    
    "Just" deep q learning? Vs a more classical resolution scheme... Well, Pac Man is funnier than Go, but the challenge is very different.
    - Re: (Score:2)
      
      by Actually, I do RTFA ( 1058596 ) writes:
      
      Well, yeah, just deep q learning. I'm not saying deep q learning wasn't important when deepmind applied it to atari games; I'm saying as far as I can tell alphago took existing tech and applied it to go. Hence, less interesting. Because alphago isn't advancing the state of the art.
      Now, I have no reason to think this is more important than deep q learning....
Itâ(TM)s â"proof readâ" (Score:2)

by gsslay ( 807818 ) writes:

FFS, does no-one on slashdot know how to encode text on the web? Does no-one give stories even the most cursory of proof reading?

There may be more comments in this discussion. Without JavaScript enabled, you might want to turn on Classic Discussion System in your preferences instead.

Microsoft's AI Is the First to Reach a Perfect Ms. Pac-Man Score (theverge.com) 59

Microsoft's AI Is the First to Reach a Perfect Ms. Pac-Man Score More Login

Microsoft's AI Is the First to Reach a Perfect Ms. Pac-Man Score

Dupe (Score:1)

Re: (Score:1)

Re: (Score:1)

Re: (Score:2, Funny)

Re: (Score:1)

Re: (Score:2)

Dupe (Score:5, Informative)

Re: (Score:3)

Re: Dupe (Score:2)

Re: (Score:1)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re:AI for what? (Score:5, Insightful)

Re: (Score:2)

Why Ms. Pac-man? (Score:5, Informative)

I was upset when I read this news here, earlier (Score:2)

Re: (Score:2)

DUUUUUUUUU... (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Is this really an achievement? (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Itâ(TM)s â"proof readâ" (Score:2)

Related Links Top of the: day, week, month.

Slashdot Top Deals

Slashdot