Want to read Slashdot from your mobile device? Point it at m.slashdot.org and keep reading!

Mario AI Competition 110

Posted by Soulskill on Wednesday August 05, 2009 @03:37AM from the must-be-on-mushrooms-when-you-write-the-code dept.

togelius writes "We're running a competition to see who can program the best AI for a version of Super Mario Bros. It's about deciding what to do at each time step — run, jump, shoot etc. — based on a description of the platforms, items and enemies around Mario. This is hard. It's so hard we believe that some sort of machine learning algorithm will be necessary to reach good playing performance. But really, any approach is fair game. We welcome hard-coded submissions, commercial AI programmers, academics and amateurs alike. Whoever wins, it will be really interesting. The competition is associated with two IEEE conferences, and there are cash prizes available for the best submissions."

This discussion has been archived. No new comments can be posted.

Mario AI Competition

Load All Comments

Search 110 Comments Log In/Create an Account

Comments Filter:

Uhhhh (Score:1, Redundant)

by Idiomatick ( 976696 ) writes:

Can I keylog myself beat the whole game on an emulator and submit the log? ... Seems kinda silly having ai for games that have nothing random in it.
- - Re: (Score:1, Informative)
    
    by Anonymous Coward writes:
    
    Too bad. Otherwise, you could send them one of the files from the TAS of SMB (available somewhere on here [tasvideos.org]) which is very probably frame-perfect at this point.
- Re:Uhhhh (Score:4, Informative)
  
  by Toridas ( 742267 ) writes: on Wednesday August 05, 2009 @04:03AM (#28953221)
  
  If you had read TFA you'd know that they are using the game Infinite Mario Bros, which has randomly generated levels.
  
  Parent Share
  twitter facebook
  - Re:Uhhhh (Score:4, Interesting)
    
    by TheRaven64 ( 641858 ) writes: on Wednesday August 05, 2009 @07:31AM (#28954755) Journal
    
    Note, however, that mario levels are composed of blocks of something like 15x20 visible at one time. Each of these has a small number of relevant states (wall, enemy, hazard, tube), let's say 8 possible states. That gives 8^300 possible states for the visible game. From each of these states, you have a small number of options. This means that infinite Mario is really finite Mario with a really large set of levels.
    Brute forcing this is not really feasible, but there are probably a large number of states that you can treat as equivalent. For example, you don't care about whether a block is destructible if you are above it, you don't care about the state of any tile under the one you are standing on, and so on.
    
    Parent Share
    twitter facebook
    - Re: (Score:2)
      
      by jerep ( 794296 ) writes:
      
      You dont need Mario to explore every last block in the game, alls you need is for him to finish the level, and if the AI is smart, it will do it without the use of items (stars, mushrooms, etc). You could just use a simple pathfinding algorithm with detection for enemies and obstacles, and let mario complete the entire level while running and stopping only to wait for certain events (such as those stairs moving up or down).
      8^300 states is a lot of data and would never perform well in realtime AI calculation
    - Re: (Score:1)
      
      by deadkennedy ( 1594629 ) writes:
      
      I think exploring some various state machine configurations would help here.
    - Re: (Score:3, Informative)
      
      by alexandreracine ( 859693 ) writes:
      
      Here is one guy doing it. Pretty impressive... http://www.youtube.com/watch?v=0s3d1LfjWCI [youtube.com]
Let's see if any of these guys have a go... (Score:5, Informative)

by VinylRecords ( 1292374 ) writes: on Wednesday August 05, 2009 @04:11AM (#28953283)

http://tasvideos.org/ [tasvideos.org]
TAS = Tool Assisted Speed Runs. Basically you program controller inputs (at very slow speeds) and then play them back at 1:1 speeds and you watch a pre-programmed controller run through an entire game as quickly as possible. There are runs for basically most of the more popular NES and SNES games as well as other games. Pretty interesting stuff and usually a daunting task is creating a TAS of a game.

Share
twitter facebook
- Re: (Score:3, Insightful)
  
  by Sockatume ( 732728 ) writes:
  
  I'm doubt that would count, seeing as you've just got a human doing the decisions. It's "artificial intelligence" not "artificial fingers".
  - - Re: (Score:2)
      
      by tepples ( 727027 ) writes:
      
      At the very least they're going to have a head start: they'll have a good grasp of the SMB physical simulation.
      But does Infinite Mario Bros. use an identical simulation, down to the 1/16th or 1/256th pixel resolution that a lot of these games used to represent coordinates? And if so, are Nintendo's lawyers interested?
    - Re: (Score:1)
      
      by KDR_11k ( 778916 ) writes:
      
      The competition doesn't use the original SMB game so the detailed physics may differ.
      TASing does occasionally involve the use of bots but AFAIK they only brute force it. I think the entire autoscrolling part of Pulseman was done with a bot.
- Re: (Score:1)
  
  by astrowill ( 1593647 ) writes:
  
  True, but TFA states that they are running "with the added benefit of endless random level generation".
The prize seems kind of paltry (Score:5, Insightful)

by BadAnalogyGuy ( 945258 ) writes: <BadAnalogyGuy@gmail.com> on Wednesday August 05, 2009 @04:20AM (#28953345)

500 dollars for the winner, but you are expecting evolutionary neural nets, genetic programming, fuzzy logic, and temporal difference learning.
The temporal difference between the effort to build such an AI and 500 bucks seems a little too great.

Share
twitter facebook
- Re: (Score:2, Informative)
  
  by Anonymous Coward writes:
  
  I love how you included fuzzy logic in your list of otherwise hard to pull of AI functions.
- Re: (Score:2, Funny)
  
  by TheCowSaysMooNotBoo ( 997535 ) writes:
  
  Not to mention that you need to be present at the conference to claim your prize. Otherwise you just get a certificate.
  - Airfare and a hotel stay (Score:2)
    
    by tepples ( 727027 ) writes:
    
    Not to mention that you need to be present at the conference to claim your prize. Otherwise you just get a certificate.
    In that case, the prize might not even cover airfare and a hotel stay.
    - Re: (Score:1)
      
      by xtracto ( 837672 ) writes:
      
      And the worst thing is that the price might not even cover the conference registration fee!
      But for the scientific value is quite interesting, moreover this is more aimed at the scientific community where researchers usually get sponsored by their institutions when publishing a paper and going at conferences
- Re: (Score:3, Insightful)
  
  by Sockatume ( 732728 ) writes:
  
  On the other hand offering a larger prize for a competition of this nature is pointless. I doubt that you'd get MIT to devote a research grant even if it was offering up $500,000.
  - Re:The prize seems kind of paltry (Score:4, Informative)
    
    by jtogel ( 840879 ) writes: <julian@togelius.com> on Wednesday August 05, 2009 @08:01AM (#28955021) Homepage Journal
    
    In previous competitions on simulated car racing AI we've had submission from Imperial College, National University of Singapore, Politecnico di Milano, University of Birmingham and other internationally leading universities. So a submission from MIT would not surprise me the least.
    
    Parent Share
    twitter facebook
    - Re: (Score:2)
      
      by Sockatume ( 732728 ) writes:
      
      I'm sure it'll get plenty of interest from academic research groups, but no amount of prize money is going to tip it over into the "new research investment" category, which is normally the situation for using prizes to draw in academics.
- Re: (Score:2)
  
  by SoVeryTired ( 967875 ) writes:
  
  A cash prize like that is more like a recognition of the achievement than an incentive to compete. Same way no-one goes after the Fields medal for the money.
- Re:The prize seems kind of paltry (Score:4, Informative)
  
  by jtogel ( 840879 ) writes: <julian@togelius.com> on Wednesday August 05, 2009 @07:57AM (#28954975) Homepage Journal
  
  Actually, this is not true. The competition is mainly aimed at academic researchers, who work with these techniques anyway, and for whom 500 dollars (into your own pocket, not your research fund) is not a completely insignificant amount. But of course, the main motivation for researchers to take part is of course the recognition. And of course others than academics are very welcome to take part as well! We're very much looking to broaden the participation.
  
  Parent Share
  twitter facebook
- Re: (Score:1)
  
  by damien_kane ( 519267 ) writes:
  
  The temporal difference between the effort to build such an AI and 500 bucks seems a little too great.
  Sure, because nobody's spending hundreds of millions of dollars in research and development for the 1-million dollar purse in the various X-Prize competitions (launch vehicles, lunar landers, autonomous vehicles, genomics, etc)
  
  Oh, wait...
Thanks for the advanced notification! (Score:5, Insightful)

by risinganger ( 586395 ) writes: on Wednesday August 05, 2009 @04:41AM (#28953471)

We welcome hard-coded submissions, commercial AI programmers, academics and amateurs alike.
Yet you only post this on slashdot with 13 days before the deadline. You couldn't have posted it here back in May? (the earliest date a post seems to have in your google group).

Share
twitter facebook
- Re:Thanks for the advanced notification! (Score:4, Insightful)
  
  by TheRaven64 ( 641858 ) writes: on Wednesday August 05, 2009 @07:33AM (#28954779) Journal
  
  Maybe he submitted it back in May and it was waiting since then for an 'editor' to approve it?
  
  Parent Share
  twitter facebook
- Re:Thanks for the advanced notification! (Score:4, Informative)
  
  by jtogel ( 840879 ) writes: <julian@togelius.com> on Wednesday August 05, 2009 @08:04AM (#28955063) Homepage Journal
  
  We did some advertising for this within the academic research community in spring, but for various reasons we were a bit late with reaching out beyond academia. Definitely an oversight on our part. Still, the deadline for the CIG phase (you don't have to submit to the first phase) is almost a month away, and if the competition is a success this year we'll run it next year as well.
  
  Parent Share
  twitter facebook
- - Re: (Score:1)
    
    by grumbel ( 592662 ) writes:
    
    Enemy stomping shouldn't be much harder then just normal jumping from platform to platform, as both are a simple matter of pathfinding and pathfinding happens to be well enough understood problem, at least when it comes to simple tilebased worlds. It gets a little trickier then static platforms, as the enemies move around, but not that much harder, as they behave completly predictable.
This is hard (Score:5, Interesting)

by phantomfive ( 622387 ) writes: on Wednesday August 05, 2009 @04:47AM (#28953507) Journal

This is hard. I think if I were going to do it, I would break it up into steps.

First, I would teach the AI to move around on flat surfaces. Then I would teach it how to navigate over holes. Then I would add pipes and things it would need to jump over. Finally I would add random bricks. These are hard because if you jump underneath them, you might bump your head and change your trajectory.
Secondly I would start adding bad guys. Start with goombas, then add green turtles, then red turtles, then piranha plants, then bullets.

This is hard, the AI will need to learn to recognize certain features of the landscape, which is something humans are really good at doing. It will have to learn things like, "if I stand next to a tube, the piranha plant will not come out." It will have to learn that sometimes a short hop is appropriate, and sometimes a long jump is better. It will have to recognize that if a red turtle is on a ledge, it doesn't need to worry about it falling, and it can run underneath at full speed.

Heh, maybe I'll enter. How hard can it be?

Share
twitter facebook
- Re: (Score:2)
  
  by jtogel ( 840879 ) writes:
  
  Yeah, please enter! You can set the level difficulty from 0 to 10, and choose whether to have enemies in or not (with the world paused/unpaused option), so basically you can evaluate your solutions first on a paused level with difficulty 0 (few holes and few obstacles) and then incrementally increase the difficulty.
  - Re: (Score:3, Funny)
    
    by BJ_Covert_Action ( 1499847 ) writes:
    
    My AI's level difficulty processing goes to 11...
- Re: (Score:2)
  
  by Lisandro ( 799651 ) writes:
  
  Heh, maybe I'll enter. How hard can it be?
  Very. As in "really fucking hard".
- - Re:This is hard (Score:4, Informative)
    
    by hesiod ( 111176 ) writes: on Wednesday August 05, 2009 @09:49AM (#28956477)
    
    even the most advanced AI doesn't learn.
    Depends how you define that. If human learning is just based on strengthened signals between synapses, then a weighted neural net certainly DOES learn.
    
    Parent Share
    twitter facebook
  - Re: (Score:1)
    
    by plastbox ( 1577037 ) writes:
    
    Uh? Sorry if I misunderstood and you have some weird, religious-like definition of "learning" as something only God's special, soul-endowed children can do.
    As far as I am concerned, any system that applies knowledge gained from experience to a situation in order to solve it has learned. A tic-tac-toe playing computer program that starts out not knowing the rules of the game but eventually ends up winning or playing you to a draw every time, without you doing anything except playing against it (or, in true W
Shard of glass in my delicious pie! *gruff* (Score:1)

by plastbox ( 1577037 ) writes:

First thought: "WOAH, this seems awesome! Can't wait to see what kind of crazy awesome stuff people come up with! Perhaps something like Air Man versus Genetic Algorithm [youtube.com]?"
Second though: "Hmm.. Emulating keypresses is easy as cake, but I wonder how their game passes info about Mario's environment to the controller? After all, this is a contest of skill and creativity, so there should be a system in place to allow code monkeys and 1337 programmers to contribute, regardless of their tools of choice."
Still thin
- Re: (Score:1)
  
  by Arthurio ( 1392181 ) writes:
  
  get eclipse http://www.eclipse.org/downloads/ [eclipse.org] and jdk - that's it. happy coding!
  - Re: (Score:2)
    
    by andi75 ( 84413 ) writes:
    
    While I think Eclipse is great, I believe NetBeans [netbeans.org] is even better these days, at least for someone just picking up Java. The advanced features don't get in your way as much as with Eclipse.
    - - Re: (Score:1)
        
        by plastbox ( 1577037 ) writes:
        
        Thanks for the friendly replies! ^^
        Just to make one thing clear though, I'm not running for 1st place here. I just want to tinker around with this to see if I can make Mario survive any significant distance at all.
        I'm thinking I might make a genetic algorithm type thing in which each string of "DNA" contains conditions and reactions. Like, conditions might be "blocking tile in position 1.0 (relative), no blocking tile in position 1.1" and the reaction would be "tap jump,press forward". Of course, represente
    - Re: (Score:1)
      
      by WarpGiGA ( 647974 ) writes:
      
      Netbeans.org all the way! (or eclipse if you wanna be like that)...
- - Re: (Score:1)
    
    by plastbox ( 1577037 ) writes:
    
    HAH! I also program loads of VB6 stuff! All your base, Lucy!
    Anyways.. Yes, I know php isn't a compiled language. Yes, I know I'm a php scripter who enjoys php scripting. Does the fact that php is a language I enjoy immensely as a recreational platform for doing things like distributed processing MD5 brute forcing, the classic mona lisa genetic algorithm thingie and tic-tac-toe AI crap as well as making CMS/gallery/forum-systems for friends, a better helpdesk-system for work, a gatekeeper to merge several se
    - - Re: (Score:1)
        
        by mrrudge ( 1120279 ) writes:
        
        I'll reply, AC, just wondering, what's the next level ?
        
        What you did ?
        What makes you useful as a bland cog in a corporate machine ?
        Following your passions with the tools available to you ?
        
        Seems to me you've got some kind of superiority thing going on because you're better-with-computers ? Because you've sat through the many fun eclipse hours of now-where-the-hell-would-a-developer-hide-that-option. Because the language you know is compiled, ffs ? I don't think you're in a position to be talking about
        
        Re: (Score:2)
        
        by hesiod ( 111176 ) writes:
        
        We need to post a sign around here or something: "Please don't feed the trolls."
        
        Re: (Score:2)
        
        by phantomfive ( 622387 ) writes:
        
        I can't speak for him, but I can tell you, in many cases, getting to the next level is a matter of realizing that the language doesn't actually matter all that much, it's all about the person and how well they organize their code. Most decent programmers I know can pick up a language in a week pretty well, and start doing useful things the first day, and moving from C++ to Java is almost like moving from one pond to another, they are so similar. So whining about the language gets the same kind of eye-roll
        
        Re: (Score:1)
        
        by holmedog ( 1130941 ) writes:
        
        Except one thing, EMACS SUCKS!!!
      - Re: (Score:1)
        
        by plastbox ( 1577037 ) writes:
        
        I'm not even going to get into a genitalia-measuring match with you. Nope, na-ah, no way. You know what? Because I am 24 years old and I know what I can and cannot do. Since you seem to be having some issues comprehending what I am trying to say, I will (probably without much hope) try to clear things up for you.
        I use php as a recreational platform. Why?
        
        Because it's a very rich language and still fairly snappy for a scriptet language
        It's a very light and fast tool to play around with when a cool ideas drop
    - - Re: (Score:1)
        
        by plastbox ( 1577037 ) writes:
        
        Sorry, I come on a bit strong when I get exited. I'm not saying Java isn't good, I'm not saying I can't write Java if I have to. My reaction was purely based on the fact that if I was hosting that sort of competition, I'd make a TCP-interface as the main option for anyone, regardless of platform or language could use. After all, with all the awesome possibilities here, it should be about the algorithm used not the language used for the implementation.
        When you see the Shell eco-marathon, it's all about creat
- - Re: (Score:1, Offtopic)
    
    by plastbox ( 1577037 ) writes:
    
    Just so happens, my girlfriend and I have been together and lived with each other for two years now. She might be lying but considering
    A. She's hot as hell
    B. I'm not exactly Johnny Depp or Brad Pitt
    C. I don't have a very deep wallet =(
    
    I'm guessing my 16cm, all-in-all pretty average self somehow manages to keep her happy ^^,
    
    Oh, btw.. From what I know both from personal experience and what friends of both genders tell me, neither the size nor the speed matter all that much without stability, and that o
    - - Re: (Score:1)
        
        by plastbox ( 1577037 ) writes:
        
        I know, I know ^^ I also work out. As in, at the gym, not with our Nintendo Wii. =O
        
        Re: (Score:1)
        
        by plastbox ( 1577037 ) writes:
        
        With the way you get into these personal attacks, I'd love to see what you look like. I'm a perfectly normal guy who tries to stay in shape and keep my diabetes under control. I don't use linux because.. well.. Like c/c++, eventhough I am fairly competent at it, for most my needs it would be a whole hell of a lot more effort to use linux for day-to-day activities than Windows.
        And by "normal" I don't mean 20kg++ overweight, and by "diabetes" I mean the type (type 1) you get for no spesific known reason, not
        
        Re: (Score:1)
        
        by plastbox ( 1577037 ) writes:
        
        Heh, I'm not American, but the eating habits and general ignorance they have over there seem to be spreading, just like every other type of "americanization" (language, fast-food, coke, advertising, etc.). 10 years ago nearly everyone I knew was into football (soccer), karate, or something and everyone used their bikes to get around if the weather allowed it.
        These days, I know one person who works out actively (as in, needing a shower and change of clothes), and that is the guy who owns the basement gym I w
- Re:Shard of glass in my delicious pie! *gruff* (Score:4, Informative)
  
  by RegularFry ( 137639 ) writes: on Wednesday August 05, 2009 @07:21AM (#28954679)
  
  "Java only? What the hell !?"
  Um... no.
  Controllers written in any language are welcome, as long as they can be interfaced to an unmodified version of the marioai package - directly if written in Java, through the TCP interface otherwise. In any case, the controllers must be able to run in real time on an Intel machine running either Mac OS X (preferred), Ubuntu Linux or Windows XP.
  
  Parent Share
  twitter facebook
  - Re: (Score:1)
    
    by plastbox ( 1577037 ) writes:
    
    Uuuh! Awesome! Seriously, how the hell did I not see that? It's not like that site is all that content heavy. o_O
    I might just suffer from some sort of sneaky invisible ninja selective blindness but I can't seem to find any info on this TCP interface. Curse you, sneaky invisible ninja selective blindness!
- Re: (Score:3, Interesting)
  
  by TheRaven64 ( 641858 ) writes:
  
  Java is a language. It's a descendant of Smalltalk via StrongTalk/Objective-C with syntax inspired by C++.
  This competition is about algorithms. The implementation language is largely irrelevant. Java is far from my favourite language, but it's expressive enough for a project of this nature.
  - Re: (Score:2)
    
    by Helios1182 ( 629010 ) writes:
    
    Not only is it expressive, but there are lots of useful AI and Machine Learning libraries written in it. Java may get a bad reputation from a lot of people, but it is heavily used by academics.
- Re:Shard of glass in my delicious pie! *gruff* (Score:4, Informative)
  
  by jtogel ( 840879 ) writes: <julian@togelius.com> on Wednesday August 05, 2009 @08:18AM (#28955239) Homepage Journal
  
  Personally, I love Java, but I recognize that not everybody does. As another poster has already commented below, any language is permitted as long it can somehow interface to the game code. To begin with, there are several languages other than Java that run on the JVM (Scala, for example) and these can interface directly to the code. You can also interface via the provided TCP interface; we've included a Python example. Or via JNI (Java Native Interface) for c programs.
  
  Parent Share
  twitter facebook
  - Re: (Score:1)
    
    by plastbox ( 1577037 ) writes:
    
    Ok ok, I was too quick to post. I get it *wanders off to corder with both feet firmly lodged in mouth*
    Luckily, I never cared much about making an ass of myself, "thinking before speaking is like wiping your ass before you.. well, you get the idea" and all. So I'll just ask and hope I get an answer and not a painful reaming.
    How do I launch the network agent? I've tried everything I can find here [google.com] but at best it launches the game in keyboard control mode. Do I just suck? Should it be blatantly obvious how to g
    - Re: (Score:2)
      
      by somersault ( 912633 ) writes:
      
      thinking before speaking is like wiping your ass before you.. well, you get the idea
      It sounds like whoever coined that idea really should have thought before they spoke. Unless they were being intentionally ironic.
too short. (Score:5, Insightful)

by tetha ( 1612425 ) writes: on Wednesday August 05, 2009 @06:14AM (#28954207)

I highly like this competitions idea, but I won't participate, because the deadline is far, far too soon.
I mean, I am supposed to understand their framework and implement, test and tweak an artificial intelligence for a pretty complicated task like this in a month (let alone, 2 weeks), with my rusty java, rusty AI-knowledge (I'd try emergent behaviour, probably)? Sorry, but this is just plain impossible, since there is enough work to do from the university and other hobby projects. Give me until, like. Christmas and I'd try.
Plus, the time shortens even further, as it appears that there are documentation issues, so one would probably have to work out how the game state is given to the AI.
So overall: very interesting, but too short for someone who actually has other work to do

Share
twitter facebook
- Re: (Score:2, Insightful)
  
  by WarpGiGA ( 647974 ) writes:
  
  Consider just doing it for fun then, the $500 price isn't worth whining about anyways.
- Re: (Score:2)
  
  by jtogel ( 840879 ) writes:
  
  The CIG deadline is September 3, which is almost a month away (you don't need to submit to the first phase). Plus, if the competition is a success, it will run next year as well.
  
  While I admit the documentation is a bit on the short side, it should be perfectly enough to get started. All you need to do is look at the Agent interface, and there you have the format of the data the game is giving to you.
  - Re: (Score:2)
    
    by jandrese ( 485 ) writes:
    
    A month is not much time at all for something like this.
- Re: (Score:1)
  
  by xtracto ( 837672 ) writes:
  
  It is amazing [youtube.com] what less whining and more working can achieve.
That's nothing. (Score:1)

by jovius ( 974690 ) writes:

I'm running a competition to see who can program an AI that will create the game Super Mario Bros, by analyzing the behavior patterns of human subjects.
- Re: (Score:3, Insightful)
  
  by Hatta ( 162192 ) * writes:
  
  So how does it test that a random level is completable? Seems to me that if there were an algorithm to do this, this competition would be moot. If there's not, there's going to be a lot of trouble with impossible random levels.
Nintendo's AI (Score:1)

by hellfish006 ( 1000936 ) writes:

I assume this competition will end before New Super Mario Bros. Wii comes out. But when it does, someone should try to get that AI to work on the game for this competition. The AI I am speaking of is the new feature where if something is too difficult you can press start, select an option, and it will complete the area for you.
- Re: (Score:2)
  
  by aardwolf64 ( 160070 ) writes:
  
  Unfortunately, New Super Mario Bros. Wii is using a fixed level... So it's not so much AI as it is preprogrammed response. It likely wouldn't know how to deal with a randomly generated level.
Neat! (Score:2, Interesting)

by The_Duck271 ( 1494641 ) writes:

Programming games are fun!

However, I've yet to see a such a contest in which the successful entries used AI techniques rather than handcoded decision-making. My money says the winners of this will be handcoded and possibly tuned automatically, and not based on neural networks or genetic programming or whatever. I suspect this is true because these games are set up so that the game mechanics and the outlines of good strategy are very intuitive to humans, and so it's most efficient for the human programme
Mechanical Turk? (Score:2, Funny)

by GranBurguesa ( 720088 ) writes:

But really, any approach is fair game
Just how random is random? (Score:1, Interesting)

by Anonymous Coward writes:

Just how random is the 'random' level generator. If the AI is beaten by weaker AI because another team managed to exploit pattens in pseudo-randomness, then your competition results wouldn't really mean anything
2009 Reinforcement Learning Competition (Score:3, Interesting)

by gizmoguy4242 ( 262865 ) writes: on Wednesday August 05, 2009 @09:24AM (#28956117)

Just thought I'd point out that we also did this in the 2009 Reinforcement Learning Competition (I was the general chair):
http://2009.rl-competition.org [rl-competition.org]

We also used Infinite Mario Bros, but combined it with the RL-glue coding framework to make the interface easier. That way, a well-coded agent is automatically compatible with any other domain that is RL-glue compatible.

The prizes were also comparable: ~$450 for the first place team, ~$250 for the second place team.

The results were interesting: far from developing interesting and novel RL algorithms, most competitors used clever feature engineering combined with dimensionality reduction to reduce the full Mario problem to a simpler one that could be solved efficiently using existing RL algorithms that are robust and well understood.

One of the big lessons that we took away from this was that we haven't solved the mechanism design problem of competitions in AI. While Mario sounds like a good "grand challenge" problem for RL / AI, it turns out that simple heuristics work pretty well. I think this is a common problem for most of these competitions -- there's the Trading Agent Competition, there's Netflix, there's the General Game Playing Competition, etc. They all have the same goals, and they all have the same problem: competitors engineer algorithms to solve the competition, not to spur progress in general AI. These games are all a proxy for what we really care about (like the Turing test), and the proxy isn't perfect (like the Turing test).

I think the only way to get around this is to craft a domain that mimics the real world, because then if anyone "solves the competition," you've made progress on what you really care about.

It would be interesting to design a competition with these goals in mind. Maybe an extraordinary complex simulator based on a physics engine (Bullet or Havok) would be a step in the right direction -- different objects with continuous, high-dimensional state spaces and complex material properties (some are soft, some are rigid, some break, etc); interesting physical interactions between objects (collisions, joints, hinges, stacking, breaking, etc.); multiple levels of spatio-temporal abstraction (from low-level motor control to abstract tasks) and a strong vision component. Now that would be a cool competition!

David Wingate
wingated@mit.edu

Share
twitter facebook
- Re: (Score:1)
  
  by CnlPepper ( 140772 ) writes:
  
  It would be more interesting to make a more complex game/simulator, however the problem with this is it rapidly raises the bar for entry into the competition. Amatures without the necessary computing power could end up being rather left out.
  A game where the input space is larger and less predictable, which also has scope for live tactical and/or strategic decisions would be interesting. How about a 3D, fully Newtonian physics space combat simulator in a chaotic asteroid field. Basically robocode++. The agen
This has been done for the original asteroids (Score:1)

by Gunstick ( 312804 ) writes:

Asteroids played by robots. Contest: http://www.heise.de/ct/projekte/machmit/asteroids/ [heise.de] Results: http://www.heise.de/ct/creativ/08/02/ergebnisse/ [heise.de] (push on the play buttons to see the videos)
Can humans even do this reasonably? (Score:2)

by immakiku ( 777365 ) writes:

Can humans even do this reasonably? By that what I mean is, when we play, we don't play based off what we see on the current screen. I doubt many people have beaten the game on their first play through. Most of us had multiple tries, seen more of the level and honed our timing in tricky spots.
- Re: (Score:2)
  
  by jerep ( 794296 ) writes:
  
  An AI could have some memory of the level too, its really not hard to just save the actions up to where it died, keep a state of what the area looked like and then plan ahead for its second try.
- Re: (Score:3, Insightful)
  
  by Bakkster ( 1529253 ) writes:
  
  Can humans even do this reasonably?
  No, but shouldn't a computer be able to do it better? Perfect concentration, perfect timing, the ability to make split-second decisions, no visual limitations of how much of the screen can be seen; computers have the potential to do far better. Isn't that the idea?
Most difficult yrs in marriage R after the Wedding (Score:1)

by t2000kw ( 1066988 ) writes:

A few words of advice from someone who is in a similar marriage and has been since 1974. The most difficult years in marriage are after the wedding. :-) I say that because so many people think that marriage will bring eternal bliss. It doesn't. But it doesn't have to be a living hell, either. Marriage isn't so much about how compatible you are but how well you deal with incompatibility. It helps to find common interests that you can share in together. Look at disagreements as a challenge to work out toge
- Re: (Score:2)
  
  by X0563511 ( 793323 ) writes:
  
  Stop spamming already.
- Re: (Score:2)
  
  by TheRaven64 ( 641858 ) writes:
  
  Are you an idiot who doesn't realise that this has absolutely no relevance to the discussion and that CUDA is incredibly poorly suited to the kind of branch-heavy coding that a solution to this is likely to involve, or is nVidia paying you to spam?
  - Re: (Score:1)
    
    by CnlPepper ( 140772 ) writes:
    
    Actually you are quite wrong to say CUDA is irrelavent. Not all the AI techniques that could be applied to this problem require heavy branching. Neural networks, for example, can have minimal branching and will be considerably accelerated using CUDA/OpenCL.

There may be more comments in this discussion. Without JavaScript enabled, you might want to turn on Classic Discussion System in your preferences instead.

Uhhhh (Score:1, Redundant)

Re: (Score:1, Informative)

Re:Uhhhh (Score:4, Informative)

Re:Uhhhh (Score:4, Interesting)

Re: (Score:2)

Re: (Score:1)

Re: (Score:3, Informative)

Let's see if any of these guys have a go... (Score:5, Informative)

Re: (Score:3, Insightful)

Re: (Score:2)

Re: (Score:1)

Re: (Score:1)

The prize seems kind of paltry (Score:5, Insightful)

Re: (Score:2, Informative)

Re: (Score:2, Funny)

Airfare and a hotel stay (Score:2)

Re: (Score:1)

Re: (Score:3, Insightful)

Re:The prize seems kind of paltry (Score:4, Informative)

Re: (Score:2)

Re: (Score:2)

Re:The prize seems kind of paltry (Score:4, Informative)

Re: (Score:1)

Thanks for the advanced notification! (Score:5, Insightful)

Re:Thanks for the advanced notification! (Score:4, Insightful)

Re:Thanks for the advanced notification! (Score:4, Informative)

Re: (Score:1)

This is hard (Score:5, Interesting)

Re: (Score:2)

Re: (Score:3, Funny)

Re: (Score:2)

Re:This is hard (Score:4, Informative)

Re: (Score:1)

Shard of glass in my delicious pie! *gruff* (Score:1)

Re: (Score:1)

Re: (Score:2)

Re: (Score:1)

Re: (Score:1)

Re: (Score:1)

Re: (Score:1)

Re: (Score:2)

Re: (Score:2)

Re: (Score:1)

Re: (Score:1)

Re: (Score:1)

Re: (Score:1, Offtopic)

Re: (Score:1)

Re: (Score:1)

Re: (Score:1)

Re:Shard of glass in my delicious pie! *gruff* (Score:4, Informative)

Re: (Score:1)

Re: (Score:3, Interesting)

Re: (Score:2)

Re:Shard of glass in my delicious pie! *gruff* (Score:4, Informative)

Re: (Score:1)

Re: (Score:2)

too short. (Score:5, Insightful)

Re: (Score:2, Insightful)

Re: (Score:2)

Re: (Score:2)

Re: (Score:1)

That's nothing. (Score:1)

Re: (Score:3, Insightful)

Nintendo's AI (Score:1)

Re: (Score:2)

Neat! (Score:2, Interesting)

Mechanical Turk? (Score:2, Funny)

Just how random is random? (Score:1, Interesting)

2009 Reinforcement Learning Competition (Score:3, Interesting)

Re: (Score:1)

This has been done for the original asteroids (Score:1)

Can humans even do this reasonably? (Score:2)

Re: (Score:2)

Re: (Score:3, Insightful)

Most difficult yrs in marriage R after the Wedding (Score:1)

Re: (Score:2)

Re: (Score:2)

Re: (Score:1)

Related Links Top of the: day, week, month.

Slashdot Top Deals

Shard of glass in my delicious pie! gruff (Score:1)

Re:Shard of glass in my delicious pie! gruff (Score:4, Informative)

Re:Shard of glass in my delicious pie! gruff (Score:4, Informative)