Rules-Unknown Artificial Intelligence Competition 176
OOglyDOOde writes: "This link points to a competition being hosted by a company that makes research on artificial intelligence. The task? Build a program that can play a number of games whose rules are totally unknown -- and earn the best score while competing against various opponents. Your program is told the possible choices available, when it should make a move, what did the opponent do; and what was your score for the last turn. There are no entry fees yet there is a cash prize. Submissions can be done in various languages, or in Linux or Windows binaries." This is certainly one of the odder ones I've ever seen, but has interesting prizes (trip to Israel) and rules (fairly broad entry categories).
My Statistics Professor Has Already Done This (Score:5, Insightful)
This code could be similarly adapted to any game, inasmuch as the code can register a table with all the possible moves provided by the interface. It doesn't even have to know what those moves do; it only needs to know if, by doing certain moves according the "state" (or the attributes) of the game, it gains points (or stays alive or whatever) or loses points. The moves are then given a distribution weighting factor. Then, the algorithm just needs to approximate the game state with the registered table entries, determine which moves have the highest "survival rate" based on the current game attributes, and then perform those moves.
Depending on the game, it may take a long time before the random variable distribution table gets populated to the point where the algorithm can make "intelligent" decisions, but it works nonetheless.
Aha! (Score:1)
So that's where all those camping bastards came from!
Is there a page for this? (Score:2)
Even if I could write this program i wouldn't... (Score:1, Flamebait)
Going through UN archives or resolutions, even the United Nations condemns Israel for taking the land by force the way they did. And what's more ridiculous, is that now they say "You palestinian's shouldn't fight us, we're ready to make peace on our Israeli terms". Ridiculous! That's like me walking into Taco's home and kicking his butt out and then when he fights back, I cry to all of Slashdot saying he is psychotic. And when that's over, I tell him that he can live in the mulch pile in the back which I sleep in his room.
I know this message will get moderated down to -1, and is it doesn't people will sit here an attack me. Both sides have done wrong, I just think the people who started it (Israeli's) should take their stuff and walk away. Didn't the holocaust help them realize anything? What's ironic is that the holocaust of 21st century is being conducted by the Israeli's and our US taxpayer money supports them with weapons!
Ultimate lameness...!
Re:Even if I could write this program i wouldn't.. (Score:1)
Re:Even if I could write this program i wouldn't.. (Score:1)
Re:Even if I could write this program i wouldn't.. (Score:1)
Just because someone hates you idelogically, that's no reason to treat them like shit.
Re:Even if I could write this program i wouldn't.. (Score:1, Insightful)
Re:Even if I could write this program i wouldn't.. (Score:1)
Making generalities about people is exactly the problem here... I have met plenty of Palestinians who don't want to kill the Israelis, and plenty of Israelis who don't want to kill the Palestinians.
Okay - I'm done. No more political crap from me
Re:Even if I could write this program i wouldn't.. (Score:1)
Last I checked, democracy isnt perfect.
There are many other factors for why people choose who they choose. Keep looking at the shallow side, and you'll never get anywhere.
Not that you tried to, anyway.
PS:
Incase you didnt notice, we didnt fire a single shot against those "poor, innocent" people for months. And yet, they keep on shooting. When we do shoot, its rare, and in response to something they did. Very inhuman. I guess that we should stop responding at all and just sit back and get killed. Fun, isnt it.
And dont start with the "who did what first". When you get killed, it doesnt matter. When we tried to talk, they didnt stop shooting for a moment. During the peace process, they kept saying that "peace is just a cover! We will kill them all anyway!" in public radio and TV. But, its ok. They're palestinians. They are permitted to do it.
Re:Even if I could write this program i wouldn't.. (Score:1)
some design specs for potential participants (Score:4, Informative)
After reading the guidelines to the contest, I figured I'd offer the following models/design specs for those interested in participating:
But the real question is (Score:3, Funny)
Re:some design specs for potential participants (Score:1)
I just finished reading the second and it is a thought provocing article.
Some black humor (Score:1, Funny)
How will they do this? (Score:1)
A round-robin tournament will be held to select the winner of the Learning Machine Challenge. All combinations of players will take part in all games, of which there will be between six and twelve.
As I see it, they plan to let every contestant play against every other, on 6-12 games, several thousand moves each.
Where will they find the time to do this if they get more than just a few dozens of entries?
Re:How will they do this? (Score:1)
Re:How will they do this? (Score:1)
I realized that.
But, let's say we get 1000 contestants.
Also, they specify 6-12 different games for each, let's say 10 on average.
This will mean 10*1000 * 999 / 2 matches (when A has played against B, B has played against A) of many thousand moves (let's say just 2000) each.
A program is required to process at least 10 moves per second. Worst case, this will lead to roughly 1000 million seconds, or 30 years, of computer time.
When you get more than 1000 contestants, the time required for the match will rise quadratically.
I realize that the average program will be faster than 10 moves/second, and that you can use several computers to speed things up (from the height of the prize, I gather that their budget is not unlimited, so I think more than 10 computers will be out of the question), but still, if you get a significant number of contestants, letting every contestant play agains very other may be prohibitive.
Re:How will they do this? (Score:1)
Interesting (Score:1)
Re:Interesting (Score:1)
One method is with classifier systems, where you evolve the rules that determine the output based either directly on the input, or a chain leading from some input. It starts with a pool of random bit strings which are evolved based on their success. The rule used is determined by a bidding scheme.
Another method, which is about as general as you can get, is genetic programming (GP). GP involves creating a set of functions and terminals and randomly generating a set of parse trees using them. Each of these programs is evaluated, and based on that the standard genetic algorithm operations are performed for form a new generation. Essentially, genetic programming is automatic programming, if given the right function and terminal set. Unfortunately, it would probably be too slow a process for this competition.
Both of the above methods have been proved over and over again. Classifier systems, for instance, have been used to run a simulated oil pipeline (with leaks, blockages, etc). Starting from a random population, it achieved human competitive results. Genetic programming has produced results that are not only human-competitive, but also infringe on pre-existing patents. [genetic-programming.com]
The simple solution (Score:1)
Even more interesting... (Score:1)
User-submitted binaries? (Score:1)
Somehow, I don't get the feeling that these people have planned this very thoroughly. There are other little things that don't quite seem right, too...
Re:User-submitted binaries? (Score:1)
Easier than I feared (Score:3, Interesting)
I was under the misconception that at each turn in the game, the judge will inform the player of all possible moves (as in chess, checkers, or the like) but looking at the specification, it seems that the moves are detailed at the outset of the game, and then are available to each player at each stage in the game.
now the odd thing to me is the measure of 'state' in the game. Is the score that's returned after each move the current cumulative score, the score for that move alone, or what? Also, what is the goal of the game? It would be short-sighted to assume it's to amass the highest score. In effect, the score is just another input variable, along with the opponents move, which may or may not be useful for judging what is a good move or a bad move.
For example, if you were trying to make an algorithm to solve the A8 puzzle (the 'sliding tile puzzle' with 15 tiles and 16 spots), and the computer judged your score by totalling up manhattan distances to the goal state, that may or may not be a fair scale of how many moves away you are from winning in an ideal case.
The system is still underspecified. Without knowing what 'score' means, and whether it is an estimate or a deterministic function, then the project is pretty much a game of luck, and coding is not an effort of skill.
Re:Easier than I feared (Score:1, Informative)
"Score" is the number of points scored that turn. It is a floating point number between -1 and 1.
The goal *is* to get the highest score.
Re:Easier than I feared (Score:2)
Imagine in chess where you scrifice a pawn to gain position - in that case your "score" that turn would be negative yet you would be able to gain a higher overall score later as a result. With only knowledge of the "score" for that turn, how can you decide when a sacrificial move might leave you better off?
Re:Easier than I feared (Score:1)
Re:Easier than I feared (Score:2)
Lose a queen and you never know what it is to not have lost a queen, for example.
Re:Easier than I feared (Score:1)
Re:Easier than I feared (Score:1)
It really doesnt matter so long as the names for the all legal moves are given to you at the beginning. Just think of them as symbols, buttons to push at any given time. You shouldnt be trying to interpret meaning in the moves that are made, only value. It will be impossible to create some kind of high level of understanding about what a move means because the program does not know what the game is. It is possible to learn what values moves have though using some kind of reinforcement learning algorithm. Think of it this way: If your program believes it is in a certain state and its opponent executes a certain move, it should pick the move available to it that maximizes the expected reward
Is the score that's returned after each move the current cumulative score, the score for that move alone, or what?
Based on the example used in the specs, I would imagine that the score is for the most recent action if given after each action or cumulative if given only once at the very end. Either way, it will be relevant in the global context of the entire contest since the program with the highest overall score after playing all games against all opponents will be the winner.
The system is still underspecified. Without knowing what 'score' means, and whether it is an estimate or a deterministic function
I think its better to think of score like a reward or a penalty. The idea is to accumulate the most reward or the least penalty. As they say in the specs, the program with the highest score is the winner. If chess were to be one of the games(which I doubt it will due to its complexity), the score would probably be 1 for a win, 0.5 for a draw and 0 for a loss, just like at any chess tournament. There would be no intermediate score for taking a queen or getting into a strong position because those things are meaningless after the game is finished.
coding is not an effort of skill.
This contest is not about coding, its about learning. They dont care how good you are at writing programs, only at how good you are at designing a robust system that learns quickly. And as I hinted at before, this problem is just screaming for contestants to use a reinforcement learning algorithm.
Offtopic: Trip to Israel (Score:5, Funny)
Of course, there may be some connection between the prize and the game ("win a conflict where you've got no clue of the rules", that pretty much sums up the problems of both parties in the Middle East).
Re:Offtopic: Trip to Israel (now totally offtopic) (Score:1)
The Palestinian have nothing to loose, most of them are rather happy to die in this war, and the Israelian have everything to loose.
Re:Offtopic: Trip to Israel (not really offtopic) (Score:1)
Anyway, as an Israeli I can assure you that the
"situation" has some really low chances of
hurting any tourist. Fact: i'm taking the bus
on a daily basis and yet i'm still alive. And
I -live- here. So, really. The media just like
to over bloat things.
On the down side: Its freaking HOT. Dont get
here unless you're heat tolerant. I'd take
a trip to swiss instead at any time.
Re:Offtopic: Trip to Israel (not really offtopic) (Score:1)
Re:Offtopic: Trip to Israel (Score:1)
You are more likely to be killed by a car accident than by a terrorist attack, with the odds being 5:1 against.
Yes, there are terrorist attacks. Yes, they kill us, and we kill them. Yes, we live our daily lives, and so do they. Don't panic, folks.
Re:Offtopic: Trip to Israel (Score:2)
Still, who wants to go to that part of the world right now? Is there something about landing in the middle of a shooting-blowing-people-up war that that just says `must-do vacation' to some folks?
I do like your idea about it being to play out the Middle East peace process. Couldn't be any worse than any of the current players.
Re:Offtopic: Trip to Israel (Score:2)
Whose land are the Israelis camped on? Whom did they displace? Who are second-class citizens on their own real estate? Here's a hint: while it is a Semitic people, it sure as shootin' ain't the Israelis. They stole that land, and the Palestinians want it back. We have our Indian issues--at least we paid lip service to the Indians and their nations. The Israelis simply took the land they were given. Note also that the Israelis are in violation fo the fourth Geneva convention with their settlements of conquered territory. Note also that Sharon is shortly to be indicted for various crimes against humanity.
Oh, and dare I mention the $10,000 reward Saddam is paying to the family of every "martyred" child?
Good for him. What's your point?
My point was a) I'd rather not visit that part of the world b) I'd rather not give any money to the Israelis c) if I had to give money to any of 'em, it might as well be the Palestinians, who have at least a legitimate beef. Not that I care for them that much either.
The game is Slashdot, the score is Karma. (Score:2)
But seriously. How can one consider this contest artificial intelligence? It's not like the entries have to be intelligent. They just have to be logical and well designed, and good at pattern recognition.
Look at chess as an example. This is like having a chess computer that has to learn the rules. Compared to playing chess (which is computable), learning the moves is relatively easy.
Re:The game is Slashdot, the score is Karma. (Score:1)
Wrong. Let's suppose this program gets a list of available moves every turn. How is it to even know the board is a 8x8 board and the relations between each square? How is it supposed to know the significance and difference between a king, queen, rook, knight, bishop and pawn? In order to play intelligently, it will have to learn that sacrifices can be a good thing, but is generally bad if you aren't guaranteed an advantage later in the game. That making one little mistake in your tactic/strategy can quickly lose you the game against a strong player.
Without assumptions you're really screwed. So it very much depends on the interface, how much and what information the program gets every turn. There'll always be some assumptions left too, as what you choose will always have some limitations.
- Steeltoe
Re:The game is Slashdot, the score is Karma. (Score:3, Insightful)
All depends on the much debated definition of what is 'Intelligence'.
Certainly, pattern recognition is a sign (symptom?) of intelligence.
So, what are you actually saying? What do you mean when you say 'intelligence' ?
Re:The game is Slashdot, the score is Karma. (Score:1)
Re:The game is Slashdot, the score is Karma. (Score:1)
So for example, you aren't intelligent because I'm confident that you're a human not a machine?
Re:The game is Slashdot, the score is Karma. (Score:2)
Turing said the following, in his famous paper "Computing Machinery and Intelligence":
"I shall replace the question [can machines think] by another ..." and goes on to describe the 'Imitation Game', which is so often now mis-labeled 'The Turing Test'.
He then goes on to say, after describing the imitation game;
"We now ask the question, 'What will happen when a machine takes the part of A in this game?' Will the interrogator decide wrongly as often when the game is played like this as he does when the game is played between a man and a woman?"
The problem these days is that when the Imitation Game is played, the interrogator has sometimes picked humans as machines. So, basically, the Imitation Game is worthless as an indicator of intelligence.
If it can't tell humans from humans then it is useless to try and tell humans from machines.
Paranoia (Score:1, Interesting)
This should prove entertaining. (Score:3, Interesting)
Re:This should prove entertaining. (Score:1)
Re:This should prove entertaining. (Score:1)
Re:This should prove entertaining. (Score:2)
Try Mornington Crescent. (Score:2)
It's played on the stations of the London Underground network. For example, I could start with Albright's Opening, *Regent Street*.
Anyone want to take me on?
Re:Try Mornington Crescent. (Score:1)
In that case...St John's Wood.
Re:Try Mornington Crescent. (Score:2)
I'll go for Ealing Broadway (hence declaring a state of emergency).
Re:This should prove entertaining. (Score:1)
Calvinball (Score:5, Funny)
It's called Calvinball, and it's the sport of kings.
Re:Calvinball (Score:1)
Here [alloftheabove.net] are the rules of Calvinball
Re:This should prove entertaining. (Score:1, Insightful)
You're playing it right now, bud ..
Re:This should prove entertaining. (Score:1)
You get a penalty for asking about the rules.
I probably shouldn't even tell you that much...
That recall me a couple of clever hack... (Score:5, Interesting)
The entries had to be given in the form of a subroutine that played the next move (given the current score and the history). The judges were linking two of them together and run the resulting binary.
Of course, there have been an entry that looked in the stack and modified the scores.
But the greatest was one (IMHO) that fork()ed and returned one possible response in each of its child. At next turn, the one that did not make the point (ie: had top score), exit()ed.
Mind-blowing. Found the link [ualberta.ca]
That program was the "Fork Bot"
Cheers,
--fred
AI? (Score:1, Funny)
Sincerely, Mike Bouma
Why not pick a real problem? (Score:1, Flamebait)
Re:Why not pick a real problem? (Score:2, Insightful)
Re:Why not pick a real problem? (Score:1)
Daniel.
Re:Why not pick a real problem? (Score:2)
random fortune... (Score:5, Insightful)
Re:random fortune... (Score:1)
MOD UP! (Score:1)
Re:MOD UP! (Score:4, Insightful)
Otherwise known as the seminal work of AI philosophy.
This is truly on topic, moreso that the un-enlightened could ever know. ask yourelf: Are my mod points the mod points of the un-enlightened? if no, please mod up the parent's parent as +1, Insightful.
thankyou.
Re:random fortune... (Score:2, Informative)
Re:random fortune... (Score:1)
All Sample games only work by Collusion! (Score:1)
hmm (Score:1)
Re:hmm (Score:1)
Re:hmm (Score:1)
Damn (Score:1)
How many actual AI researchers reading slashdot? (Score:4, Funny)
Doing that and dumb enough to waste his time with this. Count me one.
Re:How many actual AI researchers reading slashdot (Score:1)
Glossary (Score:2)
Backprop - The backpropagation algorithm for training neural networks. "Brought neural networks back from the dead."-popular opinion
RBF - Radial Basis Function Neural Nets.
HMM - Hidden Markov Models(I don't know what they're for)
SVM - Support Vector Machine(latest hot thing,I still don't know why its so great)
SOFM - Self Organizing Feature Maps(another NN architecure)
ART - no idea here... Like Hal Cohen's Aaron?(the drawing program)
I would like to add that I think Computational Learning Theory(COLT) and other statistical approaches seem really promising and I think we are going to be seeing some really interesting stuff in the next 5 - 10 years(ok maybe 20).
Re:Glossary (Score:1)
Rob
Re:How many actual AI researchers reading slashdot (Score:2)
AI hasn't progressed in 50 years. Its a failure.
I'm one of those "chumps" that would say something like this. Quite frankly, "Artificial Intelligence" is an abject failure.
Just because there isn't "strong AI" yet doesn't mean the field has failed.
Ah, there we are. If you don't want your field to be called an "abject failure", then don't call it AI. If you want to say that pattern recognition has made some strides, say it. Or pick your particular problem. But when you use the words "artificial intelligence", then I expect intelligence created artificially. Not "very sophisticated algorithms".
Physics doesn't have a grand unified theory, medicine can't make people live forever, etc. & we don't consider those fields failures.
No. But comparing an AI scientist to a physicist is like comparing Aristotle to Einstein. Both brilliant people, both made huge contributions to their field, but immense differences in knowledge.
I'm not saying AI will never get there, but I think a dose of reality is needed here.
tit-for-tat algorithm (Score:4, Interesting)
If this is a 2+ player competition and they're the right sorts of games (like the rock-paper-scissors game that it mentioned), whoever wins it might have to figure out a way to consistently beat the tit-for-tat algorithm.
Tit-for-tat [umich.edu] is one of the dead simplest game playing algorithms, and collectively it's one of the most successful.
It's based on the rule of "always do what the other player did last move". Under most circumstances it's impossible for it to actually win a game because the other player is always one step ahead. But its strength is in winning tournaments.
While it always loses, it never loses by much. This is different from other algorithms which usually have about as many weaknesses as they have strengths and will usually flunk out in at least some trials.
If someone can beat it consistently in a tournament situation, they really will have accomplished something in AI. Of course, this whole thing depends on exactly how the rules are structured, the scoring system and the information available to the program.
I can beat tit-for-tat (Score:1)
The real prog answers an ID string with something similar and voíla the fake entries becomes nice cooperators. (see my also my earlier post)
Re:tit-for-tat algorithm (Score:2)
What are you talking about? Tit-for-tat wins prisoner's dilemma tournaments, not games in general. The name is even based on prisoner's dilemma, it doesn't make sense to call repeating your opponents's last move tit-for-tat in general.
Also, you could trivially beat it at rock-paper-scissors: just play rock, paper, scissors, rock, paper, scissors, and so on forever. You will win every round except maybe the first one. Finally, if you actually read the rules of this competition, you would find that there's no guarantee that there will always be one opponent move for every one of your moves, so you couldn't even implement tit-for-tat.
A lot easier for AI when... (Score:1)
Two thoughts (Score:4, Insightful)
2. I wonder whether the winner could visit me.
:-)
Re:Two thoughts (Score:1)
Furriners who win a trip to the US shouldn't come unless they also get a get out of jail free card.
Exercise in neural networks (Score:2, Insightful)
The challenge is so obscure that any entry submitted will have to deploy a very generic NN and a trainer. this basicly means that after enough training any entry would do sufficiently good at any simple game (such as scissors, rock, paper) but playing anything more complex than
that is shooting in the dark. The interface and the rules of the challenge themselves are too obscure.
If there is someone with a code that could win such uncertainty effectively and efficiently, he'd be stupid to submit it for $2000.
Then again I must give a person that can do something extraordinary as that some credit for not doing something that stupid.
I might know how to win or get an unfair advantage (Score:4, Interesting)
Re:I might know how to win or get an unfair advant (Score:1)
I suspect, if they ever try this contest again, they'll want to ponder that rule a bit more...
JMR
Re:I might know how to win or get an unfair advant (Score:1)
The point is not to decline playing against the other real programs, just for the fake progs to play to the best of their ability against everyone but you.
Sending ID will most likely mean making a commitment, but for long and somewhat stochastic games (which I have a feeling that we will be dealing with here), this early commitment will (hopefully) not be enough of a handicap for the fake progs. They will still get points (provided that your algorithm is any good in the first place).
The real program will only reply an ID string when it is statistically sure that its opponent is a "buddy", thus the real program gets the unfair advantage of getting a few/many easy wins that the programs made by the other contestants will not get, but sacrifices nothing. You will still need a good program to win. This is just how one could (if one were so inclined ;) give it that extra boost.
Re:I might know how to win or get an unfair advant (Score:1)
You can't send arbitrary strings. Read the webpage. You can only look at moves, and make moves when your asked, out of the list of legal moves you're given.
But you could adapt your idea to this situation. Simply have your "master" program know the internals of all your "slave" programs, so that it can identify a likely slave based on the first few moves. Since it knows the slave's internals, it can beat it 100% if it's actually a slave. If it deviates from the known slave behaviour, well then you need a real algorithm.
In this scenario, your slaves don't even have to suck. They can use the best algo you can muster, but the master prog will still beat them because it knows what they're thinking. You can even use the same algo in the master when it identifies a real opponent.
Dead simple to do this if you already have a half-decent algo. Someone want to try it? Easy 2K...
Re:I might know how to win or get an unfair advant (Score:1)
Oh, and I just remembered that the judge program can insert moves that are part of the game, but not initiated by your opponent. This means you don't know if the opponent made the move or if the judge did it. So you'll need some fuzzy matching on your identification algorithms... hmm. Maybe not so easy.
Re:I might know how to win or get an unfair advant (Score:2, Insightful)
rand() (Score:2, Interesting)
Re:rand() (Score:1)
World is weird....
Re:rand() (Score:1)
humans? (Score:1)
Re:humans? (Score:1, Funny)
I remember that some few years ago some guys used bacteria to predict the stock market. apparently they are more susceptible to the miniscule trends humans tend to overlook. They really know how to adapt to beneficial environments, and, accordingly beat many an analyst.
However, with a doubling rate of over 20 minutes, they won't have a chance in Quake... Not even against Joe Schmoe with a 28.8k modem!
However in a slow, perhaps turnbased system they could be killers.
Sensmoral - that gaming competition has little to do with TrueIntelligence(tm). If one gets beaten by a gang of Streptococcus sp. it says very little indeed, but more perhaps on stock market analysts.
How they pay for the prizes... (Score:5, Insightful)
Wish I'd thought of it!
95% of the solution is written already: (Score:1)