


Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
Material Type: Assignment; Professor: Shelton; Class: ARTIFICIAL INTELLIGENCE; Subject: Computer Science; University: University of California-Riverside; Term: Fall 2006;
Typology: Assignments
1 / 4
This page cannot be seen from the preview
Don't miss anything!



Your goal is to design an electronic “gofer” (an assistant who performs chores by run- ning around fetching things). Your boss is a high-powered Hollywood executive who doesn’t have time to run around fetching coffee, paperwork, and the like. Therefore, he has selected you to come up with a robot that can do this for him. Unfortunately, you aren’t the only one so “chosen.” Your agent must compete against other agents for the right to be named as this executive’s official robotic assistant.
The service area is divided into a rectangular grid. Each cell in the grid is either empty or contains an impenetrable wall. Empty cells may contain robots (and wall cells can- not). Each robot occupies one grid cell, has an orientation (one of the four cardinal directions), and is controlled by one player.
Each robot is informed of its starting location and direction as well as those of its opponent. The layout of the rectangular grid is also known.
Unfortunately (or perhaps fortunately, depending on how you view things), the robot’s sensors are limited. While they know precisely the location of all current requests (flags), they can only sense the four nearest walls, and hear the sound of the opponent robot (robots are not quiet). And these senses are limited too. There is a fixed prob- ability of misreading the surrounding walls. The opponent robot does not count as a wall.
Furthermore, due to echos, robots may misinterpret their auditory sensors. The noise heard is directly related to the distance between the robot and its opponent. With a 50% chance, the robot predicts the correct distance. However, with a 25% chance, the robot estimates a distance that is 1 square too close. And, with a 25% chance, the robot estimates a distance that is 1 square too far. Distance is the length of shortest path through the floor plan that does not go through a wall.
Players alternate turns. Each turn a player’s robot observes its immediate surroundings (via the two sensors described above). Each turn the player must decide which action to take. There are four actions available. If the player does not decide within 2 seconds, its robot takes no action and the turn passes to the next player. The four actions are nothing, forward, turn left, and turn right.
The first causes the robot to remain in the same square with the same orientation.
The second advances the robot one square in its current direction. If there is a wall or another robot in the way, the robot does not move. There is also a chance (known to the robot) that this move fails, leaving the robot in its original location. The last two actions turn the robot 90 o^ to the left or right (respectively). There is a chance (known to the robots, but different than above) that this rotation fails, leaving the robot’s orientation unchanged.
At the beginning of the game, a set of service locations (known as flags) are specified by the boss. The first agent to arrive at any of these locations is awarded a point for having performed the necessary fetching service. Throughout the game, the boss may, from time-to-time, remember new tasks and add them to the list of flags. An agent only receives credit for a task if it is the first one to the location after the flag is revealed by the boss. The locations of all flags and whether or not they have been collected (and if so, by whom) is know to both agents.
When the competition starts, each player has 10 seconds to do any necessary initial- izations based on the preliminary map. The competition runs for a fixed (and known) number of rounds. A round consists of (in order): the placing of new flags, the turn for the first agent, and the turn for the second agent. At the end, the agent with the most number of claimed flags is the winner.
All entries must be submitted by 11:59pm on December 12, 2006. Students may only submit one entry.
All games will be run in pairs. Each game of a pair will pit the same two opponents against each other on the same game board with the same parameters, but with different random number seeds. The players swap places (who is player 1 and who is player 2) between games of a pair.
The entries will first be all played against each other in a “round robin” fashion: each competitor playing a pair of games against each other competitor. The results of this pool play will seed the competitors in a single elimintation tournament. The top teams will get “byes” (automatic victories) in the first round of the tournament if the number of entries is not a power of two. Ties will be broken by continuing to play pairs of games until the number of games won by each opponent is no longer even.
The pool play games will all be on the same board. It will not be revealed before the submission deadline. The knock-out tournament will also all be run on the same board. It too will not be revealed until after the submission deadline.
The initial stages of the tournament will be run on December 13th and 14th. The results will be posted online in real time as the tournament proceeds. The semi-finals and finals (all games after the field has been paired down to 4 agents) will be displayed live in
should explain how you approached the problem, which algorithms or methods you used (and why), and any modeling assumptions you made. I would also like to see some analysis of its behavior (why it works or doesn’t and how you would improve it).
As such, your grade will depend on how well you meet those criteria. Here is a rough grading rubric.
This assignment will test your ability to apply your knowledge to a new problem. Therefore, I will not tell you how to approach it. However, we have covered (or will cover) a number of topics, all which could be useful: search, game-playing, opti- mization, logical reasoning, planning, probabilistic inference, hidden Markov models, Markov decision processes, supervised learning, and parameter estimation.
May the best gofer win.