Ultimate 2048 aims to explore and compare the efficacy of various reinforcement learning (RL) algorithms in playing the game 2048, a puzzle game on a 4x4 grid where tiles of equal numbers are merged to create larger tiles until the 2048 tile is reached. Our focus is to (1) find and optimize an algorithm that can learn to reach the 2048 tile, and (2) perform a comparative analysis of Policy Gradient Algorithms - more specifically, Proximal Policy Optimization (PPO) and Advantage Actor-Critic (A2C) - and Monte Carlo Tree Search (MCTS) for this game under a single, controlled setting.
View our public repository @ Ultimate 2048 on GitHub
The following reports were written by our team for CS 175: Project in AI during the Winter 2025 quarter.
Learn more about our team @ Team Details