[q_learning]Flappy Bird

less than 1 minute read

Flappy Bird

I’ve set values like below.

EPISODES = 100000       COLLISION_PENALTY = 800    POINT_REWARD = 50      ALIVE_REWARD = 1
epsilon = 0.9           EPS_DECAY = 0.99998        STEP = 500             LEARNING_RATE = 0.3
DISCOUNT = 0.95         

The observation values are UPPER and LOWER which are calculated by the coordinate of bird and the coordinate of obstacles.

(UPPER and LOWER)

(result)

(average of episode rewards)

I modified flappy bird game on https://github.com/Anish-Malla/Flappy-birds-game-using-pygame

Share on

Twitter Facebook LinkedIn

Hyeong Jun An (Sam.An)

[q_learning]Flappy Bird

Flappy Bird

Share on

You may also enjoy

Medicine Vending Machine(2)

Find Missing People

Medicine Vending Machine

Algorithm from leetcode(3)