UPDATE:
AI was able to surpass the World Record in my new video:

AI bot learns to play QWOP like a human and achieves a top 10 speedrun (1m 8s). Trained using Reinforcement Learning and Imitation Learning.

Writeup:
https://towardsdatascience.com/achieving-human-level-performance-in-qwop-using-reinforcement-learning-and-imitation-learning-81b0a9bbac96

Github repo:
https://github.com/Wesleyliao/QWOP-RL

Papers mentioned:
– Sample Efficient Actor-Critic with Experience Replay
https://arxiv.org/pdf/1611.01224.pdf
– Deep Q-learning from Demonstrations
https://arxiv.org/pdf/1704.03732.pdf

Kurodo’s channel:
https://www.youtube.com/channel/UCLxJfj_Dq8Ks89tUVR3z7ug

QWOP speedrun leaderboard:
https://www.speedrun.com/qwop

source