登录查看更多内容

qdive goes to space

qdive

Passion for data

发布日期: 2022年5月25日

We’re participating in this year’s Kaggle kore competition! Competitors write algorithms that play the kore game against each other. While conceptually simple (steer your spaceships to mine the most resources and/or defeat your opponent), the game features a few challenging aspects that any agent must consider in order to win. Especially if you want to tackle the problem with Reinforcement Learning – which we of course do!

Reinforcement Learning (RL) is hard. Despite some big names having “solved” many complex games before (Chess, Go, StarCraft, etc.), the leaderboards of most of these smaller competitions are dominated by cleverly written rule-based algorithms. To train a RL agent, one needs patience, computing power, and a lot of trial-and-error refining the agent’s reward function as well as the state and action spaces. In this particular challenge, the latter two are further complicated by the fact that flight plans (an essential component of the game) have variable lengths. They can’t be mapped statically to a point in N-dimensional space. We sketched different approaches to this problem in the discussion forums, but it is ultimately down to the algorithm designer to decide.

Another key element in training a RL agent is to have a solid implementation of the chosen RL algorithm. This can be quite tricky, too, especially for newcomers to the field. That’s why, following Kaggle’s philosophy of competing hard but supporting each other along the way, we built and published an openAI wrapper that makes the kore environment compatible with the powerful RL library stable-baselines3.

领英推荐

First 40 speakers Announced

JSWORLD Conference 1 年前

Insights on using Unity DOTs with the Tashi Consensus…

Tashi Protocol 1 年前

FAQs: MoonMath Manual to zk-SNARKs

Least Authority 2 年前

Check it out! Reinforcement Learning baseline in Python

This baseline radically simplifies the process of training an RL algo on this competition and helps the community experiment with different approaches without having to start from scratch. Additionally, we demonstrate how to upload a trained agent, which has a higher degree of complexity than standard supervised competitions.

We’re not finished yet, though. There are still plenty of rule-based algos to beat before the competition closes! See you in kore!

qdive goes to space

qdive

Passion for data

领英推荐

qdive的更多文章

社区洞察

其他会员也浏览了

How the Information Age is turning into the Harry Potter Universe

Compete, Code, Conquer: Your Ultimate Guide to the Meta Hacker Cup 2024

The Strategy Pattern

GenAI n00b, Part 1

When Math Makes Fools of Us All How a simple game show puzzle reveals the limits of human reason—and how computers might help

Simulating the Monty Hall Paradox: A Mind-Boggling Game of Probability

From BASIC to AI: A Creator's Journey Through the Digital Age

chess game with two machine artificial intelligence players programmed in Go programming language by prompts from yours truly to chatgpt40

Solving 8 puzzle problem using A* star search

What we can learn from a 7 year old that can solve a Rubik's Cube

领英推荐

qdive的更多文章

Using Machine Learning for pricing in B2B sales

Finding your wifi password in 512-dimensional space

Django vs. FastAPI

The first 100 days after returning from maternity leave - an interview with our qdiver Julia

New members of the qdive family in early 2022

qdive.io is looking to a very exciting future!

Building an ML-based search engine: Introduction

qdive Offsite Report - exciting discussions and a lot of fun

Data Science Trends - Meet the Experts

Porsche AI Coding days

社区洞察

其他会员也浏览了

How the Information Age is turning into the Harry Potter Universe

Compete, Code, Conquer: Your Ultimate Guide to the Meta Hacker Cup 2024

The Strategy Pattern

GenAI n00b, Part 1

When Math Makes Fools of Us All How a simple game show puzzle reveals the limits of human reason—and how computers might help

Simulating the Monty Hall Paradox: A Mind-Boggling Game of Probability

From BASIC to AI: A Creator's Journey Through the Digital Age

chess game with two machine artificial intelligence players programmed in Go programming language by prompts from yours truly to chatgpt40

Solving 8 puzzle problem using A* star search

What we can learn from a 7 year old that can solve a Rubik's Cube