ç™»å½•æŸ¥çœ‹æ›´å¤šå†…å®¹

Monte Carlo Tree Search

Yeshwanth Nagaraj

Democratizing Math and Core AI // Levelling playfield for the future

å‘å¸ƒæ—¥æœŸ: 2023å¹´9æœˆ9æ—¥

Monte Carlo Tree Search (MCTS) is an algorithm commonly used for finding optimal decisions in a given domain by taking random samples in the decision space and building a search tree based on the results. It is most famously used in board games like Go and Chess but has applications in various fields such as robotics, resource management, and artificial intelligence in general.

MCTS has gained widespread attention because of its simplicity and effectiveness, particularly in scenarios where the decision space is too large to exhaustively search. It is also a suitable choice for problems where the transition model and reward function are not known or too complex to describe with a simple formula.

How MCTS Works

The MCTS algorithm consists of four main steps repeated multiple times:

Selection: Starting at the root node (representing the current game state), a tree policy guides the selection of child nodes down to a leaf node. The tree policy is typically based on a balance between exploiting nodes with high average reward and exploring less-visited nodes.
Expansion: Once a leaf node is reached, one or more child nodes are added to expand the search tree, based on the available actions.
Simulation: A simulation is run from the new leaf node according to a default policy to produce an outcome. This step is also called a "rollout."
Backpropagation: The result of the rollout is backpropagated up the tree to update the information stored in the nodes visited during the selection phase.

é¢†è‹±æŽ¨è

GPTEngineer VS tzap.io

Jose Berengueres 1 å¹´å‰

Computer simulation to answer: should you really run to catch the bus?

Computer simulation to answer: should you really runâ€¦

Mojtaba Parsa 1 å¹´å‰

Engineering AI Applications (Part 10): Developing a Video Frame Analyzer Using LLM-Generated C/C++ Code

Engineering AI Applications (Part 10): Developing aâ€¦

Maruthi Pathapati 3 å‘¨å‰

After a predefined number of iterations or a computational budget has been reached, the algorithm picks the action leading to the node that appears most promising based on the stored values.

Example Python Code

Here's a simplified Python code snippet illustrating a very basic MCTS. Note that this example is very minimal and doesn't include all the optimizations you might find in a full-fledged implementation.

import random

class Node:
    def __init__(self, state, parent=None):
        self.state = state
        self.parent = parent
        self.children = []
        self.visits = 0
        self.value = 0

def rollout(node):
    # Simulate a game from this node to a terminal state and return the reward
    # For now, let's return a random reward
    return random.choice([-1, 1])

def select(node):
    # Implement your tree policy here to select a child node from the given node
    # For simplicity, let's choose a random child node
    return random.choice(node.children) if node.children else None

def expand(node):
    # Generate a child node and add it to the children of the current node
    # For this example, let's assume the state is an integer and the action is to add 1 or 2
    new_state = node.state + random.choice([1, 2])
    child = Node(new_state, parent=node)
    node.children.append(child)
    return child

def backpropagate(node, reward):
    # Propagate the reward back to the root
    while node:
        node.visits += 1
        node.value += reward
        node = node.parent

def mcts(root, iterations=1000):
    for _ in range(iterations):
        node = root
        while node.children:
            node = select(node)
        if node.visits == 0:
            reward = rollout(node)
        else:
            node = expand(node)
            reward = rollout(node)
        backpropagate(node, reward)

# Example usage
root = Node(state=0)
mcts(root)

å¸¦æœ‰æ¤å›¾æ ‡çš„é“¾æŽ¥ ç”±é¢†è‹±åˆ›å»ºï¼Œä¸å¸¦æ¤å›¾æ ‡çš„é“¾æŽ¥ç”±ä½œè€…æ·»åŠ ã€‚

Math and Core Machine Learning

1,551 ä½å…³æ³¨è€…

è®¢é˜…

Nancy Chourasia

Intern at Scry AI

1 å¹´

Great share. Turingâ€™s Imitation Game allowed the judge to ask the man and the computer questions related to emotions, creativity, and imagination. Hence, such AI gradually began to be known as Artificial General Intelligence (AGI). In fact, in the movie â€œ2001: A Space Odysseyâ€, the computer, HAL 9000, was depicted as an AGI computer that exhibited creativity and emotions. However, the AGI systems of the early 1970s were limited to solving rudimentary problems because of the high cost of computing power and the lack of understanding of human thought processes. Hence, the hype regarding AI went bust by 1975 and the U.S. government withdrew funding. This led to the first â€œAI winterâ€ where research in AI declined precipitously. Although significant advances were made during this period (e.g., the development of Multilayer Perceptrons and Recurrent Neural Networks), most of them went unnoticed. Eventually, researchers decided to constrain the notion of an AI system to the ability of performing a non-trivial human task accurately. And they started investigating AI systems that can be used for specific purposes, which can reduce human labor and time. More about this topic: https://lnkd.in/gPjFMgy7

èµž

å›žå¤

è¦æŸ¥çœ‹æˆ–æ·»åŠ è¯„è®ºï¼Œè¯·ç™»å½•

Yeshwanth Nagarajçš„æ›´å¤šæ–‡ç«

Hebbian Learning: The Genesis, Influence on AI

2024å¹´10æœˆ13æ—¥

Hebbian Learning: The Genesis, Influence on AI

Hebbian learning is a fundamental concept that has significantly influenced both neuroscience and artificialâ€¦
Understanding Memory Layout in PyTorch: A Blueprint for Efficient Systems ????

2024å¹´7æœˆ28æ—¥

Understanding Memory Layout in PyTorch: A Blueprint for Efficient Systems ????

Introduction In the world of machine learning and deep learning, memory layout might seem like an esoteric topic, butâ€¦
Covert Malicious Finetuning: A Double-Edged Sword in AI

2024å¹´7æœˆ25æ—¥

Covert Malicious Finetuning: A Double-Edged Sword in AI

Introduction Covert Malicious Finetuning (CMF) is a sophisticated technique in the field of artificial intelligenceâ€¦
Twisted Sequential Monte Carlo: Navigating Complex Probability Landscapes ????

2024å¹´6æœˆ16æ—¥

Twisted Sequential Monte Carlo: Navigating Complex Probability Landscapes ????

Introduction Twisted Sequential Monte Carlo (TSMC) is a sophisticated technique used in computational statistics toâ€¦

1 æ¡è¯„è®º
Push-Forward Generative Models: Engineering the Future of Data Generation ????

2024å¹´6æœˆ7æ—¥

Push-Forward Generative Models: Engineering the Future of Data Generation ????

Introduction Push-Forward Generative Modeling is an advanced technique in the realm of data generation, offering aâ€¦
Understanding Oversquashing in Graph Neural Networks (GNNs)

2024å¹´5æœˆ31æ—¥

Understanding Oversquashing in Graph Neural Networks (GNNs)

Introduction Graph Neural Networks (GNNs) are powerful tools for processing graph-structured data. They excel in tasksâ€¦

2 æ¡è¯„è®º
Unveiling the Transformer Hawkes Process????

2024å¹´5æœˆ17æ—¥

Unveiling the Transformer Hawkes Process????

Introduction In the evolving landscape of machine learning, the Transformer Hawkes Process stands out as an innovativeâ€¦
Understanding Ollivier-Ricci Curvature

2024å¹´5æœˆ15æ—¥

Understanding Ollivier-Ricci Curvature

Curvature is a fundamental concept in mathematics, with wide-ranging applications in various fields, includingâ€¦
Understanding Differential Pruning in Neural Networks

2024å¹´5æœˆ14æ—¥

Understanding Differential Pruning in Neural Networks

Introduction In the realm of neural networks, efficiency and performance are paramount. Differential pruning, akin toâ€¦
Decoding Nature's Symphony with the Fokker-Planck Equation

2024å¹´5æœˆ13æ—¥

Decoding Nature's Symphony with the Fokker-Planck Equation

Imagine you're an engineer designing a water purification system. To ensure the water flows smoothly through theâ€¦

See all articles

Monte Carlo Tree Search

Yeshwanth Nagaraj

Democratizing Math and Core AI // Levelling playfield for the future

How MCTS Works

é¢†è‹±æŽ¨è

Example Python Code

Math and Core Machine Learning

1,551 ä½å…³æ³¨è€…

Yeshwanth Nagarajçš„æ›´å¤šæ–‡ç«

ç¤¾åŒºæ´žå¯Ÿ

å…¶ä»–ä¼šå‘˜ä¹Ÿæµè§ˆäº†

Stock Buy and Sell â€“ Multiple Transactions Allowed

"Divide and conquer" technique for solving problems

Module 2: Core FDTD Implementation (Lessons 9-18)

How to install and use DeepSeek R-1 locally

Unraveling the Fibonacci Sequence: A Journey through Recursion and Optimization

Scripts 2018 - Thiago Paulino

Pythonize Regressed Physical Chemical Propeties in DWSIM

Launch: Deploy Your Computer Vision Projects Even Faster With alwaysAI's New Features

Revolutionise Chemical Predictions Using Brain inspired Hyperdimensional Computing!

My Experiments with Simulations

How MCTS Works

é¢†è‹±æŽ¨è

Example Python Code

Math and Core Machine Learning

1,551 ä½å…³æ³¨è€…

Yeshwanth Nagarajçš„æ›´å¤šæ–‡ç«

Hebbian Learning: The Genesis, Influence on AI

Understanding Memory Layout in PyTorch: A Blueprint for Efficient Systems ????

Covert Malicious Finetuning: A Double-Edged Sword in AI

Twisted Sequential Monte Carlo: Navigating Complex Probability Landscapes ????

Push-Forward Generative Models: Engineering the Future of Data Generation ????

Understanding Oversquashing in Graph Neural Networks (GNNs)

Unveiling the Transformer Hawkes Process????

Understanding Ollivier-Ricci Curvature

Understanding Differential Pruning in Neural Networks

Decoding Nature's Symphony with the Fokker-Planck Equation

ç¤¾åŒºæ´žå¯Ÿ

å…¶ä»–ä¼šå‘˜ä¹Ÿæµè§ˆäº†

Stock Buy and Sell â€“ Multiple Transactions Allowed

"Divide and conquer" technique for solving problems

Module 2: Core FDTD Implementation (Lessons 9-18)

How to install and use DeepSeek R-1 locally

Unraveling the Fibonacci Sequence: A Journey through Recursion and Optimization

Scripts 2018 - Thiago Paulino

Pythonize Regressed Physical Chemical Propeties in DWSIM

Launch: Deploy Your Computer Vision Projects Even Faster With alwaysAI's New Features

Revolutionise Chemical Predictions Using Brain inspired Hyperdimensional Computing!

My Experiments with Simulations

é¢†è‹±æŽ¨è

1,551 ä½å…³æ³¨è€…

å…¶ä»–ä¼šå‘˜ä¹Ÿæµè§ˆäº†