Tag[q-learning] Recent Newest Questions

Deep Q Learning Approach for the card game Schnapsen

So I have a DQN Agent that plays the card game Schnapsen. I wont bore you with the details of the game as they are not so related to the question I am ...

I am working on 'https://berkeleyai.github.io/cs188-website/project3.html' reinforcement learning in Pacman project

In this project we are asked to will implement value iteration and Q-learning, and test our agents first on Gridworld (from class), then apply them to ...

DQN not converging

I am trying to implement DQN in openai-gym's "lunar lander" environment. It shows no sign of converging after 3000 episodes for training. (for compar ...

Enhancement of Agent Training Q Learning Taxi V3

I was required to enhance this code to showcase a comparison of reward and penalties. How it works is, I have to enhance it by making this code disp ...

ValueError: Error when checking input: expected Input_input to have 4 dimensions, but got array with shape (1, 1, 2)

I am trying to create a Flappy Bird AI with Convolutional Layers and Dense Layers, but at the "Train" step (Function fit()) I get the following error ...

How to cast function into a struct in C?

This is my first post on StackOverflow, so I hope the format will be okay. I want to pass functions as parameter to another function. To that end, I ...

How does the is_slippery parameter affect the reward in Frozenlake Environment?

How does the is_slippery parameter affect the reward in Frozenlake Environment? Frozenlake environment has a parameter named is_slippery, which if se ...

Q-table representation for nested lists as states and tuples as actions

How can I create a Q-table, when my states are lists and actions are tuples? Example of states for N = 3 Example of actions for those states I ...

Are there benefits to having Actor and Critic use significantly different models?

In Actor-Critic methods the Actor and Critic are assigned two complimentary, but different goals. I'm trying to understand whether the differences bet ...

Learning Curve in Q-learning

My question is I wrote the Q-learning algorithm in c++ with epsilon greedy policy now I have to plot the learning curve for the Q-values. What exactly ...

How should I code the Gambler's Problem with Q-learning (without any reinforcement learning packages)?

I would like to solve the Gambler's problem as an MDP (Markov Decision Process). Gambler's problem: A gambler has the opportunity to make bets on the ...

How does Deep Reinforcement Learning remove the need to map or explore every state, action pair for an agent?

This question was migrated from Stack Overflow because it can be answered on C ...

Python code using multiprocessing running infinitely

I am trying to execute the following code in jupyter notebook using multiprocessing but the loop is running infinitely. I need help resolving this iss ...

How can I Find Walking Paths for Different People in a Graph With Reinforcement Learning?

I don't know it is possible or not with reinforcement learning but my question is about finding walking paths for different people in a graph. A sampl ...

ValueError: Model output "Tensor("activation_1/Identity:0", shape=(?, 3), dtype=float32)" has invalid shape

I am trying to run the following github code for stock market prediction: https://github.com/multidqn/deep-q-trading using their instructions, I run ...

DQN Pytorch Loss keeps increasing

I am implementing simple DQN algorithm using pytorch, to solve the CartPole environment from gym. I have been debugging for a while now, and I cant fi ...

Deep Reinforcement Learning - CartPole Problem

I tried to implement the most simple Deep Q Learning algorithm. I think, I've implemented it right and know that Deep Q Learning struggles with diverg ...

ValueError: Input 0 of layer sequential_5 is incompatible with the layer: : expected min_ndim=4, found ndim=2. Full shape received: [None, 953]

I am making Q-learning Algorithm to play Chrome dino I capture screen and convert to binary image and convert to numpy array And i use model.predi ...

Update DOM from loop in JavaScript

I am making a maze solver via Q Learning algorithm. I have a width X height maze that is generated randomly. Each cell of the maze is a div. I have CS ...

Variable updating wrong in loop - Python (Q-learning)

Why does the position and newposition give the same output and update together in the next loop? for game in range(nr_of_games): # Initialize the ...