簡體 English 中英

我可以對具有非連續觀察空間的問題使用強化學習嗎？

[英]Can I use Reinforcment Learning for a problem that has a non continous observation space?

原文 2023-01-22 07:28:40 5 1 algorithm/ machine-learning/ reinforcement-learning/ machine-learning-model

我想訓練一個代理人在一個 9x9 的字段上放置一個 polyomino（只有一個，例如 2x2 的平方），該字段要么是空的，要么已經包含多個 OTHER（不是 2x2 平方）polyomino。 所以觀察空間不會是連續的。 這是 RL 的正確用例嗎？

1 個解決方案

當然，為什么不呢？ 強化學習算法的最簡單版本使用離散的 state 空間（實際上，為了收斂，假設代理能夠訪問每個 state 足夠多次）。 即使狀態太多並且您必須用學習的近似值（可能是 neural.net）替換 Q function，您也可以對輸入使用 one-hot 編碼。

我可以使用在每個節點上都有一個完整單詞的特里嗎？

[英]Can I use a trie that has a whole word on each node?

嗨，我在嘗試解決此算法問題時遇到 java.lang.OutOfMemoryError: Java heap space，該怎么辦？

[英]Hi, I got java.lang.OutOfMemoryError: Java heap space while trying to solve this Algorithm problem, what can be done?

我們如何在此類數據上使用機器學習算法？

[英]How can we use a machine learning algorithm on this type of data?

葉相似樹 Leetcode 問題的非並發 O(1) 空間解

[英]Non-concurrent O(1) space solution for Leaf-Similar Trees Leetcode Problem

我使用記憶的 countUnivalTrees 問題的時間和空間復雜度是多少

[英]What time and space complexity am I using with memoized countUnivalTrees problem

如何將強化學習應用於連續動作空間？

[英]How can I apply reinforcement learning to continuous action spaces?

組合和約束求解問題。我可以使用什么算法？

[英]A group combination and constraint solving problem. What algorithm can I use?

我可以將std :: nth_element與ValueSwappable迭代器一起使用，但不能使用MoveConstructible取消引用的值嗎？

[英]Can I use std::nth_element with ValueSwappable iterator but non MoveConstructible dereferenced value?

我可以使用k均值對不完整的圖進行聚類嗎？

[英]Can I use k-means to cluster a non-complete graph?

這是一個非多項式問題嗎？如果不是，如何在多項式時間內解決？

[英]Is this a non-polynomial problem? If not, how can it be solved in polynomial time?

暫無

暫無

聲明:本站的技術帖子網頁，遵循CC BY-SA 4.0協議，如果您需要轉載，請注明本站網址或者原文地址。任何問題請咨詢:yoyou2525@163.com.

相關問題 我可以使用在每個節點上都有一個完整單詞的特里嗎？嗨，我在嘗試解決此算法問題時遇到 java.lang.OutOfMemoryError: Java heap space，該怎么辦？我們如何在此類數據上使用機器學習算法？葉相似樹 Leetcode 問題的非並發 O(1) 空間解我使用記憶的 countUnivalTrees 問題的時間和空間復雜度是多少如何將強化學習應用於連續動作空間？組合和約束求解問題。我可以使用什么算法？我可以將std :: nth_element與ValueSwappable迭代器一起使用，但不能使用MoveConstructible取消引用的值嗎？我可以使用k均值對不完整的圖進行聚類嗎？這是一個非多項式問題嗎？如果不是，如何在多項式時間內解決？

相關標簽

粵ICP備18138465號 © 2020-2024 STACKOOM.COM