31 – 32 of 32
- show: 10
- |
- sort: year (new to old)
Close
Embed this list
<iframe src=""
width=""
height=""
allowtransparency="true"
frameborder="0">
</iframe>
- « previous
- 1
- 2
- 3
- 4
- next »
- 2012
-
Mark
Nonconvergence to Saddle Boundary Points under Perturbed Reinforcement Learning
(2012) 4th World Congress of the Game Theory Society
- Contribution to conference › Abstract
- 2004
-
Mark
A new Q-learning algorithm based on the Metropolis criterion
(2004) In IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics 34(5). p.2140-2143
- Contribution to journal › Article
- « previous
- 1
- 2
- 3
- 4
- next »
