Q-learning is one of the most popular reinforcement learning algorithms, as it can be used to find an optimal action-selection policy for any given environment.
In the previous years many new methods that uses neural networks for sampling the protein conformational space have been presented. In this article we explore and evaluate these different neural network approaches.