A repository to accompany the paper Quantum Algorithm for Protein Side-Chain Optimisation: Comparing Quantum to Classical Methods by Anastasia Agathangelou, Dilhan Manawadu, and Ivano Tavernelli. This ...
Our code is based on open-r1, with our customized Trainer for mixed SFT+GRPO training. Some other updates focus on the white-box RL (reward function design) and post-completion training (replacement ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results