MATLAB Reinforcement Learning Tutorial

verl: Volcano Engine Reinforcement Learning for LLMs

verl is a flexible, efficient and production-ready RL training library for large language models (LLMs). verl is the open-source version of HybridFlow: A Flexible and Efficient RLHF Framework paper.

IEEE

Continuous-Time Reinforcement Learning: New Design Algorithms With Theoretical Insights and Performance Guarantees

Abstract: Continuous-time reinforcement learning (CT-RL) methods hold great promise in real-world applications. Adaptive dynamic programming (ADP)-based CT-RL algorithms, especially their theoretical ...

IEEE

Multi-Agent Evolutionary Reinforcement Learning Based on Cooperative Games

Abstract: Despite the significant advancements in single-agent evolutionary reinforcement learning, research exploring evolutionary reinforcement learning within multi-agent systems is still in its ...

Microsoft

Agent Lightning: Adding reinforcement learning to AI agents without code rewrites

AI agents are reshaping software development, from writing code to carrying out complex instructions. Yet LLM-based agents are prone to errors and often perform poorly on complicated, multi-step tasks ...

GitHub

reinforcement-learning

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.

Some results have been hidden because they may be inaccessible to you

Show inaccessible results