Files
aitbc/apps/coordinator-api/systemd
oib 7e9ba75f6c feat(coordinator-api): enhance reinforcement learning service with PyTorch-based PPO, SAC, and Rainbow DQN implementations
- Add PyTorch neural network implementations for PPO, SAC, and Rainbow DQN agents with GPU acceleration
- Implement PPOAgent with actor-critic architecture, clip ratio, and entropy regularization
- Implement SACAgent with separate actor and dual Q-function networks for continuous action spaces
- Implement RainbowDQNAgent with dueling architecture and distributional RL (51 atoms
2026-03-01 00:18:14 +01:00
..