Two-level reinforcement learning system for automated beat generation — PPO for discrete arrangement, SAC for hybrid audio effects. CS 5180 Final Project, Northeastern University.
reinforcement-learning pytorch transformer discriminator music-generation northeastern gymnasium drum-machine ppo cs-5180
-
Updated
May 10, 2026 - Jupyter Notebook