Discover eBandit, a kernel-driven reinforcement learning framework that optimizes adaptive video streaming using real-time network metrics for superior QoE...
Discover StructRL, a framework that recovers dynamic programming structure from distributional reinforcement learning to boost efficiency and stability.