Collection of algorithms to learn loss and reward functions via gradient-based bi-level optimization.
Contributions:1 review, 11 commits, 10 PRs in 6 months
bi-level-optimizationoptimizationrewardlossgradient
Collection of algorithms to learn loss and reward functions via gradient-based bi-level optimization.
Contributions:14 pushes in 12 days
bi-level-optimizationoptimizationrewardlossgradient