Cited By
View all- Qi HFei GZhu LKiyavash NMooij J(2024)Graph feedback bandits with similar armsProceedings of the Fortieth Conference on Uncertainty in Artificial Intelligence10.5555/3702676.3702817(3021-3040)Online publication date: 15-Jul-2024
- Khan SSaveski MUgander JSalakhutdinov RKolter ZHeller KWeller AOliver NScarlett JBerkenkamp F(2024)Off-policy evaluation beyond overlapProceedings of the 41st International Conference on Machine Learning10.5555/3692070.3693021(23734-23757)Online publication date: 21-Jul-2024
- Chen RChen XSun YXiao SLi MYu YSalakhutdinov RKolter ZHeller KWeller AOliver NScarlett JBerkenkamp F(2024)Policy-conditioned environment models are more generalizableProceedings of the 41st International Conference on Machine Learning10.5555/3692070.3692323(6539-6561)Online publication date: 21-Jul-2024
- Show More Cited By