Title | Journal | Journal Categories | Citations | Publication Date |
---|---|---|---|---|
Using confidence bounds for exploitation-exploration trade-offs | 2003 | |||
Theory-inspired path-regularized differential network architecture search | 2020 | |||
Efficient neural architecture search via parameters sharing | 2018 | |||
Nested Monte-Carlo search | 2009 | |||
Policy gradient methods for reinforcement learning with function approximation | 1999 |