Semi-Infinitely Constrained Markov Decision Processes and Provably Efficient Reinforcement Learning

Article Properties
Journal Categories
Science
Mathematics
Instruments and machines
Electronic computers
Computer science
Technology
Electrical engineering
Electronics
Nuclear engineering
Electric apparatus and materials
Electric circuits
Electric networks
Technology
Electrical engineering
Electronics
Nuclear engineering
Electronics
Technology
Engineering (General)
Civil engineering (General)
Technology
Mechanical engineering and machinery
Refrences
Title Journal Journal Categories Citations Publication Date
On the theory of policy gradient methods: Optimality, approximation, and distribution shift 2021
A comprehensive survey on safe reinforcement learning 2015
Sur la détermination des polynômes d’approximation de degré donnée 1934
Policy evaluation with temporal differences: A survey and comparison 2014
CRPO: A new approach for safe reinforcement learning with convergence guarantee