Search

Your search keyword '"policy gradient"' showing total 421 results

Search Constraints

Start Over You searched for: Descriptor "policy gradient" Remove constraint Descriptor: "policy gradient"
421 results on '"policy gradient"'

Search Results

101. CodeeGAN: Code Generation via Adversarial Training

102. Conditional GANs for Image Captioning with Sentiments

103. A Reinforcement Learning Approach for Sequential Spatial Transformer Networks

104. A Deep Reinforcement Learning Approach for Autonomous Car Racing

105. Learning Agents with Prioritization and Parameter Noise in Continuous State and Action Space

106. Safe Policy Learning with Constrained Return Variance

107. Trajectory Based Prioritized Double Experience Buffer for Sample-Efficient Policy Optimization

108. An overview on algorithms and applications of deep reinforcement learning

109. Employing reinforcement learning to enhance particle swarm optimization methods.

110. On Diversity in Image Captioning: Metrics and Methods.

111. Energy-Efficient and QoS Guaranteed BBU Aggregation in CRAN Based on Heuristic- Assisted Deep Reinforcement Learning.

112. On the Convergence Rates of Policy Gradient Methods.

113. Policy Gradient and Actor--Critic Learning in Continuous Time and Space: Theory and Algorithms.

114. Global Optimality and Finite Sample Analysis of Softmax Off-Policy Actor Critic under State Distribution Mismatch.

115. Greedification Operators for Policy Optimization: Investigating Forward and Reverse KL Divergences.

116. Segment boundary detection directed attention for online end-to-end speech recognition

117. Adaptive Laser Welding Control: A Reinforcement Learning Approach

118. Performance Improvement of Linux CPU Scheduler Using Policy Gradient Reinforcement Learning for Android Smartphones

119. A Knowledge Driven Dialogue Model With Reinforcement Learning

120. On the Design of Tailored Neural Networks for Energy Harvesting Broadcast Channels: A Reinforcement Learning Approach

121. Reinforced knowledge distillation: Multi-class imbalanced classifier based on policy gradient reinforcement learning.

122. Parameter tuning of manipulator motion tracking controller based on Policy Gradient.

123. GLEU-Guided Multi-resolution Network for Short Text Conversation

124. From Plots to Endings: A Reinforced Pointer Generator for Story Ending Generation

125. Learning Heuristics for the TSP by Policy Gradient

126. Interactive Area Topics Extraction with Policy Gradient

127. Relaxation-Free Deep Hashing via Policy Gradient

128. Automatically Designing CNN Architectures for Medical Image Segmentation

130. Joint Communication and Action Learning in Multi-Target Tracking of UAV Swarms with Deep Reinforcement Learning

131. Vehicle Safety Planning Control Method Based on Variable Gauss Safety Field

132. Dynamic Navigation and Area Assignment of Multiple USVs Based on Multi-Agent Deep Reinforcement Learning

133. Reinforcement Learning: Theory and Applications in HEMS

134. AEVRNet: Adaptive exploration network with variance reduced optimization for visual tracking.

135. Reprint of: Automated stem cell production by bio-inspired control.

136. PP-PG: Combining Parameter Perturbation with Policy Gradient Methods for Effective and Efficient Explorations in Deep Reinforcement Learning.

137. Deep reinforcement learning algorithm based on multi-agent parallelism and its application in game environment.

138. ECG Generation With Sequence Generative Adversarial Nets Optimized by Policy Gradient

139. ULMR: An Unsupervised Learning Framework for Mismatch Removal

140. Learning-Based Online QoE Optimization in Multi-Agent Video Streaming

141. Policy Gradient Reinforcement Learning for I/O Reordering on Storage Servers

142. Automated stem cell production by bio-inspired control.

143. Positioning of the Robotic Arm Using Different Reinforcement Learning Algorithms.

144. Novel First Order Bayesian Optimization with an Application to Reinforcement Learning.

145. AGAN: ATTRIBUTE GENERATIVE ADVERSARIAL NETWORK.

146. Multi-Agent Safe Policy Learning for Power Management of Networked Microgrids.

147. Relative Entropy of Correct Proximal Policy Optimization Algorithms with Modified Penalty Factor in Complex Environment

148. Efficient Robot Skills Learning with Weighted Near-Optimal Experiences Policy Optimization.

149. On the Theory of Policy Gradient Methods: Optimality, Approximation, and Distribution Shift.

150. Modeling on virtual network embedding using reinforcement learning.

Catalog

Books, media, physical & digital resources