Database: Academic Search Index / Topic: 5 selected - Searchworks@Jio Institute Digital Library Search Results

Showing total 7 results

Start Over Topic algorithms Topic deep learning Topic deep reinforcement learning Topic machine learning Topic reinforcement learning Database Academic Search Index

7 results

1. 基于 LSTM 与非对称网络的改进 DDPG 算法研究.

Author: 何富君, 王晓争, and 刘凯
Subjects: *DEEP learning, *MACHINE learning, *REINFORCEMENT learning, *ALGORITHMS, *SPEED, *CRITICS
Abstract: When the deep reinforcement learning algorithm is trained in a complex dynamic environment, it is difficult for the agent to obtain useful information due to the partial observability of the environment, which leads to typical problems such as failure to learn good strategies and slow algorithm convergence speed. This paper proposed an improved DDPG algorithm based on LSTM and asymmetric actor-critic network. This method introduced the LSTM structure into actor-critic network to learn the hidden states in partially observable Markov through memory reasoning. At the same time, when the actor network only used RGB images as partially observable inputs, the critic network used the complete state of the simulation environment to train, which formed an asymmetric network and speeded up the training convergence. The simulation experiment of manipulator grasping in ROS shows that the proposed algorithm has higher success rate and faster convergence speed compared with DDPG, PPO and LSTM-DDPG. [ABSTRACT FROM AUTHOR]
Published: 2022
Full Text: View/download PDF

2. Deep learning and reinforcement learning approach on microgrid.

Author: Chandrasekaran, Kumar, Kandasamy, Prabaakaran, and Ramanathan, Srividhya
Subjects: *REINFORCEMENT learning, *ARTIFICIAL intelligence, *MICROGRIDS, *DEEP learning, *MACHINE learning, *ALGORITHMS, *COST control
Abstract: Summary: Microgrid is a new era in the power system and it has more scope of investigation on research. Due to an increase in demand and future expansion of the power system, analyzing the complexities of the network becomes a challenging task. Artificial intelligence plays a vital role in resolving such issues in a microgrid in various aspects. Owing to the rapid growth of periodical update in computational cost reduction, enhanced data analysis‐based algorithm artificial intelligence enters into new epoch Artificial Intelligence AI 2.0. Based on such approach, machine learning has been evolved as AI 2.0 initially. Now, it develops branches like deep learning, reinforcement learning, and a combination of both deep reinforcement learning algorithms. These algorithms are precise to attain higher priority in decision‐making under a complex network. This paper deals with numerous challenges of the above algorithm to state the significance of AI 2.0 and summarization of their application toward microgrid is useful to analyze the power system. [ABSTRACT FROM AUTHOR]
Published: 2020
Full Text: View/download PDF

3. Cellular UAV-to-Device Communications: Trajectory Design and Mode Selection by Multi-Agent Deep Reinforcement Learning.

Author: Wu, Fanyi, Zhang, Hongliang, Wu, Jianjun, and Song, Lingyang
Subjects: *REINFORCEMENT learning, *DEEP learning, *MARKOV processes, *ALGORITHMS
Abstract: In the current unmanned aircraft systems (UASs) for sensing services, unmanned aerial vehicles (UAVs) transmit their sensory data to terrestrial mobile devices over the unlicensed spectrum. However, the interference from surrounding terminals is uncontrollable due to the opportunistic channel access. In this paper, we consider a cellular Internet of UAVs to guarantee the Quality-of-Service (QoS), where the sensory data can be transmitted to the mobile devices either by UAV-to-Device (U2D) communications over cellular networks, or directly through the base station (BS). Since UAVs’ sensing and transmission may influence their trajectories, we study the trajectory design problem for UAVs in consideration of their sensing and transmission. This is a Markov decision problem (MDP) with a large state-action space, and thus, we utilize multi-agent deep reinforcement learning (DRL) to approximate the state-action space, and then propose a multi-UAV trajectory design algorithm to solve this problem. Simulation results show that our proposed algorithm can achieve a higher total utility than policy gradient algorithm and single-agent algorithm. [ABSTRACT FROM AUTHOR]
Published: 2020
Full Text: View/download PDF

4. Connected automated vehicle cooperative control with a deep reinforcement learning approach in a mixed traffic environment.

Author: Shi, Haotian, Zhou, Yang, Wu, Keshu, Wang, Xin, Lin, Yangxin, and Ran, Bin
Subjects: *DEEP learning, *AUTONOMOUS vehicles, *REINFORCEMENT learning, *MACHINE learning, *ALGORITHMS, *TRAFFIC flow
Abstract: • We develop a multi-objective cooperative deep reinforcement learning based CAV car following control strategy for a mixed traffic. • We embed the concept of equilibrium point inside deep reinforcement learning which facilitates algorithm convergence and stability of mixed traffic. • We embed the HDV characteristic inside the environment of deep reinforcement learning to further improve the performance of control. • We analyze the vehicle sequencing impact on the mixed traffic. This paper proposes a cooperative strategy of connected and automated vehicles (CAVs) longitudinal control for a mixed connected and automated traffic environment based on deep reinforcement learning (DRL) algorithm, which enhances the string stability of mixed traffic, car following efficiency, and energy efficiency. Since the sequences of mixed traffic are combinatory, to reduce the training dimension and alleviate communication burdens, we decomposed mixed traffic into multiple subsystems where each subsystem is comprised of human-driven vehicles (HDV) followed by cooperative CAVs. Based on that, a cooperative CAV control strategy is developed based on a deep reinforcement learning algorithm, enabling CAVs to learn the leading HDV's characteristics and make longitudinal control decisions cooperatively to improve the performance of each subsystem locally and consequently enhance performance for the whole mixed traffic flow. For training, a distributed proximal policy optimization is applied to ensure the training convergence of the proposed DRL. To verify the effectiveness of the proposed method, simulated experiments are conducted, which shows the performance of our proposed model has a great generalization capability of dampening oscillations, fulfilling the car following and energy-saving tasks efficiently under different penetration rates and various leading HDVs behaviors. [ABSTRACT FROM AUTHOR]
Published: 2021
Full Text: View/download PDF

5. A UAV Maneuver Decision-Making Algorithm for Autonomous Airdrop Based on Deep Reinforcement Learning.

Author: Li, Ke, Zhang, Kun, Zhang, Zhenchong, Liu, Zekun, Hua, Shuai, He, Jianliang, and Sanchez-Azofeifa, Arturo
Subjects: *DEEP learning, *MACHINE learning, *AIRDROP, *DECISION making, *ALGORITHMS, *REINFORCEMENT learning
Abstract: How to operate an unmanned aerial vehicle (UAV) safely and efficiently in an interactive environment is challenging. A large amount of research has been devoted to improve the intelligence of a UAV while performing a mission, where finding an optimal maneuver decision-making policy of the UAV has become one of the key issues when we attempt to enable the UAV autonomy. In this paper, we propose a maneuver decision-making algorithm based on deep reinforcement learning, which generates efficient maneuvers for a UAV agent to execute the airdrop mission autonomously in an interactive environment. Particularly, the training set of the learning algorithm by the Prioritized Experience Replay is constructed, that can accelerate the convergence speed of decision network training in the algorithm. It is shown that a desirable and effective maneuver decision-making policy can be found by extensive experimental results. [ABSTRACT FROM AUTHOR]
Published: 2021
Full Text: View/download PDF

6. Deep reinforcement learning based preventive maintenance policy for serial production lines.

Author: Huang, Jing, Chang, Qing, and Arinez, Jorge
Subjects: *REINFORCEMENT learning, *DEEP learning, *MAINTENANCE, *ASSEMBLY line methods, *MACHINE learning, *ALGORITHMS
Abstract: • Apply a state-of-the-art deep reinforcement learning algorithm to the PM problem. • Formulate the PM problem as an MDP with guidance of domain knowledge. • Use a data-driven modeling method to build a fast simulator for learning. • The DRL agent learns to perform group maintenance and opportunistic maintenance. • Proposed method outperforms age-dependent policy and opportunistic policy. In the manufacturing industry, the preventive maintenance (PM) is a common practice to reduce random machine failures by replacing/repairing the aged machines or parts. The decision on when and where the preventive maintenance needs to be carried out is nontrivial due to the complex and stochastic nature of a serial production line with intermediate buffers. In order to improve the cost efficiency of the serial production lines, a deep reinforcement learning based approach is proposed to obtain PM policy. A novel modeling method for the serial production line is adopted during the learning process. A reward function is proposed based on the system production loss evaluation. The algorithm based on the Double Deep Q-Network is applied to learn the PM policy. Using the simulation study, the learning algorithm is proved effective in delivering PM policy that leads to an increased throughput and reduced cost. Interestingly, the learned policy is found to frequently conduct "group maintenance" and "opportunistic maintenance", although their concepts and rules are not provided during the learning process. This finding further demonstrates that the problem formulation, the proposed algorithm and the reward function setting in this paper are effective. [ABSTRACT FROM AUTHOR]
Published: 2020
Full Text: View/download PDF

7. A Method of Personalized Driving Decision for Smart Car Based on Deep Reinforcement Learning.

Author: Wang, Xinpeng, Wu, Chaozhong, Xue, Jie, and Chen, Zhijun
Subjects: *REINFORCEMENT learning, *DEEP learning, *MACHINE learning, *LEARNING goals, *ALGORITHMS
Abstract: To date, automatic driving technology has become a hotspot in academia. It is necessary to provide a personalization of automatic driving decision for each passenger. The purpose of this paper is to propose a self-learning method for personalized driving decisions. First, collect and analyze driving data from different drivers to set learning goals. Then, Deep Deterministic Policy Gradient algorithm is utilized to design a driving decision system. Furthermore, personalized factors are introduced for some observed parameters to build a personalized driving decision model. Finally, compare the proposed method with classic Deep Reinforcement Learning algorithms. The results show that the performance of the personalized driving decision model is better than the classic algorithms, and it is similar to the manual driving situation. Therefore, the proposed model can effectively learn the human-like personalized driving decisions of different drivers for structured road. Based on this model, the smart car can accomplish personalized driving. [ABSTRACT FROM AUTHOR]
Published: 2020
Full Text: View/download PDF

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

7 results

1. 基于 LSTM 与非对称网络的改进 DDPG 算法研究.

2. Deep learning and reinforcement learning approach on microgrid.

3. Cellular UAV-to-Device Communications: Trajectory Design and Mode Selection by Multi-Agent Deep Reinforcement Learning.

4. Connected automated vehicle cooperative control with a deep reinforcement learning approach in a mixed traffic environment.

5. A UAV Maneuver Decision-Making Algorithm for Autonomous Airdrop Based on Deep Reinforcement Learning.

6. Deep reinforcement learning based preventive maintenance policy for serial production lines.

7. A Method of Personalized Driving Decision for Smart Car Based on Deep Reinforcement Learning.

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

7 results

Search Results

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources