Early Access

Display Method:
Research Article
Fault Information Recognition for On-board Equipment of High-speed Railway Based on Multi-Neural Network Collaboration
Lu-Jie Zhou, Jian-Wu Dang, Zhen-Hai Zhang
, Available online  , doi: 10.1007/s11633-021-1298-8
Abstract:
It is of great significance to guarantee the efficient statistics of high-speed railway on-board equipment fault information, which also improves the efficiency of fault analysis. Considering this background, this paper presents an empirical exploration of named entity recognition (NER) of on-board equipment fault information. Based on the historical fault records of on-board equipment, a fault information recognition model based on multi-neural network collaboration is proposed. First, considering Chinese recorded data characteristics, a method of constructing semantic features and additional features based on character granularity is proposed. Then, the two feature representations are concatenated and passed into the gated convolutional layer to extract the dependencies from multiple different subspaces and adjacent characters in parallel. Next, the local features are transmitted to the bidirectional long short-term memory (BiLSTM) to learn long-term dependency information. On top of BiLSTM, the sequential conditional random field (CRF) is used to jointly decode the optimized tag sequence of the whole sentence. The model is tested and compared with other representative baseline models. The results show that the proposed model not only considers the language characteristics of on-board fault records, but also has obvious advantages on the performance of fault information recognition.
A Novel Heterogeneous Actor-Critic Algorithm with Recent Emphasizing Replay Memory
Bao Xi, Rui Wang, Ying-Hao Cai, Tao Lu, Shuo Wang
, Available online  , doi: 10.1007/s11633-021-1296-x
Abstract:
Reinforcement learning (RL) algorithms have been demonstrated to solve a variety of continuous control tasks. However, the training efficiency and performance of such methods limit further applications. In this paper, we propose an off-policy heterogeneous actor-critic (HAC) algorithm, which contains soft Q-function and ordinary Q-function. The soft Q-function encourages the exploration of a Gaussian policy, and the ordinary Q-function optimizes the mean of the Gaussian policy to improve the training efficiency. Experience replay memory is another vital component of off-policy RL methods. We propose a new sampling technique that emphasizes recently experienced transitions to boost the policy training. Besides, we integrate HAC with hindsight experience replay (HER) to deal with sparse reward tasks, which are common in the robotic manipulation domain. Finally, we evaluate our methods on a series of continuous control benchmark tasks and robotic manipulation tasks. The experimental results show that our method outperforms prior state-of-the-art methods in terms of training efficiency and performance, which validate the effectiveness of our method.
Robust Optimal Higher-order-observer-based Dynamic Sliding Mode Control for VTOL Unmanned Aerial Vehicles
Yashar Mousavi, Amin Zarei, Arash Mousavi, Mohsen Biari
, Available online  , doi: 10.1007/s11633-021-1282-3
Abstract:
This paper investigates the precise trajectory tracking of unmanned aerial vehicles (UAV) capable of vertical take-off and landing (VTOL) subjected to external disturbances. For this reason, a robust higher-order-observer-based dynamic sliding mode controller (HOB-DSMC) is developed and optimized using the fractional-order firefly algorithm (FOFA). In the proposed scheme, the sliding surface is defined as a function of output variables, and the higher-order observer is utilized to estimate the unmeasured variables, which effectively alleviate the undesirable effects of the chattering phenomenon. A neighboring point close to the sliding surface is considered, and as the tracking error approaches this point, the second control is activated to reduce the control input. The stability analysis of the closed-loop system is studied based on Lyapunov stability theorem. For a better study of the proposed scheme, various trajectory tracking tests are provided, where accurate tracking and strong robustness can be simultaneously ensured. Comparative simulation results validate the proposed control strategy′s effectiveness and its superiorities over conventional sliding mode controller (SMC) and integral SMC approaches.