Anti-Intelligent UAV Jamming Strategy via Deep Q-Networks

Gao, N. and Qin, Z. and Jing, X. and Ni, Q. and Jin, S. (2020) Anti-Intelligent UAV Jamming Strategy via Deep Q-Networks. IEEE Transactions on Communications, 68 (1). pp. 569-581. ISSN 0090-6778

Text (TCOM-AI-author final)
TCOM_AI_author_final.pdf - Accepted Version
Available under License Creative Commons Attribution-NonCommercial.
Download (2MB)

Abstract

The downlink communications are vulnerable to intelligent unmanned aerial vehicle (UAV) jamming attack. In this paper, we propose a novel anti-intelligent UAV jamming strategy, in which the ground users can learn the optimal trajectory to elude such jamming. The problem is formulated as a stackelberg dynamic game, where the UAV jammer acts as a leader and the ground users act as followers. First, as the UAV jammer is only aware of the incomplete channel state information (CSI) of the ground users, for the first attempt, we model such leader sub-game as a partially observable Markov decision process (POMDP). Then, we obtain the optimal jamming trajectory via the developed deep recurrent Q-networks (DRQN) in the three-dimension space. Next, for the followers sub-game, we use the Markov decision process (MDP) to model it. Then we obtain the optimal communication trajectory via the developed deep Q-networks (DQN) in the two-dimension space. We prove the existence of the stackelberg equilibrium and derive the closed-form expression for the stackelberg equilibrium in a special case. Moreover, some insightful remarks are obtained and the time complexity of the proposed defense strategy is analyzed. The simulations show that the proposed defense strategy outperforms the benchmark strategies.

Item Type:

Journal Article

Journal or Publication Title:

IEEE Transactions on Communications

Additional Information:

©2019 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.

Uncontrolled Keywords:

/dk/atira/pure/subjectarea/asjc/2200/2208

Subjects:

?? jammingbase stationstrajectorygamesunmanned aerial vehiclesspace stationssecurityuavjammingmarkov decision processdeep q-networkselectrical and electronic engineering ??

Departments:

Faculty of Science and Technology > School of Computing & Communications

ID Code:

138108

Deposited By:

ep_importer_pure

Deposited On:

25 Oct 2019 09:30

Refereed?:

Yes

Published?:

Published

Last Modified:

14 Mar 2026 00:08

URI:

https://eprints.lancs.ac.uk/id/eprint/138108