US 12,346,838 B1
	Autonomous driving test method based on multi-agent swarm adversarial, device and medium
Jian Sun, Shanghai (CN); Peng Hang, Shanghai (CN); and Shiyu Fang, Shanghai (CN)
Assigned to TONGJI UNIVERSITY, Shanghai (CN)
Filed by TONGJI UNIVERSITY, Shanghai (CN)
Filed on Mar. 10, 2025, as Appl. No. 19/074,465.
Claims priority of application No. 202410736612.0 (CN), filed on Jun. 7, 2024.
Int. Cl. G06N 7/01 (2023.01); G06N 3/006 (2023.01); G06N 3/092 (2023.01); G06N 3/094 (2023.01); G06N 20/00 (2019.01)

CPC G06N 7/01 (2023.01) [G06N 3/006 (2013.01); B60K 2360/175 (2024.01); B60Q 2800/10 (2022.05); G05B 2219/39146 (2013.01); G06N 3/092 (2023.01); G06N 3/094 (2023.01); G06N 20/00 (2019.01)]

15 Claims

1. An autonomous driving test method based on multi-agent swarm adversarial, comprising steps of:

step S1: deducing a conflict topological relationship graph between a tested autonomous vehicle and an agent according to a road topology of a test scenario and a conflict relationship of test objects, specifically comprises:

deducing whether a spatial conflict exists between the tested autonomous vehicle and a multi-agent in an environment, and among multi-agents, based on vehicle state information of a multi-agent vehicle group {Veh1, Veh2 . . . Vehn} and the tested autonomous vehicle Veh0, and test map information; and

adopting graph theory to describe a topological relationship of a vehicle conflict to obtain conflict topological relationship graph G=(V,E) of the tested autonomous vehicle and the agent, wherein V represents a set of vehicles, vehicle Vehn had a position of p_n=(x_n(t),y_n(t)) and a speed of v_n(t) at time t; and E represents a set of edges, for edge e_ij, an inference is made according to current position p_iof vehicle Vehi and current position p_jof vehicle Vehj, and if a spatial conflict exists, then the edge is recorded as e_ij=1, and otherwise, the edge is recorded as 0;

step S2: deducing a feasible planning space of the tested autonomous vehicle according to the conflict topological relationship graph;

step S3: establishing a multi-agent swarm adversarial model based on a potential game under the feasible planning space according to a correlation between an individual reward of the agent and a swarm adversarial test effect of a multi-agent system, and solving and obtaining an optimal adversarial strategy of the multi-agent system against the tested autonomous vehicle, wherein in the multi-agent swarm adversarial model, an adversarial intensity is introduced to characterize relative weights of the individual reward of the agent and the swarm adversarial test effect of the multi-agent system, and the adversarial intensity is adaptively adjusted according to an actual response of the tested autonomous vehicle;

for the establishing a multi-agent swarm adversarial model based on a potential game under the feasible planning space according to a correlation between the individual reward of the agent and a swarm adversarial test effect of the multi-agent system, an expression is:

wherein in the expression: P(a_i^x, p_i, p₀) represents a swarm adversarial effect of a multi-agent system of agent i when an adversarial strategy is a_i^x, and U is the feasible planning space; P(a_i⁰, p_i, p₀) represents a swarm adversarial effect of the multi-agent system of the agent i under any initial adversarial strategy; R_i(a_i^x, p_i, p₀) represents an individual reward of the agent i when an adversarial strategy is a_i^x, and R_i(a_i⁰, p_i, p₀) represents an individual reward of the agent i under an initial adversarial strategy; and a_iis an acceleration of the agent i, p_iis a position of the agent i, and p₀is a position of the tested autonomous vehicle;

for the individual reward of the agent, a function expression is:

wherein in the expression: r_self,i^t(a_i, p_i) represents a driving reward of the agent i at the time t, a_iis the acceleration of the agent i, p_iis the position of the agent i, d_des,i^tis a distance between the agent i and an end point, and j_i^tis a jerk of the agent i; r_group,i0^t(a_i, p_i, p₀) represents an adversarial reward of the agent i at the time t, ΔTTCP_i0^trepresents a time difference between the agent i and the tested autonomous vehicle Veh0 reaching a conflict point at the time t, d_cp,i^trepresents a distance between the agent i and the conflict point, v_i^trepresents a speed of the agent i, d_cp,0^trepresents a distance between the tested autonomous vehicle and the conflict point, v₀^trepresents a speed of the tested autonomous vehicle, and p₀is the position of the tested autonomous vehicle; and θ is the adversarial intensity to characterize relative weights of the individual reward of the agent and the swarm adversarial test effect of the multi-agent system; and

for the swarm adversarial test effect of the multi-agent system, a function expression is:

wherein in the expression: γ represents a reward reduction coefficient; T is a planning step size; and φ_iis a contribution generated by the agent i in adversarial; and

step S4: repeatedly executing the steps S1-S3 until an adversarial task is completed.