US 11,677,839 B2
Automatic coalescing of GPU-initiated network communication
James Dinan, Sharon, MA (US); Akhil Langer, Savoy, IL (US); and Sreeram Potluri, Milpitas, CA (US)
Assigned to NVIDIA Corporation, Santa Clara, CA (US)
Filed by NVIDIA Corporation, Santa Clara, CA (US)
Filed on Jun. 17, 2021, as Appl. No. 17/351,002.
Prior Publication US 2022/0407920 A1, Dec. 22, 2022
Int. Cl. H04L 67/141 (2022.01); H04L 67/104 (2022.01); H04L 67/12 (2022.01)
CPC H04L 67/141 (2013.01) [H04L 67/104 (2013.01); H04L 67/12 (2013.01)] 20 Claims
OG exemplary drawing
 
1. A method comprising:
receiving, from a shared memory application executing on a first graphics processing unit (GPU), a first communication request having a second GPU as a destination;
determining whether the first communication request satisfies a coalescing criterion for network transport over a network to the second GPU;
transporting, in response to determining the first communication request does not satisfy the coalescing criterion, the first communication request to the second GPU over a peer-to-peer (P2P) connection between the first and second GPUs;
in response to determining the first communication request satisfies the coalescing criterion,
storing the first communication request in association with a group of requests that have a common property;
determining that a timer associated with the group of requests expires or a size of the group satisfies a group size criterion;
coalescing the group of requests into a coalesced request; and
transporting the coalesced request to the second GPU over the network.