US 12,190,886 B2
	Selective inclusion of speech content in documents
Sushain Pandit, Austin, TX (US); and Sarbajit K. Rakshit, Kolkata (IN)
Assigned to International Business Machines Corporation, Armonk, NY (US)
Filed by International Business Machines Corporation, Armonk, NY (US)
Filed on Sep. 27, 2021, as Appl. No. 17/448,967.
Prior Publication US 2023/0267933 A1, Aug. 24, 2023
Int. Cl. G10L 15/02 (2006.01); G06F 40/166 (2020.01); G10L 15/26 (2006.01); H04W 4/029 (2018.01); H04W 4/33 (2018.01); G06T 19/00 (2011.01)

CPC G10L 15/26 (2013.01) [H04W 4/029 (2018.02); H04W 4/33 (2018.02); G06T 19/006 (2013.01)]

17 Claims

1. A computer-implemented method comprising:

capturing, by one or more processors, audio of a spoken content of a first participant of a discussion via an augmented reality (AR) device worn by a user;

prior to capturing the audio of the spoken content of the first participant of the discussion via the AR device worn by the user, identifying, by the one or more processors, a plurality of participants, including the first participant, who are each physically present at a physical location to participate in the discussion, wherein each of the plurality of participants are identified based on the AR device that they are wearing;

analyzing, by the one or more processors, the audio of the spoken content of the first participant;

converting, by the one or more processors, the audio of the spoken content of the first participant to text to create a transcript;

creating, by the one or more processors, a visualization of the transcript;

presenting, by the one or more processors, the visualization of the transcript to the user via the AR device; and

enabling, by the one or more processors, the user to copy one or more parts of the transcript into a document file via a selection support,

wherein enabling the user to copy the one or more parts of the transcript into the document file via the selection support further comprises:

enabling, by the one or more processors, the user to point at a location of the first participant; and

selecting, by the one or more processors, one or more parts of the transcript associated with the first participant,

wherein the AR device provides a view of the physical location of the discussion and includes a computer-generated graphic including the visualization of the transcript overlaid on the view of the physical location.