CPC G10L 17/14 (2013.01) [G06F 3/0482 (2013.01); G06F 3/165 (2013.01); G06F 40/117 (2020.01); G06F 40/279 (2020.01); G06F 40/58 (2020.01); G10L 17/04 (2013.01); G10L 21/028 (2013.01); G06F 2203/04803 (2013.01)] | 13 Claims |
1. A computer-implemented method, performed by a computing system having one or more hardware computer processors and one or more non-transitory computer readable storage devices storing software instructions executable by the computing system to perform the computer-implemented method, the computer-implemented method comprising:
accessing a tracked object database that stores a plurality of tracked objects;
accessing audio data;
generating a target language transcript based on at least the audio data;
analyzing the target language transcript to extract one or more entities;
providing a user interface configured for user analysis of at least the target language transcript, the user interface including at least:
a first panel including the target language transcript;
a second panel including controls for audio playback of the audio data; and
a third panel including one or more suggested tags associated with the one or more entities, wherein the one or more suggested tags are indicated by respective selectable user interface buttons;
receiving, via the third panel of the user interface, a first user input selecting a first selectable user interface button for a first suggested tag of the one or more suggested tags, the first suggested tag associated with a first entity of the one or more entities;
in response to the first user input received via the third panel:
either (1) generating a first tracked object representative of the first entity, or (2) determining a first tracked object of the plurality of tracked objects representative of the first entity; and
tagging the audio data and the target language transcript with the first tracked object, wherein the tagging comprises linking the first tracked object with the audio data and the target language transcript;
receiving, via the third panel of the user interface, a second user input selecting a second selectable user interface button for a second suggested tag of the one or more suggested tags, the second suggested tag associated with a second entity of the one or more entities;
in response to the second user input received via the third panel:
either (1) generating a second tracked object representative of the second entity, or (2) determining a second tracked object of the plurality of tracked objects representative of the second entity; and
tagging the audio data and the target language transcript with the second tracked object, wherein the tagging comprises linking the second tracked object with the audio data and the target language transcript.
|