US 12,079,270 B2
System and method for multi-media search in a content management system
Gajendra Babu Bandhu, Bangalore (IN); and Sharath Babu Pulumati, Warangal (IN)
Assigned to OPEN TEXT HOLDINGS, INC., Menlo Park, CA (US)
Filed by Open Text Holdings, Inc., San Mateo, CA (US)
Filed on Dec. 9, 2019, as Appl. No. 16/708,011.
Prior Publication US 2021/0173865 A1, Jun. 10, 2021
Int. Cl. G06F 16/00 (2019.01); G06F 7/00 (2006.01); G06F 16/61 (2019.01); G06F 16/638 (2019.01); G06F 16/683 (2019.01)
CPC G06F 16/638 (2019.01) [G06F 16/61 (2019.01); G06F 16/683 (2019.01)] 16 Claims
OG exemplary drawing
 
1. A computer program product comprising a non-transitory computer-readable medium storing computer-readable instructions for:
providing a content management system, the content management system comprising a content management repository;
assigning, by the content management system, a unique identifier to a voice assistant device;
receiving audio data to be stored in the content management repository, the audio data associated with the voice assistant device;
converting, by an automatic speech recognition (ASR) processor, the audio data to produce a set of text that includes one or more attributes of the audio data associated with portions of the set of text;
in response to converting the audio data to the set of text:
accessing a configuration that specifies information to be extracted from text produced by the ASR processor, the configuration defining:
a set of target strings to be extracted; and
one or more target attributes to determine for strings matching target strings from the set of target strings;
using the configuration:
extracting, by matching text from the set of text to a specified string defined in the configuration, an extracted string, wherein the extracted string matches the specified string defined in the configuration; and
extracting, from the set of text produced by the ASR processor and based on the configuration, a corresponding attribute of the extracted string, the corresponding attribute included in the one or more target attributes defined by the configuration;
storing a first media file containing the audio data in the content management repository as a first managed object owned by the voice assistant device, the first managed object comprising a set of object properties, including an object identifier (ID) of the first managed object, and the first media file;
configuring the first managed object to be searchable by the extracted string, wherein configuring the first managed object to be searchable by the extracted string further comprises:
storing the extracted string in the set of object properties of the first managed object;
storing the corresponding attribute of the extracted string in the set of object properties of the first managed object;
assigning a search privilege to the first managed object for the voice assistant device using the unique identifier of the voice assistant device; and
indexing the first managed object, including the extracted string;
receiving, via a programmatic interface, a search query from a voice assistant tool service, the search query comprising a search string and the unique identifier of the voice assistant device;
searching a plurality of managed objects in the content management repository based on the search query for managed objects that match the search query;
determining that the extracted string matches the search string and that the voice assistant device is permitted access to the first managed object according to the search privilege;
generating a search result, wherein generating the search result comprises an indication of the first media file, the extracted string and the corresponding attribute of the extracted string in the search result, wherein the indication of the first media file, the extracted string and the corresponding attribute of the extracted string are included in the search result based on a determination that the extracted string matches the search string and that the voice assistant device is permitted access to the first managed object according to the search privilege, and wherein the indication of the first media file includes the object ID of the first managed object, as stored in the content management repository; and
returning the search result to the voice assistant tool service via the programmatic interface.