US 11,989,231 B2
Audio recommendation based on text information and video content
Om Prakash Chinta, Bangalore (IN); Karthik Gayakwad, Bangalore (IN); Shehnaz Mohamed, Bangalore (IN); and Karan Parikh, Bangalore (IN)
Assigned to SONY GROUP CORPORATION, Tokyo (JP)
Filed by SONY GROUP CORPORATION, Tokyo (JP)
Filed on Jul. 29, 2021, as Appl. No. 17/389,209.
Prior Publication US 2023/0031056 A1, Feb. 2, 2023
Int. Cl. G06F 16/683 (2019.01); G06F 16/635 (2019.01); G06F 16/68 (2019.01); G06F 16/783 (2019.01)
CPC G06F 16/685 (2019.01) [G06F 16/635 (2019.01); G06F 16/686 (2019.01); G06F 16/7834 (2019.01); G06F 16/7837 (2019.01)] 21 Claims
OG exemplary drawing
 
1. An electronic device, comprising:
circuitry configured to:
receive textual information that indicates a plurality of scenes for video content;
determine a first plurality of features for the plurality of scenes indicated by the textual information;
determine a first set of positions in the textual information based on the determined first plurality of features for the plurality of scenes, wherein a first set of audio files are to be inserted at the determined first set of positions which are related to a first set of scenes of the plurality of scenes;
determine scene analysis quotient information for each scene of the first set of scenes based on the determined first plurality of features for the plurality of scenes;
determine a genre of an audio file to be inserted for each scene of the first set of scenes based on determined the scene analysis quotient information;
determine, by an artificial intelligent (AI) engine, the first set of audio files for the first set of scenes, based on the determined genre, a second plurality of features and the first plurality of features related to the first set of scenes; and
control a display device to display first information corresponding to the determined first set of positions and second information corresponding to the determined first set of audio files.