US 12,395,708 B2
Providing dynamic media captioning and augmented/virtual reality feedback in home network environments
Rajesh Radhakrishnan, Karnataka (IN)
Assigned to ARRIS ENTERPRISES LLC, Horsham, PA (US)
Filed by ARRIS Enterprises LLC, Suwanee, GA (US)
Filed on Jan. 6, 2022, as Appl. No. 17/647,251.
Claims priority of provisional application 63/164,186, filed on Mar. 22, 2021.
Prior Publication US 2022/0303636 A1, Sep. 22, 2022
Int. Cl. H04N 21/488 (2011.01); G06N 20/00 (2019.01); G10L 15/26 (2006.01); H04N 21/422 (2011.01)
CPC H04N 21/4884 (2013.01) [G06N 20/00 (2019.01); G10L 15/26 (2013.01); H04N 21/42203 (2013.01)] 23 Claims
OG exemplary drawing
 
1. A method for providing captioning for media content performed by a media control device, the media control device being in communication with a user network and the Internet, the method comprising:
receiving, by the media control device, an input indicating at least one request for caption content for at least a part of a media content;
ascertaining, by the media control device, one or more frames of the media content corresponding to the request for caption content using one or more audio analytics algorithms, wherein the ascertaining the one or more frames of the media content corresponding to the request for caption content includes:
identifying, by the media control device, one or more frames substantially similar to the at least one frame of the one or more frames of the media content corresponding to the request for caption content using at least one of: a machine-learning audio algorithm, or a machine-learning video algorithm;
ascertaining, by the media control device, specific content from the one or more frames of the media content corresponding to the request for caption content;
identifying, by the media control device, at least one source of the requested caption content; and
providing, by the media control device, a displayable version of requested caption content in a format such that the requested caption content is displayable with the one or more frames in a modified presentation of the media content.