US 12,243,307 B2
	Scene and activity identification in video summary generation
David A. Newman, San Diego, CA (US); and Adam Silver, San Carlos, CA (US)
Assigned to GoPro, Inc., San Mateo, CA (US)
Filed by GoPro, Inc., San Mateo, CA (US)
Filed on Sep. 12, 2023, as Appl. No. 18/465,619.
Application 18/465,619 is a continuation of application No. 17/378,324, filed on Jul. 16, 2021, granted, now 11,776,579.
Application 17/378,324 is a continuation of application No. 16/382,600, filed on Apr. 12, 2019, granted, now 11,069,380, issued on Jul. 20, 2021.
Application 16/382,600 is a continuation of application No. 15/091,547, filed on Apr. 5, 2016, granted, now 10,262,695, issued on Apr. 16, 2019.
Application 15/091,547 is a continuation of application No. 14/513,149, filed on Oct. 13, 2014, granted, now 10,074,013, issued on Sep. 11, 2018.
Claims priority of provisional application 62/039,849, filed on Aug. 20, 2014.
Claims priority of provisional application 62/028,254, filed on Jul. 23, 2014.
Prior Publication US 2023/0419999 A1, Dec. 28, 2023
Int. Cl. G06V 20/40 (2022.01); G06T 7/246 (2017.01); G10L 15/06 (2013.01); G10L 15/22 (2006.01); G10L 25/57 (2013.01); G11B 27/00 (2006.01); G11B 27/031 (2006.01); G11B 27/10 (2006.01); G11B 27/13 (2006.01); G11B 27/22 (2006.01); G11B 27/28 (2006.01); G11B 27/30 (2006.01); G11B 27/34 (2006.01); H04N 5/77 (2006.01); H04N 5/91 (2006.01); H04N 9/82 (2006.01); H04N 13/172 (2018.01); H04N 13/178 (2018.01); H04N 19/513 (2014.01); H04N 21/235 (2011.01); H04N 21/8549 (2011.01); H04N 23/60 (2023.01); H04N 23/62 (2023.01); G06V 40/10 (2022.01); G06V 40/16 (2022.01); G10L 25/54 (2013.01)

CPC G06V 20/47 (2022.01) [G06T 7/246 (2017.01); G06V 20/41 (2022.01); G10L 15/063 (2013.01); G10L 15/22 (2013.01); G10L 25/57 (2013.01); G11B 27/002 (2013.01); G11B 27/031 (2013.01); G11B 27/10 (2013.01); G11B 27/13 (2013.01); G11B 27/22 (2013.01); G11B 27/28 (2013.01); G11B 27/3081 (2013.01); G11B 27/34 (2013.01); H04N 5/77 (2013.01); H04N 5/772 (2013.01); H04N 5/91 (2013.01); H04N 9/8205 (2013.01); H04N 13/172 (2018.05); H04N 13/178 (2018.05); H04N 19/513 (2014.11); H04N 21/2353 (2013.01); H04N 21/8549 (2013.01); H04N 23/60 (2023.01); H04N 23/62 (2023.01); G06V 20/44 (2022.01); G06V 40/15 (2022.01); G06V 40/179 (2022.01); G06V 2201/10 (2022.01); G10L 2015/0631 (2013.01); G10L 2015/223 (2013.01); G10L 25/54 (2013.01); H04N 2201/3228 (2013.01); H04N 2201/3229 (2013.01); H04N 2201/3256 (2013.01)]

20 Claims

1. A system configured to generate a video summary, the system comprising:

a video server connected to a client device over a network, the video server configured to:

access metadata for a video captured by a camera, the metadata for the video including motion data of the camera generated by a motion sensor of the camera, the motion data including speed and/or acceleration vectors representative of motion of the camera during the capture of the video, the camera carried by a user during the capture of the video;

analyze the metadata for the video to identify scenes of the video, wherein the analysis of the metadata for the video includes:

generation of scores for the scenes based on the speed and/or acceleration vectors of the camera associated with the scenes of the video; and

identification of jumps and/or crashes by the user carrying the camera during the capture of the video based on the speed and/or acceleration vectors of the camera associated with the scenes of the video;

rank the scenes of the video for potential inclusion in the video summary based on the scores for the scenes and the identification of jumps and/or crashes by the user carrying the camera during the capture of the video, wherein a given scene is ranked among other scenes for potential inclusion in the video summary based on a given score for the given scene and whether the user jumped or crashed within the given scene;

select one or more of the scenes of the video for inclusion in the video summary based on the ranking of the scenes of the video;

generate a video file for the video summary that includes the one or more selected scenes of the video; and

provide the video file of the video summary over the network to the client device for playback of the video summary on the client device.