US 12,323,643 B2
File parser, file generator, encoder, decoder, client, server, and methods using parameter sets for coded video sequences
Yago Sánchez De La Fuente, Berlin (DE); Robert Skupin, Berlin (DE); Cornelius Hellge, Berlin (DE); Thomas Schierl, Berlin (DE); Karsten Grueneberg, Berlin (DE); and Thomas Wiegand, Berlin (DE)
Assigned to Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V., Munich (DE)
Filed by Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V., Munich (DE)
Filed on Jun. 20, 2023, as Appl. No. 18/338,160.
Application 18/338,160 is a continuation of application No. PCT/EP2021/086875, filed on Dec. 20, 2021.
Claims priority of application No. 20216252 (EP), filed on Dec. 21, 2020.
Prior Publication US 2023/0336798 A1, Oct. 19, 2023
Int. Cl. H04N 21/2343 (2011.01); H04N 21/845 (2011.01)
CPC H04N 21/234309 (2013.01) [H04N 21/8456 (2013.01)] 26 Claims
OG exemplary drawing
 
1. A processor implemented file parser for reconstructing a video bitstream from a video file, wherein the video file comprises different tracks comprising at least two input video bitstreams embedded therein, wherein a video content is coded differently in said at least two input video bitstreams, and wherein each input video bitstream comprises one or more coded video sequences with random access points,
wherein the file parser is configured to
retrieve from the video file
switching information which indicates whether the different tracks comprise, and/or which of the different tracks are, one or more switch-to candidates for a respective track, and
one or more initialization parameter sets to be inserted into a reconstructed bitstream and to be referenced by samples thereof, wherein said reconstructed bitstream comprises either one of
the input video bitstream that is embedded in the respective track, and
a mixed video bitstream which results from switching between the input video bitstream that is embedded in the respective track and an input video bitstream that is embedded in the one or more switch-to candidates belonging to the respective track,
and wherein the file parser is further configured to
provide a decoder with a predetermined initialization parameter set retrieved from the video file for a track of interest and feed the decoder
with the input video bitstream that is embedded in the respective track, and/or
with a mixed video bitstream resulting from switching between the input video bitstream that is embedded in the respective track and the input video bitstream that is embedded in the one or more switch-to candidates belonging to the respective track,
wherein the file parser is further configured to retrieve an Entity Group comprising references to one or more tracks that can be switched among each other under continuing re-use of the predetermined initialization parameter set that was initially used to reconstruct the bitstream at the start of the coded video sequence,
wherein each track belonging to the Entity Group comprises a track switching hierarchy ID indicating a switching hierarchy level of the respective track, and wherein,
if the bitstream was reconstructed at the start of the coded video sequence by using the predetermined initialization parameter set of a predetermined track belonging to the Entity Group, which predetermined track comprises a predetermined track switching hierarchy ID indicating a predetermined switching hierarchy level,
then a switch from any switch-from track of the Entity Group to any potential switch-to candidate of the Entity Group is allowed,
if the respective switch-to candidate comprises a track switching hierarchy ID indicating a switching hierarchy level being the same as, or being smaller than, said predetermined switching hierarchy level of said predetermined track (used for initialization).