US 12,438,995 B1
	Integration of video language models with AI for filmmaking
Benjamin Geza Affleck-Boldt, West Hollywood, CA (US)
Assigned to FIN BONE, LLC, Los Angeles, CA (US)
Filed by FIN BONE, LLC, Los Angeles, CA (US)
Filed on Nov. 25, 2024, as Appl. No. 18/959,339.
Claims priority of provisional application 63/657,756, filed on Jun. 7, 2024.
Int. Cl. H04N 5/222 (2006.01); G06F 16/783 (2019.01); G06T 7/521 (2017.01)

CPC H04N 5/2224 (2013.01) [G06F 16/7837 (2019.01); G06T 7/521 (2017.01)]

19 Claims

1. A computer-implemented method for integrating one or more existing video large language models (LLMs) with custom AI algorithms for filmmaking, the method comprising:

interfacing with an existing video LLM;

receiving detailed metadata related to professional filmmaking techniques, including camera settings, shot composition, and lighting setups;

processing the received metadata to adapt the existing video LLM to generate video content that simulates professional filmmaking techniques;

receiving Lidar data captured from a lidar sensor, the Lidar data including spatial coordinates, distance measurements and relative positional information of objects within a scene; and

integrating the Lidar data with the processed metadata by combining the spatial coordinates and distance measurements from the Lidar data with the filmmaking metadata to enhance the generated video content by providing a three-dimensional spatial understanding of the scene indicating depths and positional relationships among objects; and

applying transfer learning techniques to the existing video LLM based on the processed metadata and Lidar data to refine its video content generation capabilities.