US 12,437,448 B2
System and methods for multimodal series transformation for optimal compressibility with neural upsampling
Brian Galvin, Silverdale, WA (US)
Assigned to ATOMBEAM TECHNOLOGIES INC., Moraga, CA (US)
Filed by AtomBeam Technologies Inc., Moraga, CA (US)
Filed on Mar. 22, 2025, as Appl. No. 19/087,497.
Application 19/087,497 is a continuation in part of application No. 18/915,030, filed on Oct. 14, 2024, granted, now 12,262,036.
Application 18/915,030 is a continuation in part of application No. 18/668,163, filed on May 18, 2024, granted, now 12,167,031, issued on Dec. 10, 2024.
Application 18/668,163 is a continuation in part of application No. 18/537,728, filed on Dec. 12, 2023, granted, now 12,058,333, issued on Aug. 6, 2024.
Prior Publication US 2025/0218053 A1, Jul. 3, 2025
This patent is subject to a terminal disclaimer.
Int. Cl. G06T 9/00 (2006.01); G06T 7/33 (2017.01); G06T 7/38 (2017.01); G06T 11/00 (2006.01); H04N 19/66 (2014.01); H04N 19/86 (2014.01)
CPC G06T 9/002 (2013.01) [G06T 7/337 (2017.01); G06T 7/38 (2017.01); G06T 11/00 (2013.01); G06T 2207/10016 (2013.01); G06T 2207/10044 (2013.01); G06T 2207/20081 (2013.01); G06T 2207/20084 (2013.01); H04N 19/66 (2014.11); H04N 19/86 (2014.11)] 16 Claims
OG exemplary drawing
 
1. A computer system comprising:
a hardware memory, wherein the computer system is configured to execute software instructions stored on nontransitory machine-readable storage media that:
collects a plurality of multimodal data;
processes each modality through a corresponding specialized preprocessor;
aligns and registers the preprocessed multimodal data to a common spatial-temporal reference frame;
trains an angle optimizer using the aligned multimodal data to determine optimal slicing angles that maximize compressibility while preserving cross-modal relationships;
slices the multimodal data along an optimal angle, as determined by the angle optimizer;
reconstructs the sliced multimodal data into a plurality of reconstructed representations;
encodes the plurality of reconstructed multimodal data into a plurality of compressed representations;
applies error resilience techniques to the compressed representations; and
decodes the plurality of compressed representations into a plurality of decompressed representations.