| CPC G10H 1/0008 (2013.01) [G10H 2210/076 (2013.01); G10H 2210/265 (2013.01); G10H 2210/391 (2013.01); G10H 2220/116 (2013.01); G10H 2240/325 (2013.01)] | 20 Claims |

|
1. A computer-implemented method for combining audio tracks, the method comprising:
receiving a first audio track and a second audio track;
separating the first audio track into a vocal component and one or more accompaniment components;
separating the second audio track into a vocal component and one or more accompaniment components;
determining a structure of the first audio track and a structure of the second audio track;
aligning the vocal component of the first audio track and one of the one or more accompaniment components of the second audio track based on the determined structures of the tracks;
displaying, on a user interface, a visualization of the vocal component of the first audio track and the one or more accompaniment components of the second audio track at a first alignment;
adjusting the first alignment upon receiving, via the user interface, a user input corresponding to a change in an alignment between sections of the vocal component of the first audio track and sections of the one or more accompaniment components of the second audio track;
stretching the vocal component of the first audio track to match a tempo of the second audio track; and
generating a mixed audio by adding the stretched vocal component of the first audio track to the one or more accompaniment components of the second audio track.
|