| CPC H04N 7/15 (2013.01) [H04N 21/4318 (2013.01); H04N 21/4394 (2013.01); H04N 21/4396 (2013.01); H04N 21/45457 (2013.01)] | 24 Claims |

|
1. An apparatus comprising:
at least one memory;
machine-readable instructions; and
at least one processor circuit to be programmed by the machine-readable instructions to:
detect a first visual event associated with an activity in first image data of a video stream output by a camera associated with a user device, the activity associated with a likelihood of noise, the first image data associated with a first time;
invoke a neural network to process the first image data to classify the first visual event as one of a first stage of the activity or a second stage of the activity, the first stage preceding generation of the noise, the second stage associated with the generation of the noise;
cause application of an audio filter to a first portion of an audio stream corresponding to the first image data based on classification of the first visual event as the first stage or the second stage;
detect a second visual event associated with the activity based on second image data of the video stream, the second image data associated with a second time, the second time after the first time;
invoke the neural network to process the second image data to classify the second visual event as a third stage of the activity, the third stage different than the first stage and the second stage; and
cause presentation of the second image data without audio filtering based on classification of the second visual event as the third stage.
|