CPC G06F 16/215 (2019.01) [H04N 21/44222 (2013.01)] | 20 Claims |
1. A system for processing multiple data sets to generate deduplicated audience measurement data, the system comprising:
a processor; and
at least one memory storing instructions that, when executed by the processor, cause the system to perform operations comprising:
receiving, via first network communications with one or more computing devices, a first set of data obtained by meter devices having a first meter device type,
receiving, via second network communications with the one or more computing devices, a second set of data obtained by meter devices having a second meter device type, wherein the first meter device type is different from the second meter device type;
processing the first set of data and the second set of data to identify a first media presentation device represented by the first set of data and a second media presentation device represented by the second set of data as a possible common media presentation device;
calculating at least one of a station duration metric, a time match metric or a station path metric, wherein:
i) the station duration metric is based on a first set of durations of time that the first media presentation device tuned to a first set of stations and a second set of durations of time that the second media presentation device tuned to the first set of stations,
ii) the time match metric is based on a first set of times of day that the first media presentation device tuned to a second set of stations and a second set of times of day that the second media presentation device tuned to the second set of stations, and
iii) the station path metric based on a first sequence of stations tuned to by the first media presentation device and a second sequence of stations tuned to by the second media presentation device;
determining a score based on the at least one of the station duration metric, the time match metric, or the station path metric;
determining that the first media presentation device and the second media presentation device are a common media presentation device based on the score;
processing the first set of data by removing data corresponding to the common media presentation device to generate a final data set; and
storing, in the at least one memory, the final data set as deduplicated audience measurement data.
|