CPC G16B 40/00 (2019.02) [G16B 5/00 (2019.02)] | 15 Claims |
1. A method, comprising:
featurizing data from raw MS data, wherein the raw MS data comprises a plurality of analog samples, each representing mass/charge intensity, the featurizing comprising, for each of the analog samples, the steps of
centering the sample within a retention time window;
discretizing the sample into a sequence of points representing intensity values;
standardizing the intensity values; and
assigning labels to points among the sequence of points corresponding to peak start and peak stop times to produce a labeled sequence of points;
wherein the labeled sequence of points are configured for training a machine learning model to predict an abundance in unseen MS data.
|