US 12,216,707 B1
System and method for managing storage space in a data management system
Prem Pradeep Motgi, Austin, TX (US); Dharmesh M. Patel, Round Rock, TX (US); and Manpreet Singh Sokhi, Santa Clara, CA (US)
Assigned to Dell Products L.P., Round Rock, TX (US)
Filed by Dell Products L.P., Round Rock, TX (US)
Filed on Aug. 30, 2023, as Appl. No. 18/458,412.
Int. Cl. G06F 16/00 (2019.01); G06F 16/65 (2019.01); G06F 16/683 (2019.01)
CPC G06F 16/65 (2019.01) [G06F 16/685 (2019.01)] 20 Claims
OG exemplary drawing
 
1. A method for managing storage space in a data management system, the method comprising:
identifying an occurrence of a storage space management event indicating limited storage space availability in the data management system;
remediating the storage space management event by at least:
identifying a portion of data managed by the data management system for deletion using topic classifications for the data and topic rankings for the topic classifications, each of the data is associated with one or more of the topic classifications and the identifying comprises:
determining a topic ranking of the topic rankings for each of the topic classifications, the topic ranking indicating a rank of each of the topic classifications,
obtaining a quantification for each of the data using the ranks of all of the one or more topic classifications associated with each respective one of the data,
generating, based on the quantification, a rank order of the data from a highest value quantification to a lowest value quantification, the rank order indicating a relevancy of the data to a user associated with the data with the highest value quantification indicating highest relevancy data to the user and the lowest value quantification indicating lowest relevancy data to the user, and
identifying the portion of the data using the rank order starting from data associated with the lowest value quantification; and
deleting the identified portion of the data to resolve the limited storage space availability in the data management system by deleting the lowest relevancy data to the user while retaining the highest relevancy data to the user; and
prior to the data being managed by the data management system:
obtaining, by the data management system, the data from one or more data sources and a file comprising data based on at least one conversation between two people;
after obtaining the data and by the data management system:
generating a first set of topics for each of the data using a classification model hosted by the data management system;
generating a second set of topics for each of the data based on topics identified in the at least one conversation between the two people; and
filtering out from the first set of topics any topics not included in the second set of topics to obtain a third set of topics for each of the data, the third set of topics being the topic classifications; and
storing the data with the topic classifications in a storage of the data management system.