| CPC G06F 16/2379 (2019.01) [G06F 16/2246 (2019.01); G06F 16/2474 (2019.01); G06F 16/285 (2019.01); G06F 16/36 (2019.01); G16B 20/00 (2019.02); G16B 30/10 (2019.02)] | 20 Claims |

|
1. A computer-implemented method comprising:
detecting, at a computing system, that a remote taxonomy database has been updated to include an organism name that is not represented in a particular database, the particular database including organism-name data organized using a set of buckets, wherein each bucket or the set of buckets corresponds to an organism category;
accessing, by the computing system, metadata corresponding to the organism name, wherein:
the metadata incudes two or more taxonomical classifications being associated with the organism name, and
the two or more taxonomical classifications indicates two or more hierarchy levels associated with the organism name within a taxonomy defined in the remote taxonomy database;
determining, by the computing system, whether the two or more hierarchy levels are represented on a same branch within the taxonomy;
identifying, by the computing system based on the determination that the two or more hierarchy levels are represented on the same branch within the taxonomy, that a particular bucket from amongst the set of buckets corresponds to the organism name;
updating, by the computing system, the particular database to associate the organism name with the particular bucket;
receiving, at the computing system, biological sequence data; and
determining, at the computing system, that the biological sequence data corresponds to a reference biological sequence associated with the particular bucket.
|