| CPC G16B 40/00 (2019.02) [G06F 18/232 (2023.01); G16B 30/00 (2019.02)] | 11 Claims |

|
1. A method of a metagenomic analysis of a bacterial flora, comprising:
performing clustering on base sequences obtained by DNA sequencing of a sample containing the bacterial flora, which will be hereinafter referred to as a sample sequence, wherein the clustering comprises
grouping base sequences having 100% similarity with each other, for the sample sequences thereby to generate a primary OTU (Operational Taxonomic Unit); and
performing the clustering on the sample sequences thereby to generate a secondary OTU composed of a sequence of a predetermined centroid, and sample sequences that have a similarity larger than or equal to a predetermined threshold value with the sequence of the centroid, on condition that the predetermined threshold value is less than 100%, and
setting a sequence of a primary OTU having a largest number of sequences among the primary OTUs included in each secondary OTU, as a representative sequence of the secondary OTU, collating the representative sequence with a database, and thereby estimating a lineage of the secondary OTU.
|