assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_002240055.1_ASM224005v1	NZ_CP016753	Leptotrichia sp. oral taxon 498 strain F0590 chromosome, complete genome	1	146094-147323	1,1,1	PILER-CR,CRISPRCasFinder,CRT	no		cas14j,cas14k,cas3,csa3,WYL,DEDDh,csx1,csx20,cas1,cas2,cas4,cas10,csx10gr5,csm3gr7,cas6,PD-DExK,DinG	Orphan	GTTAAAGTAGTGAATCCATTAAAATAAGGATTGAAAC,GTTAAAGTAGTGAATCCATTAAAATAAGGATTGAAAC,GTTAAAGTAGTGAATCCATTAAAATAAGGATTGAAAC	37,37,37	0	0	NA	NA	I-B:I-B:I-B	10,16,16	16	Orphan	cas14j,cas14k,cas3,csa3,WYL,DEDDh,csx1,csx20,cas1,cas2,cas4,cas10,csx10gr5,csm3gr7,cas6,PD-DExK,DinG	NA,NA	NA|393aa|up_9|NZ_CP016753.1_128148_129327_-	PRK09210, PRK09210, RNA polymerase sigma factor RpoD; Validated	NA|71aa|up_8|NZ_CP016753.1_131265_131478_-	PRK00392, rpoZ, DNA-directed RNA polymerase subunit omega; Reviewed	NA|182aa|up_7|NZ_CP016753.1_131521_132067_-	PRK00300, gmk, guanylate kinase; Provisional	NA|1340aa|up_6|NZ_CP016753.1_132170_136190_-	PRK00566, PRK00566, DNA-directed RNA polymerase subunit beta'; Provisional	NA|1149aa|up_5|NZ_CP016753.1_136259_139706_-	PRK00405, rpoB, DNA-directed RNA polymerase subunit beta; Reviewed	NA|123aa|up_4|NZ_CP016753.1_140046_140415_-	PRK00157, rplL, 50S ribosomal protein L7/L12; Reviewed	NA|170aa|up_3|NZ_CP016753.1_140462_140972_-	PRK00099, rplJ, 50S ribosomal protein L10; Reviewed	NA|557aa|up_2|NZ_CP016753.1_141361_143032_+	pfam01268, FTHFS, Formate--tetrahydrofolate ligase	NA|428aa|up_1|NZ_CP016753.1_143244_144528_-	TIGR03025, EPS_sugtrans, exopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase	NA|342aa|up_0|NZ_CP016753.1_144577_145603_-	COG3274, COG3274, Predicted O-acyltransferase [General function prediction only]	NA|145aa|down_0|NZ_CP016753.1_147630_148065_+	PRK09216, rplM, 50S ribosomal protein L13; Reviewed	NA|133aa|down_1|NZ_CP016753.1_148078_148477_+	PRK00132, rpsI, 30S ribosomal protein S9; Reviewed	NA|454aa|down_2|NZ_CP016753.1_148705_150067_-	cd05802, GlmM, GlmM is a bacterial phosphoglucosamine mutase (PNGM) that belongs to the alpha-D-phosphohexomutase superfamily	NA|531aa|down_3|NZ_CP016753.1_150104_151697_-	TIGR01386, Probable_sensor_protein_PcoS, heavy metal sensor kinase	NA|228aa|down_4|NZ_CP016753.1_151708_152392_-	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|423aa|down_5|NZ_CP016753.1_152617_153886_-	COG0826, COG0826, Collagenase and related proteases [Posttranslational modification, protein turnover, chaperones]	NA|481aa|down_6|NZ_CP016753.1_153930_155373_-	TIGR00665, DnaB, replicative DNA helicase	NA|151aa|down_7|NZ_CP016753.1_155397_155850_-	PRK00137, rplI, 50S ribosomal protein L9; Reviewed	NA|521aa|down_8|NZ_CP016753.1_155891_157454_-	TIGR02397, DNA_polymerase_III_subunit_gamma, DNA polymerase III, subunit gamma and tau	NA|214aa|down_9|NZ_CP016753.1_157613_158255_-	cd03014, PRX_Atyp2cys, Peroxiredoxin (PRX) family, Atypical 2-cys PRX subfamily; composed of PRXs containing peroxidatic and resolving cysteines, similar to the homodimeric thiol specific antioxidant (TSA) protein also known as TRX-dependent thiol peroxidase (Tpx)
GCF_002240055.1_ASM224005v1	NZ_CP016753	Leptotrichia sp. oral taxon 498 strain F0590 chromosome, complete genome	2	1309513-1310360	2,2,2	PILER-CR,CRISPRCasFinder,CRT	no	csx1,csx20,cas1,cas2,cas4,cas10	cas14j,cas14k,cas3,csa3,WYL,DEDDh,csx1,csx20,cas1,cas2,cas4,cas10,csx10gr5,csm3gr7,cas6,PD-DExK,DinG	Type III-C,Type III-B,Type III-A,,Type III-D	GTTAAAGTAGTTTATCCATTAAAACAAGGATTGAAAC,GTTAAAGTAGTTTATCCATTAAAACAAGGATTGAAAC,GTTAAAGTAGTTTATCCATTAAAACAAGGATTGAAAC	37,37,37	0	0	NA	NA	I-B:I-B:I-B	10,11,10	11	,TypeIII-C,TypeIII-B,TypeIII-A,TypeIII-D	cas14j,cas14k,cas3,csa3,WYL,DEDDh,csx1,csx20,cas1,cas2,cas4,cas10,csx10gr5,csm3gr7,cas6,PD-DExK,DinG	NA|213aa|up_7|NZ_CP016753.1_1302219_1302858_+,NA|191aa|up_6|NZ_CP016753.1_1302884_1303457_+,NA|195aa|up_5|NZ_CP016753.1_1303453_1304038_+,NA|73aa|up_3|NZ_CP016753.1_1305543_1305762_+,NA|67aa|down_2|NZ_CP016753.1_1312736_1312937_+,csx20|129aa|down_4|NZ_CP016753.1_1315325_1315712_+,NA|711aa|down_6|NZ_CP016753.1_1318001_1320134_+	NA|488aa|up_9|NZ_CP016753.1_1299784_1301248_+	COG1686, DacC, D-alanyl-D-alanine carboxypeptidase [Cell envelope biogenesis, outer membrane]	NA|249aa|up_8|NZ_CP016753.1_1301373_1302120_+	PRK05450, PRK05450, 3-deoxy-manno-octulosonate cytidylyltransferase; Provisional	NA|213aa|up_7|NZ_CP016753.1_1302219_1302858_+	NA	NA|191aa|up_6|NZ_CP016753.1_1302884_1303457_+	NA	NA|195aa|up_5|NZ_CP016753.1_1303453_1304038_+	NA	NA|483aa|up_4|NZ_CP016753.1_1304047_1305496_+	PRK09441, PRK09441, cytoplasmic alpha-amylase; Reviewed	NA|73aa|up_3|NZ_CP016753.1_1305543_1305762_+	NA	NA|381aa|up_2|NZ_CP016753.1_1305806_1306949_+	PRK05429, PRK05429, gamma-glutamyl kinase; Provisional	NA|420aa|up_1|NZ_CP016753.1_1307095_1308355_+	PRK00197, proA, gamma-glutamyl phosphate reductase; Provisional	NA|269aa|up_0|NZ_CP016753.1_1308413_1309220_+	PRK11880, PRK11880, pyrroline-5-carboxylate reductase; Reviewed	NA|231aa|down_0|NZ_CP016753.1_1310601_1311294_+	COG3596, COG3596, Predicted GTPase [General function prediction only]	NA|335aa|down_1|NZ_CP016753.1_1311618_1312623_+	COG3596, COG3596, Predicted GTPase [General function prediction only]	NA|67aa|down_2|NZ_CP016753.1_1312736_1312937_+	NA	csx1|705aa|down_3|NZ_CP016753.1_1313106_1315221_+	cd09732, Csx1_III-U, CRISPR/Cas system-associated protein Csx1	csx20|129aa|down_4|NZ_CP016753.1_1315325_1315712_+	NA	csx1|711aa|down_5|NZ_CP016753.1_1315757_1317890_+	pfam09455, Cas_DxTHG, CRISPR-associated (Cas) DxTHG family	NA|711aa|down_6|NZ_CP016753.1_1318001_1320134_+	NA	NA|154aa|down_7|NZ_CP016753.1_1320321_1320783_+	pfam13353, Fer4_12, 4Fe-4S single cluster domain	NA|533aa|down_8|NZ_CP016753.1_1320772_1322371_+	pfam13597, NRDD, Anaerobic ribonucleoside-triphosphate reductase	NA|371aa|down_9|NZ_CP016753.1_1322533_1323646_+	COG3596, COG3596, Predicted GTPase [General function prediction only]
GCF_002240055.1_ASM224005v1	NZ_CP016753	Leptotrichia sp. oral taxon 498 strain F0590 chromosome, complete genome	3	1326856-1326965	3	CRISPRCasFinder	no	csx1,csx20,cas1,cas2,cas4,cas10,csx10gr5,csm3gr7,cas6	cas14j,cas14k,cas3,csa3,WYL,DEDDh,csx1,csx20,cas1,cas2,cas4,cas10,csx10gr5,csm3gr7,cas6,PD-DExK,DinG	Type III-C,Type III-B,Type III-A,,Type III-D	GTTAAAGAAGTGAATCCATTAAAACAAGGATTGAAA	36	0	0	NA	NA	I-B	1	1	,TypeIII-C,TypeIII-B,TypeIII-A,TypeIII-D	cas14j,cas14k,cas3,csa3,WYL,DEDDh,csx1,csx20,cas1,cas2,cas4,cas10,csx10gr5,csm3gr7,cas6,PD-DExK,DinG	csx20|129aa|up_9|NZ_CP016753.1_1315325_1315712_+,NA|711aa|up_7|NZ_CP016753.1_1318001_1320134_+,NA|251aa|down_0|NZ_CP016753.1_1327301_1328054_+,NA|98aa|down_3|NZ_CP016753.1_1331935_1332229_+,NA|144aa|down_6|NZ_CP016753.1_1334225_1334657_+,NA|130aa|down_9|NZ_CP016753.1_1337695_1338085_-	csx20|129aa|up_9|NZ_CP016753.1_1315325_1315712_+	NA	csx1|711aa|up_8|NZ_CP016753.1_1315757_1317890_+	pfam09455, Cas_DxTHG, CRISPR-associated (Cas) DxTHG family	NA|711aa|up_7|NZ_CP016753.1_1318001_1320134_+	NA	NA|154aa|up_6|NZ_CP016753.1_1320321_1320783_+	pfam13353, Fer4_12, 4Fe-4S single cluster domain	NA|533aa|up_5|NZ_CP016753.1_1320772_1322371_+	pfam13597, NRDD, Anaerobic ribonucleoside-triphosphate reductase	NA|371aa|up_4|NZ_CP016753.1_1322533_1323646_+	COG3596, COG3596, Predicted GTPase [General function prediction only]	NA|205aa|up_3|NZ_CP016753.1_1323792_1324407_+	pfam12787, EcsC, EcsC protein family	cas1|324aa|up_2|NZ_CP016753.1_1324546_1325518_+	pfam01867, Cas_Cas1, CRISPR associated protein Cas1	cas2|97aa|up_1|NZ_CP016753.1_1325577_1325868_+	pfam09827, CRISPR_Cas2, CRISPR associated protein Cas2	cas4|216aa|up_0|NZ_CP016753.1_1325883_1326531_+	cd09637, Cas4_I-A_I-B_I-C_I-D_II-B, CRISPR/Cas system-associated protein Cas4	NA|251aa|down_0|NZ_CP016753.1_1327301_1328054_+	NA	cas10|656aa|down_1|NZ_CP016753.1_1328131_1330099_+	TIGR02577, thermophile-specific_DNA_repair_system, CRISPR-associated protein Cas10/Cmr2, subtype III-B	csx10gr5|608aa|down_2|NZ_CP016753.1_1330088_1331912_+	pfam03787, RAMPs, RAMP superfamily	NA|98aa|down_3|NZ_CP016753.1_1331935_1332229_+	NA	csm3gr7|277aa|down_4|NZ_CP016753.1_1332479_1333310_+	cd09683, Csm3_III-A, CRISPR/Cas system-associated RAMP superfamily protein Csm3	csm3gr7|302aa|down_5|NZ_CP016753.1_1333320_1334226_+	pfam03787, RAMPs, RAMP superfamily	NA|144aa|down_6|NZ_CP016753.1_1334225_1334657_+	NA	csm3gr7|705aa|down_7|NZ_CP016753.1_1334637_1336752_+	TIGR03986, CRISPR-associated_protein, CRISPR-associated protein	cas6|217aa|down_8|NZ_CP016753.1_1336772_1337423_+	pfam17262, DUF5328, Family of unknown function (DUF5328)	NA|130aa|down_9|NZ_CP016753.1_1337695_1338085_-	NA
GCF_002240055.1_ASM224005v1	NZ_CP016753	Leptotrichia sp. oral taxon 498 strain F0590 chromosome, complete genome	4	1930812-1931450	4,3,3	CRISPRCasFinder,CRT,PILER-CR	no		cas14j,cas14k,cas3,csa3,WYL,DEDDh,csx1,csx20,cas1,cas2,cas4,cas10,csx10gr5,csm3gr7,cas6,PD-DExK,DinG	Orphan	GTTTCAATCCTTGTTTTAATGGATACACTACTTCAAC,GTTTCAATCCTTGTTTTAATGGATACACTACTTCAAC,GTTGAAGTAGTGTATCCATTAAAACAAGGATTGAAAC	37,37,37	0	0	NA	NA	I-B:I-B:I-B	8,8,8	8	Orphan	cas14j,cas14k,cas3,csa3,WYL,DEDDh,csx1,csx20,cas1,cas2,cas4,cas10,csx10gr5,csm3gr7,cas6,PD-DExK,DinG	NA|468aa|up_8|NZ_CP016753.1_1919899_1921303_+,NA|462aa|up_6|NZ_CP016753.1_1922355_1923741_+,NA|162aa|down_1|NZ_CP016753.1_1934018_1934504_-,NA|167aa|down_3|NZ_CP016753.1_1936992_1937493_-	NA|501aa|up_9|NZ_CP016753.1_1918227_1919730_-	PRK09225, PRK09225, threonine synthase; Validated	NA|468aa|up_8|NZ_CP016753.1_1919899_1921303_+	NA	NA|333aa|up_7|NZ_CP016753.1_1921305_1922304_+	COG2849, COG2849, Uncharacterized protein conserved in bacteria [Function unknown]	NA|462aa|up_6|NZ_CP016753.1_1922355_1923741_+	NA	NA|179aa|up_5|NZ_CP016753.1_1924079_1924616_+	pfam06210, DUF1003, Protein of unknown function (DUF1003)	NA|105aa|up_4|NZ_CP016753.1_1924623_1924938_+	cd06981, cupin_reut_a1446, Cupriavidus pinatubonensis reut_a1446 and related proteins, cupin domain	NA|285aa|up_3|NZ_CP016753.1_1924967_1925822_+	PLN02829, PLN02829, Probable galacturonosyltransferase	NA|1170aa|up_2|NZ_CP016753.1_1925936_1929446_-	TIGR02082, Methionine_synthase, 5-methyltetrahydrofolate--homocysteine methyltransferase	NA|188aa|up_1|NZ_CP016753.1_1929499_1930063_-	pfam02417, Chromate_transp, Chromate transporter	NA|198aa|up_0|NZ_CP016753.1_1930074_1930668_-	pfam02417, Chromate_transp, Chromate transporter	NA|673aa|down_0|NZ_CP016753.1_1931829_1933848_+	PRK07956, ligA, NAD-dependent DNA ligase LigA; Validated	NA|162aa|down_1|NZ_CP016753.1_1934018_1934504_-	NA	NA|336aa|down_2|NZ_CP016753.1_1934626_1935634_-	TIGR04171, ribonucleotide-diphosphate_reductase_subunit_beta, ribonucleoside-diphosphate reductase, class 1b, beta subunit	NA|167aa|down_3|NZ_CP016753.1_1936992_1937493_-	NA	NA|836aa|down_4|NZ_CP016753.1_1937530_1940038_-	COG0270, Dcm, Site-specific DNA methylase [DNA replication, recombination, and repair]	NA|700aa|down_5|NZ_CP016753.1_1940084_1942184_-	PRK07632, PRK07632, ribonucleotide-diphosphate reductase subunit alpha; Validated	NA|124aa|down_6|NZ_CP016753.1_1942266_1942638_-	PRK03600, nrdI, class Ib ribonucleoside-diphosphate reductase assembly flavoprotein NrdI	NA|80aa|down_7|NZ_CP016753.1_1942649_1942889_-	cd02947, TRX_family, TRX family; composed of two groups: Group I, which includes proteins that exclusively encode a TRX domain; and Group II, which are composed of fusion proteins of TRX and additional domains	NA|479aa|down_8|NZ_CP016753.1_1943217_1944654_-	COG2265, TrmA, SAM-dependent methyltransferases related to tRNA (uracil-5-)-methyltransferase [Translation, ribosomal structure and biogenesis]	NA|383aa|down_9|NZ_CP016753.1_1944779_1945928_-	PRK07683, PRK07683, aminotransferase A; Validated
