assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000238215.1_ASM23821v1	NC_016610	Tannerella forsythia 92A2, complete sequence	1	612054-612207	1	CRISPRCasFinder	no		cas10,csm3gr7,csx10gr5,csx19,PD-DExK,WYL,cas3,cas6,cas5,cas8b4,cas7b,cas4,csm2gr11,csm4gr5,csm5gr7,csx1,csx20,cas1,cas2,DEDDh,DinG	Orphan	AGGGTCATTCAGAGCACTTCCTTTGTCATTCAGAGCGTCAGGGATGAATCT	51	0	0	NA	NA	NA	1	1	Orphan	cas10,csm3gr7,csx10gr5,csx19,PD-DExK,WYL,cas3,cas6,cas5,cas8b4,cas7b,cas4,csm2gr11,csm4gr5,csm5gr7,csx1,csx20,cas1,cas2,DEDDh,DinG	NA,NA|50aa|down_1|NC_016610.1_613018_613168_-,NA|561aa|down_3|NC_016610.1_623018_624701_-,NA|193aa|down_6|NC_016610.1_631632_632211_-,NA|86aa|down_7|NC_016610.1_632676_632934_-,NA|443aa|down_9|NC_016610.1_633998_635327_-	NA|154aa|up_9|NC_016610.1_596732_597194_-	PRK06558, PRK06558, V-type ATP synthase subunit K; Validated	NA|624aa|up_8|NC_016610.1_597246_599118_-	PRK05771, PRK05771, V-type ATP synthase subunit I; Validated	NA|207aa|up_7|NC_016610.1_599114_599735_-	PRK02195, PRK02195, V-type ATP synthase subunit D; Provisional	NA|442aa|up_6|NC_016610.1_599766_601092_-	PRK02118, PRK02118, V-type ATP synthase subunit B; Provisional	NA|586aa|up_5|NC_016610.1_601098_602856_-	PRK04192, PRK04192, V-type ATP synthase subunit A; Provisional	NA|290aa|up_4|NC_016610.1_602859_603729_-	pfam10962, DUF2764, Protein of unknown function (DUF2764)	NA|197aa|up_3|NC_016610.1_603728_604319_-	PRK01558, PRK01558, V-type ATP synthase subunit E; Provisional	NA|1054aa|up_2|NC_016610.1_604583_607745_+	COG4585, COG4585, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|214aa|up_1|NC_016610.1_607953_608595_+	COG2197, CitB, Response regulator containing a CheY-like receiver domain and an HTH DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|662aa|up_0|NC_016610.1_609960_611946_-	cd00063, FN3, Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin	NA|102aa|down_0|NC_016610.1_612218_612524_-	cd10448, GIY-YIG_unchar_3, GIY-YIG domain of uncharacterized hypothetical protein found in bacteria	NA|50aa|down_1|NC_016610.1_613018_613168_-	NA	NA|3097aa|down_2|NC_016610.1_613744_623035_-	TIGR02542, T_forsyth_147, TANFOR domain	NA|561aa|down_3|NC_016610.1_623018_624701_-	NA	NA|827aa|down_4|NC_016610.1_625962_628443_-	sd00036, LRR_3, leucine-rich repeats	NA|599aa|down_5|NC_016610.1_629160_630957_-	sd00036, LRR_3, leucine-rich repeats	NA|193aa|down_6|NC_016610.1_631632_632211_-	NA	NA|86aa|down_7|NC_016610.1_632676_632934_-	NA	NA|334aa|down_8|NC_016610.1_632954_633956_-	cd03408, SPFH_like_u1, Uncharacterized family; SPFH (stomatin, prohibitin, flotillin, and HflK/C) superfamily	NA|443aa|down_9|NC_016610.1_633998_635327_-	NA
GCF_000238215.1_ASM23821v1	NC_016610	Tannerella forsythia 92A2, complete sequence	2	2133979-2134062	2	CRISPRCasFinder	no		cas10,csm3gr7,csx10gr5,csx19,PD-DExK,WYL,cas3,cas6,cas5,cas8b4,cas7b,cas4,csm2gr11,csm4gr5,csm5gr7,csx1,csx20,cas1,cas2,DEDDh,DinG	Orphan	CATGATTCGGCGTGTACCACGCATAAC	27	1	2	2134006-2134035|2134006-2134035	NC_016610.1_2134359-2134388|NC_016610.1_2815462-2815433	NA	1	1	Orphan	cas10,csm3gr7,csx10gr5,csx19,PD-DExK,WYL,cas3,cas6,cas5,cas8b4,cas7b,cas4,csm2gr11,csm4gr5,csm5gr7,csx1,csx20,cas1,cas2,DEDDh,DinG	NA|81aa|up_0|NC_016610.1_2133578_2133821_+,NA	NA|117aa|up_9|NC_016610.1_2124367_2124718_+	cd04647, LbH_MAT_like, Maltose O-acyltransferase (MAT)-like: This family is composed of maltose O-acetyltransferase, galactoside O-acetyltransferase (GAT), xenobiotic acyltransferase (XAT) and similar proteins	NA|446aa|up_8|NC_016610.1_2124693_2126031_+	cd03823, GT4_ExpE7-like, glycosyltransferase ExpE7 and similar proteins	NA|378aa|up_7|NC_016610.1_2126025_2127159_-	cd04955, GT4-like, glycosyltransferase family 4 proteins	NA|158aa|up_6|NC_016610.1_2127175_2127649_-	pfam02590, SPOUT_MTase, Predicted SPOUT methyltransferase	NA|124aa|up_5|NC_016610.1_2127845_2128217_+	pfam16022, DUF4783, Domain of unknown function (DUF4783)	NA|283aa|up_4|NC_016610.1_2128240_2129089_+	cd01572, QPRTase, Quinolinate phosphoribosyl transferase (QAPRTase or QPRTase), also called nicotinate-nucleotide pyrophosphorylase, is involved in the de novo synthesis of NAD in both prokaryotes and eukaryotes	NA|905aa|up_3|NC_016610.1_2129092_2131807_+	pfam13715, CarbopepD_reg_2, CarboxypepD_reg-like domain	NA|191aa|up_2|NC_016610.1_2131815_2132388_-	cd03358, LbH_WxcM_N_like, WcxM-like, Left-handed parallel beta-Helix (LbH) N-terminal domain: This group is composed of Xanthomonas campestris WcxM and proteins with similarity to the WcxM N-terminal domain	NA|235aa|up_1|NC_016610.1_2132409_2133114_-	cd11649, RsmI_like, uncharacterized subfamily of the tetrapyrrole methylase family similar to Ribosomal RNA small subunit methyltransferase I (RsmI)	NA|81aa|up_0|NC_016610.1_2133578_2133821_+	NA	NA|1078aa|down_0|NC_016610.1_2134584_2137818_-	sd00036, LRR_3, leucine-rich repeats	NA|83aa|down_1|NC_016610.1_2138339_2138588_-	cd12843, Bvu_2165_C_like, The C-terminal domain of uncharacterized bacterial proteins	NA|720aa|down_2|NC_016610.1_2140586_2142746_+	cd18037, DEXSc_Pif1_like, DEAD-box helicase domain of Pif1	NA|1227aa|down_3|NC_016610.1_2143595_2147276_+	PRK05297, PRK05297, phosphoribosylformylglycinamidine synthase; Provisional	NA|512aa|down_4|NC_016610.1_2147581_2149117_+	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|242aa|down_5|NC_016610.1_2149136_2149862_+	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|217aa|down_6|NC_016610.1_2150029_2150680_-	pfam16961, OmpA_like, Putative OmpA-OmpF-like porin family	NA|233aa|down_7|NC_016610.1_2150814_2151513_-	pfam01863, DUF45, Protein of unknown function DUF45	NA|180aa|down_8|NC_016610.1_2151869_2152409_+	COG1595, RpoE, DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog [Transcription]	NA|332aa|down_9|NC_016610.1_2152592_2153588_+	pfam04773, FecR, FecR protein
GCF_000238215.1_ASM23821v1	NC_016610	Tannerella forsythia 92A2, complete sequence	3	2508338-2513170	1,3,1	PILER-CR,CRISPRCasFinder,CRT	no	cas6,cas5,cas8b4,cas7b,cas3,cas4,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,csx1,csx20,cas1,cas2	cas10,csm3gr7,csx10gr5,csx19,PD-DExK,WYL,cas3,cas6,cas5,cas8b4,cas7b,cas4,csm2gr11,csm4gr5,csm5gr7,csx1,csx20,cas1,cas2,DEDDh,DinG	Type III-C,Type III-A,Type III-B,Type III-D	CTTTTAATCGGACTATCATAGAATTGAAA,CTTTTAATCGGACTATCATAGAATTGAAAC,CTTTTAATCGGACTATCATAGAATTGAAAC	29,30,30	0	0	NA	NA	I-A,II-B,III-A:I-A,II-B,III-A:I-A,II-B,III-A	72,72,72	72	TypeIII-C,TypeIII-A,TypeIII-B,TypeIII-D	cas10,csm3gr7,csx10gr5,csx19,PD-DExK,WYL,cas3,cas6,cas5,cas8b4,cas7b,cas4,csm2gr11,csm4gr5,csm5gr7,csx1,csx20,cas1,cas2,DEDDh,DinG	csx20|122aa|up_2|NC_016610.1_2506457_2506823_+,NA|61aa|down_0|NC_016610.1_2514817_2515000_+	cas4|177aa|up_9|NC_016610.1_2499903_2500434_+	pfam01930, Cas_Cas4, Domain of unknown function DUF83	cas10|481aa|up_8|NC_016610.1_2500480_2501923_+	cd09680, Cas10_III, CRISPR/Cas system-associated protein Cas10	csm2gr11|157aa|up_7|NC_016610.1_2501928_2502399_+	pfam03750, Csm2_III-A, Csm2 Type III-A	csm3gr7|213aa|up_6|NC_016610.1_2502408_2503047_+	cd09684, Csm3_III-A, CRISPR/Cas system-associated RAMP superfamily protein Csm3	csm4gr5|332aa|up_5|NC_016610.1_2503046_2504042_+	COG1567, COG1567, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	csm5gr7|385aa|up_4|NC_016610.1_2504054_2505209_+	TIGR01899, cas_TM1807_csm5, CRISPR type III-A/MTUBE-associated RAMP protein Csm5	csx1|415aa|up_3|NC_016610.1_2505210_2506455_+	pfam09455, Cas_DxTHG, CRISPR-associated (Cas) DxTHG family	csx20|122aa|up_2|NC_016610.1_2506457_2506823_+	NA	cas1|339aa|up_1|NC_016610.1_2506847_2507864_+	TIGR03641, cas1_HMARI, CRISPR-associated endonuclease Cas1, subtype I-B/HMARI/TNEAP	cas2|88aa|up_0|NC_016610.1_2507863_2508127_+	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	NA|61aa|down_0|NC_016610.1_2514817_2515000_+	NA	NA|175aa|down_1|NC_016610.1_2515919_2516444_+	cd01055, Nonheme_Ferritin, nonheme-containing ferritins	NA|734aa|down_2|NC_016610.1_2516707_2518909_-	pfam03030, H_PPase, Inorganic H+ pyrophosphatase	NA|379aa|down_3|NC_016610.1_2519561_2520698_-	cd06829, PLPDE_III_CANSDC, Type III Pyridoxal 5-phosphate (PLP)-Dependent Enzyme Carboxynorspermidine Decarboxylase	NA|771aa|down_4|NC_016610.1_2520702_2523015_-	TIGR01073, ATP-dependent_DNA_helicase_PcrA, ATP-dependent DNA helicase PcrA	NA|166aa|down_5|NC_016610.1_2523251_2523749_+	cd13831, HU, histone-like DNA-binding protein HU	NA|119aa|down_6|NC_016610.1_2523923_2524280_-	pfam02152, FolB, Dihydroneopterin aldolase	NA|140aa|down_7|NC_016610.1_2524368_2524788_+	pfam04519, Bactofilin, Polymer-forming cytoskeletal	NA|898aa|down_8|NC_016610.1_2524780_2527474_-	PLN02950, PLN02950, 4-alpha-glucanotransferase	NA|846aa|down_9|NC_016610.1_2527492_2530030_-	TIGR02504, ribonucleotide_reductase, ribonucleoside-diphosphate reductase, adenosylcobalamin-dependent
