assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000816145.1_ASM81614v1	NZ_CP007141	Pseudothermotoga hypogea DSM 11164 = NBRC 106472 strain DSM 11164 chromosome, complete genome	1	1057312-1058399	1,1,1,2	CRT,PILER-CR,CRISPRCasFinder,PILER-CR	no	csx1,cas2,cas6,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas1,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5	cas3,cas14k,csx1,cas2,cas6,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas1,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cmr1gr7,DEDDh,cas5,cas7,cas8b1,cas4,csa3,Cas14b_CAS-V-F	Type III-D,Type III-C,Type III-A,Type III-B	GTTTCCATCCCTCATAGGAGCCTTCTAAAC,AGTTTCCATCCCTCATAGGAGCCTTCTAAAC,GTTTCCATCCCTCATAGGAGCCTTCTAAAC,GTTTCCATCCCTCATAGGAGCCTTCTAAAC	30,31,30,30	0	0	NA	NA	NA:NA:NA:NA	15,11,14,11	15	TypeIII-D,TypeIII-C,TypeIII-A,TypeIII-B	cas3,cas14k,csx1,cas2,cas6,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas1,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cmr1gr7,DEDDh,cas5,cas7,cas8b1,cas4,csa3,Cas14b_CAS-V-F	NA|82aa|up_4|NZ_CP007141.1_1050055_1050301_+,NA|266aa|down_1|NZ_CP007141.1_1060098_1060896_-,NA|60aa|down_3|NZ_CP007141.1_1061587_1061767_-,cas6|144aa|down_7|NZ_CP007141.1_1063413_1063845_-	NA|593aa|up_9|NZ_CP007141.1_1041677_1043456_+	COG3408, GDB1, Glycogen debranching enzyme [Carbohydrate transport and metabolism]	NA|610aa|up_8|NZ_CP007141.1_1043472_1045302_+	pfam14871, GHL6, Hypothetical glycosyl hydrolase 6	NA|684aa|up_7|NZ_CP007141.1_1045355_1047407_-	COG1033, COG1033, Predicted exporters of the RND superfamily [General function prediction only]	NA|641aa|up_6|NZ_CP007141.1_1047431_1049354_-	COG0744, MrcB, Membrane carboxypeptidase (penicillin-binding protein) [Cell envelope biogenesis, outer membrane]	NA|181aa|up_5|NZ_CP007141.1_1049335_1049878_-	COG0350, Ada, Methylated DNA-protein cysteine methyltransferase [DNA replication, recombination, and repair]	NA|82aa|up_4|NZ_CP007141.1_1050055_1050301_+	NA	NA|113aa|up_3|NZ_CP007141.1_1050297_1050636_+	TIGR02937, RNA_polymerase_sigma_factor, RNA polymerase sigma factor, sigma-70 family	NA|782aa|up_2|NZ_CP007141.1_1052366_1054712_+	PRK05261, PRK05261, phosphoketolase	NA|468aa|up_1|NZ_CP007141.1_1054758_1056162_-	COG2211, MelB, Na+/melibiose symporter and related transporters [Carbohydrate transport and metabolism]	NA|305aa|up_0|NZ_CP007141.1_1056191_1057106_-	COG0596, MhpC, Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) [General function prediction only]	csx1|476aa|down_0|NZ_CP007141.1_1058674_1060102_-	cd09747, Csx1_III-U, CRISPR/Cas system-associated protein Csx1	NA|266aa|down_1|NZ_CP007141.1_1060098_1060896_-	NA	NA|229aa|down_2|NZ_CP007141.1_1060923_1061610_-	TIGR02985, Sig70_bacteroi1, RNA polymerase sigma-70 factor, Bacteroides expansion family 1	NA|60aa|down_3|NZ_CP007141.1_1061587_1061767_-	NA	cas2|88aa|down_4|NZ_CP007141.1_1062118_1062382_+	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas2|91aa|down_5|NZ_CP007141.1_1062378_1062651_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas6|251aa|down_6|NZ_CP007141.1_1062664_1063417_-	TIGR01877, CRISPR-associated_endoribonuclease_Cas6_1, CRISPR-associated endoribonuclease Cas6	cas6|144aa|down_7|NZ_CP007141.1_1063413_1063845_-	NA	csm5gr7|387aa|down_8|NZ_CP007141.1_1064039_1065200_-	TIGR01899, cas_TM1807_csm5, CRISPR type III-A/MTUBE-associated RAMP protein Csm5	csm4gr5|320aa|down_9|NZ_CP007141.1_1065189_1066149_-	TIGR01903, Hypothetical_protein
GCF_000816145.1_ASM81614v1	NZ_CP007141	Pseudothermotoga hypogea DSM 11164 = NBRC 106472 strain DSM 11164 chromosome, complete genome	2	1072723-1073604	2,3,2	CRT,PILER-CR,CRISPRCasFinder	no	csx1,cas2,cas6,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas1,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cmr1gr7,DEDDh	cas3,cas14k,csx1,cas2,cas6,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas1,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cmr1gr7,DEDDh,cas5,cas7,cas8b1,cas4,csa3,Cas14b_CAS-V-F	Type III-D,Type III-B, Type III-B?,Type III-C,Type III-A	GTTTCCATCCCTCATAGGACCTCTCTAAAC,GTTTCCATCCCTCATAGGACCTCTCTAAAC,GTTTCCATCCCTCATAGGACCTCTCTAAAC	30,30,30	0	0	NA	NA	NA:NA:NA	12,11,11	12	TypeIII-D,TypeIII-B,TypeIII-B?,TypeIII-C,TypeIII-A	cas3,cas14k,csx1,cas2,cas6,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas1,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cmr1gr7,DEDDh,cas5,cas7,cas8b1,cas4,csa3,Cas14b_CAS-V-F	cas6|144aa|up_7|NZ_CP007141.1_1063413_1063845_-,NA|53aa|down_0|NZ_CP007141.1_1074599_1074758_+,NA|104aa|down_8|NZ_CP007141.1_1083389_1083701_-,NA|132aa|down_9|NZ_CP007141.1_1083716_1084112_-	cas2|91aa|up_9|NZ_CP007141.1_1062378_1062651_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas6|251aa|up_8|NZ_CP007141.1_1062664_1063417_-	TIGR01877, CRISPR-associated_endoribonuclease_Cas6_1, CRISPR-associated endoribonuclease Cas6	cas6|144aa|up_7|NZ_CP007141.1_1063413_1063845_-	NA	csm5gr7|387aa|up_6|NZ_CP007141.1_1064039_1065200_-	TIGR01899, cas_TM1807_csm5, CRISPR type III-A/MTUBE-associated RAMP protein Csm5	csm4gr5|320aa|up_5|NZ_CP007141.1_1065189_1066149_-	TIGR01903, Hypothetical_protein	csm3gr7|264aa|up_4|NZ_CP007141.1_1066164_1066956_-	cd09684, Csm3_III-A, CRISPR/Cas system-associated RAMP superfamily protein Csm3	csm2gr11|162aa|up_3|NZ_CP007141.1_1066952_1067438_-	pfam03750, Csm2_III-A, Csm2 Type III-A	cas10|815aa|up_2|NZ_CP007141.1_1067452_1069897_-	cd09680, Cas10_III, CRISPR/Cas system-associated protein Cas10	cas1|330aa|up_1|NZ_CP007141.1_1069910_1070900_-	cd09722, Cas1_I-B, CRISPR/Cas system-associated protein Cas1	NA|468aa|up_0|NZ_CP007141.1_1070940_1072344_-	pfam18145, SAVED, SMODS-associated and fused to various effectors sensor domain	NA|53aa|down_0|NZ_CP007141.1_1074599_1074758_+	NA	cmr6gr7|254aa|down_1|NZ_CP007141.1_1074883_1075645_-	cd09661, Cmr6_III-B, CRISPR/Cas system-associated RAMP superfamily protein Cmr6	cmr5gr11|113aa|down_2|NZ_CP007141.1_1075651_1075990_-	pfam09701, Cas_Cmr5, CRISPR-associated protein (Cas_Cmr5)	cmr4gr7|290aa|down_3|NZ_CP007141.1_1075970_1076840_-	TIGR02580, putative_CRISPR-associated_protein, CRISPR type III-B/RAMP module RAMP protein Cmr4	cmr3gr5|345aa|down_4|NZ_CP007141.1_1076846_1077881_-	COG1769, COG1769, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	cas10|797aa|down_5|NZ_CP007141.1_1077877_1080268_-	cd09679, Cas10_III, CRISPR/Cas system-associated protein Cas10	cmr1gr7|462aa|down_6|NZ_CP007141.1_1080264_1081650_-	COG1367, COG1367, CRISPR system related protein, RAMP superfamily [Defense mechanisms]	csx1|457aa|down_7|NZ_CP007141.1_1081706_1083077_-	cd09728, Csx1_III-U, CRISPR/Cas system-associated protein Csx1	NA|104aa|down_8|NZ_CP007141.1_1083389_1083701_-	NA	NA|132aa|down_9|NZ_CP007141.1_1083716_1084112_-	NA
GCF_000816145.1_ASM81614v1	NZ_CP007141	Pseudothermotoga hypogea DSM 11164 = NBRC 106472 strain DSM 11164 chromosome, complete genome	3	1134832-1139908	4,3,3	PILER-CR,CRISPRCasFinder,CRT	no	cas3,cas5,cas7,cas8b1,cas2,cas1,cas4,cas6	cas3,cas14k,csx1,cas2,cas6,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas1,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cmr1gr7,DEDDh,cas5,cas7,cas8b1,cas4,csa3,Cas14b_CAS-V-F	Type I-B	GTTTCCATTCCTCATAGATTCGATTGAAC,GTTTCCATTCCTCATAGATTCGATTGAAC,GTTTCCATTCCTCATAGATTCGATTGAAC	29,29,29	0	0	NA	NA	NA:NA:NA	77,77,77	77	TypeI-B	cas3,cas14k,csx1,cas2,cas6,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas1,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cmr1gr7,DEDDh,cas5,cas7,cas8b1,cas4,csa3,Cas14b_CAS-V-F	NA,NA	NA|92aa|up_9|NZ_CP007141.1_1125586_1125862_+	TIGR01215, Cell_division_topological_specificity_factor, cell division topological specificity factor MinE	NA|134aa|up_8|NZ_CP007141.1_1125869_1126271_+	pfam01894, UPF0047, Uncharacterized protein family UPF0047	NA|380aa|up_7|NZ_CP007141.1_1126246_1127386_-	cd00622, PLPDE_III_ODC, Type III Pyridoxal 5-phosphate (PLP)-Dependent Enzyme Ornithine Decarboxylase	NA|319aa|up_6|NZ_CP007141.1_1127382_1128339_-	COG1956, COG1956, GAF domain-containing protein [Signal transduction mechanisms]	NA|139aa|up_5|NZ_CP007141.1_1128335_1128752_-	COG1959, COG1959, Predicted transcriptional regulator [Transcription]	NA|572aa|up_4|NZ_CP007141.1_1128757_1130473_-	cd14486, 3D_domain, 3D domain, named for 3 conserved aspartate residues, is found in mltA-like lytic transglycosylases and numerous other contexts	NA|219aa|up_3|NZ_CP007141.1_1130469_1131126_-	pfam09986, DUF2225, Uncharacterized protein conserved in bacteria (DUF2225)	NA|294aa|up_2|NZ_CP007141.1_1131169_1132051_-	PRK10416, PRK10416, signal recognition particle-docking protein FtsY; Provisional	NA|575aa|up_1|NZ_CP007141.1_1132054_1133779_-	PRK09194, PRK09194, prolyl-tRNA synthetase; Provisional	NA|167aa|up_0|NZ_CP007141.1_1134136_1134637_+	pfam03961, FapA, Flagellar Assembly Protein A	cas3|746aa|down_0|NZ_CP007141.1_1140342_1142580_-	COG1203, COG1203, CRISPR-associated helicase Cas3 [Defense mechanisms]	cas5|233aa|down_1|NZ_CP007141.1_1142564_1143263_-	TIGR02592, hypothetical_protein_CTC_01466, CRISPR-associated protein Cas5, subtype I-B/HMARI	cas7|295aa|down_2|NZ_CP007141.1_1143259_1144144_-	TIGR02590, hypothetical_protein_MM_0563, CRISPR-associated protein Cas7/Csh2, subtype I-B/HMARI	cas8b1|578aa|down_3|NZ_CP007141.1_1144133_1145867_-	cd09730, Cas8a1_I-A, CRISPR/Cas system-associated protein Cas8a1	cas2|88aa|down_4|NZ_CP007141.1_1145882_1146146_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|328aa|down_5|NZ_CP007141.1_1146219_1147203_-	TIGR03641, cas1_HMARI, CRISPR-associated endonuclease Cas1, subtype I-B/HMARI/TNEAP	cas4|163aa|down_6|NZ_CP007141.1_1147212_1147701_-	pfam01930, Cas_Cas4, Domain of unknown function DUF83	cas6|261aa|down_7|NZ_CP007141.1_1147709_1148492_-	COG1583, COG1583, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	NA|353aa|down_8|NZ_CP007141.1_1148850_1149909_+	cd01949, GGDEF, Diguanylate-cyclase (DGC) or GGDEF domain	NA|340aa|down_9|NZ_CP007141.1_1149910_1150930_-	pfam03706, LPG_synthase_TM, Lysylphosphatidylglycerol synthase TM region
GCF_000816145.1_ASM81614v1	NZ_CP007141	Pseudothermotoga hypogea DSM 11164 = NBRC 106472 strain DSM 11164 chromosome, complete genome	4	1233417-1235361	5,4,4	PILER-CR,CRISPRCasFinder,CRT	no	Cas14b_CAS-V-F,cas6,cas7,cas5,cas3,cas4,cas1,cas2,cas14k	cas3,cas14k,csx1,cas2,cas6,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas1,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cmr1gr7,DEDDh,cas5,cas7,cas8b1,cas4,csa3,Cas14b_CAS-V-F	Unclear	GTTTGATCTGAACTATGTGGGATGTGAAC,GTTTGATCTGAACTATGTGGGATGTGAAC,GTTTGATCTGAACTATGTGGGATGTGAAC	29,29,29	0	0	NA	NA	NA:NA:NA	28,29,29	29	TypeV	cas3,cas14k,csx1,cas2,cas6,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas1,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cmr1gr7,DEDDh,cas5,cas7,cas8b1,cas4,csa3,Cas14b_CAS-V-F	NA|185aa|up_6|NZ_CP007141.1_1226395_1226950_+,NA|86aa|down_0|NZ_CP007141.1_1235961_1236219_+,NA|50aa|down_8|NZ_CP007141.1_1245994_1246144_+,NA|105aa|down_9|NZ_CP007141.1_1246173_1246488_+	NA|191aa|up_9|NZ_CP007141.1_1223461_1224034_-	cd17517, RMtype1_S_EcoKI_StySPI-TRD2-CR2_like, Type I restriction-modification system specificity (S) subunit Target Recognition Domain-ConseRved domain (TRD-CR),similar to Escherichia coli str	cas6|273aa|up_8|NZ_CP007141.1_1224427_1225246_+	cd09652, Cas6-I-III, CRISPR/Cas system-associated RAMP superfamily protein Cas6	NA|384aa|up_7|NZ_CP007141.1_1225247_1226399_+	cd09754, Cas8a1_I-A, CRISPR/Cas system-associated protein Cas8a1	NA|185aa|up_6|NZ_CP007141.1_1226395_1226950_+	NA	cas7|385aa|up_5|NZ_CP007141.1_1226933_1228088_+	pfam01905, DevR, CRISPR-associated negative auto-regulator DevR/Csa2	cas5|268aa|up_4|NZ_CP007141.1_1228105_1228909_+	TIGR01895, conserved_hypothetical_protein, CRISPR-associated protein Cas5, subtype I-B/TNEAP	cas3|835aa|up_3|NZ_CP007141.1_1228910_1231415_+	TIGR01587, CRISPR-associated_endonuclease/helicase_Cas3, CRISPR-associated helicase Cas3	cas4|171aa|up_2|NZ_CP007141.1_1231434_1231947_+	pfam01930, Cas_Cas4, Domain of unknown function DUF83	cas1|328aa|up_1|NZ_CP007141.1_1231953_1232937_+	TIGR03641, cas1_HMARI, CRISPR-associated endonuclease Cas1, subtype I-B/HMARI/TNEAP	cas2|89aa|up_0|NZ_CP007141.1_1232936_1233203_+	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	NA|86aa|down_0|NZ_CP007141.1_1235961_1236219_+	NA	NA|570aa|down_1|NZ_CP007141.1_1236515_1238225_+	COG0579, COG0579, Predicted dehydrogenase [General function prediction only]	NA|551aa|down_2|NZ_CP007141.1_1238217_1239870_+	TIGR01372, sarcosine_oxidase_alpha_subunit, sarcosine oxidase, alpha subunit family, heterotetrameric form	NA|519aa|down_3|NZ_CP007141.1_1239844_1241401_+	cd07779, FGGY_ygcE_like, uncharacterized ygcE-like proteins	NA|568aa|down_4|NZ_CP007141.1_1241411_1243115_+	COG0277, GlcD, FAD/FMN-containing dehydrogenases [Energy production and conversion]	NA|239aa|down_5|NZ_CP007141.1_1243130_1243847_+	COG2188, PhnF, Transcriptional regulators [Transcription]	NA|197aa|down_6|NZ_CP007141.1_1243865_1244456_+	pfam06283, ThuA, Trehalose utilisation	NA|353aa|down_7|NZ_CP007141.1_1244474_1245533_+	pfam11175, DUF2961, Protein of unknown function (DUF2961)	NA|50aa|down_8|NZ_CP007141.1_1245994_1246144_+	NA	NA|105aa|down_9|NZ_CP007141.1_1246173_1246488_+	NA
