assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_007165405.1_ASM716540v1	NZ_AP019795	Thermus thermophilus strain AA2-29 plasmid pAA229, complete sequence	1	4188-5812	1,1,1	PILER-CR,CRISPRCasFinder,CRT	no	WYL,cas3,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2	WYL,cas3,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,c2c9_V-U4,Cas14u_CAS-V	Type I-E	CGGTCCATCCCCACGTGCGTGGGGACTAC,CGGTCCATCCCCACGTGCGTGGGGACTAC,CGGTCCATCCCCACGTGCGTGGGGACTAC	29,29,29	0	0	NA	NA	I-E,II-B:I-E,II-B:I-E,II-B	26,26,26	26	TypeI-E	csa3,cas2,DEDDh,cas3,Cas9_archaeal,WYL,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,c2c9_V-U4,Cas14u_CAS-V	NA|222aa|up_1|NZ_AP019795.1_2184_2850_-,NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|386aa|up_2|NZ_AP019795.1_1034_2192_-	cd01465, vWA_subgroup, VWA subgroup: Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF)	NA|222aa|up_1|NZ_AP019795.1_2184_2850_-	NA	NA|315aa|up_0|NZ_AP019795.1_3056_4001_-	pfam01555, N6_N4_Mtase, DNA methylase	WYL|330aa|down_0|NZ_AP019795.1_5999_6989_+	pfam13280, WYL, WYL domain	cas3|920aa|down_1|NZ_AP019795.1_6985_9745_+	PRK09694, PRK09694, CRISPR-associated helicase/endonuclease Cas3	cas8e|494aa|down_2|NZ_AP019795.1_9794_11276_+	TIGR02547, CRISPR_system_Cascade_subunit_CasA, CRISPR type I-E/ECOLI-associated protein CasA/Cse1	cse2gr11|164aa|down_3|NZ_AP019795.1_11272_11764_+	PRK13921, PRK13921, CRISPR-associated Cse2 family protein; Provisional	cas7|372aa|down_4|NZ_AP019795.1_11767_12883_+	pfam09344, Cas_CT1975, CT1975-like protein	cas5|225aa|down_5|NZ_AP019795.1_12884_13559_+	pfam09704, Cas_Cas5d, CRISPR-associated protein (Cas_Cas5)	cas6e|212aa|down_6|NZ_AP019795.1_13545_14181_+	cd09664, Cas6_I-E, CRISPR/Cas system-associated RAMP superfamily protein Cas6e	cas1|326aa|down_7|NZ_AP019795.1_14190_15168_+	TIGR03638, cas1_ECOLI, CRISPR-associated endonuclease Cas1, subtype I-E/ECOLI	cas2|126aa|down_8|NZ_AP019795.1_15121_15499_+	pfam09707, Cas_Cas2CT1978, CRISPR-associated protein (Cas_Cas2CT1978)	NA|251aa|down_9|NZ_AP019795.1_16512_17265_-	pfam03746, LamB_YcsF, LamB/YcsF family
GCF_007165405.1_ASM716540v1	NZ_AP019795	Thermus thermophilus strain AA2-29 plasmid pAA229, complete sequence	2	15560-16509	2,2,2	CRISPRCasFinder,CRT,PILER-CR	no	WYL,cas3,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2	WYL,cas3,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,c2c9_V-U4,Cas14u_CAS-V	Type I-E	GTAGTCCCCACGCGTGTGGGGATGGACCG,GTAGTCCCCACGCGTGTGGGGATGGACCG,GTAGTCCCCACGCGTGTGGGGATGGACCG	29,29,29	0	0	NA	NA	I-E,II-B:I-E,II-B:I-E,II-B	15,15,10	15	TypeI-E	csa3,cas2,DEDDh,cas3,Cas9_archaeal,WYL,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,c2c9_V-U4,Cas14u_CAS-V	NA,NA	NA|315aa|up_9|NZ_AP019795.1_3056_4001_-	pfam01555, N6_N4_Mtase, DNA methylase	WYL|330aa|up_8|NZ_AP019795.1_5999_6989_+	pfam13280, WYL, WYL domain	cas3|920aa|up_7|NZ_AP019795.1_6985_9745_+	PRK09694, PRK09694, CRISPR-associated helicase/endonuclease Cas3	cas8e|494aa|up_6|NZ_AP019795.1_9794_11276_+	TIGR02547, CRISPR_system_Cascade_subunit_CasA, CRISPR type I-E/ECOLI-associated protein CasA/Cse1	cse2gr11|164aa|up_5|NZ_AP019795.1_11272_11764_+	PRK13921, PRK13921, CRISPR-associated Cse2 family protein; Provisional	cas7|372aa|up_4|NZ_AP019795.1_11767_12883_+	pfam09344, Cas_CT1975, CT1975-like protein	cas5|225aa|up_3|NZ_AP019795.1_12884_13559_+	pfam09704, Cas_Cas5d, CRISPR-associated protein (Cas_Cas5)	cas6e|212aa|up_2|NZ_AP019795.1_13545_14181_+	cd09664, Cas6_I-E, CRISPR/Cas system-associated RAMP superfamily protein Cas6e	cas1|326aa|up_1|NZ_AP019795.1_14190_15168_+	TIGR03638, cas1_ECOLI, CRISPR-associated endonuclease Cas1, subtype I-E/ECOLI	cas2|126aa|up_0|NZ_AP019795.1_15121_15499_+	pfam09707, Cas_Cas2CT1978, CRISPR-associated protein (Cas_Cas2CT1978)	NA|251aa|down_0|NZ_AP019795.1_16512_17265_-	pfam03746, LamB_YcsF, LamB/YcsF family	NA|229aa|down_1|NZ_AP019795.1_17277_17964_-	cd07729, AHL_lactonase_MBL-fold, quorum-quenching N-acyl-homoserine lactonase, MBL-fold metallo-hydrolase domain	NA|434aa|down_2|NZ_AP019795.1_17966_19268_-	TIGR00786, TRAP_transporter_permease_protein_SiaT, TRAP transporter, DctM subunit	NA|152aa|down_3|NZ_AP019795.1_19264_19720_-	pfam04290, DctQ, Tripartite ATP-independent periplasmic transporters, DctQ component	NA|319aa|down_4|NZ_AP019795.1_19719_20676_-	cd13602, PBP2_TRAP_BpDctp6_7, Substrate-binding domain of a pyroglutamic acid binding DctP subfamily of the tripartite ATP-independent periplasmic transporters; contains the type 2 periplasmic binding protein fold	NA|259aa|down_5|NZ_AP019795.1_21694_22471_-	COG0600, TauC, ABC-type nitrate/sulfonate/bicarbonate transport system, permease component [Inorganic ion transport and metabolism]	NA|258aa|down_6|NZ_AP019795.1_22457_23231_-	COG1116, TauB, ABC-type nitrate/sulfonate/bicarbonate transport system, ATPase component [Inorganic ion transport and metabolism]	NA|270aa|down_7|NZ_AP019795.1_23529_24339_-	COG1878, COG1878, Kynurenine formamidase [Amino acid transport and metabolism]	NA|278aa|down_8|NZ_AP019795.1_24437_25271_-	PRK00724, PRK00724, formate dehydrogenase accessory sulfurtransferase FdhD	NA|762aa|down_9|NZ_AP019795.1_25267_27553_-	cd02767, MopB_ydeP, The MopB_ydeP CD includes a group of related uncharacterized bacterial molybdopterin-binding oxidoreductase-like domains with a putative molybdopterin cofactor binding site
GCF_007165405.1_ASM716540v1	NZ_AP019795	Thermus thermophilus strain AA2-29 plasmid pAA229, complete sequence	3	34770-35110	3,3,3	PILER-CR,CRISPRCasFinder,CRT	no	cas2	WYL,cas3,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,c2c9_V-U4,Cas14u_CAS-V	Unclear	GTTGCAAGGGATTGAGCCCCGTAAGGGGATTGCGAC,GTTGCAAGGGATTGAGCCCCGTAAGGGGATTGCGAC,GTTGCAAGGGATTGAGCCCCGTAAGGGGATTGCGAC	36,36,36	0	0	NA	NA	III-A:III-A:III-A	3,4,4	4	Unclear	csa3,cas2,DEDDh,cas3,Cas9_archaeal,WYL,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,c2c9_V-U4,Cas14u_CAS-V	NA|60aa|up_0|NZ_AP019795.1_34538_34718_-,NA|67aa|down_7|NZ_AP019795.1_43390_43591_+	NA|762aa|up_9|NZ_AP019795.1_25267_27553_-	cd02767, MopB_ydeP, The MopB_ydeP CD includes a group of related uncharacterized bacterial molybdopterin-binding oxidoreductase-like domains with a putative molybdopterin cofactor binding site	NA|444aa|up_8|NZ_AP019795.1_28424_29756_-	cd05379, CAP_bacterial, Bacterial CAP (cysteine-rich secretory proteins, antigen 5, and pathogenesis-related 1 proteins) domain proteins	NA|407aa|up_7|NZ_AP019795.1_29877_31098_+	pfam00872, Transposase_mut, Transposase, Mutator family	NA|132aa|up_6|NZ_AP019795.1_31142_31538_-	cd18683, PIN_VapC-like, Uncharacterized subfamily of the VapC (virulence-associated protein C)-like family of the PIN domain superfamily	NA|87aa|up_5|NZ_AP019795.1_31534_31795_-	TIGR01439, Uncharacterized_protein_Mb2626, looped-hinge helix DNA binding domain, AbrB family	NA|135aa|up_4|NZ_AP019795.1_32285_32690_-	cd09872, PIN_Sll0205-like, VapC-like PIN domain of Sll0205 protein and homologs	NA|77aa|up_3|NZ_AP019795.1_32686_32917_-	pfam02604, PhdYeFM_antitox, Antitoxin Phd_YefM, type II toxin-antitoxin system	NA|81aa|up_2|NZ_AP019795.1_33248_33491_+	TIGR01439, Uncharacterized_protein_Mb2626, looped-hinge helix DNA binding domain, AbrB family	NA|152aa|up_1|NZ_AP019795.1_33471_33927_+	pfam13470, PIN_3, PIN domain	NA|60aa|up_0|NZ_AP019795.1_34538_34718_-	NA	NA|424aa|down_0|NZ_AP019795.1_35872_37144_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|220aa|down_1|NZ_AP019795.1_37140_37800_-	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|151aa|down_2|NZ_AP019795.1_37800_38253_-	pfam00034, Cytochrom_C, Cytochrome c	NA|308aa|down_3|NZ_AP019795.1_38359_39283_+	cd05819, NHL, NHL repeat unit of beta-propeller proteins	NA|375aa|down_4|NZ_AP019795.1_39292_40417_+	cd10917, CE4_NodB_like_6s_7s, Catalytic NodB homology domain of rhizobial NodB-like proteins	NA|619aa|down_5|NZ_AP019795.1_40491_42348_+	pfam13520, AA_permease_2, Amino acid permease	NA|309aa|down_6|NZ_AP019795.1_42344_43271_-	COG0628, yhhT, Predicted permease, member of the PurR regulon [General function prediction only]	NA|67aa|down_7|NZ_AP019795.1_43390_43591_+	NA	NA|403aa|down_8|NZ_AP019795.1_43625_44834_+	TIGR02037, Probable_periplasmic_serine_protease_do/HhoA-like, periplasmic serine protease, Do/DeqQ family	NA|274aa|down_9|NZ_AP019795.1_44845_45667_+	pfam01972, SDH_sah, Serine dehydrogenase proteinase
GCF_007165405.1_ASM716540v1	NZ_AP019794	Thermus thermophilus strain AA2-29	1	1197048-1197125	1	CRISPRCasFinder	no		csa3,cas2,DEDDh,cas3,Cas9_archaeal	Orphan	ATCCAAAGCCTGCGGCAGGAGATG	24	0	0	NA	NA	NA	1	1	Orphan	csa3,cas2,DEDDh,cas3,Cas9_archaeal,WYL,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,c2c9_V-U4,Cas14u_CAS-V	NA,NA|54aa|down_4|NZ_AP019794.1_1201255_1201417_+	NA|338aa|up_9|NZ_AP019794.1_1187564_1188578_-	PRK05479, PRK05479, ketol-acid reductoisomerase; Provisional	NA|171aa|up_8|NZ_AP019794.1_1188574_1189087_-	PRK11895, ilvH, acetolactate synthase 3 regulatory subunit; Reviewed	NA|563aa|up_7|NZ_AP019794.1_1189083_1190772_-	TIGR00118, Probable_acetolactate_synthase_large_subunit, acetolactate synthase, large subunit, biosynthetic type	NA|327aa|up_6|NZ_AP019794.1_1190927_1191908_-	TIGR04018, thioredoxin_reductase, putative bacillithiol system oxidoreductase, YpdA family	NA|157aa|up_5|NZ_AP019794.1_1192073_1192544_+	pfam12019, GspH, Type II transport protein GspH	NA|158aa|up_4|NZ_AP019794.1_1192599_1193073_+	pfam12019, GspH, Type II transport protein GspH	NA|194aa|up_3|NZ_AP019794.1_1193069_1193651_+	COG2165, PulG, Type II secretory pathway, pseudopilin PulG [Cell motility and secretion / Intracellular trafficking and secretion]	NA|234aa|up_2|NZ_AP019794.1_1193647_1194349_+	COG4795, PulJ, Type II secretory pathway, component PulJ [Intracellular trafficking and secretion]	NA|559aa|up_1|NZ_AP019794.1_1194359_1196036_+	pfam14341, PilX_N, PilX N-terminal	NA|117aa|up_0|NZ_AP019794.1_1196256_1196607_+	COG2165, PulG, Type II secretory pathway, pseudopilin PulG [Cell motility and secretion / Intracellular trafficking and secretion]	NA|181aa|down_0|NZ_AP019794.1_1197311_1197854_+	cd06260, DUF820, Domain of unknown function (DUF820)	NA|258aa|down_1|NZ_AP019794.1_1197875_1198649_-	pfam13612, DDE_Tnp_1_3, Transposase DDE domain	NA|367aa|down_2|NZ_AP019794.1_1198726_1199827_+	pfam13546, DDE_5, DDE superfamily endonuclease	NA|407aa|down_3|NZ_AP019794.1_1199886_1201107_+	pfam00872, Transposase_mut, Transposase, Mutator family	NA|54aa|down_4|NZ_AP019794.1_1201255_1201417_+	NA	NA|186aa|down_5|NZ_AP019794.1_1201515_1202073_-	cd06260, DUF820, Domain of unknown function (DUF820)	NA|219aa|down_6|NZ_AP019794.1_1202091_1202748_-	COG1354, scpA, Rec8/ScpA/Scc1-like protein (kleisin family) [Replication,    recombination, and repair]	NA|338aa|down_7|NZ_AP019794.1_1202744_1203758_-	PRK00927, PRK00927, tryptophanyl-tRNA synthetase; Reviewed	NA|473aa|down_8|NZ_AP019794.1_1204082_1205501_+	PRK05478, PRK05478, 3-isopropylmalate dehydratase large subunit	NA|202aa|down_9|NZ_AP019794.1_1205514_1206120_+	PRK01641, leuD, 3-isopropylmalate dehydratase small subunit
GCF_007165405.1_ASM716540v1	NZ_AP019794	Thermus thermophilus strain AA2-29	2	1754953-1755043	2	CRISPRCasFinder	no		csa3,cas2,DEDDh,cas3,Cas9_archaeal	Orphan	TCCTAAAGGGGGGTAAAGGGGGG	23	0	0	NA	NA	NA	1	1	Orphan	csa3,cas2,DEDDh,cas3,Cas9_archaeal,WYL,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,c2c9_V-U4,Cas14u_CAS-V	NA|94aa|up_4|NZ_AP019794.1_1750116_1750398_+,NA|164aa|up_3|NZ_AP019794.1_1750512_1751004_+,NA|50aa|down_1|NZ_AP019794.1_1755970_1756120_-,NA|236aa|down_3|NZ_AP019794.1_1758876_1759584_+	NA|76aa|up_9|NZ_AP019794.1_1747454_1747682_-	PRK06870, secG, preprotein translocase subunit SecG; Reviewed	NA|396aa|up_8|NZ_AP019794.1_1747901_1749089_-	cd01189, INT_ICEBs1_C_like, C-terminal catalytic domain of integrases from bacterial phages and conjugate transposons	NA|127aa|up_7|NZ_AP019794.1_1749100_1749481_-	pfam01402, RHH_1, Ribbon-helix-helix protein, copG family	NA|85aa|up_6|NZ_AP019794.1_1749584_1749839_+	pfam12728, HTH_17, Helix-turn-helix domain	NA|94aa|up_5|NZ_AP019794.1_1749825_1750107_+	pfam12728, HTH_17, Helix-turn-helix domain	NA|94aa|up_4|NZ_AP019794.1_1750116_1750398_+	NA	NA|164aa|up_3|NZ_AP019794.1_1750512_1751004_+	NA	NA|407aa|up_2|NZ_AP019794.1_1751119_1752340_-	pfam00872, Transposase_mut, Transposase, Mutator family	NA|289aa|up_1|NZ_AP019794.1_1752356_1753223_+	pfam13362, Toprim_3, Toprim domain	NA|494aa|up_0|NZ_AP019794.1_1753209_1754691_+	pfam13148, DUF3987, Protein of unknown function (DUF3987)	NA|149aa|down_0|NZ_AP019794.1_1755527_1755974_-	cd09874, PIN_MT3492-like, VapC-like PIN domain of the hypothetical protein MT3492 of Mycobacterium tuberculosis CDC1551 and other uncharacterized, annotated PilT protein domain proteins	NA|50aa|down_1|NZ_AP019794.1_1755970_1756120_-	NA	NA|367aa|down_2|NZ_AP019794.1_1757521_1758622_-	pfam13546, DDE_5, DDE superfamily endonuclease	NA|236aa|down_3|NZ_AP019794.1_1758876_1759584_+	NA	NA|959aa|down_4|NZ_AP019794.1_1759894_1762771_+	pfam01139, RtcB, tRNA-splicing ligase RtcB	NA|488aa|down_5|NZ_AP019794.1_1762905_1764369_+	pfam09992, NAGPA, Phosphodiester glycosidase	NA|307aa|down_6|NZ_AP019794.1_1764385_1765306_+	PRK07764, PRK07764, DNA polymerase III subunits gamma and tau; Validated	NA|110aa|down_7|NZ_AP019794.1_1765283_1765613_-	cd00562, NifX_NifB, This CD represents a family of iron-molybdenum cluster-binding proteins that includes NifB, NifX, and NifY, all of which are involved in the synthesis of an iron-molybdenum cofactor (FeMo-co) that binds the active site of the dinitrogenase enzyme	NA|158aa|down_8|NZ_AP019794.1_1765641_1766115_-	PRK09364, moaC, cyclic pyranopterin monophosphate synthase MoaC	NA|200aa|down_9|NZ_AP019794.1_1766124_1766724_-	COG2860, COG2860, Predicted membrane protein [Function unknown]
