assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_011399095.1_ASM1139909v1	NZ_AP022843	Halomonas hydrothermalis strain Slthf2	1	240140-242670	1,1,1,2	CRISPRCasFinder,CRT,PILER-CR,PILER-CR	no	WYL,cas3,cas8e,cse2gr11,cas6e,cas7,cas5,cas1,cas2	WYL,cas3,cas8e,cse2gr11,cas6e,cas7,cas5,cas1,cas2,csx1,DEDDh,Cas9_archaeal,cas14j,csa3,PD-DExK,DinG,RT	Type I-E	GTCTTCCCCACGCCCGTGGGGGTGTTTCT,GTCTTCCCCACGCCCGTGGGGGTGTTTC,GTCTTCCCCACGCCCGTGGGGGTGTTTC,GTCTTCCCCACGCCCGTGGGGGTGTTTCC	29,28,28,29	0	0	NA	NA	I-B,III-A,III-B:I-B,III-A,III-B:I-B,III-A,III-B:I-B,III-A,III-B	41,41,37,37	41	TypeI-E	WYL,cas3,cas8e,cse2gr11,cas6e,cas7,cas5,cas1,cas2,csx1,DEDDh,Cas9_archaeal,cas14j,csa3,PD-DExK,DinG,RT	NA,NA|101aa|down_1|NZ_AP022843.1_243959_244262_-,NA|208aa|down_2|NZ_AP022843.1_244625_245249_-,NA|59aa|down_7|NZ_AP022843.1_250901_251078_-	NA|245aa|up_9|NZ_AP022843.1_229498_230233_-	cd03235, ABC_Metallic_Cations, ATP-binding cassette domain of the metal-type transporters	WYL|322aa|up_8|NZ_AP022843.1_230823_231789_+	COG2378, COG2378, Predicted transcriptional regulator [Transcription]	cas3|860aa|up_7|NZ_AP022843.1_231890_234470_+	PRK09694, PRK09694, CRISPR-associated helicase/endonuclease Cas3	cas8e|499aa|up_6|NZ_AP022843.1_234473_235970_+	cd09729, Cse1_I-E, CRISPR/Cas system-associated protein Cse1	cse2gr11|184aa|up_5|NZ_AP022843.1_235975_236527_+	cd09731, Cse2_I-E, CRISPR/Cas system-associated protein Cse2	cas6e|215aa|up_4|NZ_AP022843.1_236523_237168_+	cd09727, Cas6_I-E, CRISPR/Cas system-associated RAMP superfamily protein Cas6e	cas7|349aa|up_3|NZ_AP022843.1_237184_238231_+	pfam09344, Cas_CT1975, CT1975-like protein	cas5|237aa|up_2|NZ_AP022843.1_238239_238950_+	TIGR01868, hypothetical_protein, CRISPR-associated protein Cas5/CasD, subtype I-E/ECOLI	cas1|292aa|up_1|NZ_AP022843.1_238952_239828_+	TIGR03638, cas1_ECOLI, CRISPR-associated endonuclease Cas1, subtype I-E/ECOLI	cas2|96aa|up_0|NZ_AP022843.1_239805_240093_+	cd09648, Cas2_I-E, CRISPR/Cas system-associated protein Cas2	NA|269aa|down_0|NZ_AP022843.1_243071_243878_+	PRK10334, PRK10334, small-conductance mechanosensitive channel MscS	NA|101aa|down_1|NZ_AP022843.1_243959_244262_-	NA	NA|208aa|down_2|NZ_AP022843.1_244625_245249_-	NA	NA|469aa|down_3|NZ_AP022843.1_245721_247128_+	PRK09469, glnA, glutamate--ammonia ligase	NA|353aa|down_4|NZ_AP022843.1_247500_248559_+	COG3852, NtrB, Signal transduction histidine kinase, nitrogen specific [Signal transduction mechanisms]	NA|477aa|down_5|NZ_AP022843.1_248555_249986_+	TIGR01818, Nitrogen_assimilation_regulatory_protein, nitrogen regulation protein NR(I)	NA|283aa|down_6|NZ_AP022843.1_250040_250889_-	pfam04187, Cofac_haem_bdg, Haem-binding uptake, Tiki superfamily, ChaN	NA|59aa|down_7|NZ_AP022843.1_250901_251078_-	NA	NA|171aa|down_8|NZ_AP022843.1_251617_252130_-	COG2945, COG2945, Predicted hydrolase of the alpha/beta superfamily [General function prediction only]	NA|45aa|down_9|NZ_AP022843.1_252135_252270_-	COG5510, COG5510, Predicted small secreted protein [Function unknown]
GCF_011399095.1_ASM1139909v1	NZ_AP022843	Halomonas hydrothermalis strain Slthf2	2	2562300-2562406	2	CRISPRCasFinder	no		WYL,cas3,cas8e,cse2gr11,cas6e,cas7,cas5,cas1,cas2,csx1,DEDDh,Cas9_archaeal,cas14j,csa3,PD-DExK,DinG,RT	Orphan	TGTTCAGTACTACATTTTTCCAATCTCTCCTGAAC	35	0	0	NA	NA	NA	1	1	Orphan	WYL,cas3,cas8e,cse2gr11,cas6e,cas7,cas5,cas1,cas2,csx1,DEDDh,Cas9_archaeal,cas14j,csa3,PD-DExK,DinG,RT	NA|321aa|up_7|NZ_AP022843.1_2549092_2550055_-,NA|303aa|up_5|NZ_AP022843.1_2551398_2552307_-,NA|339aa|up_4|NZ_AP022843.1_2553953_2554970_-,NA|228aa|up_0|NZ_AP022843.1_2561599_2562283_-,NA|236aa|down_1|NZ_AP022843.1_2564096_2564804_-	NA|576aa|up_9|NZ_AP022843.1_2546402_2548130_+	cd01115, SLC13_permease, Permease SLC13 (solute carrier 13)	NA|159aa|up_8|NZ_AP022843.1_2548173_2548650_-	cd17874, FtsY, signal recognition particle receptor FtsY	NA|321aa|up_7|NZ_AP022843.1_2549092_2550055_-	NA	NA|263aa|up_6|NZ_AP022843.1_2550264_2551053_-	pfam13578, Methyltransf_24, Methyltransferase domain	NA|303aa|up_5|NZ_AP022843.1_2551398_2552307_-	NA	NA|339aa|up_4|NZ_AP022843.1_2553953_2554970_-	NA	NA|813aa|up_3|NZ_AP022843.1_2555042_2557481_-	cd00761, Glyco_tranf_GTA_type, Glycosyltransferase family A (GT-A) includes diverse families of glycosyl transferases with a common GT-A type structural fold	NA|426aa|up_2|NZ_AP022843.1_2557498_2558776_-	PRK15182, PRK15182, Vi polysaccharide biosynthesis UDP-N-acetylglucosamine C-6 dehydrogenase TviB	NA|729aa|up_1|NZ_AP022843.1_2559416_2561603_-	TIGR02168, Chromosome_partition_protein_Smc, chromosome segregation protein SMC, common bacterial type	NA|228aa|up_0|NZ_AP022843.1_2561599_2562283_-	NA	NA|460aa|down_0|NZ_AP022843.1_2562638_2564018_-	pfam13701, DDE_Tnp_1_4, Transposase DDE domain group 1	NA|236aa|down_1|NZ_AP022843.1_2564096_2564804_-	NA	NA|441aa|down_2|NZ_AP022843.1_2565084_2566407_-	COG1004, Ugd, Predicted UDP-glucose 6-dehydrogenase [Cell envelope biogenesis, outer membrane]	NA|182aa|down_3|NZ_AP022843.1_2566448_2566994_-	pfam00908, dTDP_sugar_isom, dTDP-4-dehydrorhamnose 3,5-epimerase	NA|299aa|down_4|NZ_AP022843.1_2567076_2567973_-	TIGR01207, Glucose-1-phosphate_thymidylyltransferase_1, glucose-1-phosphate thymidylyltransferase, short form	NA|306aa|down_5|NZ_AP022843.1_2568646_2569564_-	pfam04321, RmlD_sub_bind, RmlD substrate binding domain	NA|375aa|down_6|NZ_AP022843.1_2570220_2571345_-	COG1088, RfbB, dTDP-D-glucose 4,6-dehydratase [Cell envelope biogenesis, outer membrane]	NA|294aa|down_7|NZ_AP022843.1_2571393_2572275_-	cd03220, ABC_KpsT_Wzt, ATP-binding cassette component of polysaccharide transport system	NA|117aa|down_8|NZ_AP022843.1_2572350_2572701_-	cd16377, 23S_rRNA_IVP_like, 23S rRNA-intervening sequence protein and similar proteins	NA|474aa|down_9|NZ_AP022843.1_2572893_2574315_-	COG1236, YSH1, Predicted exonuclease of the beta-lactamase fold involved in RNA processing [Translation, ribosomal structure and biogenesis]
