assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000015585.1_ASM1558v1	NC_008789	Halorhodospira halophila SL1, complete sequence	1	695980-696726	1,1,1	CRISPRCasFinder,CRT,PILER-CR	no	cas1,cas2,csx1,csx16,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6	csa3,DEDDh,DinG,cas1,cas2,csx1,csx16,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6,Cas14u_CAS-V,RT,WYL,cas3,Cas9_archaeal	Type III-D,Type III-B,Type III-A,Type III-C	GTCTTAATCCCTTTTCCAACAGGGCTAGGTTCGGAC,GTCTTAATCCCTTTTCCAACAGGGCTAGGTTCGGAC,GTCTTAATCCCTTTTCCAACAGGGCTAGGTTCGGAC	36,36,36	0	0	NA	NA	NA:NA:NA	10,10,9	10	TypeIII-D,TypeIII-B,TypeIII-A,TypeIII-C	csa3,DEDDh,DinG,cas1,cas2,csx1,csx16,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6,Cas14u_CAS-V,RT,WYL,cas3,Cas9_archaeal	NA|275aa|up_9|NC_008789.1_684796_685621_+,NA	NA|275aa|up_9|NC_008789.1_684796_685621_+	NA	NA|160aa|up_8|NC_008789.1_685674_686154_+	pfam03748, FliL, Flagellar basal body-associated protein FliL	NA|457aa|up_7|NC_008789.1_686229_687600_-	COG1115, AlsT, Na+/alanine symporter [Amino acid transport and metabolism]	NA|199aa|up_6|NC_008789.1_687665_688262_-	PRK03767, PRK03767, NAD(P)H:quinone oxidoreductase; Provisional	NA|126aa|up_5|NC_008789.1_688553_688931_-	pfam07238, PilZ, PilZ domain	NA|291aa|up_4|NC_008789.1_688968_689841_-	PRK10792, PRK10792, bifunctional methylenetetrahydrofolate dehydrogenase/methenyltetrahydrofolate cyclohydrolase FolD	NA|63aa|up_3|NC_008789.1_690768_690957_-	TIGR02224, Tyrosine_recombinase_XerC, tyrosine recombinase XerC	NA|255aa|up_2|NC_008789.1_693167_693932_+	pfam01695, IstB_IS21, IstB-like ATP binding protein	NA|251aa|up_1|NC_008789.1_694910_695663_-	PHA02517, PHA02517, putative transposase OrfB; Reviewed	NA|107aa|up_0|NC_008789.1_695605_695926_-	COG2963, COG2963, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|79aa|down_0|NC_008789.1_696949_697186_-	smart00530, HTH_XRE, Helix-turn-helix XRE-family like proteins	cas1|321aa|down_1|NC_008789.1_697187_698150_-	pfam01867, Cas_Cas1, CRISPR associated protein Cas1	cas2|105aa|down_2|NC_008789.1_698260_698575_-	pfam09827, CRISPR_Cas2, CRISPR associated protein Cas2	NA|266aa|down_3|NC_008789.1_698567_699365_-	pfam01867, Cas_Cas1, CRISPR associated protein Cas1	cas2|94aa|down_4|NC_008789.1_699378_699660_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	csx1|386aa|down_5|NC_008789.1_699884_701042_-	pfam09002, DUF1887, Domain of unknown function (DUF1887)	csx16|104aa|down_6|NC_008789.1_701071_701383_-	cd09743, Csx16_III-U, CRISPR/Cas system-associated protein Csx16	csx1|378aa|down_7|NC_008789.1_701615_702749_+	cd09741, Csx1_III-U, CRISPR/Cas system-associated protein Csx1	cmr1gr7|330aa|down_8|NC_008789.1_702826_703816_+	cd09657, Cmr1_III-B, CRISPR/Cas system-associated RAMP superfamily protein Cmr1	cas10|972aa|down_9|NC_008789.1_703830_706746_+	cd09679, Cas10_III, CRISPR/Cas system-associated protein Cas10
GCF_000015585.1_ASM1558v1	NC_008789	Halorhodospira halophila SL1, complete sequence	2	711566-711671	2	CRISPRCasFinder	no	cas1,cas2,csx1,csx16,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6,csa3	csa3,DEDDh,DinG,cas1,cas2,csx1,csx16,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6,Cas14u_CAS-V,RT,WYL,cas3,Cas9_archaeal	Type III-D,Type III-B,Type III-A,Type III-C	GTCTGAGACGAGCCCTGCTTGAAAAGGGATTAAGAC	36	0	0	NA	NA	NA	1	1	TypeIII-D,TypeIII-B,TypeIII-A,TypeIII-C	csa3,DEDDh,DinG,cas1,cas2,csx1,csx16,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6,Cas14u_CAS-V,RT,WYL,cas3,Cas9_archaeal	NA,NA|177aa|down_3|NC_008789.1_713544_714075_+,NA|327aa|down_7|NC_008789.1_717263_718244_-,NA|106aa|down_9|NC_008789.1_718800_719118_-	csx1|386aa|up_9|NC_008789.1_699884_701042_-	pfam09002, DUF1887, Domain of unknown function (DUF1887)	csx16|104aa|up_8|NC_008789.1_701071_701383_-	cd09743, Csx16_III-U, CRISPR/Cas system-associated protein Csx16	csx1|378aa|up_7|NC_008789.1_701615_702749_+	cd09741, Csx1_III-U, CRISPR/Cas system-associated protein Csx1	cmr1gr7|330aa|up_6|NC_008789.1_702826_703816_+	cd09657, Cmr1_III-B, CRISPR/Cas system-associated RAMP superfamily protein Cmr1	cas10|972aa|up_5|NC_008789.1_703830_706746_+	cd09679, Cas10_III, CRISPR/Cas system-associated protein Cas10	cmr3gr5|384aa|up_4|NC_008789.1_706749_707901_+	cd09748, Cmr3_III-B, CRISPR/Cas system-associated RAMP superfamily protein Cmr3	cmr4gr7|305aa|up_3|NC_008789.1_707915_708830_+	TIGR02580, putative_CRISPR-associated_protein, CRISPR type III-B/RAMP module RAMP protein Cmr4	cmr5gr11|126aa|up_2|NC_008789.1_708838_709216_+	cd09749, Cmr5_III-B, CRISPR/Cas system-associated protein Cmr5	cmr6gr7|404aa|up_1|NC_008789.1_709212_710424_+	TIGR01898, repair_system, CRISPR type III-B/RAMP module RAMP protein Cmr6	cas6|305aa|up_0|NC_008789.1_710420_711335_+	pfam10040, CRISPR_Cas6, CRISPR-associated endoribonuclease Cas6	NA|99aa|down_0|NC_008789.1_711876_712173_+	pfam01610, DDE_Tnp_ISL3, Transposase	NA|293aa|down_1|NC_008789.1_712132_713011_-	pfam09670, Cas_Cas02710, CRISPR-associated protein (Cas_Cas02710)	csx16|101aa|down_2|NC_008789.1_713007_713310_-	cd09743, Csx16_III-U, CRISPR/Cas system-associated protein Csx16	NA|177aa|down_3|NC_008789.1_713544_714075_+	NA	NA|79aa|down_4|NC_008789.1_715470_715707_-	smart00530, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|234aa|down_5|NC_008789.1_715690_716392_-	pfam02643, DUF192, Uncharacterized ACR, COG1430	NA|258aa|down_6|NC_008789.1_716481_717255_-	COG1484, DnaC, DNA replication protein [DNA replication, recombination, and repair]	NA|327aa|down_7|NC_008789.1_717263_718244_-	NA	NA|185aa|down_8|NC_008789.1_718243_718798_-	pfam11198, DUF2857, Protein of unknown function (DUF2857)	NA|106aa|down_9|NC_008789.1_718800_719118_-	NA
GCF_000015585.1_ASM1558v1	NC_008789	Halorhodospira halophila SL1, complete sequence	3	714988-715237	3,2,2	CRISPRCasFinder,CRT,PILER-CR	no	cas1,cas2,csx1,csx16,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6,csa3	csa3,DEDDh,DinG,cas1,cas2,csx1,csx16,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6,Cas14u_CAS-V,RT,WYL,cas3,Cas9_archaeal	Type III-D,Type III-B,Type III-A,Type III-C	GTCTTAATCCCTTTTCCAACAGGGCTAGGTTCGGAC,GTCTTAATCCCTTTTCCAACAGGGCTAGGTTCGGAC,GTCTTAATCCCTTTTCCAACAGGGCTAGGTTCGGAC	36,36,36	0	0	NA	NA	NA:NA:NA	3,3,2	3	TypeIII-D,TypeIII-B,TypeIII-A,TypeIII-C	csa3,DEDDh,DinG,cas1,cas2,csx1,csx16,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6,Cas14u_CAS-V,RT,WYL,cas3,Cas9_archaeal	NA|177aa|up_0|NC_008789.1_713544_714075_+,NA|327aa|down_3|NC_008789.1_717263_718244_-,NA|106aa|down_5|NC_008789.1_718800_719118_-	cmr3gr5|384aa|up_9|NC_008789.1_706749_707901_+	cd09748, Cmr3_III-B, CRISPR/Cas system-associated RAMP superfamily protein Cmr3	cmr4gr7|305aa|up_8|NC_008789.1_707915_708830_+	TIGR02580, putative_CRISPR-associated_protein, CRISPR type III-B/RAMP module RAMP protein Cmr4	cmr5gr11|126aa|up_7|NC_008789.1_708838_709216_+	cd09749, Cmr5_III-B, CRISPR/Cas system-associated protein Cmr5	cmr6gr7|404aa|up_6|NC_008789.1_709212_710424_+	TIGR01898, repair_system, CRISPR type III-B/RAMP module RAMP protein Cmr6	cas6|305aa|up_5|NC_008789.1_710420_711335_+	pfam10040, CRISPR_Cas6, CRISPR-associated endoribonuclease Cas6	NA|73aa|up_4|NC_008789.1_711664_711883_-	pfam02643, DUF192, Uncharacterized ACR, COG1430	NA|99aa|up_3|NC_008789.1_711876_712173_+	pfam01610, DDE_Tnp_ISL3, Transposase	NA|293aa|up_2|NC_008789.1_712132_713011_-	pfam09670, Cas_Cas02710, CRISPR-associated protein (Cas_Cas02710)	csx16|101aa|up_1|NC_008789.1_713007_713310_-	cd09743, Csx16_III-U, CRISPR/Cas system-associated protein Csx16	NA|177aa|up_0|NC_008789.1_713544_714075_+	NA	NA|79aa|down_0|NC_008789.1_715470_715707_-	smart00530, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|234aa|down_1|NC_008789.1_715690_716392_-	pfam02643, DUF192, Uncharacterized ACR, COG1430	NA|258aa|down_2|NC_008789.1_716481_717255_-	COG1484, DnaC, DNA replication protein [DNA replication, recombination, and repair]	NA|327aa|down_3|NC_008789.1_717263_718244_-	NA	NA|185aa|down_4|NC_008789.1_718243_718798_-	pfam11198, DUF2857, Protein of unknown function (DUF2857)	NA|106aa|down_5|NC_008789.1_718800_719118_-	NA	NA|332aa|down_6|NC_008789.1_719386_720382_-	TIGR03764, ICE_PFGI_1_parB, integrating conjugative element, PFGI_1 class, ParB family protein	NA|274aa|down_7|NC_008789.1_720418_721240_-	COG1192, Soj, ATPases involved in chromosome partitioning [Cell division and chromosome partitioning]	NA|121aa|down_8|NC_008789.1_721700_722063_-	cd00397, DNA_BRE_C, DNA breaking-rejoining enzymes, C-terminal catalytic domain	NA|463aa|down_9|NC_008789.1_724625_726014_+	PRK06292, PRK06292, dihydrolipoamide dehydrogenase; Validated
GCF_000015585.1_ASM1558v1	NC_008789	Halorhodospira halophila SL1, complete sequence	4	795393-795492	4	CRISPRCasFinder	no	DEDDh	csa3,DEDDh,DinG,cas1,cas2,csx1,csx16,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6,Cas14u_CAS-V,RT,WYL,cas3,Cas9_archaeal	Unclear	GGGGCGCGTCCTTCGCTCGACCCACGGGGC	30	0	0	NA	NA	NA	1	1	Orphan	csa3,DEDDh,DinG,cas1,cas2,csx1,csx16,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6,Cas14u_CAS-V,RT,WYL,cas3,Cas9_archaeal	NA,NA|241aa|down_4|NC_008789.1_799825_800548_-	NA|293aa|up_9|NC_008789.1_786936_787815_+	cd05266, SDR_a4, atypical (a) SDRs, subgroup 4	NA|93aa|up_8|NC_008789.1_787845_788124_-	COG1254, AcyP, Acylphosphatases [Energy production and conversion]	NA|441aa|up_7|NC_008789.1_788120_789443_-	PRK01637, PRK01637, virulence factor BrkB family protein	NA|201aa|up_6|NC_008789.1_789570_790173_+	PRK03767, PRK03767, NAD(P)H:quinone oxidoreductase; Provisional	NA|126aa|up_5|NC_008789.1_790169_790547_+	pfam09842, DUF2069, Predicted membrane protein (DUF2069)	NA|242aa|up_4|NC_008789.1_790473_791199_-	TIGR03420, DnaA_homol_Hda, DnaA regulatory inactivator Hda	NA|240aa|up_3|NC_008789.1_791372_792092_+	pfam04338, DUF481, Protein of unknown function, DUF481	NA|369aa|up_2|NC_008789.1_792113_793220_-	pfam09839, DUF2066, Uncharacterized protein conserved in bacteria (DUF2066)	NA|348aa|up_1|NC_008789.1_793439_794483_+	PRK05385, PRK05385, phosphoribosylaminoimidazole synthetase; Provisional	NA|223aa|up_0|NC_008789.1_794479_795148_+	PRK05647, purN, phosphoribosylglycinamide formyltransferase; Reviewed	NA|256aa|down_0|NC_008789.1_795610_796378_+	pfam11306, DUF3108, Protein of unknown function (DUF3108)	NA|190aa|down_1|NC_008789.1_796387_796957_-	PRK00416, dcd, deoxycytidine triphosphate deaminase; Reviewed	NA|450aa|down_2|NC_008789.1_797067_798417_-	COG2239, MgtE, Mg/Co/Ni transporter MgtE (contains CBS domain) [Inorganic ion transport and metabolism]	NA|288aa|down_3|NC_008789.1_798852_799716_+	pfam02673, BacA, Bacitracin resistance protein BacA	NA|241aa|down_4|NC_008789.1_799825_800548_-	NA	NA|680aa|down_5|NC_008789.1_800620_802660_+	PRK00133, metG, methionyl-tRNA synthetase; Reviewed	NA|425aa|down_6|NC_008789.1_802971_804246_-	TIGR02037, Probable_periplasmic_serine_protease_do/HhoA-like, periplasmic serine protease, Do/DeqQ family	NA|361aa|down_7|NC_008789.1_804357_805440_+	PRK05151, PRK05151, electron transport complex protein RsxA; Provisional	NA|682aa|down_8|NC_008789.1_805471_807517_+	PRK05035, PRK05035, electron transport complex protein RnfC; Provisional	NA|324aa|down_9|NC_008789.1_807519_808491_+	PRK00816, rnfD, electron transport complex protein RnfD; Reviewed
GCF_000015585.1_ASM1558v1	NC_008789	Halorhodospira halophila SL1, complete sequence	5	1261384-1262117	5,3	CRISPRCasFinder,PILER-CR	no	DinG	csa3,DEDDh,DinG,cas1,cas2,csx1,csx16,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6,Cas14u_CAS-V,RT,WYL,cas3,Cas9_archaeal	Type IV-A	TCGATACCCCCCTTGTATGAGCCGGGTGG,CGATACCCCCCTTGTATGAGCCGGGTGG	29,28	0	0	NA	NA	NA:NA	8,8	8	Orphan	csa3,DEDDh,DinG,cas1,cas2,csx1,csx16,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6,Cas14u_CAS-V,RT,WYL,cas3,Cas9_archaeal	NA|230aa|up_9|NC_008789.1_1245780_1246470_+,NA|262aa|up_6|NC_008789.1_1249927_1250713_-,NA|393aa|up_5|NC_008789.1_1250924_1252103_-,NA|894aa|down_1|NC_008789.1_1263623_1266305_+,NA|104aa|down_2|NC_008789.1_1267024_1267336_-	NA|230aa|up_9|NC_008789.1_1245780_1246470_+	NA	DinG|831aa|up_8|NC_008789.1_1246568_1249061_-	COG1204, COG1204, Superfamily II helicase [General function prediction only]	NA|58aa|up_7|NC_008789.1_1249658_1249832_+	pfam01610, DDE_Tnp_ISL3, Transposase	NA|262aa|up_6|NC_008789.1_1249927_1250713_-	NA	NA|393aa|up_5|NC_008789.1_1250924_1252103_-	NA	NA|1060aa|up_4|NC_008789.1_1252133_1255313_-	COG0610, COG0610, Type I site-specific restriction-modification system, R (restriction) subunit and related helicases [Defense mechanisms]	NA|335aa|up_3|NC_008789.1_1255432_1256437_-	cd10447, GIY-YIG_unchar_2, GIY-YIG domain of uncharacterized hypothetical protein found in bacteria and archaea	NA|430aa|up_2|NC_008789.1_1256433_1257723_-	cd17266, RMtype1_S_Sau1132ORF3780P-TRD2-CR2_like, Type I restriction-modification system specificity (S) subunit Target Recognition Domain-ConseRved domain (TRD-CR), similar to Staphylococcus aureus subsp	NA|316aa|up_1|NC_008789.1_1257719_1258667_-	pfam07751, Abi_2, Abi-like protein	NA|660aa|up_0|NC_008789.1_1258836_1260816_-	COG0286, HsdM, Type I restriction-modification system methyltransferase subunit [Defense mechanisms]	NA|72aa|down_0|NC_008789.1_1263351_1263567_+	COG3311, AlpA, Predicted transcriptional regulator [Transcription]	NA|894aa|down_1|NC_008789.1_1263623_1266305_+	NA	NA|104aa|down_2|NC_008789.1_1267024_1267336_-	NA	NA|76aa|down_3|NC_008789.1_1267398_1267626_-	cd03169, GATase1_PfpI_1, Type 1 glutamine amidotransferase (GATase1)-like domain found in a subgroup of proteins similar to PfpI from Pyrococcus furiosus	NA|98aa|down_4|NC_008789.1_1268152_1268446_-	cd19138, AKR_YeaE, Escherichia coli YeaE and similar proteins	NA|175aa|down_5|NC_008789.1_1268608_1269133_+	COG1514, LigT, 2'-5' RNA ligase [Translation, ribosomal structure and biogenesis]	NA|108aa|down_6|NC_008789.1_1269367_1269691_+	COG2863, COG2863, Cytochrome c553 [Energy production and conversion]	NA|435aa|down_7|NC_008789.1_1269699_1271004_+	pfam09242, FCSD-flav_bind, Flavocytochrome c sulphide dehydrogenase, flavin-binding	NA|262aa|down_8|NC_008789.1_1271065_1271851_-	TIGR01583, Formate_dehydrogenase_cytochrome_b556_subunit, formate dehydrogenase, gamma subunit	NA|260aa|down_9|NC_008789.1_1271843_1272623_-	cd10551, PsrB, polysulfide reductase beta (PsrB) subunit
