assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000024725.1_ASM2472v1	NC_013960	Nitrosococcus halophilus Nc 4, complete sequence	1	148071-148167	1	CRISPRCasFinder	no		cas14j,csa3,Cas9_archaeal,c2c9_V-U4,cas3,cas3f,cas1,cas8f,cas5f,cas7f,cas6f,Cas14u_CAS-V,cas5,cas8c,cas7,cas4,cas2,DinG,cas6,csx1,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,WYL,cas10d,csc2gr7,csc1gr5,RT,DEDDh	Orphan	GCATGGGGCGTAATCCTGTTTTCCT	25	0	0	NA	NA	NA	1	1	Orphan	cas14j,csa3,Cas9_archaeal,c2c9_V-U4,cas3,cas3f,cas1,cas8f,cas5f,cas7f,cas6f,Cas14u_CAS-V,cas5,cas8c,cas7,cas4,cas2,DinG,cas6,csx1,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,WYL,cas10d,csc2gr7,csc1gr5,RT,DEDDh	NA|215aa|up_7|NC_013960.1_135230_135875_-,NA|89aa|up_6|NC_013960.1_135957_136224_-,NA|121aa|down_0|NC_013960.1_148174_148537_-,NA|96aa|down_1|NC_013960.1_148590_148878_-,NA|156aa|down_2|NC_013960.1_149057_149525_-,NA|539aa|down_4|NC_013960.1_151547_153164_-,NA|92aa|down_7|NC_013960.1_155861_156137_+,NA|95aa|down_9|NC_013960.1_158016_158301_-	NA|80aa|up_9|NC_013960.1_133756_133996_-	pfam13711, DUF4160, Domain of unknown function (DUF4160)	NA|89aa|up_8|NC_013960.1_134131_134398_-	pfam10387, DUF2442, Protein of unknown function (DUF2442)	NA|215aa|up_7|NC_013960.1_135230_135875_-	NA	NA|89aa|up_6|NC_013960.1_135957_136224_-	NA	NA|202aa|up_5|NC_013960.1_137749_138355_+	cd00540, AAG, Alkyladenine DNA glycosylase catalyzes the first step in base excision repair	NA|1855aa|up_4|NC_013960.1_138487_144052_+	COG0178, UvrA, Excinuclease ATPase subunit [DNA replication, recombination, and repair]	NA|262aa|up_3|NC_013960.1_144074_144860_+	COG2343, COG2343, Uncharacterized protein conserved in bacteria [Function unknown]	NA|202aa|up_2|NC_013960.1_144964_145570_+	cd06260, DUF820, Domain of unknown function (DUF820)	NA|420aa|up_1|NC_013960.1_145833_147093_-	COG0810, TonB, Periplasmic protein TonB, links inner and outer membranes [Cell envelope biogenesis, outer membrane]	NA|174aa|up_0|NC_013960.1_147133_147655_-	cd02423, Peptidase_C39G, A sub-family of peptidase family C39	NA|121aa|down_0|NC_013960.1_148174_148537_-	NA	NA|96aa|down_1|NC_013960.1_148590_148878_-	NA	NA|156aa|down_2|NC_013960.1_149057_149525_-	NA	NA|159aa|down_3|NC_013960.1_150119_150596_+	pfam05110, AF-4, AF-4 proto-oncoprotein	NA|539aa|down_4|NC_013960.1_151547_153164_-	NA	NA|429aa|down_5|NC_013960.1_153448_154735_+	pfam08014, DUF1704, Domain of unknown function (DUF1704)	NA|293aa|down_6|NC_013960.1_154798_155677_-	cd05266, SDR_a4, atypical (a) SDRs, subgroup 4	NA|92aa|down_7|NC_013960.1_155861_156137_+	NA	NA|390aa|down_8|NC_013960.1_156133_157303_-	COG1398, OLE1, Fatty-acid desaturase [Lipid metabolism]	NA|95aa|down_9|NC_013960.1_158016_158301_-	NA
GCF_000024725.1_ASM2472v1	NC_013960	Nitrosococcus halophilus Nc 4, complete sequence	2	1192984-1194875	2,1,1	CRISPRCasFinder,CRT,PILER-CR	no	cas3f,cas1,cas8f,cas5f,cas7f,cas6f	cas14j,csa3,Cas9_archaeal,c2c9_V-U4,cas3,cas3f,cas1,cas8f,cas5f,cas7f,cas6f,Cas14u_CAS-V,cas5,cas8c,cas7,cas4,cas2,DinG,cas6,csx1,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,WYL,cas10d,csc2gr7,csc1gr5,RT,DEDDh	Type I-F	GTTCGCCGCCGCGTAGGCGGTTTAGAAA,GTTCGCCGCCGCGTAGGCGGTTTAGAAA,GTTCGCCGCCGCGTAGGCGGTTTAGAAA	28,28,28	0	0	NA	NA	I-F:I-F:I-F	31,31,23	31	TypeI-F	cas14j,csa3,Cas9_archaeal,c2c9_V-U4,cas3,cas3f,cas1,cas8f,cas5f,cas7f,cas6f,Cas14u_CAS-V,cas5,cas8c,cas7,cas4,cas2,DinG,cas6,csx1,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,WYL,cas10d,csc2gr7,csc1gr5,RT,DEDDh	NA|148aa|up_9|NC_013960.1_1178630_1179074_-,NA	NA|148aa|up_9|NC_013960.1_1178630_1179074_-	NA	NA|847aa|up_8|NC_013960.1_1179756_1182297_-	pfam11854, MtrB_PioB, Putative outer membrane beta-barrel porin, MtrB/PioB	NA|326aa|up_7|NC_013960.1_1182244_1183222_-	TIGR03508, decahem_SO, decaheme c-type cytochrome, DmsE family	NA|284aa|up_6|NC_013960.1_1183259_1184111_+	PLN02839, PLN02839, nudix hydrolase	cas3f|1161aa|up_5|NC_013960.1_1184183_1187666_-	cd09673, Cas3_Cas2_I-F, CRISPR/Cas system-associated protein Cas3/Cas2	cas1|296aa|up_4|NC_013960.1_1187662_1188550_-	cd09718, Cas1_I-F, CRISPR/Cas system-associated protein Cas1	cas8f|453aa|up_3|NC_013960.1_1188870_1190229_+	pfam09611, Cas_Csy1, CRISPR-associated protein (Cas_Csy1)	cas5f|310aa|up_2|NC_013960.1_1190225_1191155_+	cd09736, Csy2_I-F, CRISPR/Cas system-associated RAMP superfamily protein Csy2	cas7f|353aa|up_1|NC_013960.1_1191177_1192236_+	cd09737, Csy3_I-F, CRISPR/Cas system-associated RAMP superfamily protein Csy3	cas6f|200aa|up_0|NC_013960.1_1192250_1192850_+	pfam09618, Cas_Csy4, CRISPR-associated protein (Cas_Csy4)	NA|389aa|down_0|NC_013960.1_1195088_1196255_-	COG4591, LolE, ABC-type transport system, involved in lipoprotein release, permease component [Cell envelope biogenesis, outer membrane]	NA|387aa|down_1|NC_013960.1_1196257_1197418_-	COG0577, SalY, ABC-type antimicrobial peptide transport system, permease component [Defense mechanisms]	NA|232aa|down_2|NC_013960.1_1197445_1198141_-	cd03255, ABC_MJ0796_LolCDE_FtsE, ATP-binding cassette domain of the transporters involved in export of lipoprotein and macrolide, and cell division protein	NA|408aa|down_3|NC_013960.1_1198145_1199369_-	TIGR01730, COG0845:_Membrane-fusion_protein, RND family efflux transporter, MFP subunit	NA|171aa|down_4|NC_013960.1_1199567_1200080_+	COG1183, PssA, Phosphatidylserine synthase [Lipid metabolism]	NA|146aa|down_5|NC_013960.1_1200145_1200583_+	cd00293, USP_Like, Usp: Universal stress protein family	NA|297aa|down_6|NC_013960.1_1200720_1201611_+	PRK11320, prpB, 2-methylisocitrate lyase; Provisional	NA|375aa|down_7|NC_013960.1_1201607_1202732_+	cd06108, Ec2MCS_like, Escherichia coli (Ec) 2-methylcitrate synthase (2MCS)_like	NA|484aa|down_8|NC_013960.1_1202744_1204196_+	PRK09425, prpD, bifunctional 2-methylcitrate dehydratase/aconitate hydratase	NA|81aa|down_9|NC_013960.1_1204442_1204685_+	smart00966, SpoVT_AbrB, SpoVT / AbrB like domain
GCF_000024725.1_ASM2472v1	NC_013960	Nitrosococcus halophilus Nc 4, complete sequence	3	2010353-2010683	2,3,2	PILER-CR,CRISPRCasFinder,CRT	no	cas3,cas5,cas8c,cas7,cas4,cas1,cas2	cas14j,csa3,Cas9_archaeal,c2c9_V-U4,cas3,cas3f,cas1,cas8f,cas5f,cas7f,cas6f,Cas14u_CAS-V,cas5,cas8c,cas7,cas4,cas2,DinG,cas6,csx1,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,WYL,cas10d,csc2gr7,csc1gr5,RT,DEDDh	Type I-C,Type I-U, Type I-U?	GCATCACCCGTCCTTCGGGGCGGGTGTGGATTGAAAC,GCATCACCCGTCCTTCGGGGCGGGTGTGGATTGAAAC,GCATCACCCGTCCTTCGGGGCGGGTGTGGATTGAAAC	37,37,37	1	1	2010390-2010424	NC_013960.1_3820185-3820219	I-C:I-C:I-C	4,4,4	4	TypeI-C,TypeI-U,TypeI-U?	cas14j,csa3,Cas9_archaeal,c2c9_V-U4,cas3,cas3f,cas1,cas8f,cas5f,cas7f,cas6f,Cas14u_CAS-V,cas5,cas8c,cas7,cas4,cas2,DinG,cas6,csx1,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,WYL,cas10d,csc2gr7,csc1gr5,RT,DEDDh	NA,NA	NA|423aa|up_9|NC_013960.1_1998581_1999850_+	COG3211, PhoX, Predicted phosphatase [General function prediction only]	NA|562aa|up_8|NC_013960.1_1999965_2001651_-	cd05804, StaR_like, StaR_like; a well-conserved protein found in bacteria, plants, and animals	NA|156aa|up_7|NC_013960.1_2001775_2002243_+	pfam07486, Hydrolase_2, Cell Wall Hydrolase	cas3|743aa|up_6|NC_013960.1_2002591_2004820_+	cd17930, DEXHc_cas3, DEXH/Q-box helicase domain of Cas3	cas5|226aa|up_5|NC_013960.1_2004875_2005553_+	cd09752, Cas5_I-C, CRISPR/Cas system-associated RAMP superfamily protein Cas5	cas8c|583aa|up_4|NC_013960.1_2005549_2007298_+	pfam09709, Cas_Csd1, CRISPR-associated protein (Cas_Csd1)	cas7|295aa|up_3|NC_013960.1_2007312_2008197_+	pfam05107, Cas_Cas7, CRISPR-associated protein Cas7	cas4|210aa|up_2|NC_013960.1_2008221_2008851_+	cd09637, Cas4_I-A_I-B_I-C_I-D_II-B, CRISPR/Cas system-associated protein Cas4	cas1|345aa|up_1|NC_013960.1_2008835_2009870_+	TIGR03640, cas1_DVULG, CRISPR-associated endonuclease Cas1, subtype I-C/DVULG	cas2|97aa|up_0|NC_013960.1_2009884_2010175_+	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	NA|410aa|down_0|NC_013960.1_2010702_2011932_-	COG0475, KefB, Kef-type K+ transport systems, membrane components [Inorganic ion transport and metabolism]	NA|107aa|down_1|NC_013960.1_2011921_2012242_-	TIGR02230, H+-transporting_ATP_synthase_gene_1, F0F1-ATPase subunit, putative	NA|128aa|down_2|NC_013960.1_2012246_2012630_-	PRK06228, PRK06228, F0F1 ATP synthase subunit epsilon; Validated	NA|471aa|down_3|NC_013960.1_2012641_2014054_-	PRK09280, PRK09280, F0F1 ATP synthase subunit beta; Validated	NA|170aa|down_4|NC_013960.1_2014363_2014873_+	cd00518, H2MP, Hydrogenase specific C-terminal endopeptidases, also called Hydrogen Maturation Proteases (H2MP)	NA|477aa|down_5|NC_013960.1_2014876_2016307_+	pfam06965, Na_H_antiport_1, Na+/H+ antiporter 1	NA|426aa|down_6|NC_013960.1_2016334_2017612_-	COG2270, COG2270, Permeases of the major facilitator superfamily [General function prediction only]	NA|405aa|down_7|NC_013960.1_2017659_2018874_-	cd06164, S2P-M50_SpoIVFB_CBS, SpoIVFB Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50B), regulates intramembrane proteolysis (RIP), and is involved in the pro-sigmaK pathway of bacterial spore formation	NA|337aa|down_8|NC_013960.1_2019271_2020282_-	COG2187, COG2187, Uncharacterized protein conserved in bacteria [Function unknown]	NA|111aa|down_9|NC_013960.1_2020336_2020669_-	cd08976, BaFpgNei_N_4, Uncharacterized bacterial subgroup of the N-terminal domain of Fpg (formamidopyrimidine-DNA glycosylase, MutM)_Nei  (endonuclease VIII) base-excision repair DNA glycosylases
GCF_000024725.1_ASM2472v1	NC_013960	Nitrosococcus halophilus Nc 4, complete sequence	4	2905751-2906562	3,4,3	PILER-CR,CRISPRCasFinder,CRT	no	cas6,csx1,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,cas2,cas1	cas14j,csa3,Cas9_archaeal,c2c9_V-U4,cas3,cas3f,cas1,cas8f,cas5f,cas7f,cas6f,Cas14u_CAS-V,cas5,cas8c,cas7,cas4,cas2,DinG,cas6,csx1,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,WYL,cas10d,csc2gr7,csc1gr5,RT,DEDDh	Type III-B,Type III-C,Type III-A,Type III-D	GTTTGAAACGTAGACCTGATAAAGAAGGGATTAAGAC,GTTTGAAACGTAGACCTGATAAAGAAGGGATTAAGAC,GTTTGAAACGTAGACCTGATANAAGAAGGGATTAAGAC	37,37,38	0	0	NA	NA	NA:NA:NA	10,10,11	11	TypeIII-B,TypeIII-C,TypeIII-A,TypeIII-D	cas14j,csa3,Cas9_archaeal,c2c9_V-U4,cas3,cas3f,cas1,cas8f,cas5f,cas7f,cas6f,Cas14u_CAS-V,cas5,cas8c,cas7,cas4,cas2,DinG,cas6,csx1,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,WYL,cas10d,csc2gr7,csc1gr5,RT,DEDDh	NA,NA|263aa|down_3|NC_013960.1_2910367_2911156_+	cas10|862aa|up_9|NC_013960.1_2895100_2897686_+	cd09680, Cas10_III, CRISPR/Cas system-associated protein Cas10	csm2gr11|154aa|up_8|NC_013960.1_2897700_2898162_+	pfam03750, Csm2_III-A, Csm2 Type III-A	csm3gr7|247aa|up_7|NC_013960.1_2898177_2898918_+	cd09684, Csm3_III-A, CRISPR/Cas system-associated RAMP superfamily protein Csm3	csm4gr5|321aa|up_6|NC_013960.1_2898914_2899877_+	TIGR01903, Hypothetical_protein	csm5gr7|386aa|up_5|NC_013960.1_2899876_2901034_+	TIGR01899, cas_TM1807_csm5, CRISPR type III-A/MTUBE-associated RAMP protein Csm5	NA|198aa|up_4|NC_013960.1_2901051_2901645_+	cd06260, DUF820, Domain of unknown function (DUF820)	csx1|430aa|up_3|NC_013960.1_2901690_2902980_+	cd09747, Csx1_III-U, CRISPR/Cas system-associated protein Csx1	NA|179aa|up_2|NC_013960.1_2903143_2903680_+	COG1943, COG1943, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	csx1|207aa|up_1|NC_013960.1_2903792_2904413_+	cd09747, Csx1_III-U, CRISPR/Cas system-associated protein Csx1	csx1|386aa|up_0|NC_013960.1_2904419_2905577_+	pfam09002, DUF1887, Domain of unknown function (DUF1887)	NA|538aa|down_0|NC_013960.1_2906624_2908238_+	COG5421, COG5421, Transposase [DNA replication, recombination, and repair]	cas2|95aa|down_1|NC_013960.1_2908804_2909089_+	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|326aa|down_2|NC_013960.1_2909393_2910371_+	pfam01867, Cas_Cas1, CRISPR associated protein Cas1	NA|263aa|down_3|NC_013960.1_2910367_2911156_+	NA	cas2|113aa|down_4|NC_013960.1_2911155_2911494_+	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	NA|432aa|down_5|NC_013960.1_2911599_2912895_+	cd11313, AmyAc_arch_bac_AmyA, Alpha amylase catalytic domain found in archaeal and bacterial Alpha-amylases (also called 1,4-alpha-D-glucan-4-glucanohydrolase)	NA|603aa|down_6|NC_013960.1_2912930_2914739_+	COG3387, SGA1, Glucoamylase and related glycosyl hydrolases [Carbohydrate transport and metabolism]	NA|521aa|down_7|NC_013960.1_2914720_2916283_-	PRK05722, PRK05722, glucose-6-phosphate 1-dehydrogenase; Validated	NA|481aa|down_8|NC_013960.1_2916283_2917726_-	PRK09287, PRK09287, NADP-dependent phosphogluconate dehydrogenase	NA|617aa|down_9|NC_013960.1_2917871_2919722_+	COG3387, SGA1, Glucoamylase and related glycosyl hydrolases [Carbohydrate transport and metabolism]
GCF_000024725.1_ASM2472v1	NC_013960	Nitrosococcus halophilus Nc 4, complete sequence	5	2908266-2908654	5,4,4	CRISPRCasFinder,CRT,PILER-CR	no	cas6,csx1,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,cas2,cas1	cas14j,csa3,Cas9_archaeal,c2c9_V-U4,cas3,cas3f,cas1,cas8f,cas5f,cas7f,cas6f,Cas14u_CAS-V,cas5,cas8c,cas7,cas4,cas2,DinG,cas6,csx1,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,WYL,cas10d,csc2gr7,csc1gr5,RT,DEDDh	Type III-B,Type III-C,Type III-A,Type III-D	GTTTGAAACGTAGACCTGATAAAGAAGGGATTAAGAC,GTTTGAAACGTAGACCTGATAAAGAAGGGATTAAGAC,GTTTGAAACGTAGACCTGATAAAGAAGGGATTAAGAC	37,37,37	0	0	NA	NA	NA:NA:NA	5,5,4	5	TypeIII-B,TypeIII-C,TypeIII-A,TypeIII-D	cas14j,csa3,Cas9_archaeal,c2c9_V-U4,cas3,cas3f,cas1,cas8f,cas5f,cas7f,cas6f,Cas14u_CAS-V,cas5,cas8c,cas7,cas4,cas2,DinG,cas6,csx1,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,WYL,cas10d,csc2gr7,csc1gr5,RT,DEDDh	NA,NA|263aa|down_2|NC_013960.1_2910367_2911156_+	csm2gr11|154aa|up_9|NC_013960.1_2897700_2898162_+	pfam03750, Csm2_III-A, Csm2 Type III-A	csm3gr7|247aa|up_8|NC_013960.1_2898177_2898918_+	cd09684, Csm3_III-A, CRISPR/Cas system-associated RAMP superfamily protein Csm3	csm4gr5|321aa|up_7|NC_013960.1_2898914_2899877_+	TIGR01903, Hypothetical_protein	csm5gr7|386aa|up_6|NC_013960.1_2899876_2901034_+	TIGR01899, cas_TM1807_csm5, CRISPR type III-A/MTUBE-associated RAMP protein Csm5	NA|198aa|up_5|NC_013960.1_2901051_2901645_+	cd06260, DUF820, Domain of unknown function (DUF820)	csx1|430aa|up_4|NC_013960.1_2901690_2902980_+	cd09747, Csx1_III-U, CRISPR/Cas system-associated protein Csx1	NA|179aa|up_3|NC_013960.1_2903143_2903680_+	COG1943, COG1943, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	csx1|207aa|up_2|NC_013960.1_2903792_2904413_+	cd09747, Csx1_III-U, CRISPR/Cas system-associated protein Csx1	csx1|386aa|up_1|NC_013960.1_2904419_2905577_+	pfam09002, DUF1887, Domain of unknown function (DUF1887)	NA|538aa|up_0|NC_013960.1_2906624_2908238_+	COG5421, COG5421, Transposase [DNA replication, recombination, and repair]	cas2|95aa|down_0|NC_013960.1_2908804_2909089_+	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|326aa|down_1|NC_013960.1_2909393_2910371_+	pfam01867, Cas_Cas1, CRISPR associated protein Cas1	NA|263aa|down_2|NC_013960.1_2910367_2911156_+	NA	cas2|113aa|down_3|NC_013960.1_2911155_2911494_+	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	NA|432aa|down_4|NC_013960.1_2911599_2912895_+	cd11313, AmyAc_arch_bac_AmyA, Alpha amylase catalytic domain found in archaeal and bacterial Alpha-amylases (also called 1,4-alpha-D-glucan-4-glucanohydrolase)	NA|603aa|down_5|NC_013960.1_2912930_2914739_+	COG3387, SGA1, Glucoamylase and related glycosyl hydrolases [Carbohydrate transport and metabolism]	NA|521aa|down_6|NC_013960.1_2914720_2916283_-	PRK05722, PRK05722, glucose-6-phosphate 1-dehydrogenase; Validated	NA|481aa|down_7|NC_013960.1_2916283_2917726_-	PRK09287, PRK09287, NADP-dependent phosphogluconate dehydrogenase	NA|617aa|down_8|NC_013960.1_2917871_2919722_+	COG3387, SGA1, Glucoamylase and related glycosyl hydrolases [Carbohydrate transport and metabolism]	NA|424aa|down_9|NC_013960.1_2919724_2920996_-	COG0334, GdhA, Glutamate dehydrogenase/leucine dehydrogenase [Amino acid transport and metabolism]
GCF_000024725.1_ASM2472v1	NC_013960	Nitrosococcus halophilus Nc 4, complete sequence	6	3003701-3005981	5,6,5	PILER-CR,CRISPRCasFinder,CRT	no	WYL,cas10d,csc2gr7,csc1gr5,cas3,cas4,cas1,cas2,cas14j	cas14j,csa3,Cas9_archaeal,c2c9_V-U4,cas3,cas3f,cas1,cas8f,cas5f,cas7f,cas6f,Cas14u_CAS-V,cas5,cas8c,cas7,cas4,cas2,DinG,cas6,csx1,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,WYL,cas10d,csc2gr7,csc1gr5,RT,DEDDh	Type I-D	GTTACGAATCCCTATAAGGGGTTATGAG,GTTACGAATCCCTATAAGGGGTTATGAG,GTTACGAATCCCTATAAGGGGTTATGAG	28,28,28	1	1	3005917-3005952	NC_013960.1_999736-999701	NA:NA:NA	35,35,35	35	TypeI-D,TypeV	cas14j,csa3,Cas9_archaeal,c2c9_V-U4,cas3,cas3f,cas1,cas8f,cas5f,cas7f,cas6f,Cas14u_CAS-V,cas5,cas8c,cas7,cas4,cas2,DinG,cas6,csx1,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,WYL,cas10d,csc2gr7,csc1gr5,RT,DEDDh	csc1gr5|190aa|up_4|NC_013960.1_2999220_2999790_+,NA	NA|391aa|up_9|NC_013960.1_2992488_2993661_-	COG3503, COG3503, Predicted membrane protein [Function unknown]	WYL|262aa|up_8|NC_013960.1_2993668_2994454_-	pfam13280, WYL, WYL domain	NA|243aa|up_7|NC_013960.1_2994528_2995257_+	cd09652, Cas6-I-III, CRISPR/Cas system-associated RAMP superfamily protein Cas6	cas10d|952aa|up_6|NC_013960.1_2995253_2998109_+	cd09712, Cas10d_I-D, CRISPR/Cas system-associated protein Cas10d	csc2gr7|333aa|up_5|NC_013960.1_2998111_2999110_+	cd09709, Csc2_I-D, CRISPR/Cas system-associated protein Csc2	csc1gr5|190aa|up_4|NC_013960.1_2999220_2999790_+	NA	cas3|663aa|up_3|NC_013960.1_2999780_3001769_+	smart00487, DEXDc, DEAD-like helicases superfamily	cas4|183aa|up_2|NC_013960.1_3001725_3002274_+	pfam01930, Cas_Cas4, Domain of unknown function DUF83	cas1|323aa|up_1|NC_013960.1_3002283_3003252_+	cd09722, Cas1_I-B, CRISPR/Cas system-associated protein Cas1	cas2|101aa|up_0|NC_013960.1_3003258_3003561_+	pfam09827, CRISPR_Cas2, CRISPR associated protein Cas2	NA|131aa|down_0|NC_013960.1_3008038_3008431_+	COG4190, COG4190, Predicted transcriptional regulator [Transcription]	NA|96aa|down_1|NC_013960.1_3008709_3008997_+	pfam14076, DUF4258, Domain of unknown function (DUF4258)	NA|87aa|down_2|NC_013960.1_3008999_3009260_+	TIGR03831, hypothetical_protein, YgiT-type zinc finger domain	NA|138aa|down_3|NC_013960.1_3009464_3009878_-	pfam01797, Y1_Tnp, Transposase IS200 like	cas14j|392aa|down_4|NC_013960.1_3009934_3011110_+	COG0675, COG0675, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|569aa|down_5|NC_013960.1_3011466_3013173_+	pfam02028, BCCT, BCCT, betaine/carnitine/choline family transporter	NA|228aa|down_6|NC_013960.1_3013258_3013942_-	COG0705, COG0705, Membrane associated serine protease [Amino acid transport and metabolism]	NA|253aa|down_7|NC_013960.1_3014145_3014904_+	COG1073, COG1073, Hydrolases of the alpha/beta superfamily [General function prediction only]	NA|254aa|down_8|NC_013960.1_3014863_3015625_-	pfam11249, DUF3047, Protein of unknown function (DUF3047)	NA|720aa|down_9|NC_013960.1_3015899_3018059_+	PRK06370, PRK06370, FAD-containing oxidoreductase
GCF_000024725.1_ASM2472v1	NC_013960	Nitrosococcus halophilus Nc 4, complete sequence	7	3007065-3007353	6,7,6	PILER-CR,CRISPRCasFinder,CRT	no	WYL,cas10d,csc2gr7,csc1gr5,cas3,cas4,cas1,cas2,cas14j	cas14j,csa3,Cas9_archaeal,c2c9_V-U4,cas3,cas3f,cas1,cas8f,cas5f,cas7f,cas6f,Cas14u_CAS-V,cas5,cas8c,cas7,cas4,cas2,DinG,cas6,csx1,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,WYL,cas10d,csc2gr7,csc1gr5,RT,DEDDh	Type I-D	GTTACGAATCCCTATAAGGGGTTATGAG,GTTACGAATCCCTATAAGGGGTTATGAG,GTTACGAATCCCTATAAGGGGTTATGAG	28,28,28	0	0	NA	NA	NA:NA:NA	4,4,4	4	TypeI-D,TypeV	cas14j,csa3,Cas9_archaeal,c2c9_V-U4,cas3,cas3f,cas1,cas8f,cas5f,cas7f,cas6f,Cas14u_CAS-V,cas5,cas8c,cas7,cas4,cas2,DinG,cas6,csx1,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,WYL,cas10d,csc2gr7,csc1gr5,RT,DEDDh	csc1gr5|190aa|up_4|NC_013960.1_2999220_2999790_+,NA	NA|391aa|up_9|NC_013960.1_2992488_2993661_-	COG3503, COG3503, Predicted membrane protein [Function unknown]	WYL|262aa|up_8|NC_013960.1_2993668_2994454_-	pfam13280, WYL, WYL domain	NA|243aa|up_7|NC_013960.1_2994528_2995257_+	cd09652, Cas6-I-III, CRISPR/Cas system-associated RAMP superfamily protein Cas6	cas10d|952aa|up_6|NC_013960.1_2995253_2998109_+	cd09712, Cas10d_I-D, CRISPR/Cas system-associated protein Cas10d	csc2gr7|333aa|up_5|NC_013960.1_2998111_2999110_+	cd09709, Csc2_I-D, CRISPR/Cas system-associated protein Csc2	csc1gr5|190aa|up_4|NC_013960.1_2999220_2999790_+	NA	cas3|663aa|up_3|NC_013960.1_2999780_3001769_+	smart00487, DEXDc, DEAD-like helicases superfamily	cas4|183aa|up_2|NC_013960.1_3001725_3002274_+	pfam01930, Cas_Cas4, Domain of unknown function DUF83	cas1|323aa|up_1|NC_013960.1_3002283_3003252_+	cd09722, Cas1_I-B, CRISPR/Cas system-associated protein Cas1	cas2|101aa|up_0|NC_013960.1_3003258_3003561_+	pfam09827, CRISPR_Cas2, CRISPR associated protein Cas2	NA|131aa|down_0|NC_013960.1_3008038_3008431_+	COG4190, COG4190, Predicted transcriptional regulator [Transcription]	NA|96aa|down_1|NC_013960.1_3008709_3008997_+	pfam14076, DUF4258, Domain of unknown function (DUF4258)	NA|87aa|down_2|NC_013960.1_3008999_3009260_+	TIGR03831, hypothetical_protein, YgiT-type zinc finger domain	NA|138aa|down_3|NC_013960.1_3009464_3009878_-	pfam01797, Y1_Tnp, Transposase IS200 like	cas14j|392aa|down_4|NC_013960.1_3009934_3011110_+	COG0675, COG0675, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|569aa|down_5|NC_013960.1_3011466_3013173_+	pfam02028, BCCT, BCCT, betaine/carnitine/choline family transporter	NA|228aa|down_6|NC_013960.1_3013258_3013942_-	COG0705, COG0705, Membrane associated serine protease [Amino acid transport and metabolism]	NA|253aa|down_7|NC_013960.1_3014145_3014904_+	COG1073, COG1073, Hydrolases of the alpha/beta superfamily [General function prediction only]	NA|254aa|down_8|NC_013960.1_3014863_3015625_-	pfam11249, DUF3047, Protein of unknown function (DUF3047)	NA|720aa|down_9|NC_013960.1_3015899_3018059_+	PRK06370, PRK06370, FAD-containing oxidoreductase
GCF_000024725.1_ASM2472v1	NC_013960	Nitrosococcus halophilus Nc 4, complete sequence	8	3128237-3128342	8	CRISPRCasFinder	no		cas14j,csa3,Cas9_archaeal,c2c9_V-U4,cas3,cas3f,cas1,cas8f,cas5f,cas7f,cas6f,Cas14u_CAS-V,cas5,cas8c,cas7,cas4,cas2,DinG,cas6,csx1,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,WYL,cas10d,csc2gr7,csc1gr5,RT,DEDDh	Orphan	GAGGTCCCGATTCGCTCCCGGCGAATCGGTG	31	0	0	NA	NA	NA	1	1	Orphan	cas14j,csa3,Cas9_archaeal,c2c9_V-U4,cas3,cas3f,cas1,cas8f,cas5f,cas7f,cas6f,Cas14u_CAS-V,cas5,cas8c,cas7,cas4,cas2,DinG,cas6,csx1,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,WYL,cas10d,csc2gr7,csc1gr5,RT,DEDDh	NA,NA|132aa|down_6|NC_013960.1_3135367_3135763_+,NA|260aa|down_7|NC_013960.1_3135755_3136535_-,NA|64aa|down_8|NC_013960.1_3136554_3136746_-	NA|370aa|up_9|NC_013960.1_3120664_3121774_+	COG2863, COG2863, Cytochrome c553 [Energy production and conversion]	NA|442aa|up_8|NC_013960.1_3121824_3123150_-	cd16954, HATPase_PhoQ-like, Histidine kinase-like ATPase domain of two-component sensor histidine kinases similar to Escherichia coli PhoQ and Providencia stuartii AarG	NA|220aa|up_7|NC_013960.1_3123155_3123815_-	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|111aa|up_6|NC_013960.1_3123814_3124147_-	COG3212, COG3212, Predicted membrane protein [Function unknown]	NA|104aa|up_5|NC_013960.1_3124118_3124430_-	COG3212, COG3212, Predicted membrane protein [Function unknown]	NA|97aa|up_4|NC_013960.1_3124652_3124943_+	pfam14242, DUF4342, Domain of unknown function (DUF4342)	NA|246aa|up_3|NC_013960.1_3125028_3125766_+	cd01400, 6PGL, 6PGL: 6-Phosphogluconolactonase (6PGL) subfamily; 6PGL catalyzes the second step of the oxidative phase of the pentose phosphate pathway, the hydrolyzation of 6-phosphoglucono-1,5-lactone (delta form) to 6-phosphogluconate	NA|336aa|up_2|NC_013960.1_3125762_3126770_+	pfam02685, Glucokinase, Glucokinase	NA|128aa|up_1|NC_013960.1_3126802_3127186_+	PRK05461, apaG, CO2+/MG2+ efflux protein ApaG; Reviewed	NA|279aa|up_0|NC_013960.1_3127292_3128129_+	PRK00166, apaH, symmetrical bis(5'-nucleosyl)-tetraphosphatase	NA|224aa|down_0|NC_013960.1_3129808_3130480_-	pfam09948, DUF2182, Predicted metal-binding integral membrane protein (DUF2182)	NA|245aa|down_1|NC_013960.1_3130572_3131307_-	cd05373, SDR_c10, classical (c) SDR, subgroup  10	NA|194aa|down_2|NC_013960.1_3131313_3131895_-	cd03379, beta_CA_cladeD, Carbonic anhydrases (CA) are zinc-containing enzymes that catalyze the reversible hydration of carbon dioxide in a two-step mechanism in which the nucleophilic attack of a zinc-bound hydroxide ion on carbon dioxide is followed by the regeneration of an active site by ionization of the zinc-bound water molecule and removal of a proton from the active site	NA|217aa|down_3|NC_013960.1_3131917_3132568_-	pfam06080, DUF938, Protein of unknown function (DUF938)	NA|532aa|down_4|NC_013960.1_3132818_3134414_+	COG2204, AtoC, Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains [Signal transduction mechanisms]	NA|299aa|down_5|NC_013960.1_3134425_3135322_-	smart00892, Endonuclease_NS, DNA/RNA non-specific endonuclease	NA|132aa|down_6|NC_013960.1_3135367_3135763_+	NA	NA|260aa|down_7|NC_013960.1_3135755_3136535_-	NA	NA|64aa|down_8|NC_013960.1_3136554_3136746_-	NA	NA|95aa|down_9|NC_013960.1_3137344_3137629_+	TIGR02684, conserved_hypothetical_protein, probable addiction module antidote protein
