assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000018265.1_ASM1826v1	NC_009953	Salinispora arenicola CNS-205, complete genome	1	2092637-2092738	1	CRISPRCasFinder	no		csa3,DinG,cas3,DEDDh,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,cas3HD,WYL,csb1gr7,csb2gr5,cas8u1	Orphan	CGCCCGGCATGGTCGACGTGCGGCGCC	27	0	0	NA	NA	NA	1	1	Orphan	csa3,DinG,cas3,DEDDh,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,cas3HD,WYL,csb1gr7,csb2gr5,cas8u1	NA|72aa|up_5|NC_009953.1_2085029_2085245_+,NA|222aa|down_0|NC_009953.1_2093897_2094563_+,NA|102aa|down_2|NC_009953.1_2095204_2095510_+,NA|647aa|down_7|NC_009953.1_2100975_2102916_+	NA|278aa|up_9|NC_009953.1_2080939_2081773_-	cd00317, cyclophilin, cyclophilin: cyclophilin-type peptidylprolyl cis- trans isomerases	NA|295aa|up_8|NC_009953.1_2081812_2082697_-	cd00317, cyclophilin, cyclophilin: cyclophilin-type peptidylprolyl cis- trans isomerases	NA|239aa|up_7|NC_009953.1_2082938_2083655_+	cd06262, metallo-hydrolase-like_MBL-fold, mainly hydrolytic enzymes and related proteins which carry out various biological functions; MBL-fold metallohydrolase domain	NA|441aa|up_6|NC_009953.1_2083707_2085030_+	PRK00037, hisS, histidyl-tRNA synthetase; Reviewed	NA|72aa|up_5|NC_009953.1_2085029_2085245_+	NA	NA|312aa|up_4|NC_009953.1_2085241_2086177_+	cd01050, Acyl_ACP_Desat, Acyl ACP desaturase, ferritin-like diiron-binding domain	NA|605aa|up_3|NC_009953.1_2086345_2088160_+	PRK00476, aspS, aspartyl-tRNA synthetase; Validated	NA|199aa|up_2|NC_009953.1_2088330_2088927_+	pfam12840, HTH_20, Helix-turn-helix domain	NA|457aa|up_1|NC_009953.1_2088923_2090294_+	cd17325, MFS_MdtG_SLC18_like, bacterial MdtG-like and eukaryotic solute carrier 18 (SLC18) family of the Major Facilitator Superfamily of transporters	NA|503aa|up_0|NC_009953.1_2090409_2091918_+	PRK13342, PRK13342, recombination factor protein RarA; Reviewed	NA|222aa|down_0|NC_009953.1_2093897_2094563_+	NA	NA|154aa|down_1|NC_009953.1_2094713_2095175_+	pfam06103, DUF948, Bacterial protein of unknown function (DUF948)	NA|102aa|down_2|NC_009953.1_2095204_2095510_+	NA	NA|893aa|down_3|NC_009953.1_2095506_2098185_+	PRK00252, alaS, alanyl-tRNA synthetase; Reviewed	NA|154aa|down_4|NC_009953.1_2098255_2098717_+	pfam03652, RuvX, Holliday junction resolvase	NA|402aa|down_5|NC_009953.1_2098713_2099919_+	pfam02618, YceG, YceG-like family	NA|276aa|down_6|NC_009953.1_2099908_2100736_+	PRK00258, aroE, shikimate 5-dehydrogenase; Reviewed	NA|647aa|down_7|NC_009953.1_2100975_2102916_+	NA	NA|181aa|down_8|NC_009953.1_2103007_2103550_+	cd04647, LbH_MAT_like, Maltose O-acyltransferase (MAT)-like: This family is composed of maltose O-acetyltransferase, galactoside O-acetyltransferase (GAT), xenobiotic acyltransferase (XAT) and similar proteins	NA|557aa|down_9|NC_009953.1_2103542_2105213_+	TIGR03007, pepcterm_ChnLen, polysaccharide chain length determinant protein, PEP-CTERM locus subfamily
GCF_000018265.1_ASM1826v1	NC_009953	Salinispora arenicola CNS-205, complete genome	2	2220232-2220330	2	CRISPRCasFinder	no		csa3,DinG,cas3,DEDDh,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,cas3HD,WYL,csb1gr7,csb2gr5,cas8u1	Orphan	GCTGTCAACGGGGGGCGTTGACA	23	0	0	NA	NA	NA	1	1	Orphan	csa3,DinG,cas3,DEDDh,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,cas3HD,WYL,csb1gr7,csb2gr5,cas8u1	NA,NA|99aa|down_2|NC_009953.1_2223886_2224183_-,NA|83aa|down_3|NC_009953.1_2224179_2224428_-,NA|65aa|down_5|NC_009953.1_2226874_2227069_-,NA|336aa|down_6|NC_009953.1_2227399_2228407_-,NA|125aa|down_8|NC_009953.1_2229087_2229462_-,NA|89aa|down_9|NC_009953.1_2229461_2229728_-	NA|372aa|up_9|NC_009953.1_2209974_2211090_+	cd05305, L-AlaDH, Alanine dehydrogenase NAD-binding and catalytic domains	NA|346aa|up_8|NC_009953.1_2211134_2212172_+	PRK00283, xerD, tyrosine recombinase	NA|308aa|up_7|NC_009953.1_2212341_2213265_+	pfam13614, AAA_31, AAA domain	NA|343aa|up_6|NC_009953.1_2213285_2214314_+	COG1354, scpA, Rec8/ScpA/Scc1-like protein (kleisin family) [Replication,    recombination, and repair]	NA|270aa|up_5|NC_009953.1_2214306_2215116_+	pfam04079, SMC_ScpB, Segregation and condensation complex subunit ScpB	NA|257aa|up_4|NC_009953.1_2215102_2215873_+	COG1187, RsuA, 16S rRNA uridine-516 pseudouridylate synthase and related pseudouridylate synthases [Translation, ribosomal structure and biogenesis]	NA|227aa|up_3|NC_009953.1_2216045_2216726_+	PRK00023, cmk, (d)CMP kinase	NA|468aa|up_2|NC_009953.1_2216722_2218126_+	PRK03003, PRK03003, GTP-binding protein Der; Reviewed	NA|402aa|up_1|NC_009953.1_2218414_2219620_-	cd00796, INT_Rci_Hp1_C, Shufflon-specific DNA recombinase Rci and Bacteriophage Hp1_like integrase, C-terminal catalytic domain	NA|81aa|up_0|NC_009953.1_2219619_2219862_-	pfam12728, HTH_17, Helix-turn-helix domain	NA|534aa|down_0|NC_009953.1_2220393_2221995_-	pfam13148, DUF3987, Protein of unknown function (DUF3987)	NA|378aa|down_1|NC_009953.1_2222748_2223882_-	pfam09250, Prim-Pol, Bifunctional DNA primase/polymerase, N-terminal	NA|99aa|down_2|NC_009953.1_2223886_2224183_-	NA	NA|83aa|down_3|NC_009953.1_2224179_2224428_-	NA	NA|738aa|down_4|NC_009953.1_2224570_2226784_-	COG1674, FtsK, DNA segregation ATPase FtsK/SpoIIIE and related proteins [Cell division and chromosome partitioning]	NA|65aa|down_5|NC_009953.1_2226874_2227069_-	NA	NA|336aa|down_6|NC_009953.1_2227399_2228407_-	NA	NA|222aa|down_7|NC_009953.1_2228425_2229091_-	cd03670, ADPRase_NUDT9, ADP-ribose pyrophosphatase (ADPRase) catalyzes the hydrolysis of ADP-ribose to AMP and ribose-5-P	NA|125aa|down_8|NC_009953.1_2229087_2229462_-	NA	NA|89aa|down_9|NC_009953.1_2229461_2229728_-	NA
GCF_000018265.1_ASM1826v1	NC_009953	Salinispora arenicola CNS-205, complete genome	3	2266659-2268213	3,1,1	CRISPRCasFinder,CRT,PILER-CR	no	cas3,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2	csa3,DinG,cas3,DEDDh,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,cas3HD,WYL,csb1gr7,csb2gr5,cas8u1	Type I-E	CGGAGCACCCCCACGTGCGTGGGGAGGAC,CGGAGCACCCCCACGTGCGTGGGGAGGAC,GGAGCACCCCCACGTGCGTGGGGAGGAC	29,29,28	2	2	2267116-2267147|2267116-2267148	NC_009953.1_1369204-1369235|NC_009953.1_1369204-1369236	I-B,III-A,III-B:I-B,III-A,III-B:I-B,III-A,III-B	25,25,25	25	TypeI-E	csa3,DinG,cas3,DEDDh,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,cas3HD,WYL,csb1gr7,csb2gr5,cas8u1	NA|66aa|up_3|NC_009953.1_2262520_2262718_-,NA|104aa|down_9|NC_009953.1_2279905_2280217_-	NA|396aa|up_9|NC_009953.1_2253978_2255166_-	COG1506, DAP2, Dipeptidyl aminopeptidases/acylaminoacyl-peptidases [Amino acid transport and metabolism]	NA|239aa|up_8|NC_009953.1_2256049_2256766_-	pfam02633, Creatininase, Creatinine amidohydrolase	NA|411aa|up_7|NC_009953.1_2256769_2258002_-	pfam13560, HTH_31, Helix-turn-helix domain	NA|469aa|up_6|NC_009953.1_2258280_2259687_+	COG0531, PotE, Amino acid transporters [Amino acid transport and metabolism]	NA|351aa|up_5|NC_009953.1_2259683_2260736_+	COG2421, COG2421, Predicted acetamidase/formamidase [Energy production and conversion]	NA|227aa|up_4|NC_009953.1_2260732_2261413_+	pfam05719, GPP34, Golgi phosphoprotein 3 (GPP34)	NA|66aa|up_3|NC_009953.1_2262520_2262718_-	NA	NA|409aa|up_2|NC_009953.1_2262873_2264100_+	pfam00665, rve, Integrase core domain	NA|263aa|up_1|NC_009953.1_2264099_2264888_+	PRK06526, PRK06526, transposase; Provisional	NA|256aa|up_0|NC_009953.1_2265499_2266267_-	cd15457, NADAR, Escherichia coli swarming motility protein YbiA and related proteins	cas3|930aa|down_0|NC_009953.1_2268335_2271125_+	PRK09694, PRK09694, CRISPR-associated helicase/endonuclease Cas3	cas8e|496aa|down_1|NC_009953.1_2271292_2272780_+	cd09729, Cse1_I-E, CRISPR/Cas system-associated protein Cse1	cse2gr11|221aa|down_2|NC_009953.1_2272779_2273442_+	cd09731, Cse2_I-E, CRISPR/Cas system-associated protein Cse2	cas7|381aa|down_3|NC_009953.1_2273438_2274581_+	pfam09344, Cas_CT1975, CT1975-like protein	cas5|258aa|down_4|NC_009953.1_2274577_2275351_+	TIGR01868, hypothetical_protein, CRISPR-associated protein Cas5/CasD, subtype I-E/ECOLI	cas6e|207aa|down_5|NC_009953.1_2275347_2275968_+	cd09727, Cas6_I-E, CRISPR/Cas system-associated RAMP superfamily protein Cas6e	cas1|321aa|down_6|NC_009953.1_2275964_2276927_+	TIGR03638, cas1_ECOLI, CRISPR-associated endonuclease Cas1, subtype I-E/ECOLI	cas2|114aa|down_7|NC_009953.1_2276928_2277270_+	pfam09707, Cas_Cas2CT1978, CRISPR-associated protein (Cas_Cas2CT1978)	NA|305aa|down_8|NC_009953.1_2277750_2278665_-	pfam13340, DUF4096, Putative transposase of IS4/5 family (DUF4096)	NA|104aa|down_9|NC_009953.1_2279905_2280217_-	NA
GCF_000018265.1_ASM1826v1	NC_009953	Salinispora arenicola CNS-205, complete genome	4	2277289-2277745	2,4,2	PILER-CR,CRISPRCasFinder,CRT	no	cas3,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2	csa3,DinG,cas3,DEDDh,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,cas3HD,WYL,csb1gr7,csb2gr5,cas8u1	Type I-E	CGGAGCACCCCCACGTGCGTGGGGAGGAC,GTCCTCCCCACGCACGTGGGGGTGCTCCG,GTCCTCCCCACGCACGTGGGGGTGCTCCG	29,29,29	0	0	NA	NA	I-B,III-A,III-B:I-B,III-A,III-B:I-B,III-A,III-B	7,7,7	7	TypeI-E	csa3,DinG,cas3,DEDDh,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,cas3HD,WYL,csb1gr7,csb2gr5,cas8u1	NA,NA|104aa|down_1|NC_009953.1_2279905_2280217_-,NA|135aa|down_4|NC_009953.1_2285128_2285533_-,NA|81aa|down_5|NC_009953.1_2286643_2286886_+,NA|81aa|down_6|NC_009953.1_2287081_2287324_+,NA|192aa|down_7|NC_009953.1_2287466_2288042_-,NA|127aa|down_8|NC_009953.1_2288732_2289113_+	NA|263aa|up_9|NC_009953.1_2264099_2264888_+	PRK06526, PRK06526, transposase; Provisional	NA|256aa|up_8|NC_009953.1_2265499_2266267_-	cd15457, NADAR, Escherichia coli swarming motility protein YbiA and related proteins	cas3|930aa|up_7|NC_009953.1_2268335_2271125_+	PRK09694, PRK09694, CRISPR-associated helicase/endonuclease Cas3	cas8e|496aa|up_6|NC_009953.1_2271292_2272780_+	cd09729, Cse1_I-E, CRISPR/Cas system-associated protein Cse1	cse2gr11|221aa|up_5|NC_009953.1_2272779_2273442_+	cd09731, Cse2_I-E, CRISPR/Cas system-associated protein Cse2	cas7|381aa|up_4|NC_009953.1_2273438_2274581_+	pfam09344, Cas_CT1975, CT1975-like protein	cas5|258aa|up_3|NC_009953.1_2274577_2275351_+	TIGR01868, hypothetical_protein, CRISPR-associated protein Cas5/CasD, subtype I-E/ECOLI	cas6e|207aa|up_2|NC_009953.1_2275347_2275968_+	cd09727, Cas6_I-E, CRISPR/Cas system-associated RAMP superfamily protein Cas6e	cas1|321aa|up_1|NC_009953.1_2275964_2276927_+	TIGR03638, cas1_ECOLI, CRISPR-associated endonuclease Cas1, subtype I-E/ECOLI	cas2|114aa|up_0|NC_009953.1_2276928_2277270_+	pfam09707, Cas_Cas2CT1978, CRISPR-associated protein (Cas_Cas2CT1978)	NA|305aa|down_0|NC_009953.1_2277750_2278665_-	pfam13340, DUF4096, Putative transposase of IS4/5 family (DUF4096)	NA|104aa|down_1|NC_009953.1_2279905_2280217_-	NA	NA|720aa|down_2|NC_009953.1_2281348_2283508_+	cd06974, TerD_like, Uncharacterized proteins involved in stress response, similar to tellurium resistance terD	NA|196aa|down_3|NC_009953.1_2283910_2284498_+	pfam13546, DDE_5, DDE superfamily endonuclease	NA|135aa|down_4|NC_009953.1_2285128_2285533_-	NA	NA|81aa|down_5|NC_009953.1_2286643_2286886_+	NA	NA|81aa|down_6|NC_009953.1_2287081_2287324_+	NA	NA|192aa|down_7|NC_009953.1_2287466_2288042_-	NA	NA|127aa|down_8|NC_009953.1_2288732_2289113_+	NA	NA|124aa|down_9|NC_009953.1_2289378_2289750_-	cd10170, HSP70_NBD, Nucleotide-binding domain of the HSP70 family
GCF_000018265.1_ASM1826v1	NC_009953	Salinispora arenicola CNS-205, complete genome	5	2278794-2279003	5,3	CRISPRCasFinder,CRT	no	cas3,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2	csa3,DinG,cas3,DEDDh,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,cas3HD,WYL,csb1gr7,csb2gr5,cas8u1	Type I-E	GTCCTCCCCACGCACGTGGGGGTGCTCCG,GTCCTCCCCACGCACGTGGGGGTGCTCCGNN	29,31	0	0	NA	NA	I-B,III-A,III-B:I-B,III-A,III-B	3,3	3	TypeI-E	csa3,DinG,cas3,DEDDh,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,cas3HD,WYL,csb1gr7,csb2gr5,cas8u1	NA,NA|104aa|down_0|NC_009953.1_2279905_2280217_-,NA|135aa|down_3|NC_009953.1_2285128_2285533_-,NA|81aa|down_4|NC_009953.1_2286643_2286886_+,NA|81aa|down_5|NC_009953.1_2287081_2287324_+,NA|192aa|down_6|NC_009953.1_2287466_2288042_-,NA|127aa|down_7|NC_009953.1_2288732_2289113_+,NA|146aa|down_9|NC_009953.1_2291116_2291554_-	NA|256aa|up_9|NC_009953.1_2265499_2266267_-	cd15457, NADAR, Escherichia coli swarming motility protein YbiA and related proteins	cas3|930aa|up_8|NC_009953.1_2268335_2271125_+	PRK09694, PRK09694, CRISPR-associated helicase/endonuclease Cas3	cas8e|496aa|up_7|NC_009953.1_2271292_2272780_+	cd09729, Cse1_I-E, CRISPR/Cas system-associated protein Cse1	cse2gr11|221aa|up_6|NC_009953.1_2272779_2273442_+	cd09731, Cse2_I-E, CRISPR/Cas system-associated protein Cse2	cas7|381aa|up_5|NC_009953.1_2273438_2274581_+	pfam09344, Cas_CT1975, CT1975-like protein	cas5|258aa|up_4|NC_009953.1_2274577_2275351_+	TIGR01868, hypothetical_protein, CRISPR-associated protein Cas5/CasD, subtype I-E/ECOLI	cas6e|207aa|up_3|NC_009953.1_2275347_2275968_+	cd09727, Cas6_I-E, CRISPR/Cas system-associated RAMP superfamily protein Cas6e	cas1|321aa|up_2|NC_009953.1_2275964_2276927_+	TIGR03638, cas1_ECOLI, CRISPR-associated endonuclease Cas1, subtype I-E/ECOLI	cas2|114aa|up_1|NC_009953.1_2276928_2277270_+	pfam09707, Cas_Cas2CT1978, CRISPR-associated protein (Cas_Cas2CT1978)	NA|305aa|up_0|NC_009953.1_2277750_2278665_-	pfam13340, DUF4096, Putative transposase of IS4/5 family (DUF4096)	NA|104aa|down_0|NC_009953.1_2279905_2280217_-	NA	NA|720aa|down_1|NC_009953.1_2281348_2283508_+	cd06974, TerD_like, Uncharacterized proteins involved in stress response, similar to tellurium resistance terD	NA|196aa|down_2|NC_009953.1_2283910_2284498_+	pfam13546, DDE_5, DDE superfamily endonuclease	NA|135aa|down_3|NC_009953.1_2285128_2285533_-	NA	NA|81aa|down_4|NC_009953.1_2286643_2286886_+	NA	NA|81aa|down_5|NC_009953.1_2287081_2287324_+	NA	NA|192aa|down_6|NC_009953.1_2287466_2288042_-	NA	NA|127aa|down_7|NC_009953.1_2288732_2289113_+	NA	NA|124aa|down_8|NC_009953.1_2289378_2289750_-	cd10170, HSP70_NBD, Nucleotide-binding domain of the HSP70 family	NA|146aa|down_9|NC_009953.1_2291116_2291554_-	NA
GCF_000018265.1_ASM1826v1	NC_009953	Salinispora arenicola CNS-205, complete genome	6	2394434-2395806	4,6,3	CRT,CRISPRCasFinder,PILER-CR	no		csa3,DinG,cas3,DEDDh,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,cas3HD,WYL,csb1gr7,csb2gr5,cas8u1	Orphan	NNGGATCACCCCCGCCTGC,GGATCACCCCCGCCTGCGCGGGGAACAG,GGATCACCCCCGCCTGCGCGGGGAACAG	19,28,28	0	0	NA	NA	NA:I-E:I-E	22,22,21	22	Orphan	csa3,DinG,cas3,DEDDh,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,cas3HD,WYL,csb1gr7,csb2gr5,cas8u1	NA,NA	NA|2244aa|up_9|NC_009953.1_2375651_2382383_+	COG3321, COG3321, Polyketide synthase modules and related proteins [Secondary metabolites biosynthesis, transport, and catabolism]	NA|518aa|up_8|NC_009953.1_2382399_2383953_+	cd08503, PBP2_NikA_DppA_OppA_like_17, The substrate-binding component of an uncharacterized ABC-type nickel/dipeptide/oligopeptide-like import system contains the type 2 periplasmic binding fold	NA|319aa|up_7|NC_009953.1_2383949_2384906_+	COG0601, DppB, ABC-type dipeptide/oligopeptide/nickel transport systems, permease components [Amino acid transport and metabolism / Inorganic ion transport and metabolism]	NA|266aa|up_6|NC_009953.1_2384902_2385700_+	COG1173, DppC, ABC-type dipeptide/oligopeptide/nickel transport systems, permease components [Amino acid transport and metabolism / Inorganic ion transport and metabolism]	NA|515aa|up_5|NC_009953.1_2385696_2387241_+	COG1123, COG1123, ATPase components of various ABC-type transport systems, contain duplicated ATPase [General function prediction only]	NA|457aa|up_4|NC_009953.1_2387237_2388608_+	pfam08242, Methyltransf_12, Methyltransferase domain	NA|494aa|up_3|NC_009953.1_2388604_2390086_+	pfam08242, Methyltransf_12, Methyltransferase domain	NA|671aa|up_2|NC_009953.1_2390082_2392095_+	cd19535, Cyc_NRPS, Cyc (heterocyclization) domain of nonribosomal peptide synthetases (NRPSs); belongs to the Condensation-domain family	NA|307aa|up_1|NC_009953.1_2392175_2393096_-	COG0492, TrxB, Thioredoxin reductase [Posttranslational modification, protein turnover, chaperones]	NA|372aa|up_0|NC_009953.1_2393197_2394313_+	COG1748, LYS9, Saccharopine dehydrogenase and related proteins [Amino acid transport and metabolism]	NA|340aa|down_0|NC_009953.1_2396221_2397241_+	TIGR02716, C-20_methyltransferase, C-20 methyltransferase BchU	NA|416aa|down_1|NC_009953.1_2397978_2399226_+	cd01118, ArsB_permease, Anion permease ArsB	NA|142aa|down_2|NC_009953.1_2399528_2399954_-	COG4319, COG4319, Ketosteroid isomerase homolog [Function unknown]	NA|110aa|down_3|NC_009953.1_2400068_2400398_+	COG1733, COG1733, Predicted transcriptional regulators [Transcription]	NA|175aa|down_4|NC_009953.1_2400596_2401121_+	pfam13508, Acetyltransf_7, Acetyltransferase (GNAT) domain	NA|319aa|down_5|NC_009953.1_2401344_2402301_-	TIGR03395, Sphingomyelinase_C, sphingomyelin phosphodiesterase	NA|137aa|down_6|NC_009953.1_2402594_2403005_-	TIGR02246, hypothetical_protein, conserved hypothetical protein	NA|393aa|down_7|NC_009953.1_2403038_2404217_-	cd01159, NcnH, Naphthocyclinone hydroxylase	NA|262aa|down_8|NC_009953.1_2404362_2405148_-	COG2207, AraC, AraC-type DNA-binding domain-containing proteins [Transcription]	NA|437aa|down_9|NC_009953.1_2406011_2407322_+	PRK06185, PRK06185, FAD-dependent oxidoreductase
GCF_000018265.1_ASM1826v1	NC_009953	Salinispora arenicola CNS-205, complete genome	7	2935990-2937300	4,7,5	PILER-CR,CRISPRCasFinder,CRT	no	cas3,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2	csa3,DinG,cas3,DEDDh,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,cas3HD,WYL,csb1gr7,csb2gr5,cas8u1	Type I-E	GGGATCACCCCCGCATGCGCGGGGAGCAG,CTGCTCCCCGCGCATGCGGGGGTGATCCC,CTGCTCCCCGCGCATGCGGGGGTGATCCC	29,29,29	0	0	NA	NA	I-E:I-E:I-E	21,21,21	21	TypeI-E	csa3,DinG,cas3,DEDDh,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,cas3HD,WYL,csb1gr7,csb2gr5,cas8u1	NA,NA	NA|231aa|up_9|NC_009953.1_2925243_2925936_+	PRK05625, PRK05625, 5-amino-6-(5-phosphoribosylamino)uracil reductase; Validated	NA|167aa|up_8|NC_009953.1_2925920_2926421_+	cd01284, Riboflavin_deaminase-reductase, Riboflavin-specific deaminase	cas3|964aa|up_7|NC_009953.1_2926539_2929431_+	PRK09694, PRK09694, CRISPR-associated helicase/endonuclease Cas3	cas8e|563aa|up_6|NC_009953.1_2929646_2931335_+	pfam09481, CRISPR_Cse1, CRISPR-associated protein Cse1 (CRISPR_cse1)	cse2gr11|227aa|up_5|NC_009953.1_2931331_2932012_+	pfam09485, CRISPR_Cse2, CRISPR-associated protein Cse2 (CRISPR_cse2)	cas7|388aa|up_4|NC_009953.1_2932014_2933178_+	pfam09344, Cas_CT1975, CT1975-like protein	cas5|241aa|up_3|NC_009953.1_2933174_2933897_+	cd09645, Cas5_I-E, CRISPR/Cas system-associated RAMP superfamily protein Cas5	cas6e|228aa|up_2|NC_009953.1_2933905_2934589_+	pfam08798, CRISPR_assoc, CRISPR associated protein	cas1|323aa|up_1|NC_009953.1_2934585_2935554_+	TIGR03638, cas1_ECOLI, CRISPR-associated endonuclease Cas1, subtype I-E/ECOLI	cas2|137aa|up_0|NC_009953.1_2935553_2935964_+	pfam09707, Cas_Cas2CT1978, CRISPR-associated protein (Cas_Cas2CT1978)	NA|409aa|down_0|NC_009953.1_2937404_2938631_+	COG4584, COG4584, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|263aa|down_1|NC_009953.1_2938630_2939419_+	PRK06526, PRK06526, transposase; Provisional	NA|136aa|down_2|NC_009953.1_2940966_2941374_+	TIGR04188, conserved_hypothetical_protein, methyltransferase, ATP-grasp peptide maturase system	NA|647aa|down_3|NC_009953.1_2941477_2943418_-	TIGR03604, hypothetical_protein, thiazole/oxazole-forming peptide maturase, SagD family component	NA|252aa|down_4|NC_009953.1_2943414_2944170_-	COG0842, COG0842, ABC-type multidrug transport system, permease component [Defense mechanisms]	NA|309aa|down_5|NC_009953.1_2944166_2945093_-	COG1131, CcmA, ABC-type multidrug transport system, ATPase component [Defense mechanisms]	NA|399aa|down_6|NC_009953.1_2945089_2946286_-	cd06159, S2P-M50_PDZ_Arch, Uncharacterized Archaeal homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms	NA|356aa|down_7|NC_009953.1_2946282_2947350_-	pfam14028, Lant_dehydr_C, Lantibiotic biosynthesis dehydratase C-term	NA|920aa|down_8|NC_009953.1_2947346_2950106_-	pfam04738, Lant_dehydr_N, Lantibiotic dehydratase, C-terminus	NA|538aa|down_9|NC_009953.1_2950105_2951719_-	TIGR03605, antibiot_sagB, SagB-type dehydrogenase domain
GCF_000018265.1_ASM1826v1	NC_009953	Salinispora arenicola CNS-205, complete genome	8	2939492-2940920	5,8,6	PILER-CR,CRISPRCasFinder,CRT	no	cas3,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2	csa3,DinG,cas3,DEDDh,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,cas3HD,WYL,csb1gr7,csb2gr5,cas8u1	Type I-E	GGATCACCCCCGCATGCGCGGGGAGCAG,CTGCTCCCCGCGCATGCGGGGGTGATCCC,CTGCTCCCCGCGCATGCGGGGGTGATCC	28,29,28	0	0	NA	NA	I-E:I-E:I-E	20,23,23	23	TypeI-E	csa3,DinG,cas3,DEDDh,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,cas3HD,WYL,csb1gr7,csb2gr5,cas8u1	NA,NA	cas3|964aa|up_9|NC_009953.1_2926539_2929431_+	PRK09694, PRK09694, CRISPR-associated helicase/endonuclease Cas3	cas8e|563aa|up_8|NC_009953.1_2929646_2931335_+	pfam09481, CRISPR_Cse1, CRISPR-associated protein Cse1 (CRISPR_cse1)	cse2gr11|227aa|up_7|NC_009953.1_2931331_2932012_+	pfam09485, CRISPR_Cse2, CRISPR-associated protein Cse2 (CRISPR_cse2)	cas7|388aa|up_6|NC_009953.1_2932014_2933178_+	pfam09344, Cas_CT1975, CT1975-like protein	cas5|241aa|up_5|NC_009953.1_2933174_2933897_+	cd09645, Cas5_I-E, CRISPR/Cas system-associated RAMP superfamily protein Cas5	cas6e|228aa|up_4|NC_009953.1_2933905_2934589_+	pfam08798, CRISPR_assoc, CRISPR associated protein	cas1|323aa|up_3|NC_009953.1_2934585_2935554_+	TIGR03638, cas1_ECOLI, CRISPR-associated endonuclease Cas1, subtype I-E/ECOLI	cas2|137aa|up_2|NC_009953.1_2935553_2935964_+	pfam09707, Cas_Cas2CT1978, CRISPR-associated protein (Cas_Cas2CT1978)	NA|409aa|up_1|NC_009953.1_2937404_2938631_+	COG4584, COG4584, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|263aa|up_0|NC_009953.1_2938630_2939419_+	PRK06526, PRK06526, transposase; Provisional	NA|136aa|down_0|NC_009953.1_2940966_2941374_+	TIGR04188, conserved_hypothetical_protein, methyltransferase, ATP-grasp peptide maturase system	NA|647aa|down_1|NC_009953.1_2941477_2943418_-	TIGR03604, hypothetical_protein, thiazole/oxazole-forming peptide maturase, SagD family component	NA|252aa|down_2|NC_009953.1_2943414_2944170_-	COG0842, COG0842, ABC-type multidrug transport system, permease component [Defense mechanisms]	NA|309aa|down_3|NC_009953.1_2944166_2945093_-	COG1131, CcmA, ABC-type multidrug transport system, ATPase component [Defense mechanisms]	NA|399aa|down_4|NC_009953.1_2945089_2946286_-	cd06159, S2P-M50_PDZ_Arch, Uncharacterized Archaeal homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms	NA|356aa|down_5|NC_009953.1_2946282_2947350_-	pfam14028, Lant_dehydr_C, Lantibiotic biosynthesis dehydratase C-term	NA|920aa|down_6|NC_009953.1_2947346_2950106_-	pfam04738, Lant_dehydr_N, Lantibiotic dehydratase, C-terminus	NA|538aa|down_7|NC_009953.1_2950105_2951719_-	TIGR03605, antibiot_sagB, SagB-type dehydrogenase domain	NA|631aa|down_8|NC_009953.1_2951728_2953621_-	TIGR03604, hypothetical_protein, thiazole/oxazole-forming peptide maturase, SagD family component	NA|727aa|down_9|NC_009953.1_2953617_2955798_-	TIGR03693, ocin_ThiF_like, putative thiazole-containing bacteriocin maturation protein
GCF_000018265.1_ASM1826v1	NC_009953	Salinispora arenicola CNS-205, complete genome	9	3251231-3251337	9	CRISPRCasFinder	no		csa3,DinG,cas3,DEDDh,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,cas3HD,WYL,csb1gr7,csb2gr5,cas8u1	Orphan	AGCCGGGTGCTCTACCACTGAGCTAC	26	0	0	NA	NA	NA	1	1	Orphan	csa3,DinG,cas3,DEDDh,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,cas3HD,WYL,csb1gr7,csb2gr5,cas8u1	NA|47aa|up_5|NC_009953.1_3243363_3243504_-,NA|176aa|up_2|NC_009953.1_3247868_3248396_-,NA|474aa|down_0|NC_009953.1_3251361_3252783_-,NA|128aa|down_2|NC_009953.1_3256532_3256916_-	NA|409aa|up_9|NC_009953.1_3238113_3239340_+	pfam00665, rve, Integrase core domain	NA|263aa|up_8|NC_009953.1_3239339_3240128_+	PRK06526, PRK06526, transposase; Provisional	NA|271aa|up_7|NC_009953.1_3240582_3241395_+	COG2159, COG2159, Predicted metal-dependent hydrolase of the TIM-barrel fold [General function prediction only]	NA|269aa|up_6|NC_009953.1_3242225_3243032_-	pfam01471, PG_binding_1, Putative peptidoglycan binding domain	NA|47aa|up_5|NC_009953.1_3243363_3243504_-	NA	NA|339aa|up_4|NC_009953.1_3243937_3244954_-	pfam11271, DUF3068, Protein of unknown function (DUF3068)	NA|509aa|up_3|NC_009953.1_3245272_3246799_+	PRK07656, PRK07656, long-chain-fatty-acid--CoA ligase; Validated	NA|176aa|up_2|NC_009953.1_3247868_3248396_-	NA	NA|287aa|up_1|NC_009953.1_3248465_3249326_+	TIGR03339, phn_lysR, aminoethylphosphonate catabolism associated LysR family transcriptional regulator	NA|182aa|up_0|NC_009953.1_3249318_3249864_+	PRK00819, PRK00819, RNA 2'-phosphotransferase; Reviewed	NA|474aa|down_0|NC_009953.1_3251361_3252783_-	NA	NA|1204aa|down_1|NC_009953.1_3252921_3256533_-	COG1112, COG1112, Superfamily I DNA and RNA helicases and helicase subunits [DNA replication, recombination, and repair]	NA|128aa|down_2|NC_009953.1_3256532_3256916_-	NA	NA|257aa|down_3|NC_009953.1_3257177_3257948_-	pfam10127, Nuc-transf, Predicted nucleotidyltransferase	NA|224aa|down_4|NC_009953.1_3257947_3258619_-	pfam10127, Nuc-transf, Predicted nucleotidyltransferase	NA|186aa|down_5|NC_009953.1_3259578_3260136_-	smart00903, Flavin_Reduct, Flavin reductase like domain	NA|300aa|down_6|NC_009953.1_3260132_3261032_-	TIGR03213, Biphenyl-23-diol_12-dioxygenase_1, 2,3-dihydroxybiphenyl 1,2-dioxygenase	NA|284aa|down_7|NC_009953.1_3261028_3261880_-	TIGR03343, 2-hydroxy-6-oxo-6-phenylhexa-24-dienoate_hydrolase, 2-hydroxy-6-oxo-6-phenylhexa-2,4-dienoate hydrolase	NA|391aa|down_8|NC_009953.1_3261887_3263060_-	cd01159, NcnH, Naphthocyclinone hydroxylase	NA|378aa|down_9|NC_009953.1_3263184_3264318_+	cd03531, Rieske_RO_Alpha_KSH, The alignment model represents the N-terminal rieske iron-sulfur domain of KshA, the oxygenase component of 3-ketosteroid 9-alpha-hydroxylase (KSH)
GCF_000018265.1_ASM1826v1	NC_009953	Salinispora arenicola CNS-205, complete genome	10	5064111-5066458	6,10,7	PILER-CR,CRISPRCasFinder,CRT	no	csb1gr7,csb2gr5,cas3,cas8u1	csa3,DinG,cas3,DEDDh,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,cas3HD,WYL,csb1gr7,csb2gr5,cas8u1	Unclear	CCTTCACCGACCCAACGCGGTCGGTCTCCGTTGCGG,CCTTCACCGACCCAACGCGGTCGGTCTCCGTTGCGGG,CCTTCACCGACCCAACGCGGTCGGTCTCCGTTGCGG	36,37,36	0	0	NA	NA	NA:NA:NA	31,31,31	31	Unclear	csa3,DinG,cas3,DEDDh,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,cas3HD,WYL,csb1gr7,csb2gr5,cas8u1	NA|312aa|up_6|NC_009953.1_5053973_5054909_+,cas8u1|322aa|up_0|NC_009953.1_5062940_5063906_+,NA|107aa|down_0|NC_009953.1_5067348_5067669_+,NA|79aa|down_2|NC_009953.1_5069972_5070209_-,NA|172aa|down_3|NC_009953.1_5071265_5071781_-,NA|77aa|down_4|NC_009953.1_5072230_5072461_-,NA|468aa|down_9|NC_009953.1_5077065_5078469_+	NA|254aa|up_9|NC_009953.1_5051508_5052270_-	pfam05138, PaaA_PaaC, Phenylacetic acid catabolic protein	NA|107aa|up_8|NC_009953.1_5052266_5052587_-	PRK13781, paaB, phenylacetate-CoA oxygenase subunit PaaB; Provisional	NA|352aa|up_7|NC_009953.1_5052583_5053639_-	PRK13778, paaA, phenylacetate-CoA oxygenase subunit PaaA; Provisional	NA|312aa|up_6|NC_009953.1_5053973_5054909_+	NA	NA|313aa|up_5|NC_009953.1_5054908_5055847_+	pfam13354, Beta-lactamase2, Beta-lactamase enzyme family	NA|284aa|up_4|NC_009953.1_5055867_5056719_-	cd13634, PBP2_Sco4506, The conserved hypothetical protein SCO4506 exhibits the type 2 periplasmic-binidng protein fold	csb1gr7|385aa|up_3|NC_009953.1_5057489_5058644_+	cd09738, Csb1_I-U, CRISPR/Cas system-associated protein Csb1	csb2gr5|468aa|up_2|NC_009953.1_5058647_5060051_+	cd09734, Csb2_I-U, CRISPR/Cas system-associated protein Csb2	cas3|968aa|up_1|NC_009953.1_5060037_5062941_+	TIGR02621, CRISPR-associated_helicase_Cas3, CRISPR-associated helicase Cas3, subtype Dpsyc	cas8u1|322aa|up_0|NC_009953.1_5062940_5063906_+	NA	NA|107aa|down_0|NC_009953.1_5067348_5067669_+	NA	NA|681aa|down_1|NC_009953.1_5067784_5069827_+	COG3973, COG3973, Superfamily I DNA and RNA helicases [General function prediction only]	NA|79aa|down_2|NC_009953.1_5069972_5070209_-	NA	NA|172aa|down_3|NC_009953.1_5071265_5071781_-	NA	NA|77aa|down_4|NC_009953.1_5072230_5072461_-	NA	NA|220aa|down_5|NC_009953.1_5072734_5073394_+	COG1011, COG1011, Predicted hydrolase (HAD superfamily) [General function prediction only]	NA|441aa|down_6|NC_009953.1_5073390_5074713_-	cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|285aa|down_7|NC_009953.1_5075051_5075906_+	COG0438, RfaG, Glycosyltransferase [Cell envelope biogenesis, outer membrane]	NA|379aa|down_8|NC_009953.1_5075932_5077069_+	pfam07883, Cupin_2, Cupin domain	NA|468aa|down_9|NC_009953.1_5077065_5078469_+	NA
GCF_000018265.1_ASM1826v1	NC_009953	Salinispora arenicola CNS-205, complete genome	11	5587840-5588355	11,8,7	CRISPRCasFinder,CRT,PILER-CR	no		csa3,DinG,cas3,DEDDh,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,cas3HD,WYL,csb1gr7,csb2gr5,cas8u1	Orphan	GGATCACCCCCGCGTGTGCGGGGAACAG,GGATCACCCCCGCGTGTGCGGGGAACAG,GGATCACCCCCGCGTGTGCGGGGAACAG	28,28,28	0	0	NA	NA	I-C,I-E,II-B:I-C,I-E,II-B:I-C,I-E,II-B	8,8,6	8	Orphan	csa3,DinG,cas3,DEDDh,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,cas3HD,WYL,csb1gr7,csb2gr5,cas8u1	NA|68aa|up_8|NC_009953.1_5581296_5581500_+,NA|77aa|up_7|NC_009953.1_5581499_5581730_+,NA|97aa|up_3|NC_009953.1_5584433_5584724_+,NA|460aa|up_2|NC_009953.1_5585215_5586595_+,NA|95aa|down_0|NC_009953.1_5588647_5588932_-	NA|257aa|up_9|NC_009953.1_5580391_5581162_-	COG2944, COG2944, Predicted transcriptional regulator [Transcription]	NA|68aa|up_8|NC_009953.1_5581296_5581500_+	NA	NA|77aa|up_7|NC_009953.1_5581499_5581730_+	NA	NA|99aa|up_6|NC_009953.1_5581809_5582106_+	TIGR04186, conserved_hypothetical_protein, putative ATP-grasp target RiPP	NA|319aa|up_5|NC_009953.1_5582110_5583067_+	TIGR04187, conserved_hypothetical_protein, ATP-grasp ribosomal peptide maturase, SAV_5884 family	NA|410aa|up_4|NC_009953.1_5583063_5584293_+	TIGR04188, conserved_hypothetical_protein, methyltransferase, ATP-grasp peptide maturase system	NA|97aa|up_3|NC_009953.1_5584433_5584724_+	NA	NA|460aa|up_2|NC_009953.1_5585215_5586595_+	NA	NA|72aa|up_1|NC_009953.1_5586974_5587190_-	COG1598, COG1598, Predicted nuclease of the RNAse H fold, HicB family [General    function prediction only]	NA|116aa|up_0|NC_009953.1_5587402_5587750_-	cd00796, INT_Rci_Hp1_C, Shufflon-specific DNA recombinase Rci and Bacteriophage Hp1_like integrase, C-terminal catalytic domain	NA|95aa|down_0|NC_009953.1_5588647_5588932_-	NA	NA|263aa|down_1|NC_009953.1_5589012_5589801_-	PRK06526, PRK06526, transposase; Provisional	NA|409aa|down_2|NC_009953.1_5589800_5591027_-	pfam00665, rve, Integrase core domain	NA|315aa|down_3|NC_009953.1_5592236_5593181_-	cd01104, HTH_MlrA-CarA, Helix-Turn-Helix DNA binding domain of the transcription regulators MlrA and CarA	NA|200aa|down_4|NC_009953.1_5593170_5593770_-	PRK03759, PRK03759, isopentenyl-diphosphate Delta-isomerase	NA|493aa|down_5|NC_009953.1_5593766_5595245_-	TIGR02734, Phytoene_desaturase_lycopene-forming, phytoene desaturase	NA|385aa|down_6|NC_009953.1_5595274_5596429_-	cd00685, Trans_IPPS_HT, Trans-Isoprenyl Diphosphate Synthases, head-to-tail	NA|304aa|down_7|NC_009953.1_5596492_5597404_+	cd00683, Trans_IPPS_HH, Trans-Isoprenyl Diphosphate Synthases, head-to-head	NA|424aa|down_8|NC_009953.1_5597562_5598834_-	COG2124, CypX, Cytochrome P450 [Secondary metabolites biosynthesis, transport, and catabolism]	NA|556aa|down_9|NC_009953.1_5599142_5600810_+	cd05920, 23DHB-AMP_lg, 2,3-dihydroxybenzoate-AMP ligase
