assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000018105.1_ASM1810v1	NC_009925	Acaryochloris marina MBIC11017, complete sequence	1	2028067-2028372	1	CRISPRCasFinder	no		RT,csa3,PD-DExK,DEDDh,DinG,cas3,c2c5_V-U5,cas4	Orphan	AGTAGCCTGCCACCCGAAATCGGAAAGCTC	30	0	0	NA	NA	NA	4	4	Orphan	RT,csa3,PD-DExK,DEDDh,DinG,cas3,c2c5_V-U5,cas4,c2c9_V-U4,c2c8_V-U2,cas14j,Cas9_archaeal	NA|31aa|up_9|NC_009925.1_2018008_2018101_+,NA|172aa|up_4|NC_009925.1_2021520_2022036_-,NA|177aa|up_0|NC_009925.1_2025630_2026161_-,NA|105aa|down_8|NC_009925.1_2041459_2041774_-	NA|31aa|up_9|NC_009925.1_2018008_2018101_+	NA	NA|193aa|up_8|NC_009925.1_2018203_2018782_-	PRK00122, rimM, 16S rRNA-processing protein RimM; Provisional	NA|426aa|up_7|NC_009925.1_2018816_2020094_-	PRK09440, avtA, valine--pyruvate transaminase; Provisional	NA|255aa|up_6|NC_009925.1_2020227_2020992_-	COG2112, COG2112, Predicted Ser/Thr protein kinase [Signal transduction mechanisms]	NA|155aa|up_5|NC_009925.1_2021010_2021475_-	pfam00583, Acetyltransf_1, Acetyltransferase (GNAT) family	NA|172aa|up_4|NC_009925.1_2021520_2022036_-	NA	NA|458aa|up_3|NC_009925.1_2022125_2023499_-	PRK09201, PRK09201, AtzE family amidohydrolase	NA|49aa|up_2|NC_009925.1_2023495_2023642_-	pfam13318, DUF4089, Protein of unknown function (DUF4089)	NA|504aa|up_1|NC_009925.1_2023867_2025379_+	COG0043, UbiD, 3-polyprenyl-4-hydroxybenzoate decarboxylase and related decarboxylases [Coenzyme metabolism]	NA|177aa|up_0|NC_009925.1_2025630_2026161_-	NA	NA|370aa|down_0|NC_009925.1_2028642_2029752_-	pfam13531, SBP_bac_11, Bacterial extracellular solute-binding protein	NA|243aa|down_1|NC_009925.1_2029760_2030489_-	TIGR02168, Chromosome_partition_protein_Smc, chromosome segregation protein SMC, common bacterial type	NA|574aa|down_2|NC_009925.1_2030522_2032244_-	cd01461, vWA_interalpha_trypsin_inhibitor, vWA_interalpha trypsin inhibitor (ITI): ITI is a glycoprotein composed of three polypeptides- two heavy chains and one light chain (bikunin)	NA|446aa|down_3|NC_009925.1_2032357_2033695_-	COG4402, COG4402, Uncharacterized protein conserved in bacteria [Function unknown]	NA|893aa|down_4|NC_009925.1_2033874_2036553_-	PRK07561, PRK07561, DNA topoisomerase I subunit omega; Validated	NA|522aa|down_5|NC_009925.1_2036952_2038518_+	PRK02504, PRK02504, NAD(P)H-quinone oxidoreductase subunit N	NA|350aa|down_6|NC_009925.1_2038698_2039748_-	cd06208, CYPOR_like_FNR, These ferredoxin reductases are related to the NADPH cytochrome p450 reductases (CYPOR), but lack the FAD-binding region connecting sub-domain	NA|407aa|down_7|NC_009925.1_2040113_2041334_+	PLN00093, PLN00093, geranylgeranyl diphosphate reductase; Provisional	NA|105aa|down_8|NC_009925.1_2041459_2041774_-	NA	NA|260aa|down_9|NC_009925.1_2042065_2042845_+	pfam02548, Pantoate_transf, Ketopantoate hydroxymethyltransferase
GCF_000018105.1_ASM1810v1	NC_009925	Acaryochloris marina MBIC11017, complete sequence	2	3218306-3218413	2	CRISPRCasFinder	no		RT,csa3,PD-DExK,DEDDh,DinG,cas3,c2c5_V-U5,cas4	Orphan	GTATCGTTGCCAGACCCACCATCTAAG	27	0	0	NA	NA	NA	1	1	Orphan	RT,csa3,PD-DExK,DEDDh,DinG,cas3,c2c5_V-U5,cas4,c2c9_V-U4,c2c8_V-U2,cas14j,Cas9_archaeal	NA|84aa|up_8|NC_009925.1_3203032_3203284_+,NA|99aa|up_7|NC_009925.1_3203302_3203599_-,NA|104aa|down_1|NC_009925.1_3223879_3224191_-,NA|176aa|down_4|NC_009925.1_3228142_3228670_-,NA|118aa|down_8|NC_009925.1_3233261_3233615_-	NA|142aa|up_9|NC_009925.1_3202520_3202946_+	sd00006, TPR, Tetratricopeptide repeat	NA|84aa|up_8|NC_009925.1_3203032_3203284_+	NA	NA|99aa|up_7|NC_009925.1_3203302_3203599_-	NA	NA|296aa|up_6|NC_009925.1_3203973_3204861_+	PLN02578, PLN02578, hydrolase	NA|288aa|up_5|NC_009925.1_3204870_3205734_-	pfam13649, Methyltransf_25, Methyltransferase domain	NA|981aa|up_4|NC_009925.1_3206527_3209470_+	pfam08548, Peptidase_M10_C, Peptidase M10 serralysin C terminal	NA|251aa|up_3|NC_009925.1_3210192_3210945_+	COG0760, SurA, Parvulin-like peptidyl-prolyl isomerase [Posttranslational modification, protein turnover, chaperones]	NA|986aa|up_2|NC_009925.1_3210941_3213899_+	COG2274, SunT, ABC-type bacteriocin/lantibiotic exporters, contain an N-terminal double-glycine peptidase domain [Defense mechanisms]	NA|532aa|up_1|NC_009925.1_3213895_3215491_+	TIGR01843, Hemolysin_secretion_protein_D_plasmid, type I secretion membrane fusion protein, HlyD family	NA|291aa|up_0|NC_009925.1_3216016_3216890_-	pfam13358, DDE_3, DDE superfamily endonuclease	NA|319aa|down_0|NC_009925.1_3222765_3223722_-	cd04770, HTH_HMRTR, Helix-Turn-Helix DNA binding domain of Heavy Metal Resistance transcription regulators	NA|104aa|down_1|NC_009925.1_3223879_3224191_-	NA	NA|551aa|down_2|NC_009925.1_3224596_3226249_-	cd07302, CHD, cyclase homology domain	NA|510aa|down_3|NC_009925.1_3226616_3228146_+	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|176aa|down_4|NC_009925.1_3228142_3228670_-	NA	NA|319aa|down_5|NC_009925.1_3229135_3230092_+	COG0697, RhaT, Permeases of the drug/metabolite transporter (DMT) superfamily [Carbohydrate transport and metabolism / Amino acid transport and metabolism / General function prediction only]	NA|51aa|down_6|NC_009925.1_3230481_3230634_+	PLN00014, PLN00014, light-harvesting-like protein 3; Provisional	NA|761aa|down_7|NC_009925.1_3230746_3233029_-	cd08023, GH16_laminarinase_like, Laminarinase, member of the glycosyl hydrolase family 16	NA|118aa|down_8|NC_009925.1_3233261_3233615_-	NA	NA|423aa|down_9|NC_009925.1_3233800_3235069_-	cd03800, GT4_sucrose_synthase, sucrose-phosphate synthase and similar proteins
GCF_000018105.1_ASM1810v1	NC_009925	Acaryochloris marina MBIC11017, complete sequence	3	4815174-4815426	1,3,1	PILER-CR,CRISPRCasFinder,CRT	no	c2c5_V-U5,DEDDh,RT	RT,csa3,PD-DExK,DEDDh,DinG,cas3,c2c5_V-U5,cas4	Unclear	CTTTCAACCCCCAACACCCTCAAAAGGACGTTGCGAC,CTTTCAACCCCCAACACCCTCAAAAGGACGTTGCGAC,CTTTCAACCCCCAACACCCTCAAAAGGACGTTGCGAC	37,37,37	0	0	NA	NA	NA:NA:NA	3,3,3	3	Unclear	RT,csa3,PD-DExK,DEDDh,DinG,cas3,c2c5_V-U5,cas4,c2c9_V-U4,c2c8_V-U2,cas14j,Cas9_archaeal	NA|185aa|up_8|NC_009925.1_4805853_4806408_+,NA|197aa|up_7|NC_009925.1_4806667_4807258_+,NA|51aa|up_2|NC_009925.1_4812134_4812287_-,c2c5_V-U5|206aa|down_0|NC_009925.1_4815957_4816575_-,NA|87aa|down_2|NC_009925.1_4817710_4817971_-,NA|134aa|down_5|NC_009925.1_4819574_4819976_-,NA|86aa|down_9|NC_009925.1_4823402_4823660_+	NA|508aa|up_9|NC_009925.1_4804067_4805591_-	CHL00195, ycf46, Ycf46; Provisional	NA|185aa|up_8|NC_009925.1_4805853_4806408_+	NA	NA|197aa|up_7|NC_009925.1_4806667_4807258_+	NA	NA|194aa|up_6|NC_009925.1_4807217_4807799_-	cd02969, PRX_like1, Peroxiredoxin (PRX)-like 1 family; hypothetical proteins that show sequence similarity to PRXs	NA|557aa|up_5|NC_009925.1_4808065_4809736_+	COG0578, GlpA, Glycerol-3-phosphate dehydrogenase [Energy production and conversion]	NA|311aa|up_4|NC_009925.1_4809764_4810697_+	PRK14619, PRK14619, NAD(P)H-dependent glycerol-3-phosphate dehydrogenase; Provisional	NA|386aa|up_3|NC_009925.1_4810963_4812121_+	COG1104, NifS, Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes [Amino acid transport and metabolism]	NA|51aa|up_2|NC_009925.1_4812134_4812287_-	NA	NA|169aa|up_1|NC_009925.1_4812419_4812926_-	cd06661, GGCT_like, GGCT-like domains, also called AIG2-like family	NA|526aa|up_0|NC_009925.1_4812925_4814503_-	pfam16927, HisKA_7TM, N-terminal 7TM region of histidine kinase	c2c5_V-U5|206aa|down_0|NC_009925.1_4815957_4816575_-	NA	NA|112aa|down_1|NC_009925.1_4817378_4817714_-	COG2026, RelE, Cytotoxic translational repressor of toxin-antitoxin stability system [Translation, ribosomal structure and biogenesis / Cell division and chromosome partitioning]	NA|87aa|down_2|NC_009925.1_4817710_4817971_-	NA	NA|184aa|down_3|NC_009925.1_4818075_4818627_-	pfam05685, Uma2, Putative restriction endonuclease	NA|314aa|down_4|NC_009925.1_4818693_4819635_-	cd17282, RMtype1_S_Eco16444ORF1681_TRD1-CR1_like, Type I restriction-modification system specificity (S) subunit Target Recognition Domain-ConseRved domain (TRD-CR), similar to Escherichia coli G4/9 S subunit (S	NA|134aa|down_5|NC_009925.1_4819574_4819976_-	NA	NA|113aa|down_6|NC_009925.1_4820071_4820410_-	cd16382, XisI-like, XisI is FdxN element excision controlling factor protein	NA|138aa|down_7|NC_009925.1_4820397_4820811_-	pfam08814, XisH, XisH protein	NA|808aa|down_8|NC_009925.1_4820803_4823227_-	COG0286, HsdM, Type I restriction-modification system methyltransferase subunit [Defense mechanisms]	NA|86aa|down_9|NC_009925.1_4823402_4823660_+	NA
GCF_000018105.1_ASM1810v1	NC_009932	Acaryochloris marina MBIC11017 plasmid pREB7, complete sequence	1	59551-59687	1	CRISPRCasFinder	no		DEDDh	Orphan	TAAGGTTTTCGGACCCTCTGGTTGTTTGATCGCGGTTAGAGGG	43	0	0	NA	NA	NA	1	1	Orphan	RT,csa3,PD-DExK,DEDDh,DinG,cas3,c2c5_V-U5,cas4,c2c9_V-U4,c2c8_V-U2,cas14j,Cas9_archaeal	NA|207aa|up_7|NC_009932.1_45663_46284_-,NA|329aa|up_6|NC_009932.1_46574_47561_-,NA|237aa|up_3|NC_009932.1_51315_52026_+,NA|288aa|down_1|NC_009932.1_60765_61629_+,NA|219aa|down_2|NC_009932.1_61713_62370_+,NA|1042aa|down_3|NC_009932.1_62381_65507_+,NA|382aa|down_4|NC_009932.1_65510_66656_+,NA|426aa|down_5|NC_009932.1_66681_67959_+,NA|160aa|down_6|NC_009932.1_68025_68505_+,NA|143aa|down_7|NC_009932.1_68497_68926_+,NA|459aa|down_9|NC_009932.1_70604_71981_+	NA|605aa|up_9|NC_009932.1_41566_43381_+	TIGR00836, Ammonium_transporter, ammonium transporter	NA|501aa|up_8|NC_009932.1_44109_45612_+	cd11474, SLC5sbd_CHT, Na(+)- and Cl(-)-dependent choline cotransporter CHT and related proteins; solute-binding domain	NA|207aa|up_7|NC_009932.1_45663_46284_-	NA	NA|329aa|up_6|NC_009932.1_46574_47561_-	NA	NA|486aa|up_5|NC_009932.1_47868_49326_+	COG1596, Wza, Periplasmic protein involved in polysaccharide export, contains    SLBB domain of b-grasp fold [Cell wall/membrane/envelope biogenesis]	NA|491aa|up_4|NC_009932.1_49504_50977_-	pfam17815, PDZ_3, PDZ domain	NA|237aa|up_3|NC_009932.1_51315_52026_+	NA	NA|382aa|up_2|NC_009932.1_52368_53514_+	cd10283, MnuA_DNase1-like, Mycoplasma pulmonis MnuA nuclease-like	NA|1252aa|up_1|NC_009932.1_53965_57721_-	COG4372, COG4372, Uncharacterized protein conserved in bacteria with the myosin-like domain [Function unknown]	NA|424aa|up_0|NC_009932.1_58247_59519_-	TIGR02393, RNA_polymerase_sigma_factor_RpoD, RNA polymerase sigma factor RpoD, C-terminal domain	NA|79aa|down_0|NC_009932.1_60468_60705_+	pfam13443, HTH_26, Cro/C1-type HTH DNA-binding domain	NA|288aa|down_1|NC_009932.1_60765_61629_+	NA	NA|219aa|down_2|NC_009932.1_61713_62370_+	NA	NA|1042aa|down_3|NC_009932.1_62381_65507_+	NA	NA|382aa|down_4|NC_009932.1_65510_66656_+	NA	NA|426aa|down_5|NC_009932.1_66681_67959_+	NA	NA|160aa|down_6|NC_009932.1_68025_68505_+	NA	NA|143aa|down_7|NC_009932.1_68497_68926_+	NA	NA|437aa|down_8|NC_009932.1_68922_70233_-	COG5659, COG5659, FOG: Transposase [DNA replication, recombination, and repair]	NA|459aa|down_9|NC_009932.1_70604_71981_+	NA
GCF_000018105.1_ASM1810v1	NC_009933	Acaryochloris marina MBIC11017 plasmid pREB8, complete sequence	1	90274-90411	1	CRISPRCasFinder	no		cas4	Orphan	TAAGGTTTTCGGACCCTCTGGTTGTTTGATCGCGGTTAGAGGG	43	0	0	NA	NA	NA	1	1	Orphan	RT,csa3,PD-DExK,DEDDh,DinG,cas3,c2c5_V-U5,cas4,c2c9_V-U4,c2c8_V-U2,cas14j,Cas9_archaeal	NA|100aa|up_9|NC_009933.1_70372_70672_+,NA|207aa|up_7|NC_009933.1_77484_78105_-,NA|195aa|up_3|NC_009933.1_82217_82802_-,NA|288aa|down_1|NC_009933.1_91424_92288_+,NA|219aa|down_2|NC_009933.1_92372_93029_+,NA|296aa|down_4|NC_009933.1_96427_97315_+,NA|424aa|down_5|NC_009933.1_97346_98618_+,NA|158aa|down_6|NC_009933.1_98693_99167_+,NA|226aa|down_7|NC_009933.1_99173_99851_+,NA|451aa|down_8|NC_009933.1_99884_101237_+,NA|133aa|down_9|NC_009933.1_101348_101747_-	NA|100aa|up_9|NC_009933.1_70372_70672_+	NA	NA|697aa|up_8|NC_009933.1_70934_73025_-	COG1357, COG1357, Pentapeptide repeats containing protein [Function unknown]	NA|207aa|up_7|NC_009933.1_77484_78105_-	NA	NA|508aa|up_6|NC_009933.1_78234_79758_+	COG1357, COG1357, Pentapeptide repeats containing protein [Function unknown]	NA|425aa|up_5|NC_009933.1_79837_81112_-	COG1357, COG1357, Pentapeptide repeats containing protein [Function unknown]	NA|83aa|up_4|NC_009933.1_81116_81365_-	pfam03050, DDE_Tnp_IS66, Transposase IS66 family	NA|195aa|up_3|NC_009933.1_82217_82802_-	NA	NA|518aa|up_2|NC_009933.1_82959_84513_-	COG1357, COG1357, Pentapeptide repeats containing protein [Function unknown]	NA|1252aa|up_1|NC_009933.1_84678_88434_-	COG4372, COG4372, Uncharacterized protein conserved in bacteria with the myosin-like domain [Function unknown]	NA|424aa|up_0|NC_009933.1_88970_90242_-	TIGR02393, RNA_polymerase_sigma_factor_RpoD, RNA polymerase sigma factor RpoD, C-terminal domain	NA|94aa|down_0|NC_009933.1_91082_91364_+	pfam13443, HTH_26, Cro/C1-type HTH DNA-binding domain	NA|288aa|down_1|NC_009933.1_91424_92288_+	NA	NA|219aa|down_2|NC_009933.1_92372_93029_+	NA	NA|1042aa|down_3|NC_009933.1_93040_96166_+	pfam13401, AAA_22, AAA domain	NA|296aa|down_4|NC_009933.1_96427_97315_+	NA	NA|424aa|down_5|NC_009933.1_97346_98618_+	NA	NA|158aa|down_6|NC_009933.1_98693_99167_+	NA	NA|226aa|down_7|NC_009933.1_99173_99851_+	NA	NA|451aa|down_8|NC_009933.1_99884_101237_+	NA	NA|133aa|down_9|NC_009933.1_101348_101747_-	NA
