assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000021565.1_ASM2156v1	NC_012440	Persephonella marina EX-H1, complete sequence	1	120836-121244	1,1,1	PILER-CR,CRISPRCasFinder,CRT	no		csa3,cas3,DEDDh,cas6,cas8b2,cas7,cas5,cas4,cas1,cas2	Orphan	CTGTTGAAGTTTTCCCGAGATATAAGGGAATTGCGAC,GTTGAAGTTTTCCCGAGATATAAGGGAATTGCGAC,TTTCCCGAGANATAAGGGAATTGCGAC	37,35,27	0	0	NA	NA	NA:NA:NA	3,5,5	5	Orphan	csa3,cas3,DEDDh,cas6,cas8b2,cas7,cas5,cas4,cas1,cas2	NA|144aa|up_2|NC_012440.1_117558_117990_-,NA|72aa|down_0|NC_012440.1_121420_121636_+,NA|134aa|down_4|NC_012440.1_122764_123166_+	NA|320aa|up_9|NC_012440.1_113408_114368_-	TIGR00557, pdxA, 4-hydroxythreonine-4-phosphate dehydrogenase	NA|81aa|up_8|NC_012440.1_114371_114614_-	TIGR03011, sulf_tusB_dsrH, sulfur relay protein TusB/DsrH	NA|117aa|up_7|NC_012440.1_114607_114958_-	pfam02635, DrsE, DsrE/DsrF-like family	NA|101aa|up_6|NC_012440.1_114967_115270_-	pfam02635, DrsE, DsrE/DsrF-like family	NA|81aa|up_5|NC_012440.1_115266_115509_-	pfam01206, TusA, Sulfurtransferase TusA	NA|273aa|up_4|NC_012440.1_115519_116338_-	PRK08762, PRK08762, molybdopterin-synthase adenylyltransferase MoeB	NA|383aa|up_3|NC_012440.1_116413_117562_-	COG2070, COG2070, Dioxygenases related to 2-nitropropane dioxygenase [General function prediction only]	NA|144aa|up_2|NC_012440.1_117558_117990_-	NA	NA|567aa|up_1|NC_012440.1_118089_119790_+	TIGR00644, recJ, single-stranded-DNA-specific exonuclease RecJ	NA|201aa|up_0|NC_012440.1_119871_120474_+	cd03015, PRX_Typ2cys, Peroxiredoxin (PRX) family, Typical 2-Cys PRX subfamily; PRXs are thiol-specific antioxidant (TSA) proteins, which confer a protective role in cells through its peroxidase activity by reducing hydrogen peroxide, peroxynitrite, and organic hydroperoxides	NA|72aa|down_0|NC_012440.1_121420_121636_+	NA	NA|67aa|down_1|NC_012440.1_121682_121883_+	PRK00270, rpsU, 30S ribosomal protein S21; Reviewed	NA|149aa|down_2|NC_012440.1_121893_122340_+	pfam09424, YqeY, Yqey-like protein	NA|145aa|down_3|NC_012440.1_122342_122777_+	pfam02674, Colicin_V, Colicin V production protein	NA|134aa|down_4|NC_012440.1_122764_123166_+	NA	NA|160aa|down_5|NC_012440.1_123162_123642_-	pfam13686, DrsE_2, DsrE/DsrF/DrsH-like family	NA|440aa|down_6|NC_012440.1_123678_124998_-	TIGR00054, Putative_zinc_metalloprotease_slr1821, RIP metalloprotease RseP	NA|374aa|down_7|NC_012440.1_125012_126134_-	PRK05447, PRK05447, 1-deoxy-D-xylulose 5-phosphate reductoisomerase; Provisional	NA|202aa|down_8|NC_012440.1_126144_126750_-	TIGR02187, glutaredoxin-like_protein, Glutaredoxin-like domain protein	NA|234aa|down_9|NC_012440.1_126833_127535_-	PRK05888, PRK05888, NADH-quinone oxidoreductase subunit NuoI
GCF_000021565.1_ASM2156v1	NC_012440	Persephonella marina EX-H1, complete sequence	2	1254083-1254830	2,2,2	CRISPRCasFinder,CRT,PILER-CR	no	cas6,cas8b2,cas7,cas5,cas3,cas4,cas1,cas2	csa3,cas3,DEDDh,cas6,cas8b2,cas7,cas5,cas4,cas1,cas2	Unclear	CTTTCTATCCCACTTAGTTCAAAGAAAAC,CTTTCTATCCCACTTAGTTCAAAGAAAAC,CTTTCTATCCCACTTAGTTCAAAGAAAAC	29,29,29	0	0	NA	NA	NA:NA:NA	11,11,10	11	Unclear	csa3,cas3,DEDDh,cas6,cas8b2,cas7,cas5,cas4,cas1,cas2	NA|90aa|up_2|NC_012440.1_1249940_1250210_+,NA|310aa|up_1|NC_012440.1_1250251_1251181_+,NA	NA|447aa|up_9|NC_012440.1_1243220_1244561_+	PRK01490, tig, trigger factor; Provisional	NA|203aa|up_8|NC_012440.1_1244564_1245173_+	PRK00277, clpP, ATP-dependent Clp protease proteolytic subunit; Reviewed	NA|412aa|up_7|NC_012440.1_1245175_1246411_+	PRK05342, clpX, ATP-dependent Clp protease ATP-binding subunit ClpX	NA|204aa|up_6|NC_012440.1_1246412_1247024_+	PRK00454, engB, GTP-binding protein YsxC; Reviewed	NA|239aa|up_5|NC_012440.1_1247131_1247848_+	PRK13111, trpA, tryptophan synthase subunit alpha; Provisional	NA|207aa|up_4|NC_012440.1_1247844_1248465_+	PRK13141, hisH, imidazole glycerol phosphate synthase subunit HisH; Provisional	NA|462aa|up_3|NC_012440.1_1248554_1249940_+	PRK09249, PRK09249, coproporphyrinogen dehydrogenase	NA|90aa|up_2|NC_012440.1_1249940_1250210_+	NA	NA|310aa|up_1|NC_012440.1_1250251_1251181_+	NA	NA|922aa|up_0|NC_012440.1_1251247_1254013_+	PRK00349, uvrA, excinuclease ABC subunit UvrA	cas6|266aa|down_0|NC_012440.1_1255187_1255985_+	cd09652, Cas6-I-III, CRISPR/Cas system-associated RAMP superfamily protein Cas6	cas8b2|568aa|down_1|NC_012440.1_1255984_1257688_+	cd09665, Cas8a1_I-A, CRISPR/Cas system-associated protein Cas8a1	cas7|311aa|down_2|NC_012440.1_1257680_1258613_+	cd09687, Cas7_I-C, CRISPR/Cas system-associated RAMP superfamily protein Cas7	NA|83aa|down_3|NC_012440.1_1258683_1258932_+	pfam02604, PhdYeFM_antitox, Antitoxin Phd_YefM, type II toxin-antitoxin system	NA|93aa|down_4|NC_012440.1_1258932_1259211_+	COG3041, COG3041, Uncharacterized protein conserved in bacteria [Function unknown]	cas5|241aa|down_5|NC_012440.1_1259227_1259950_+	TIGR01895, conserved_hypothetical_protein, CRISPR-associated protein Cas5, subtype I-B/TNEAP	cas3|750aa|down_6|NC_012440.1_1259968_1262218_+	COG1203, COG1203, CRISPR-associated helicase Cas3 [Defense mechanisms]	NA|263aa|down_7|NC_012440.1_1262346_1263135_+	pfam10127, Nuc-transf, Predicted nucleotidyltransferase	cas4|171aa|down_8|NC_012440.1_1263119_1263632_+	pfam01930, Cas_Cas4, Domain of unknown function DUF83	NA|782aa|down_9|NC_012440.1_1263635_1265981_-	COG1401, McrB, GTPase subunit of restriction endonuclease [Defense mechanisms]
GCF_000021565.1_ASM2156v1	NC_012440	Persephonella marina EX-H1, complete sequence	3	1269028-1269447	3,3,3	PILER-CR,CRISPRCasFinder,CRT	no	cas6,cas8b2,cas7,cas5,cas3,cas4,cas1,cas2	csa3,cas3,DEDDh,cas6,cas8b2,cas7,cas5,cas4,cas1,cas2	Unclear	AATTTATACCCCACTTGGTTCAGAAAAAAC,GTTTTTTCTGAACCAAGTGGGGTATAAA,GTTTTTTCTGAACCAAGTGGGGTATAAATT	30,28,30	0	0	NA	NA	NA:NA:NA	4,6,6	6	Unclear	csa3,cas3,DEDDh,cas6,cas8b2,cas7,cas5,cas4,cas1,cas2	NA,NA|126aa|down_0|NC_012440.1_1269585_1269963_+,NA|97aa|down_1|NC_012440.1_1270050_1270341_-,NA|484aa|down_2|NC_012440.1_1270402_1271854_-,NA|88aa|down_3|NC_012440.1_1272069_1272333_-,NA|69aa|down_4|NC_012440.1_1272492_1272699_-	NA|83aa|up_9|NC_012440.1_1258683_1258932_+	pfam02604, PhdYeFM_antitox, Antitoxin Phd_YefM, type II toxin-antitoxin system	NA|93aa|up_8|NC_012440.1_1258932_1259211_+	COG3041, COG3041, Uncharacterized protein conserved in bacteria [Function unknown]	cas5|241aa|up_7|NC_012440.1_1259227_1259950_+	TIGR01895, conserved_hypothetical_protein, CRISPR-associated protein Cas5, subtype I-B/TNEAP	cas3|750aa|up_6|NC_012440.1_1259968_1262218_+	COG1203, COG1203, CRISPR-associated helicase Cas3 [Defense mechanisms]	NA|263aa|up_5|NC_012440.1_1262346_1263135_+	pfam10127, Nuc-transf, Predicted nucleotidyltransferase	cas4|171aa|up_4|NC_012440.1_1263119_1263632_+	pfam01930, Cas_Cas4, Domain of unknown function DUF83	NA|782aa|up_3|NC_012440.1_1263635_1265981_-	COG1401, McrB, GTPase subunit of restriction endonuclease [Defense mechanisms]	NA|464aa|up_2|NC_012440.1_1266003_1267395_-	COG1700, COG1700, Uncharacterized conserved protein [Function unknown]	cas1|332aa|up_1|NC_012440.1_1267467_1268463_+	cd09722, Cas1_I-B, CRISPR/Cas system-associated protein Cas1	cas2|89aa|up_0|NC_012440.1_1268464_1268731_+	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	NA|126aa|down_0|NC_012440.1_1269585_1269963_+	NA	NA|97aa|down_1|NC_012440.1_1270050_1270341_-	NA	NA|484aa|down_2|NC_012440.1_1270402_1271854_-	NA	NA|88aa|down_3|NC_012440.1_1272069_1272333_-	NA	NA|69aa|down_4|NC_012440.1_1272492_1272699_-	NA	NA|210aa|down_5|NC_012440.1_1272815_1273445_+	pfam08241, Methyltransf_11, Methyltransferase domain	NA|286aa|down_6|NC_012440.1_1273445_1274303_-	PRK11820, PRK11820, YicC family protein	NA|363aa|down_7|NC_012440.1_1274313_1275402_-	pfam01988, VIT1, VIT family	NA|399aa|down_8|NC_012440.1_1275398_1276595_-	PRK05749, PRK05749, 3-deoxy-D-manno-octulosonic-acid transferase; Reviewed	NA|245aa|down_9|NC_012440.1_1276594_1277329_-	pfam03649, UPF0014, Uncharacterized protein family (UPF0014)
GCF_000021565.1_ASM2156v1	NC_012440	Persephonella marina EX-H1, complete sequence	4	1302353-1303310	4,4,4	PILER-CR,CRISPRCasFinder,CRT	no		csa3,cas3,DEDDh,cas6,cas8b2,cas7,cas5,cas4,cas1,cas2	Orphan	CTTTCTATCCCACTTGGTTCAAAGAAAAC,GTTTTCTTTGAACCAAGTGGGATAGAAAG,GTTTTCTTTGAACCAAGTGGGATAGAAAG	29,29,29	0	0	NA	NA	NA:NA:NA	14,14,14	14	Orphan	csa3,cas3,DEDDh,cas6,cas8b2,cas7,cas5,cas4,cas1,cas2	NA|168aa|up_9|NC_012440.1_1292428_1292932_-,NA|475aa|up_0|NC_012440.1_1300584_1302009_-,NA|115aa|down_0|NC_012440.1_1304526_1304871_-,NA|321aa|down_1|NC_012440.1_1304901_1305864_-,NA|306aa|down_2|NC_012440.1_1305986_1306904_-	NA|168aa|up_9|NC_012440.1_1292428_1292932_-	NA	NA|296aa|up_8|NC_012440.1_1292961_1293849_-	cd16283, RomA-like_MBL-fold, Enterobacter cloacae RomA and related proteins; MBL-fold metallo hydrolase domain	NA|262aa|up_7|NC_012440.1_1293802_1294588_-	PRK09562, mazG, nucleoside triphosphate pyrophosphohydrolase; Reviewed	NA|209aa|up_6|NC_012440.1_1294647_1295274_+	TIGR01091, Uracil_phosphoribosyltransferase, uracil phosphoribosyltransferase	NA|202aa|up_5|NC_012440.1_1295273_1295879_+	pfam01810, LysE, LysE type translocator	NA|288aa|up_4|NC_012440.1_1295880_1296744_+	cd05271, NDUFA9_like_SDR_a, NADH dehydrogenase (ubiquinone) 1 alpha subcomplex, subunit 9, 39 kDa, (NDUFA9) -like, atypical (a) SDRs	NA|464aa|up_3|NC_012440.1_1297056_1298448_+	TIGR03015, pepcterm_ATPase, putative secretion ATPase, PEP-CTERM locus subfamily	NA|362aa|up_2|NC_012440.1_1298444_1299530_+	COG3034, COG3034, Uncharacterized protein conserved in bacteria [Function unknown]	NA|363aa|up_1|NC_012440.1_1299499_1300588_+	cd06243, M14_CP_Csd4-like, Peptidase M14 carboxypeptidase Csd4 and similar proteins	NA|475aa|up_0|NC_012440.1_1300584_1302009_-	NA	NA|115aa|down_0|NC_012440.1_1304526_1304871_-	NA	NA|321aa|down_1|NC_012440.1_1304901_1305864_-	NA	NA|306aa|down_2|NC_012440.1_1305986_1306904_-	NA	NA|258aa|down_3|NC_012440.1_1307385_1308159_-	PRK00346, surE, 5'(3')-nucleotidase/polyphosphatase; Provisional	NA|97aa|down_4|NC_012440.1_1308305_1308596_-	cd13836, IHF_B, Beta subunit of integration host factor (IHFB)	NA|173aa|down_5|NC_012440.1_1308636_1309155_-	cd04645, LbH_gamma_CA_like, Gamma carbonic anhydrase-like: This family is composed of gamma carbonic anhydrase (CA), Ferripyochelin Binding Protein (FBP), E	NA|162aa|down_6|NC_012440.1_1309261_1309747_-	pfam01242, PTPS, 6-pyruvoyl tetrahydropterin synthase	NA|260aa|down_7|NC_012440.1_1309758_1310538_-	cd06218, DHOD_e_trans, FAD/NAD binding domain in the electron transfer subunit of dihydroorotate dehydrogenase	NA|570aa|down_8|NC_012440.1_1310642_1312352_-	TIGR01812, Fumarate_reductase_flavoprotein_subunit, succinate dehydrogenase or fumarate reductase, flavoprotein subunitGram-negative/mitochondrial subgroup	NA|430aa|down_9|NC_012440.1_1312447_1313737_-	PRK00885, PRK00885, phosphoribosylamine--glycine ligase; Provisional
GCF_000021565.1_ASM2156v1	NC_012440	Persephonella marina EX-H1, complete sequence	5	1307009-1307096	5	CRISPRCasFinder	no		csa3,cas3,DEDDh,cas6,cas8b2,cas7,cas5,cas4,cas1,cas2	Orphan	CTTTGAACTAAGCGGGATAGAAA	23	0	0	NA	NA	NA	1	1	Orphan	csa3,cas3,DEDDh,cas6,cas8b2,cas7,cas5,cas4,cas1,cas2	NA|475aa|up_3|NC_012440.1_1300584_1302009_-,NA|115aa|up_2|NC_012440.1_1304526_1304871_-,NA|321aa|up_1|NC_012440.1_1304901_1305864_-,NA|306aa|up_0|NC_012440.1_1305986_1306904_-,NA	NA|209aa|up_9|NC_012440.1_1294647_1295274_+	TIGR01091, Uracil_phosphoribosyltransferase, uracil phosphoribosyltransferase	NA|202aa|up_8|NC_012440.1_1295273_1295879_+	pfam01810, LysE, LysE type translocator	NA|288aa|up_7|NC_012440.1_1295880_1296744_+	cd05271, NDUFA9_like_SDR_a, NADH dehydrogenase (ubiquinone) 1 alpha subcomplex, subunit 9, 39 kDa, (NDUFA9) -like, atypical (a) SDRs	NA|464aa|up_6|NC_012440.1_1297056_1298448_+	TIGR03015, pepcterm_ATPase, putative secretion ATPase, PEP-CTERM locus subfamily	NA|362aa|up_5|NC_012440.1_1298444_1299530_+	COG3034, COG3034, Uncharacterized protein conserved in bacteria [Function unknown]	NA|363aa|up_4|NC_012440.1_1299499_1300588_+	cd06243, M14_CP_Csd4-like, Peptidase M14 carboxypeptidase Csd4 and similar proteins	NA|475aa|up_3|NC_012440.1_1300584_1302009_-	NA	NA|115aa|up_2|NC_012440.1_1304526_1304871_-	NA	NA|321aa|up_1|NC_012440.1_1304901_1305864_-	NA	NA|306aa|up_0|NC_012440.1_1305986_1306904_-	NA	NA|258aa|down_0|NC_012440.1_1307385_1308159_-	PRK00346, surE, 5'(3')-nucleotidase/polyphosphatase; Provisional	NA|97aa|down_1|NC_012440.1_1308305_1308596_-	cd13836, IHF_B, Beta subunit of integration host factor (IHFB)	NA|173aa|down_2|NC_012440.1_1308636_1309155_-	cd04645, LbH_gamma_CA_like, Gamma carbonic anhydrase-like: This family is composed of gamma carbonic anhydrase (CA), Ferripyochelin Binding Protein (FBP), E	NA|162aa|down_3|NC_012440.1_1309261_1309747_-	pfam01242, PTPS, 6-pyruvoyl tetrahydropterin synthase	NA|260aa|down_4|NC_012440.1_1309758_1310538_-	cd06218, DHOD_e_trans, FAD/NAD binding domain in the electron transfer subunit of dihydroorotate dehydrogenase	NA|570aa|down_5|NC_012440.1_1310642_1312352_-	TIGR01812, Fumarate_reductase_flavoprotein_subunit, succinate dehydrogenase or fumarate reductase, flavoprotein subunitGram-negative/mitochondrial subgroup	NA|430aa|down_6|NC_012440.1_1312447_1313737_-	PRK00885, PRK00885, phosphoribosylamine--glycine ligase; Provisional	NA|427aa|down_7|NC_012440.1_1313763_1315044_-	cd06828, PLPDE_III_DapDC, Type III Pyridoxal 5-phosphate (PLP)-Dependent Enzyme Diaminopimelate Decarboxylase	NA|200aa|down_8|NC_012440.1_1315127_1315727_+	PRK00513, minC, septum formation inhibitor; Reviewed	NA|270aa|down_9|NC_012440.1_1315749_1316559_+	COG2894, MinD, Septum formation inhibitor-activating ATPase [Cell division and chromosome partitioning]
