assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCA_003031405.1_ASM303140v1	CP020469	Halomonas sp. 'Soap Lake #6' chromosome, complete genome	1	849351-849451	1	CRISPRCasFinder	no		csa3,cas14j,WYL,RT,DEDDh,DinG,cas5f,cas7f,cas6f,cas3,cas3HD,cas5,cas8c,cas7,cas4,cas1,cas2	Orphan	ATCTCCTTTCCCGCTCCATAGCA	23	0	0	NA	NA	NA	1	1	Orphan	csa3,cas14j,WYL,RT,DEDDh,DinG,cas5f,cas7f,cas6f,cas3,cas3HD,cas5,cas8c,cas7,cas4,cas1,cas2	NA,NA	NA|364aa|up_9|CP020469.1_838280_839372_+	cd03311, CIMS_C_terminal_like, CIMS - Cobalamine-independent methonine synthase, or MetE, C-terminal domain_like	NA|316aa|up_8|CP020469.1_839364_840312_+	pfam00990, GGDEF, Diguanylate cyclase, GGDEF domain	NA|302aa|up_7|CP020469.1_840375_841281_-	PRK05457, PRK05457, protease HtpX	NA|416aa|up_6|CP020469.1_841370_842618_-	PRK09265, PRK09265, aminotransferase AlaT; Validated	NA|318aa|up_5|CP020469.1_842675_843629_-	TIGR03596, GTPase_YlqF, ribosome biogenesis GTP-binding protein YlqF	NA|133aa|up_4|CP020469.1_843758_844157_-	PRK00222, PRK00222, peptide-methionine (R)-S-oxide reductase MsrB	NA|529aa|up_3|CP020469.1_844278_845865_-	COG0405, Ggt, Gamma-glutamyltransferase [Amino acid transport and metabolism]	NA|442aa|up_2|CP020469.1_845892_847218_-	COG4664, FcbT3, TRAP-type mannitol/chloroaromatic compound transport system, large permease component [Secondary metabolites biosynthesis, transport, and catabolism]	NA|170aa|up_1|CP020469.1_847221_847731_-	COG4665, FcbT2, TRAP-type mannitol/chloroaromatic compound transport system, small permease component [Secondary metabolites biosynthesis, transport, and catabolism]	NA|368aa|up_0|CP020469.1_847732_848836_-	cd13604, PBP2_TRAP_ketoacid_lactate_like, Substrate-binding domain of an alpha-keto acid binding Tripartite ATP-independent Periplasmic transporter and related proteins; the type 2 periplasmic-binding protein fold	NA|321aa|down_0|CP020469.1_850179_851142_+	COG1131, CcmA, ABC-type multidrug transport system, ATPase component [Defense mechanisms]	NA|258aa|down_1|CP020469.1_851138_851912_+	PRK15066, PRK15066, inner membrane transport permease; Provisional	NA|278aa|down_2|CP020469.1_851946_852780_+	PRK11792, queF, 7-cyano-7-deazaguanine reductase; Provisional	NA|187aa|down_3|CP020469.1_852829_853390_+	cd02135, YdjA-like, nitroreductase family protein similar to Escherichia coli YdjA	NA|169aa|down_4|CP020469.1_853427_853934_+	pfam09500, YiiD_C, Putative thioesterase (yiiD_Cterm)	NA|483aa|down_5|CP020469.1_854517_855966_+	cd05245, SDR_a2, atypical (a) SDRs, subgroup 2	NA|313aa|down_6|CP020469.1_855947_856886_-	COG1090, COG1090, Predicted nucleoside-diphosphate sugar epimerase [General function prediction only]	NA|166aa|down_7|CP020469.1_857001_857499_+	pfam11066, DUF2867, Protein of unknown function (DUF2867)	NA|303aa|down_8|CP020469.1_857491_858400_-	COG0385, COG0385, Predicted Na+-dependent transporter [General function prediction only]	NA|51aa|down_9|CP020469.1_858842_858995_-	pfam02069, Metallothio_Pro, Prokaryotic metallothionein
GCA_003031405.1_ASM303140v1	CP020469	Halomonas sp. 'Soap Lake #6' chromosome, complete genome	2	1488195-1488278	2	CRISPRCasFinder	no	cas5f,cas7f,cas6f	csa3,cas14j,WYL,RT,DEDDh,DinG,cas5f,cas7f,cas6f,cas3,cas3HD,cas5,cas8c,cas7,cas4,cas1,cas2	Unclear	GTAAACCGCCGAATAGGCAGCTGA	24	0	0	NA	NA	NA	1	1	Unclear	csa3,cas14j,WYL,RT,DEDDh,DinG,cas5f,cas7f,cas6f,cas3,cas3HD,cas5,cas8c,cas7,cas4,cas1,cas2	NA|108aa|up_9|CP020469.1_1478162_1478486_+,NA|166aa|up_8|CP020469.1_1478757_1479255_+,NA|197aa|up_7|CP020469.1_1479312_1479903_+,NA|238aa|up_6|CP020469.1_1480014_1480728_+,NA|174aa|up_5|CP020469.1_1481101_1481623_+,NA|166aa|down_1|CP020469.1_1490068_1490566_+,NA|121aa|down_4|CP020469.1_1492003_1492366_-,NA|133aa|down_6|CP020469.1_1493633_1494032_+	NA|108aa|up_9|CP020469.1_1478162_1478486_+	NA	NA|166aa|up_8|CP020469.1_1478757_1479255_+	NA	NA|197aa|up_7|CP020469.1_1479312_1479903_+	NA	NA|238aa|up_6|CP020469.1_1480014_1480728_+	NA	NA|174aa|up_5|CP020469.1_1481101_1481623_+	NA	NA|408aa|up_4|CP020469.1_1481651_1482875_+	pfam14020, DUF4236, Protein of unknown function (DUF4236)	NA|413aa|up_3|CP020469.1_1483103_1484342_+	pfam06527, TniQ, TniQ	cas5f|687aa|up_2|CP020469.1_1484346_1486407_+	cd09736, Csy2_I-F, CRISPR/Cas system-associated RAMP superfamily protein Csy2	cas7f|350aa|up_1|CP020469.1_1486417_1487467_+	cd09737, Csy3_I-F, CRISPR/Cas system-associated RAMP superfamily protein Csy3	cas6f|203aa|up_0|CP020469.1_1487463_1488072_+	cd09739, Cas6_I-F, CRISPR/Cas system-associated RAMP superfamily protein Cas6f	NA|157aa|down_0|CP020469.1_1488463_1488934_-	TIGR02293, conserved_protein_of_unknown_function, putative toxin-antitoxin system antitoxin component, TIGR02293 family	NA|166aa|down_1|CP020469.1_1490068_1490566_+	NA	NA|74aa|down_2|CP020469.1_1491097_1491319_+	COG2199, COG2199, c-di-GMP synthetase (diguanylate cyclase, GGDEF domain) [Signal    transduction mechanisms]	NA|86aa|down_3|CP020469.1_1491363_1491621_+	cd01949, GGDEF, Diguanylate-cyclase (DGC) or GGDEF domain	NA|121aa|down_4|CP020469.1_1492003_1492366_-	NA	NA|253aa|down_5|CP020469.1_1492749_1493508_+	pfam12570, DUF3750, Protein of unknown function (DUF3750)	NA|133aa|down_6|CP020469.1_1493633_1494032_+	NA	NA|232aa|down_7|CP020469.1_1494025_1494721_+	pfam08808, RES, RES domain	NA|159aa|down_8|CP020469.1_1494733_1495210_-	PRK11050, PRK11050, manganese-binding transcriptional regulator MntR	NA|305aa|down_9|CP020469.1_1495363_1496278_+	cd01137, PsaA, Metal binding protein PsaA
GCA_003031405.1_ASM303140v1	CP020469	Halomonas sp. 'Soap Lake #6' chromosome, complete genome	4	1948767-1948902	4	CRISPRCasFinder	no		csa3,cas14j,WYL,RT,DEDDh,DinG,cas5f,cas7f,cas6f,cas3,cas3HD,cas5,cas8c,cas7,cas4,cas1,cas2	Orphan	GTTACCACGCGCTACAAAATGTGGCTATGAGGTCATCGTGGCTGC	45	0	0	NA	NA	NA	1	1	Orphan	csa3,cas14j,WYL,RT,DEDDh,DinG,cas5f,cas7f,cas6f,cas3,cas3HD,cas5,cas8c,cas7,cas4,cas1,cas2	NA|206aa|up_9|CP020469.1_1936614_1937232_-,NA|273aa|up_8|CP020469.1_1937328_1938147_-,NA|774aa|up_6|CP020469.1_1938818_1941140_-,NA|218aa|up_4|CP020469.1_1942388_1943042_-,NA|444aa|up_3|CP020469.1_1943138_1944470_-,NA|438aa|down_0|CP020469.1_1949077_1950391_+	NA|206aa|up_9|CP020469.1_1936614_1937232_-	NA	NA|273aa|up_8|CP020469.1_1937328_1938147_-	NA	NA|198aa|up_7|CP020469.1_1938232_1938826_-	COG1670, RimL, Acetyltransferases, including N-acetylases of ribosomal proteins [Translation, ribosomal structure and biogenesis]	NA|774aa|up_6|CP020469.1_1938818_1941140_-	NA	NA|345aa|up_5|CP020469.1_1941248_1942283_-	pfam08239, SH3_3, Bacterial SH3 domain	NA|218aa|up_4|CP020469.1_1942388_1943042_-	NA	NA|444aa|up_3|CP020469.1_1943138_1944470_-	NA	NA|344aa|up_2|CP020469.1_1944549_1945581_-	cd17932, DEXQc_UvrD, DEXQD-box helicase domain of UvrD	NA|522aa|up_1|CP020469.1_1945565_1947131_-	cd01026, TOPRIM_OLD, TOPRIM_OLD: topoisomerase-primase (TOPRIM) nucleotidyl transferase/hydrolase domain of the type found in bacterial and archaeal nucleases of the OLD (overcome lysogenization defect) family	NA|321aa|up_0|CP020469.1_1947516_1948479_+	TIGR02249, Integrase/recombinase_E2_protein	NA|438aa|down_0|CP020469.1_1949077_1950391_+	NA	NA|173aa|down_1|CP020469.1_1950411_1950930_-	pfam07295, DUF1451, Zinc-ribbon containing domain	NA|862aa|down_2|CP020469.1_1951093_1953679_+	PRK00390, leuS, leucyl-tRNA synthetase; Validated	NA|171aa|down_3|CP020469.1_1953675_1954188_+	COG2980, RlpB, Rare lipoprotein B [Cell envelope biogenesis, outer membrane]	NA|356aa|down_4|CP020469.1_1954184_1955252_+	PRK05574, holA, DNA polymerase III subunit delta; Reviewed	NA|133aa|down_5|CP020469.1_1955352_1955751_+	pfam14828, Amnionless, Amnionless	NA|310aa|down_6|CP020469.1_1955761_1956691_+	cd02549, Peptidase_C39A, A sub-family of peptidase family C39	NA|464aa|down_7|CP020469.1_1956744_1958136_-	PRK05249, PRK05249, Si-specific NAD(P)(+) transhydrogenase	NA|85aa|down_8|CP020469.1_1958244_1958499_-	COG2991, COG2991, Uncharacterized protein conserved in bacteria [Function unknown]	NA|411aa|down_9|CP020469.1_1958690_1959923_-	PRK05464, PRK05464, Na(+)-translocating NADH-quinone reductase subunit F; Provisional
GCA_003031405.1_ASM303140v1	CP020469	Halomonas sp. 'Soap Lake #6' chromosome, complete genome	5	2057457-2057549	5	CRISPRCasFinder	no	csa3	csa3,cas14j,WYL,RT,DEDDh,DinG,cas5f,cas7f,cas6f,cas3,cas3HD,cas5,cas8c,cas7,cas4,cas1,cas2	Type I-A	AGCCCGGCATACAATAGATTGAT	23	0	0	NA	NA	NA	1	1	Orphan	csa3,cas14j,WYL,RT,DEDDh,DinG,cas5f,cas7f,cas6f,cas3,cas3HD,cas5,cas8c,cas7,cas4,cas1,cas2	NA|186aa|up_3|CP020469.1_2054199_2054757_-,NA|60aa|down_6|CP020469.1_2065103_2065283_-	NA|155aa|up_9|CP020469.1_2050715_2051180_-	cd04685, Nudix_Hydrolase_26, Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X	NA|144aa|up_8|CP020469.1_2051309_2051741_-	COG3153, COG3153, Predicted acetyltransferase [General function prediction only]	NA|141aa|up_7|CP020469.1_2051909_2052332_-	cd06587, VOC, vicinal oxygen chelate (VOC) family	NA|195aa|up_6|CP020469.1_2052382_2052967_-	COG0655, WrbA, Multimeric flavodoxin WrbA [General function prediction only]	NA|157aa|up_5|CP020469.1_2052966_2053437_-	pfam00903, Glyoxalase, Glyoxalase/Bleomycin resistance protein/Dioxygenase superfamily	NA|161aa|up_4|CP020469.1_2053542_2054025_+	cd01110, HTH_SoxR, Helix-Turn-Helix DNA binding domain of the SoxR transcription regulator	NA|186aa|up_3|CP020469.1_2054199_2054757_-	NA	NA|194aa|up_2|CP020469.1_2054848_2055430_-	pfam13649, Methyltransf_25, Methyltransferase domain	NA|207aa|up_1|CP020469.1_2055509_2056130_-	COG1670, RimL, Acetyltransferases, including N-acetylases of ribosomal proteins [Translation, ribosomal structure and biogenesis]	NA|186aa|up_0|CP020469.1_2056469_2057027_-	pfam13302, Acetyltransf_3, Acetyltransferase (GNAT) domain	NA|251aa|down_0|CP020469.1_2057789_2058542_+	pfam01925, TauE, Sulfite exporter TauE/SafE	NA|361aa|down_1|CP020469.1_2058606_2059689_-	COG0644, FixC, Dehydrogenases (flavoproteins) [Energy production and conversion]	NA|126aa|down_2|CP020469.1_2061873_2062251_-	COG2963, COG2963, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|133aa|down_3|CP020469.1_2062347_2062746_-	COG3791, COG3791, Uncharacterized conserved protein [Function unknown]	NA|125aa|down_4|CP020469.1_2062791_2063166_-	cd16362, TflA, Toxoflavin Lyase	NA|315aa|down_5|CP020469.1_2064030_2064975_+	PRK11727, PRK11727, 23S rRNA (adenine(1618)-N(6))-methyltransferase RlmF	NA|60aa|down_6|CP020469.1_2065103_2065283_-	NA	NA|300aa|down_7|CP020469.1_2065293_2066193_-	pfam00797, Acetyltransf_2, N-acetyltransferase	NA|152aa|down_8|CP020469.1_2066430_2066886_+	cd01110, HTH_SoxR, Helix-Turn-Helix DNA binding domain of the SoxR transcription regulator	NA|237aa|down_9|CP020469.1_2067058_2067769_-	COG0625, Gst, Glutathione S-transferase [Posttranslational modification, protein turnover, chaperones]
GCA_003031405.1_ASM303140v1	CP020469	Halomonas sp. 'Soap Lake #6' chromosome, complete genome	6	4445124-4451832	1,6,1	PILER-CR,CRISPRCasFinder,CRT	no	WYL,cas3HD,cas3,cas5,cas8c,cas7,cas4,cas1,cas2	csa3,cas14j,WYL,RT,DEDDh,DinG,cas5f,cas7f,cas6f,cas3,cas3HD,cas5,cas8c,cas7,cas4,cas1,cas2	 Type I-U?,Type I-U,Type I-C	GTCGCGCCCCACGCGGGCGCGTGGATTGAAAC,GTCGCGCCCCACGCGGGCGCGTGGATTGAAAC,GTCGCGCCCCACGCGGGCGCGTGGATTGAAAC	32,32,32	0	0	NA	NA	I-C:I-C:I-C	100,101,101	101	TypeI-U,TypeI-U?,TypeI-C	csa3,cas14j,WYL,RT,DEDDh,DinG,cas5f,cas7f,cas6f,cas3,cas3HD,cas5,cas8c,cas7,cas4,cas1,cas2	NA|224aa|up_9|CP020469.1_4434344_4435016_-,NA|751aa|down_0|CP020469.1_4452350_4454603_-	NA|224aa|up_9|CP020469.1_4434344_4435016_-	NA	WYL|322aa|up_8|CP020469.1_4435225_4436191_+	pfam13280, WYL, WYL domain	cas3HD|124aa|up_7|CP020469.1_4436272_4436644_+	cd09641, Cas3''_I, CRISPR/Cas system-associated protein Cas3''	cas3|822aa|up_6|CP020469.1_4436630_4439096_+	COG1203, COG1203, CRISPR-associated helicase Cas3 [Defense mechanisms]	cas5|235aa|up_5|CP020469.1_4439106_4439811_+	cd09651, Cas5_I-C, CRISPR/Cas system-associated RAMP superfamily protein Cas5	cas8c|661aa|up_4|CP020469.1_4439807_4441790_+	cd09757, Cas8c_I-C, CRISPR/Cas system-associated protein Cas8c	cas7|278aa|up_3|CP020469.1_4441789_4442623_+	cd09689, Cas7_I-C, CRISPR/Cas system-associated RAMP superfamily protein Cas7	cas4|214aa|up_2|CP020469.1_4442995_4443637_+	cd09637, Cas4_I-A_I-B_I-C_I-D_II-B, CRISPR/Cas system-associated protein Cas4	cas1|339aa|up_1|CP020469.1_4443617_4444634_+	TIGR03640, cas1_DVULG, CRISPR-associated endonuclease Cas1, subtype I-C/DVULG	cas2|98aa|up_0|CP020469.1_4444653_4444947_+	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	NA|751aa|down_0|CP020469.1_4452350_4454603_-	NA	NA|887aa|down_1|CP020469.1_4454831_4457492_+	pfam00171, Aldedh, Aldehyde dehydrogenase family	NA|400aa|down_2|CP020469.1_4457554_4458754_-	cd17325, MFS_MdtG_SLC18_like, bacterial MdtG-like and eukaryotic solute carrier 18 (SLC18) family of the Major Facilitator Superfamily of transporters	NA|698aa|down_3|CP020469.1_4458880_4460974_-	PRK11154, fadJ, fatty acid oxidation complex subunit alpha FadJ	NA|601aa|down_4|CP020469.1_4461366_4463169_+	cd01153, ACAD_fadE5, Putative acyl-CoA dehydrogenases similar to fadE5	NA|332aa|down_5|CP020469.1_4463279_4464275_+	cd08241, QOR1, Quinone oxidoreductase (QOR)	NA|418aa|down_6|CP020469.1_4464338_4465592_+	cd01155, ACAD_FadE2, Acyl-CoA dehydrogenases similar to fadE2	NA|399aa|down_7|CP020469.1_4465694_4466891_+	TIGR03204, pimC_large, pimeloyl-CoA dehydrogenase, large subunit	NA|377aa|down_8|CP020469.1_4466902_4468033_+	TIGR03203, pimD_small, pimeloyl-CoA dehydrogenase, small subunit	NA|259aa|down_9|CP020469.1_4468145_4468922_+	COG2086, FixA, Electron transfer flavoprotein, beta subunit [Energy production and conversion]
