assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_002549795.1_ASM254979v1	NZ_CP021435	Halomonas beimenensis strain NTU-111 chromosome, complete genome	1	559648-559746	1	CRISPRCasFinder	no		DinG,DEDDh,csa3,WYL,RT,cas3,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e	Orphan	TGGCGGCACCCAGACAGGCGATGGCCTCGAGCCCTG	36	0	0	NA	NA	NA	1	1	Orphan	DinG,DEDDh,csa3,WYL,RT,cas3,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e	NA,NA	NA|256aa|up_9|NZ_CP021435.1_550160_550928_-	pfam07386, DUF1499, Protein of unknown function (DUF1499)	NA|199aa|up_8|NZ_CP021435.1_551125_551722_+	pfam09997, DUF2238, Predicted membrane protein (DUF2238)	NA|428aa|up_7|NZ_CP021435.1_552115_553399_+	COG2814, AraJ, Arabinose efflux permease [Carbohydrate transport and metabolism]	NA|461aa|up_6|NZ_CP021435.1_553582_554965_+	cd17477, MFS_YcaD_like, YcaD and similar transporters of the Major Facilitator Superfamily	NA|180aa|up_5|NZ_CP021435.1_555105_555645_-	pfam10688, Imp-YgjV, Bacterial inner membrane protein	NA|190aa|up_4|NZ_CP021435.1_555754_556324_+	COG0590, CumB, Cytosine/adenosine deaminases [Nucleotide transport and metabolism / Translation, ribosomal structure and biogenesis]	NA|96aa|up_3|NZ_CP021435.1_556331_556619_-	TIGR01554, prophage_Lp3_protein_18, phage major capsid protein, HK97 family	NA|314aa|up_2|NZ_CP021435.1_556735_557677_-	PRK11139, PRK11139, DNA-binding transcriptional activator GcvA; Provisional	NA|236aa|up_1|NZ_CP021435.1_557799_558507_+	COG1182, AcpD, Acyl carrier protein phosphodiesterase [Lipid metabolism]	NA|332aa|up_0|NZ_CP021435.1_558543_559539_-	pfam00520, Ion_trans, Ion transport protein	NA|219aa|down_0|NZ_CP021435.1_559774_560431_-	PRK06026, PRK06026, 5'-methylthioadenosine/S-adenosylhomocysteine nucleosidase; Validated	NA|724aa|down_1|NZ_CP021435.1_560845_563017_+	COG0286, HsdM, Type I restriction-modification system methyltransferase subunit [Defense mechanisms]	NA|338aa|down_2|NZ_CP021435.1_563013_564027_+	pfam13310, Virulence_RhuM, Virulence protein RhuM family	NA|430aa|down_3|NZ_CP021435.1_564059_565349_+	cd17249, RMtype1_S_EcoR124I-TRD2-CR2_like, Type I restriction-modification system specificity (S) subunit Target Recognition Domain-ConseRved domain (TRD-CR), similar to S	NA|1013aa|down_4|NZ_CP021435.1_565362_568401_+	COG0610, COG0610, Type I site-specific restriction-modification system, R (restriction) subunit and related helicases [Defense mechanisms]	NA|772aa|down_5|NZ_CP021435.1_568401_570717_-	PRK10875, recD, exodeoxyribonuclease V subunit alpha	NA|1314aa|down_6|NZ_CP021435.1_570713_574655_-	PRK10876, recB, exonuclease V subunit beta; Provisional	NA|1239aa|down_7|NZ_CP021435.1_574651_578368_-	TIGR01450, recC, exodeoxyribonuclease V, gamma subunit	NA|163aa|down_8|NZ_CP021435.1_578504_578993_+	pfam12850, Metallophos_2, Calcineurin-like phosphoesterase superfamily domain	NA|386aa|down_9|NZ_CP021435.1_579128_580286_+	pfam02371, Transposase_20, Transposase IS116/IS110/IS902 family
GCF_002549795.1_ASM254979v1	NZ_CP021435	Halomonas beimenensis strain NTU-111 chromosome, complete genome	2	959733-959831	2	CRISPRCasFinder	no		DinG,DEDDh,csa3,WYL,RT,cas3,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e	Orphan	GCTCAGTTGGTTAGAGCGCACCCC	24	0	0	NA	NA	NA	1	1	Orphan	DinG,DEDDh,csa3,WYL,RT,cas3,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e	NA,NA|235aa|down_5|NZ_CP021435.1_967511_968216_-	NA|465aa|up_9|NZ_CP021435.1_943986_945381_+	COG1236, YSH1, Predicted exonuclease of the beta-lactamase fold involved in RNA processing [Translation, ribosomal structure and biogenesis]	NA|428aa|up_8|NZ_CP021435.1_945431_946715_+	pfam13535, ATP-grasp_4, ATP-grasp domain	NA|207aa|up_7|NZ_CP021435.1_946806_947427_-	PRK02090, PRK02090, phosphoadenylyl-sulfate reductase	NA|456aa|up_6|NZ_CP021435.1_947540_948908_-	COG0147, TrpE, Anthranilate/para-aminobenzoate synthases component I [Amino acid transport and metabolism / Coenzyme metabolism]	NA|238aa|up_5|NZ_CP021435.1_949147_949861_+	COG1802, GntR, Transcriptional regulators [Transcription]	NA|297aa|up_4|NZ_CP021435.1_949890_950781_+	PRK11320, prpB, 2-methylisocitrate lyase; Provisional	NA|376aa|up_3|NZ_CP021435.1_950820_951948_+	PRK12351, PRK12351, methylcitrate synthase; Provisional	NA|872aa|up_2|NZ_CP021435.1_952129_954745_+	PRK09277, PRK09277, aconitate hydratase AcnA	NA|390aa|up_1|NZ_CP021435.1_954741_955911_+	TIGR02334, conserved_hypothetical_protein, probable AcnD-accessory protein PrpF	NA|495aa|up_0|NZ_CP021435.1_956026_957511_+	PRK09425, prpD, bifunctional 2-methylcitrate dehydratase/aconitate hydratase	NA|530aa|down_0|NZ_CP021435.1_963738_965328_+	pfam02028, BCCT, BCCT, betaine/carnitine/choline family transporter	NA|146aa|down_1|NZ_CP021435.1_965401_965839_+	pfam11026, DUF2721, Protein of unknown function (DUF2721)	NA|356aa|down_2|NZ_CP021435.1_965835_966903_+	PRK11760, PRK11760, putative 23S rRNA C2498 ribose 2'-O-ribose methyltransferase; Provisional	NA|96aa|down_3|NZ_CP021435.1_966907_967195_-	COG2329, COG2329, Uncharacterized enzyme involved in biosynthesis of extracellular polysaccharides [General function prediction only]	NA|84aa|down_4|NZ_CP021435.1_967239_967491_+	cd03423, SirA, SirA (also known as UvrY,  and YhhP) belongs to a family of two-component response regulators that controls secondary metabolism and virulence	NA|235aa|down_5|NZ_CP021435.1_967511_968216_-	NA	NA|343aa|down_6|NZ_CP021435.1_968335_969364_-	pfam11279, DUF3080, Protein of unknown function (DUF3080)	NA|486aa|down_7|NZ_CP021435.1_969338_970796_-	PRK01766, PRK01766, multidrug efflux protein; Reviewed	NA|231aa|down_8|NZ_CP021435.1_970958_971651_-	COG1733, COG1733, Predicted transcriptional regulators [Transcription]	NA|473aa|down_9|NZ_CP021435.1_971970_973389_+	COG0277, GlcD, FAD/FMN-containing dehydrogenases [Energy production and conversion]
GCF_002549795.1_ASM254979v1	NZ_CP021435	Halomonas beimenensis strain NTU-111 chromosome, complete genome	3	3515832-3515930	3	CRISPRCasFinder	no		DinG,DEDDh,csa3,WYL,RT,cas3,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e	Orphan	GGGGTGCGCTCCAACCAACTGAGC	24	0	0	NA	NA	NA	1	1	Orphan	DinG,DEDDh,csa3,WYL,RT,cas3,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e	NA|164aa|up_9|NZ_CP021435.1_3504446_3504938_-,NA|68aa|up_3|NZ_CP021435.1_3509220_3509424_-,NA	NA|164aa|up_9|NZ_CP021435.1_3504446_3504938_-	NA	NA|172aa|up_8|NZ_CP021435.1_3505018_3505534_-	sd00010, SLR, Sel1-like repeat	NA|344aa|up_7|NZ_CP021435.1_3505753_3506785_+	cd19094, AKR_Tas-like, Escherichia coli Tas protein and similar proteins	NA|260aa|up_6|NZ_CP021435.1_3506881_3507661_+	PRK02101, PRK02101, peroxide stress protein YaaA	NA|192aa|up_5|NZ_CP021435.1_3507657_3508233_-	cd01012, YcaC_related, YcaC related amidohydrolases; E	NA|271aa|up_4|NZ_CP021435.1_3508339_3509152_+	smart00978, Tim44, Tim44 is an essential component of the machinery that mediates the translocation of nuclear-encoded proteins across the mitochondrial inner membrane	NA|68aa|up_3|NZ_CP021435.1_3509220_3509424_-	NA	NA|123aa|up_2|NZ_CP021435.1_3509783_3510152_+	COG2963, COG2963, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|112aa|up_1|NZ_CP021435.1_3510148_3510484_+	pfam05717, TnpB_IS66, IS66 Orf2 like protein	NA|517aa|up_0|NZ_CP021435.1_3510564_3512115_+	pfam03050, DDE_Tnp_IS66, Transposase IS66 family	NA|273aa|down_0|NZ_CP021435.1_3523687_3524506_-	COG1183, PssA, Phosphatidylserine synthase [Lipid metabolism]	NA|339aa|down_1|NZ_CP021435.1_3524634_3525651_-	PRK05479, PRK05479, ketol-acid reductoisomerase; Provisional	NA|297aa|down_2|NZ_CP021435.1_3525842_3526733_+	PRK11716, PRK11716, HTH-type transcriptional activator IlvY	NA|164aa|down_3|NZ_CP021435.1_3526846_3527338_-	PRK11895, ilvH, acetolactate synthase 3 regulatory subunit; Reviewed	NA|575aa|down_4|NZ_CP021435.1_3527337_3529062_-	PRK06466, PRK06466, acetolactate synthase 3 large subunit	NA|489aa|down_5|NZ_CP021435.1_3529561_3531028_+	cd17359, MFS_XylE_like, D-xylose-proton symporter and similar transporters of the Major Facilitator Superfamily	NA|863aa|down_6|NZ_CP021435.1_3531135_3533724_-	TIGR03346, chaperone_ClpB, ATP-dependent chaperone ClpB	NA|391aa|down_7|NZ_CP021435.1_3534034_3535206_-	PHA02517, PHA02517, putative transposase OrfB; Reviewed	NA|252aa|down_8|NZ_CP021435.1_3535320_3536076_-	PRK10723, PRK10723, polyphenol oxidase	NA|320aa|down_9|NZ_CP021435.1_3536072_3537032_-	PRK11180, rluD, 23S rRNA pseudouridine(1911/1915/1917) synthase RluD
GCF_002549795.1_ASM254979v1	NZ_CP021435	Halomonas beimenensis strain NTU-111 chromosome, complete genome	4	3745912-3752524	4,1,1	CRISPRCasFinder,CRT,PILER-CR	no	cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,cas3,WYL	DinG,DEDDh,csa3,WYL,RT,cas3,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e	Type I-E	CGGGTTATCCCCGCGCCTGCGGGGATCGG,CGGGTTATCCCCGCGCCTGCGGGGATCGG,CGGGTTATCCCCGCGCCTGCGGGGATCGG	29,29,29	0	0	NA	NA	NA:NA:NA	108,107,99	108	TypeI-E	DinG,DEDDh,csa3,WYL,RT,cas3,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e	NA|85aa|up_2|NZ_CP021435.1_3744257_3744512_-,NA|123aa|up_1|NZ_CP021435.1_3744742_3745111_+,NA	NA|235aa|up_9|NZ_CP021435.1_3736232_3736937_+	cd03224, ABC_TM1139_LivF_branched, ATP-binding cassette domain of branched-chain amino acid transporter	NA|335aa|up_8|NZ_CP021435.1_3736968_3737973_+	cd06582, TM_PBP1_LivH_like, Transmembrane subunit (TM) of Escherichia coli LivH and related proteins	NA|434aa|up_7|NZ_CP021435.1_3737972_3739274_+	cd06581, TM_PBP1_LivM_like, Transmembrane subunit (TM) of Escherichia coli LivM and related proteins	NA|405aa|up_6|NZ_CP021435.1_3739531_3740746_-	cd06346, PBP1_ABC_ligand_binding-like, type 1 periplasmic ligand-binding domain of uncharacterized ABC (Atpase Binding Cassette)-type active transport systems predicted to be involved in uptake of amino acids, peptides, or inorganic ions	NA|246aa|up_5|NZ_CP021435.1_3740987_3741725_+	COG0412, COG0412, Dienelactone hydrolase and related enzymes [Secondary metabolites biosynthesis, transport, and catabolism]	NA|419aa|up_4|NZ_CP021435.1_3741844_3743101_+	pfam00375, SDF, Sodium:dicarboxylate symporter family	NA|359aa|up_3|NZ_CP021435.1_3743184_3744261_+	pfam03417, AAT, Acyl-coenzyme A:6-aminopenicillanic acid acyl-transferase	NA|85aa|up_2|NZ_CP021435.1_3744257_3744512_-	NA	NA|123aa|up_1|NZ_CP021435.1_3744742_3745111_+	NA	NA|197aa|up_0|NZ_CP021435.1_3745210_3745801_-	pfam13875, DUF4202, Domain of unknown function (DUF4202)	cas2|99aa|down_0|NZ_CP021435.1_3752630_3752927_-	PRK11558, PRK11558, putative ssRNA endonuclease; Provisional	cas1|307aa|down_1|NZ_CP021435.1_3752928_3753849_-	TIGR03638, cas1_ECOLI, CRISPR-associated endonuclease Cas1, subtype I-E/ECOLI	cas6e|231aa|down_2|NZ_CP021435.1_3753876_3754569_-	smart01101, CRISPR_assoc, This domain forms an anti-parallel beta strand structure with flanking alpha helical regions	cas5|252aa|down_3|NZ_CP021435.1_3754568_3755324_-	cd09645, Cas5_I-E, CRISPR/Cas system-associated RAMP superfamily protein Cas5	cas7|358aa|down_4|NZ_CP021435.1_3755333_3756407_-	pfam09344, Cas_CT1975, CT1975-like protein	cse2gr11|218aa|down_5|NZ_CP021435.1_3756436_3757090_-	pfam09485, CRISPR_Cse2, CRISPR-associated protein Cse2 (CRISPR_cse2)	cas8e|561aa|down_6|NZ_CP021435.1_3757086_3758769_-	cd09729, Cse1_I-E, CRISPR/Cas system-associated protein Cse1	cas3|897aa|down_7|NZ_CP021435.1_3758781_3761472_-	PRK09694, PRK09694, CRISPR-associated helicase/endonuclease Cas3	WYL|290aa|down_8|NZ_CP021435.1_3761607_3762477_+	COG2378, COG2378, Predicted transcriptional regulator [Transcription]	NA|199aa|down_9|NZ_CP021435.1_3762537_3763134_+	pfam13649, Methyltransf_25, Methyltransferase domain
