assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000055785.1_ASM5578v1	NC_007963	Chromohalobacter salexigens DSM 3043, complete sequence	1	252373-253682	1,1,1,2	PILER-CR,CRISPRCasFinder,CRT,PILER-CR	no	cas3,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2	cas3,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,DEDDh,RT,csa3,DinG,WYL,cas14j	Type I-E	CCGTTCCCCGCAGGCGCGGGGATCAACCG,CCGTTCCCCGCAGGCGCGGGGATCAACCG,CCGTTCCCCGCAGGCGCGGGGATCAACCG,CCGTTCCCCGCAGGCGCGGGGATCAACCG	29,29,29,29	0	0	NA	NA	NA:NA:NA:NA	13,21,21,13	21	TypeI-E	cas3,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,DEDDh,RT,csa3,DinG,WYL,cas14j	NA|47aa|up_1|NC_007963.1_249459_249600_-,NA	NA|76aa|up_9|NC_007963.1_240732_240960_-	pfam07869, DUF1656, Protein of unknown function (DUF1656)	NA|700aa|up_8|NC_007963.1_240949_243049_-	pfam04632, FUSC, Fusaric acid resistance protein family	NA|178aa|up_7|NC_007963.1_243254_243788_+	cd07233, GlxI_Zn, Glyoxalase I that uses Zn(++) as cofactor	NA|551aa|up_6|NC_007963.1_243981_245634_+	PRK13273, mdoD, glucan biosynthesis protein D; Provisional	NA|322aa|up_5|NC_007963.1_245722_246688_+	cd13640, PBP2_ChoX, Substrate binding domain of ABC-type choline transport system; the type 2 periplasmic binding protein fold	NA|334aa|up_4|NC_007963.1_246763_247765_-	cd01137, PsaA, Metal binding protein PsaA	NA|305aa|up_3|NC_007963.1_247814_248729_-	COG1108, ZnuB, ABC-type Mn2+/Zn2+ transport systems, permease components [Inorganic ion transport and metabolism]	NA|250aa|up_2|NC_007963.1_248713_249463_-	COG1121, ZnuC, ABC-type Mn/Zn transport systems, ATPase component [Inorganic ion transport and metabolism]	NA|47aa|up_1|NC_007963.1_249459_249600_-	NA	NA|687aa|up_0|NC_007963.1_249947_252008_+	PRK09928, PRK09928, choline transport protein BetT; Provisional	cas3|880aa|down_0|NC_007963.1_253826_256466_+	PRK09694, PRK09694, CRISPR-associated helicase/endonuclease Cas3	NA|112aa|down_1|NC_007963.1_256544_256880_+	cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|416aa|down_2|NC_007963.1_256872_258120_+	pfam07804, HipA_C, HipA-like C-terminal domain	cas8e|563aa|down_3|NC_007963.1_258299_259988_+	cd09729, Cse1_I-E, CRISPR/Cas system-associated protein Cse1	cse2gr11|231aa|down_4|NC_007963.1_259984_260677_+	cd09731, Cse2_I-E, CRISPR/Cas system-associated protein Cse2	cas7|369aa|down_5|NC_007963.1_260736_261843_+	pfam09344, Cas_CT1975, CT1975-like protein	cas5|261aa|down_6|NC_007963.1_261854_262637_+	cd09645, Cas5_I-E, CRISPR/Cas system-associated RAMP superfamily protein Cas5	cas6e|231aa|down_7|NC_007963.1_262636_263329_+	smart01101, CRISPR_assoc, This domain forms an anti-parallel beta strand structure with flanking alpha helical regions	cas1|307aa|down_8|NC_007963.1_263338_264259_+	TIGR03638, cas1_ECOLI, CRISPR-associated endonuclease Cas1, subtype I-E/ECOLI	cas2|99aa|down_9|NC_007963.1_264260_264557_+	PRK11558, PRK11558, putative ssRNA endonuclease; Provisional
GCF_000055785.1_ASM5578v1	NC_007963	Chromohalobacter salexigens DSM 3043, complete sequence	2	264668-266588	3,2,2,4	PILER-CR,CRISPRCasFinder,CRT,PILER-CR	no	cas3,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2	cas3,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,DEDDh,RT,csa3,DinG,WYL,cas14j	Type I-E	CCGTTCCCCGCAGGCGCGGGGATCAACCG,CCGTTCCCCGCAGGCGCGGGGATCAACCG,CCGTTCCCCGCAGGCGCGGGGATCAACCG,TCCGTTCCCCGCAGGCGCGGGGATCAACCGT	29,29,29,31	0	0	NA	NA	NA:NA:NA:NA	26,31,31,26	31	TypeI-E	cas3,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,DEDDh,RT,csa3,DinG,WYL,cas14j	NA,NA|104aa|down_3|NC_007963.1_270652_270964_-	cas3|880aa|up_9|NC_007963.1_253826_256466_+	PRK09694, PRK09694, CRISPR-associated helicase/endonuclease Cas3	NA|112aa|up_8|NC_007963.1_256544_256880_+	cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|416aa|up_7|NC_007963.1_256872_258120_+	pfam07804, HipA_C, HipA-like C-terminal domain	cas8e|563aa|up_6|NC_007963.1_258299_259988_+	cd09729, Cse1_I-E, CRISPR/Cas system-associated protein Cse1	cse2gr11|231aa|up_5|NC_007963.1_259984_260677_+	cd09731, Cse2_I-E, CRISPR/Cas system-associated protein Cse2	cas7|369aa|up_4|NC_007963.1_260736_261843_+	pfam09344, Cas_CT1975, CT1975-like protein	cas5|261aa|up_3|NC_007963.1_261854_262637_+	cd09645, Cas5_I-E, CRISPR/Cas system-associated RAMP superfamily protein Cas5	cas6e|231aa|up_2|NC_007963.1_262636_263329_+	smart01101, CRISPR_assoc, This domain forms an anti-parallel beta strand structure with flanking alpha helical regions	cas1|307aa|up_1|NC_007963.1_263338_264259_+	TIGR03638, cas1_ECOLI, CRISPR-associated endonuclease Cas1, subtype I-E/ECOLI	cas2|99aa|up_0|NC_007963.1_264260_264557_+	PRK11558, PRK11558, putative ssRNA endonuclease; Provisional	NA|459aa|down_0|NC_007963.1_266624_268001_-	TIGR02400, Alphaalpha-trehalose-phosphate_synthase, alpha,alpha-trehalose-phosphate synthase [UDP-forming]	NA|594aa|down_1|NC_007963.1_267997_269779_-	COG3387, SGA1, Glucoamylase and related glycosyl hydrolases [Carbohydrate transport and metabolism]	NA|248aa|down_2|NC_007963.1_269793_270537_-	cd01627, HAD_TPP, trehalose-phosphate phosphatase similar to Escherichia coli trehalose-6-phosphate phosphatase OtsB and Saccharomyces cerevisiae trehalose-phosphatase TPS2	NA|104aa|down_3|NC_007963.1_270652_270964_-	NA	NA|210aa|down_4|NC_007963.1_271037_271667_-	PRK10959, PRK10959, outer membrane protein W; Provisional	NA|609aa|down_5|NC_007963.1_271845_273672_-	COG1217, TypA, Predicted membrane GTPase involved in stress response [Signal transduction mechanisms]	NA|331aa|down_6|NC_007963.1_273928_274921_-	cd10918, CE4_NodB_like_5s_6s, Putative catalytic NodB homology domain of PgaB, IcaB, and similar proteins which consist of a deformed (beta/alpha)8 barrel fold with 5- or 6-strands	NA|322aa|down_7|NC_007963.1_275062_276028_-	cd02525, Succinoglycan_BP_ExoA, ExoA is involved in the biosynthesis of succinoglycan	NA|470aa|down_8|NC_007963.1_276485_277895_+	PRK09469, glnA, glutamate--ammonia ligase	NA|187aa|down_9|NC_007963.1_278131_278692_+	pfam13511, DUF4124, Domain of unknown function (DUF4124)
GCF_000055785.1_ASM5578v1	NC_007963	Chromohalobacter salexigens DSM 3043, complete sequence	3	558966-559063	3	CRISPRCasFinder	no		cas3,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,DEDDh,RT,csa3,DinG,WYL,cas14j	Orphan	GCTCAGTTGGTTAGAGCGCACCCC	24	0	0	NA	NA	NA	1	1	Orphan	cas3,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,DEDDh,RT,csa3,DinG,WYL,cas14j	NA,NA	NA|269aa|up_9|NC_007963.1_544102_544909_-	TIGR03302, OM_YfiO, outer membrane assembly lipoprotein YfiO	NA|320aa|up_8|NC_007963.1_545014_545974_+	PRK11180, rluD, 23S rRNA pseudouridine(1911/1915/1917) synthase RluD	NA|252aa|up_7|NC_007963.1_545985_546741_+	PRK10723, PRK10723, polyphenol oxidase	NA|861aa|up_6|NC_007963.1_546882_549465_+	TIGR03346, chaperone_ClpB, ATP-dependent chaperone ClpB	NA|446aa|up_5|NC_007963.1_549561_550899_-	cd17359, MFS_XylE_like, D-xylose-proton symporter and similar transporters of the Major Facilitator Superfamily	NA|575aa|up_4|NC_007963.1_551421_553146_+	PRK06466, PRK06466, acetolactate synthase 3 large subunit	NA|164aa|up_3|NC_007963.1_553145_553637_+	PRK11895, ilvH, acetolactate synthase 3 regulatory subunit; Reviewed	NA|294aa|up_2|NC_007963.1_553710_554592_-	PRK11716, PRK11716, HTH-type transcriptional activator IlvY	NA|339aa|up_1|NC_007963.1_554759_555776_+	PRK05479, PRK05479, ketol-acid reductoisomerase; Provisional	NA|275aa|up_0|NC_007963.1_555931_556756_+	COG1183, PssA, Phosphatidylserine synthase [Lipid metabolism]	NA|442aa|down_0|NC_007963.1_562850_564176_+	PRK08032, fliD, flagellar capping protein; Reviewed	NA|352aa|down_1|NC_007963.1_564282_565338_-	PRK05387, PRK05387, histidinol-phosphate aminotransferase; Provisional	NA|366aa|down_2|NC_007963.1_565521_566619_+	cd02933, OYE_like_FMN, Old yellow enzyme (OYE)-like FMN binding domain	NA|153aa|down_3|NC_007963.1_566720_567179_+	pfam04170, NlpE, NlpE N-terminal domain	NA|459aa|down_4|NC_007963.1_567305_568682_+	PRK11823, PRK11823, DNA repair protein RadA; Provisional	NA|346aa|down_5|NC_007963.1_568736_569774_+	PRK00943, PRK00943, selenide, water dikinase SelD	NA|362aa|down_6|NC_007963.1_569770_570856_+	PRK11784, PRK11784, tRNA 2-selenouridine synthase; Provisional	NA|235aa|down_7|NC_007963.1_570865_571570_-	cd01741, GATase1_1, Subgroup of proteins having the Type 1 glutamine amidotransferase (GATase1) domain	NA|99aa|down_8|NC_007963.1_571634_571931_+	COG3695, COG3695, Predicted methylated DNA-protein cysteine methyltransferase [DNA replication, recombination, and repair]	NA|200aa|down_9|NC_007963.1_571954_572554_+	pfam04222, DUF416, Protein of unknown function (DUF416)
GCF_000055785.1_ASM5578v1	NC_007963	Chromohalobacter salexigens DSM 3043, complete sequence	4	1727227-1727484	5	PILER-CR	no		cas3,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,DEDDh,RT,csa3,DinG,WYL,cas14j	Orphan	CGTTCCCCGATAGCTCAGTTGGTAGAGCAAATGACTGTTAATCATTGGGTCGCAGG	56	0	0	NA	NA	NA	2	2	Orphan	cas3,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,DEDDh,RT,csa3,DinG,WYL,cas14j	NA,NA	NA|326aa|up_9|NC_007963.1_1710055_1711033_+	PRK10754, PRK10754, NADPH:quinone reductase	NA|124aa|up_8|NC_007963.1_1711056_1711428_+	COG0797, RlpA, Lipoproteins [Cell envelope biogenesis, outer membrane]	NA|816aa|up_7|NC_007963.1_1711481_1713929_-	PRK09463, fadE, acyl-CoA dehydrogenase; Reviewed	NA|1038aa|up_6|NC_007963.1_1714044_1717158_-	PRK10614, PRK10614, multidrug efflux system subunit MdtC; Provisional	NA|1043aa|up_5|NC_007963.1_1717154_1720283_-	PRK10503, PRK10503, MdtB/MuxB family multidrug efflux RND transporter permease subunit	NA|405aa|up_4|NC_007963.1_1720279_1721494_-	PRK11556, PRK11556, MdtA/MuxA family multidrug efflux RND transporter periplasmic adaptor subunit	NA|561aa|up_3|NC_007963.1_1721737_1723420_-	PRK02106, PRK02106, choline dehydrogenase; Validated	NA|495aa|up_2|NC_007963.1_1723498_1724983_-	PRK13252, PRK13252, betaine aldehyde dehydrogenase; Provisional	NA|203aa|up_1|NC_007963.1_1725024_1725633_-	PRK00767, PRK00767, transcriptional regulator BetI; Validated	NA|317aa|up_0|NC_007963.1_1725862_1726813_+	TIGR03414, ABC_choline_bnd, choline ABC transporter, periplasmic binding protein	NA|699aa|down_0|NC_007963.1_1729562_1731659_-	PRK11186, PRK11186, carboxy terminal-processing peptidase	NA|433aa|down_1|NC_007963.1_1731953_1733252_+	PRK02813, PRK02813, putative aminopeptidase 2; Provisional	NA|364aa|down_2|NC_007963.1_1733524_1734616_-	PRK09601, PRK09601, redox-regulated ATPase YchF	NA|195aa|down_3|NC_007963.1_1734729_1735314_-	PRK05426, PRK05426, peptidyl-tRNA hydrolase; Provisional	NA|216aa|down_4|NC_007963.1_1735335_1735983_-	PRK05618, PRK05618, 50S ribosomal protein L25/general stress protein Ctc; Reviewed	NA|314aa|down_5|NC_007963.1_1736120_1737062_-	PRK01259, PRK01259, ribose-phosphate diphosphokinase	NA|299aa|down_6|NC_007963.1_1737248_1738145_-	PRK00343, ipk, 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase; Provisional	NA|215aa|down_7|NC_007963.1_1738144_1738789_-	PRK00022, lolB, lipoprotein localization protein LolB	NA|589aa|down_8|NC_007963.1_1738785_1740552_-	TIGR02917, TPR_domain_protein, putative PEP-CTERM system TPR-repeat lipoprotein	NA|425aa|down_9|NC_007963.1_1740808_1742083_+	PRK00045, hemA, glutamyl-tRNA reductase; Reviewed
GCF_000055785.1_ASM5578v1	NC_007963	Chromohalobacter salexigens DSM 3043, complete sequence	5	3396424-3396520	4	CRISPRCasFinder	no		cas3,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,DEDDh,RT,csa3,DinG,WYL,cas14j	Orphan	GGGTGCGCTCTAACCAACCGAGC	23	0	0	NA	NA	NA	1	1	Orphan	cas3,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,DEDDh,RT,csa3,DinG,WYL,cas14j	NA|105aa|up_9|NC_007963.1_3385409_3385724_+,NA|166aa|up_5|NC_007963.1_3387904_3388402_-,NA|67aa|up_0|NC_007963.1_3392267_3392468_-,NA	NA|105aa|up_9|NC_007963.1_3385409_3385724_+	NA	NA|174aa|up_8|NC_007963.1_3385707_3386229_+	pfam04417, DUF501, Protein of unknown function (DUF501)	NA|188aa|up_7|NC_007963.1_3386257_3386821_+	TIGR00730, LOG_family_protein_YJL055W, TIGR00730 family protein	NA|324aa|up_6|NC_007963.1_3386890_3387862_+	cd05276, p53_inducible_oxidoreductase, PIG3 p53-inducible quinone oxidoreductase	NA|166aa|up_5|NC_007963.1_3387904_3388402_-	NA	NA|170aa|up_4|NC_007963.1_3388521_3389031_-	sd00010, SLR, Sel1-like repeat	NA|344aa|up_3|NC_007963.1_3389291_3390323_+	cd19094, AKR_Tas-like, Escherichia coli Tas protein and similar proteins	NA|263aa|up_2|NC_007963.1_3390434_3391223_+	PRK02101, PRK02101, peroxide stress protein YaaA	NA|273aa|up_1|NC_007963.1_3391353_3392172_+	smart00978, Tim44, Tim44 is an essential component of the machinery that mediates the translocation of nuclear-encoded proteins across the mitochondrial inner membrane	NA|67aa|up_0|NC_007963.1_3392267_3392468_-	NA	NA|399aa|down_0|NC_007963.1_3398730_3399927_-	PRK05912, PRK05912, tyrosyl-tRNA synthetase; Validated	NA|372aa|down_1|NC_007963.1_3400027_3401143_+	PRK09585, anmK, anhydro-N-acetylmuramic acid kinase; Reviewed	NA|386aa|down_2|NC_007963.1_3401148_3402306_-	COG5008, PilU, Tfp pilus assembly protein, ATPase PilU [Cell motility and secretion / Intracellular trafficking and secretion]	NA|347aa|down_3|NC_007963.1_3402347_3403388_-	COG2805, PilT, Tfp pilus assembly protein, pilus retraction ATPase PilT [Cell motility and secretion / Intracellular trafficking and secretion]	NA|236aa|down_4|NC_007963.1_3403423_3404131_+	cd06824, PLPDE_III_Yggs_like, Pyridoxal 5-phosphate (PLP)-binding TIM barrel domain of Type III PLP-Dependent Enzymes, Yggs-like proteins	NA|275aa|down_5|NC_007963.1_3404180_3405005_+	PRK11880, PRK11880, pyrroline-5-carboxylate reductase; Reviewed	NA|197aa|down_6|NC_007963.1_3405042_3405633_+	pfam02325, YGGT, YGGT family	NA|387aa|down_7|NC_007963.1_3405691_3406852_+	PRK00175, metX, homoserine O-acetyltransferase; Provisional	NA|204aa|down_8|NC_007963.1_3406851_3407463_+	TIGR02081, conserved_hypothetical_protein, methionine biosynthesis protein MetW	NA|254aa|down_9|NC_007963.1_3407452_3408214_-	PRK00347, PRK00347, DNA/RNA nuclease SfsA
GCF_000055785.1_ASM5578v1	NC_007963	Chromohalobacter salexigens DSM 3043, complete sequence	6	3446528-3446616	5	CRISPRCasFinder	no		cas3,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,DEDDh,RT,csa3,DinG,WYL,cas14j	Orphan	GCCATGACAGGTCGGGCAGGTCT	23	0	0	NA	NA	NA	1	1	Orphan	cas3,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,DEDDh,RT,csa3,DinG,WYL,cas14j	NA,NA|70aa|down_9|NC_007963.1_3455760_3455970_-	NA|109aa|up_9|NC_007963.1_3434887_3435214_+	smart01103, CRS1_YhbY, Escherichia coli YhbY is associated with pre-50S ribosomal subunits, which implies a function in ribosome assembly	NA|159aa|up_8|NC_007963.1_3435309_3435786_-	PRK00226, greA, transcription elongation factor GreA; Reviewed	NA|1077aa|up_7|NC_007963.1_3435782_3439013_-	PRK05294, carB, carbamoyl-phosphate synthase large subunit	NA|382aa|up_6|NC_007963.1_3439085_3440231_-	PRK12564, PRK12564, carbamoyl-phosphate synthase small subunit	NA|270aa|up_5|NC_007963.1_3440502_3441312_-	PRK00048, PRK00048, dihydrodipicolinate reductase; Provisional	NA|97aa|up_4|NC_007963.1_3441412_3441703_-	pfam09981, DUF2218, Uncharacterized protein conserved in bacteria (DUF2218)	NA|376aa|up_3|NC_007963.1_3441952_3443080_+	cd01139, TroA_f, Periplasmic binding protein TroA_f	NA|354aa|up_2|NC_007963.1_3443102_3444164_+	COG0609, FepD, ABC-type Fe3+-siderophore transport system, permease component [Inorganic ion transport and metabolism]	NA|250aa|up_1|NC_007963.1_3444157_3444907_+	COG1120, FepC, ABC-type cobalamin/Fe3+-siderophores transport systems, ATPase components [Inorganic ion transport and metabolism / Coenzyme metabolism]	NA|298aa|up_0|NC_007963.1_3444884_3445778_-	COG0697, RhaT, Permeases of the drug/metabolite transporter (DMT) superfamily [Carbohydrate transport and metabolism / Amino acid transport and metabolism / General function prediction only]	NA|645aa|down_0|NC_007963.1_3447199_3449134_-	PRK00290, dnaK, molecular chaperone DnaK; Provisional	NA|211aa|down_1|NC_007963.1_3449267_3449900_-	PRK14150, PRK14150, heat shock protein GrpE; Provisional	NA|558aa|down_2|NC_007963.1_3450087_3451761_+	COG0497, RecN, ATPase involved in DNA repair [DNA replication, recombination, and repair]	NA|151aa|down_3|NC_007963.1_3451890_3452343_-	PRK09462, fur, ferric uptake regulator; Provisional	NA|144aa|down_4|NC_007963.1_3452421_3452853_+	pfam04355, SmpA_OmlA, SmpA / OmlA family	NA|106aa|down_5|NC_007963.1_3452868_3453186_-	pfam03658, Ub-RnfH, RnfH family Ubiquitin	NA|147aa|down_6|NC_007963.1_3453172_3453613_-	cd07813, COQ10p_like, Coenzyme Q-binding protein COQ10p and similar proteins	NA|158aa|down_7|NC_007963.1_3453743_3454217_+	PRK05422, smpB, SsrA-binding protein SmpB	NA|287aa|down_8|NC_007963.1_3454746_3455607_-	cd13639, PBP2_OpuAC_like, Substrate binding domain of Lactococcus lactis ABC-type transporter OpuA and related proteins; the type 2 periplasmic binding protein fold	NA|70aa|down_9|NC_007963.1_3455760_3455970_-	NA
