assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_001675145.1_ASM167514v1	NZ_CP014667	Escherichia coli strain ECONIH2 chromosome, complete genome	1	430169-430304	1	CRISPRCasFinder	no		DEDDh,cas14j,DinG,cas3,csa3,RT,PD-DExK,WYL	Orphan	GCCGGATGCGGCGTGAACGCCTTATCCGGCCTACGAATGGCGC	43	0	0	NA	NA	NA	1	1	Orphan	DEDDh,cas14j,DinG,cas3,csa3,RT,PD-DExK,WYL	NA,NA	NA|491aa|up_9|NZ_CP014667.1_417661_419134_-	PRK13252, PRK13252, betaine aldehyde dehydrogenase; Provisional	NA|196aa|up_8|NZ_CP014667.1_419147_419735_-	PRK00767, PRK00767, transcriptional regulator BetI; Validated	NA|678aa|up_7|NZ_CP014667.1_419863_421897_+	PRK09928, PRK09928, choline transport protein BetT; Provisional	NA|363aa|up_6|NZ_CP014667.1_422772_423861_+	smart00052, EAL, Putative diguanylate phosphodiesterase	NA|311aa|up_5|NZ_CP014667.1_423902_424835_-	PRK10094, PRK10094, HTH-type transcriptional activator AllS	NA|166aa|up_4|NZ_CP014667.1_424926_425424_-	pfam06496, DUF1097, Protein of unknown function (DUF1097)	NA|202aa|up_3|NZ_CP014667.1_425681_426287_+	sd00045, ANK, ankyrin repeats	NA|288aa|up_2|NZ_CP014667.1_426326_427190_+	pfam11392, DUF2877, Protein of unknown function (DUF2877)	NA|516aa|up_1|NZ_CP014667.1_427179_428727_+	PRK06091, PRK06091, membrane protein FdrA; Validated	NA|473aa|up_0|NZ_CP014667.1_428726_430145_+	pfam06545, DUF1116, Protein of unknown function (DUF1116)	NA|317aa|down_0|NZ_CP014667.1_430391_431342_+	PRK12352, PRK12352, putative carbamate kinase; Reviewed	NA|461aa|down_1|NZ_CP014667.1_431351_432734_+	PRK06846, PRK06846, putative deaminase; Validated	NA|350aa|down_2|NZ_CP014667.1_433110_434160_+	cd05283, CAD1, Cinnamyl alcohol dehydrogenases (CAD)	NA|225aa|down_3|NZ_CP014667.1_435041_435716_-	TIGR00949, Uncharacterized_membrane_protein_YahN, The Resistance to Homoserine/Threonine (RhtB) Family protein	NA|92aa|down_4|NZ_CP014667.1_435862_436138_+	PRK09929, PRK09929, hypothetical protein; Provisional	NA|529aa|down_5|NZ_CP014667.1_436238_437825_-	PRK15424, PRK15424, propionate catabolism operon regulatory protein PrpR; Provisional	NA|297aa|down_6|NZ_CP014667.1_438063_438954_+	PRK11320, prpB, 2-methylisocitrate lyase; Provisional	NA|390aa|down_7|NZ_CP014667.1_439113_440283_+	PRK12351, PRK12351, methylcitrate synthase; Provisional	NA|484aa|down_8|NZ_CP014667.1_440316_441768_+	PRK09425, prpD, bifunctional 2-methylcitrate dehydratase/aconitate hydratase	NA|629aa|down_9|NZ_CP014667.1_441807_443694_+	PRK10524, prpE, propionyl-CoA synthetase; Provisional
GCF_001675145.1_ASM167514v1	NZ_CP014667	Escherichia coli strain ECONIH2 chromosome, complete genome	2	958517-958664	2	CRISPRCasFinder	no		DEDDh,cas14j,DinG,cas3,csa3,RT,PD-DExK,WYL	Orphan	GTTCACTGCCGTACAGGCAGCTTAGAAA	28	0	0	NA	NA	I-F	2	2	Orphan	DEDDh,cas14j,DinG,cas3,csa3,RT,PD-DExK,WYL	NA|86aa|up_8|NZ_CP014667.1_950290_950548_+,NA|58aa|up_6|NZ_CP014667.1_951330_951504_-,NA|94aa|up_3|NZ_CP014667.1_954254_954536_+,NA|59aa|up_2|NZ_CP014667.1_954883_955060_+,NA	NA|585aa|up_9|NZ_CP014667.1_948247_950002_+	COG3378, COG3378, Phage associated DNA primase [General function prediction only]	NA|86aa|up_8|NZ_CP014667.1_950290_950548_+	NA	NA|141aa|up_7|NZ_CP014667.1_950544_950967_+	cd04496, SSB_OBF, SSB_OBF: A subfamily of OB folds similar to the OB fold of ssDNA-binding protein (SSB)	NA|58aa|up_6|NZ_CP014667.1_951330_951504_-	NA	NA|638aa|up_5|NZ_CP014667.1_951596_953510_+	cd07016, S14_ClpP_1, Caseinolytic protease (ClpP) is an ATP-dependent, highly conserved serine protease	NA|162aa|up_4|NZ_CP014667.1_953766_954252_+	pfam07278, DUF1441, Protein of unknown function (DUF1441)	NA|94aa|up_3|NZ_CP014667.1_954254_954536_+	NA	NA|59aa|up_2|NZ_CP014667.1_954883_955060_+	NA	NA|107aa|up_1|NZ_CP014667.1_955676_955997_+	PRK00033, clpS, ATP-dependent Clp protease adaptor protein ClpS; Reviewed	NA|759aa|up_0|NZ_CP014667.1_956027_958304_+	PRK11034, clpA, ATP-dependent Clp protease ATP-binding subunit; Provisional	NA|73aa|down_0|NZ_CP014667.1_959047_959266_-	PRK00276, infA, translation initiation factor IF-1; Validated	NA|235aa|down_1|NZ_CP014667.1_959550_960255_-	PRK00301, aat, leucyl/phenylalanyl-tRNA--protein transferase; Reviewed	NA|574aa|down_2|NZ_CP014667.1_960296_962018_-	PRK11160, PRK11160, cysteine/glutathione ABC transporter membrane/ATP-binding component; Reviewed	NA|589aa|down_3|NZ_CP014667.1_962018_963785_-	PRK11174, PRK11174, cysteine/glutathione ABC transporter membrane/ATP-binding component; Reviewed	NA|322aa|down_4|NZ_CP014667.1_963907_964873_-	PRK10262, PRK10262, thioredoxin reductase; Provisional	NA|165aa|down_5|NZ_CP014667.1_965416_965911_+	PRK11169, PRK11169, leucine-responsive transcriptional regulator Lrp	NA|1340aa|down_6|NZ_CP014667.1_966045_970065_+	PRK10263, PRK10263, DNA translocase FtsK; Provisional	NA|204aa|down_7|NZ_CP014667.1_970223_970835_+	TIGR00547, Outer-membrane_lipoprotein_carrier_protein, periplasmic chaperone LolA	NA|448aa|down_8|NZ_CP014667.1_970845_972189_+	PRK13342, PRK13342, recombination factor protein RarA; Reviewed	NA|431aa|down_9|NZ_CP014667.1_972279_973572_+	PRK05431, PRK05431, seryl-tRNA synthetase; Provisional
GCF_001675145.1_ASM167514v1	NZ_CP014667	Escherichia coli strain ECONIH2 chromosome, complete genome	3	2451420-2451533	3	CRISPRCasFinder	no		DEDDh,cas14j,DinG,cas3,csa3,RT,PD-DExK,WYL	Orphan	TTTGTAGGCCGGATAAGCGAAGCGCATCCGGCA	33	0	0	NA	NA	NA	1	1	Orphan	DEDDh,cas14j,DinG,cas3,csa3,RT,PD-DExK,WYL	NA,NA	NA|395aa|up_9|NZ_CP014667.1_2430322_2431507_+	PRK05790, PRK05790, putative acyltransferase; Provisional	NA|259aa|up_8|NZ_CP014667.1_2431580_2432357_-	COG4676, COG4676, Uncharacterized protein conserved in bacteria [Function unknown]	NA|550aa|up_7|NZ_CP014667.1_2432361_2434011_-	COG5445, COG5445, Predicted secreted protein [Function unknown]	NA|1505aa|up_6|NZ_CP014667.1_2434011_2438526_-	COG2373, COG2373, Large extracellular alpha-helical protein [General function prediction only]	NA|208aa|up_5|NZ_CP014667.1_2438549_2439173_-	COG3234, COG3234, Uncharacterized protein conserved in bacteria [Function unknown]	NA|563aa|up_4|NZ_CP014667.1_2439169_2440858_-	COG4685, COG4685, Uncharacterized protein conserved in bacteria [Function unknown]	NA|876aa|up_3|NZ_CP014667.1_2441006_2443634_-	PRK05560, PRK05560, DNA gyrase subunit A; Validated	NA|241aa|up_2|NZ_CP014667.1_2443780_2444503_+	PRK05134, PRK05134, bifunctional 2-polyprenyl-6-hydroxyphenol methylase/3-demethylubiquinol 3-O-methyltransferase UbiG	NA|1253aa|up_1|NZ_CP014667.1_2444642_2448401_-	PRK09752, PRK09752, AIDA-I family autotransporter YfaL	NA|762aa|up_0|NZ_CP014667.1_2449082_2451368_+	PRK09103, PRK09103, ribonucleoside-diphosphate reductase subunit alpha	NA|377aa|down_0|NZ_CP014667.1_2451557_2452688_+	PRK09101, nrdB, ribonucleotide-diphosphate reductase subunit beta; Reviewed	NA|85aa|down_1|NZ_CP014667.1_2452687_2452942_+	PRK10713, PRK10713, 2Fe-2S ferredoxin-like protein	NA|217aa|down_2|NZ_CP014667.1_2452995_2453646_-	PRK09902, PRK09902, lipopolysaccharide kinase InaA	NA|359aa|down_3|NZ_CP014667.1_2453848_2454925_-	PRK11143, glpQ, glycerophosphodiester phosphodiesterase; Provisional	NA|453aa|down_4|NZ_CP014667.1_2454929_2456288_-	PRK11273, glpT, glycerol-3-phosphate transporter	NA|543aa|down_5|NZ_CP014667.1_2456560_2458189_+	PRK11101, glpA, anaerobic glycerol-3-phosphate dehydrogenase subunit A	NA|420aa|down_6|NZ_CP014667.1_2458178_2459438_+	COG3075, GlpB, Anaerobic glycerol-3-phosphate dehydrogenase [Amino acid transport and metabolism]	NA|397aa|down_7|NZ_CP014667.1_2459434_2460625_+	TIGR03379, glycerol3P_GlpC, glycerol-3-phosphate dehydrogenase, anaerobic, C subunit	NA|320aa|down_8|NZ_CP014667.1_2460817_2461777_+	PRK09956, PRK09956, ISNCY family transposase	NA|62aa|down_9|NZ_CP014667.1_2461789_2461975_+	PRK09956, PRK09956, ISNCY family transposase
GCF_001675145.1_ASM167514v1	NZ_CP014667	Escherichia coli strain ECONIH2 chromosome, complete genome	4	4349390-4349591	1	PILER-CR	no		DEDDh,cas14j,DinG,cas3,csa3,RT,PD-DExK,WYL	Orphan	GTAGACCGGATAAGGCGTTCACGCCGCATCCGGCAA	36	0	0	NA	NA	NA	2	2	Orphan	DEDDh,cas14j,DinG,cas3,csa3,RT,PD-DExK,WYL	NA,NA	NA|252aa|up_9|NZ_CP014667.1_4340940_4341696_+	PRK00216, ubiE, bifunctional demethylmenaquinone methyltransferase/2-methoxy-6-polyprenyl-1,4-benzoquinol methylase UbiE	NA|202aa|up_8|NZ_CP014667.1_4341709_4342315_+	COG3165, COG3165, Uncharacterized protein conserved in bacteria [Function unknown]	NA|547aa|up_7|NZ_CP014667.1_4342311_4343952_+	PRK04750, ubiB, putative ubiquinone biosynthesis protein UbiB; Reviewed	NA|90aa|up_6|NZ_CP014667.1_4344030_4344300_+	PRK03554, tatA, Sec-independent protein translocase subunit TatA	NA|172aa|up_5|NZ_CP014667.1_4344303_4344819_+	PRK01770, PRK01770, Sec-independent protein translocase subunit TatB	NA|259aa|up_4|NZ_CP014667.1_4344821_4345598_+	PRK10921, PRK10921, Sec-independent protein translocase subunit TatC	NA|261aa|up_3|NZ_CP014667.1_4345639_4346422_+	PRK10425, PRK10425, 3'-5' ssDNA/RNA exonuclease TatD	NA|163aa|up_2|NZ_CP014667.1_4346418_4346907_-	PRK09014, rfaH, transcription/translation regulatory transformer protein RfaH	NA|498aa|up_1|NZ_CP014667.1_4347073_4348567_+	PRK10922, PRK10922, 4-hydroxy-3-polyprenylbenzoate decarboxylase	NA|234aa|up_0|NZ_CP014667.1_4348612_4349314_+	PRK08051, fre, FMN reductase; Validated	NA|388aa|down_0|NZ_CP014667.1_4349595_4350759_-	PRK08947, fadA, 3-ketoacyl-CoA thiolase; Reviewed	NA|730aa|down_1|NZ_CP014667.1_4350768_4352958_-	PRK11730, fadB, fatty acid oxidation complex subunit alpha FadB	NA|444aa|down_2|NZ_CP014667.1_4353147_4354479_+	PRK13607, PRK13607, proline dipeptidase; Provisional	NA|205aa|down_3|NZ_CP014667.1_4354478_4355093_+	PRK11568, PRK11568, IMPACT family protein	NA|484aa|down_4|NZ_CP014667.1_4355131_4356583_+	PRK10750, PRK10750, Trk system potassium transporter TrkH	NA|182aa|down_5|NZ_CP014667.1_4356594_4357140_+	PRK11104, hemG, menaquinone-dependent protoporphyrinogen IX dehydrogenase	NA|176aa|down_6|NZ_CP014667.1_4362723_4363251_-	PRK10751, PRK10751, molybdopterin-guanine dinucleotide biosynthesis protein B; Provisional	NA|195aa|down_7|NZ_CP014667.1_4363232_4363817_-	PRK00317, mobA, molybdopterin-guanine dinucleotide biosynthesis protein MobA; Reviewed	NA|90aa|down_8|NZ_CP014667.1_4363886_4364156_+	pfam06288, DUF1040, Protein of unknown function (DUF1040)	NA|329aa|down_9|NZ_CP014667.1_4364232_4365219_+	PRK11768, PRK11768, serine/threonine protein kinase
GCF_001675145.1_ASM167514v1	NZ_CP014667	Escherichia coli strain ECONIH2 chromosome, complete genome	5	4410018-4410144	4	CRISPRCasFinder	no		DEDDh,cas14j,DinG,cas3,csa3,RT,PD-DExK,WYL	Orphan	GTAGGCCGGATAAGGCACTCGTGCCGCATCCGGCA	35	0	0	NA	NA	NA	1	1	Orphan	DEDDh,cas14j,DinG,cas3,csa3,RT,PD-DExK,WYL	NA,NA	NA|73aa|up_9|NZ_CP014667.1_4399274_4399493_+	pfam01402, RHH_1, Ribbon-helix-helix protein, copG family	NA|310aa|up_8|NZ_CP014667.1_4399707_4400637_-	PRK03564, PRK03564, formate dehydrogenase accessory protein FdhE; Provisional	NA|212aa|up_7|NZ_CP014667.1_4400633_4401269_-	PRK10639, PRK10639, formate dehydrogenase cytochrome b556 subunit	NA|301aa|up_6|NZ_CP014667.1_4401265_4402168_-	TIGR01582, Formate_dehydrogenase_iron-sulfur_subunit, formate dehydrogenase, beta subunit, Fe-S containing	NA|1017aa|up_5|NZ_CP014667.1_4402180_4405231_-	TIGR01553, Formate_dehydrogenase_major_subunit, formate dehydrogenase-N alpha subunit	NA|278aa|up_4|NZ_CP014667.1_4405424_4406258_+	PRK00724, PRK00724, formate dehydrogenase accessory sulfurtransferase FdhD	NA|228aa|up_3|NZ_CP014667.1_4406286_4406970_+	pfam12889, DUF3829, Protein of unknown function (DUF3829)	NA|465aa|up_2|NZ_CP014667.1_4407390_4408785_+	pfam16966, Porin_8, Porin-like glycoporin RafY	NA|105aa|up_1|NZ_CP014667.1_4408825_4409140_-	TIGR02625, L-rhamnose_mutarotase, L-rhamnose mutarotase	NA|275aa|up_0|NZ_CP014667.1_4409149_4409974_-	PRK03634, PRK03634, rhamnulose-1-phosphate aldolase; Provisional	NA|420aa|down_0|NZ_CP014667.1_4410148_4411408_-	PRK01076, PRK01076, L-rhamnose isomerase; Provisional	NA|490aa|down_1|NZ_CP014667.1_4411404_4412874_-	PRK10640, rhaB, rhamnulokinase; Provisional	NA|279aa|down_2|NZ_CP014667.1_4413161_4413998_+	PRK13503, PRK13503, HTH-type transcriptional activator RhaS	NA|313aa|down_3|NZ_CP014667.1_4413981_4414920_+	PRK13500, PRK13500, HTH-type transcriptional activator RhaR	NA|345aa|down_4|NZ_CP014667.1_4414916_4415951_-	pfam06379, RhaT, L-rhamnose-proton symport protein (RhaT)	NA|207aa|down_5|NZ_CP014667.1_4416236_4416857_+	PRK10925, PRK10925, superoxide dismutase [Mn]	NA|328aa|down_6|NZ_CP014667.1_4417116_4418100_+	TIGR00793, 2-keto-3-deoxygluconate_permease, 2-keto-3-deoxygluconate transporter	NA|225aa|down_7|NZ_CP014667.1_4418248_4418923_+	PRK11536, PRK11536, 6-N-hydroxylaminopurine resistance protein; Provisional	NA|458aa|down_8|NZ_CP014667.1_4419094_4420468_-	PRK09470, cpxA, envelope stress sensor histidine kinase CpxA	NA|233aa|down_9|NZ_CP014667.1_4420464_4421163_-	PRK10955, PRK10955, envelope stress response regulator transcription factor CpxR
GCF_001675145.1_ASM167514v1	NZ_CP014667	Escherichia coli strain ECONIH2 chromosome, complete genome	6	4838854-4838993	5	CRISPRCasFinder	no		DEDDh,cas14j,DinG,cas3,csa3,RT,PD-DExK,WYL	Orphan	TGTGTAGGTCGGATAAGGCGTTCACGCCGCATCCGACAATAACA	44	0	0	NA	NA	NA	1	1	Orphan	DEDDh,cas14j,DinG,cas3,csa3,RT,PD-DExK,WYL	NA,NA	NA|335aa|up_9|NZ_CP014667.1_4827399_4828404_-	PRK02102, PRK02102, ornithine carbamoyltransferase; Validated	NA|315aa|up_8|NZ_CP014667.1_4828414_4829359_-	PRK12354, PRK12354, carbamate kinase; Reviewed	NA|407aa|up_7|NZ_CP014667.1_4829369_4830590_-	PRK01388, PRK01388, arginine deiminase; Provisional	NA|151aa|up_6|NZ_CP014667.1_4831267_4831720_+	COG2731, EbgC, Beta-galactosidase, beta subunit [Carbohydrate transport and metabolism]	NA|335aa|up_5|NZ_CP014667.1_4831764_4832769_-	PRK03515, PRK03515, ornithine carbamoyltransferase subunit I; Provisional	NA|139aa|up_4|NZ_CP014667.1_4832930_4833347_+	PRK11191, PRK11191, ribonuclease E inhibitor RraB	NA|168aa|up_3|NZ_CP014667.1_4833523_4834027_-	pfam00583, Acetyltransf_1, Acetyltransferase (GNAT) family	NA|396aa|up_2|NZ_CP014667.1_4834219_4835407_+	COG4269, COG4269, Predicted membrane protein [Function unknown]	NA|952aa|up_1|NZ_CP014667.1_4835453_4838309_-	PRK05729, valS, valyl-tRNA synthetase; Reviewed	NA|148aa|up_0|NZ_CP014667.1_4838308_4838752_-	PRK05728, PRK05728, DNA polymerase III subunit chi; Validated	NA|504aa|down_0|NZ_CP014667.1_4839009_4840521_-	PRK00913, PRK00913, multifunctional aminopeptidase A; Provisional	NA|367aa|down_1|NZ_CP014667.1_4840787_4841888_+	PRK15120, PRK15120, lipopolysaccharide ABC transporter permease LptF; Provisional	NA|361aa|down_2|NZ_CP014667.1_4841887_4842970_+	PRK15071, PRK15071, lipopolysaccharide ABC transporter permease; Provisional	NA|501aa|down_3|NZ_CP014667.1_4843130_4844633_-	pfam05872, DUF853, Bacterial protein of unknown function (DUF853)	NA|333aa|down_4|NZ_CP014667.1_4844710_4845709_-	cd01575, PBP1_GntR, ligand-binding domain of DNA transcription repressor GntR specific for gluconate, a member of the LacI-GalR family of bacterial transcription regulators	NA|440aa|down_5|NZ_CP014667.1_4845775_4847095_-	TIGR00791, Gluconate_permease, gluconate transporter	NA|255aa|down_6|NZ_CP014667.1_4847159_4847924_-	PRK08085, PRK08085, gluconate 5-dehydrogenase; Provisional	NA|344aa|down_7|NZ_CP014667.1_4847947_4848979_-	PRK09880, PRK09880, L-idonate 5-dehydrogenase; Provisional	NA|188aa|down_8|NZ_CP014667.1_4849195_4849759_+	PRK09825, idnK, gluconokinase	NA|340aa|down_9|NZ_CP014667.1_4849762_4850782_-	cd05283, CAD1, Cinnamyl alcohol dehydrogenases (CAD)
