assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCA_002853805.1_ASM285380v1	CP024141	Escherichia coli strain 14EC029 chromosome, complete genome	1	661154-661271	1	CRISPRCasFinder	no		WYL,cas3,RT,csa3,PD-DExK,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,DEDDh,DinG,c2c9_V-U4	Orphan	TGCCGGATGCGATGCTGGCGCACCTTATCCGGCCTACGGG	40	0	0	NA	NA	NA	1	1	Orphan	WYL,cas3,RT,csa3,PD-DExK,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,DEDDh,DinG,c2c9_V-U4	NA,NA|55aa|down_0|CP024141.1_661344_661509_-	NA|115aa|up_9|CP024141.1_648875_649220_-	PRK11424, PRK11424, DNA-binding transcriptional activator TdcR; Provisional	NA|313aa|up_8|CP024141.1_649408_650347_+	PRK10341, PRK10341, transcriptional regulator TdcA	NA|330aa|up_7|CP024141.1_650445_651435_+	PRK08638, PRK08638, bifunctional threonine ammonia-lyase/L-serine ammonia-lyase TdcB	NA|444aa|up_6|CP024141.1_651456_652788_+	PRK13629, PRK13629, threonine/serine transporter TdcC; Provisional	NA|403aa|up_5|CP024141.1_652813_654022_+	PRK12379, PRK12379, propionate kinase	NA|765aa|up_4|CP024141.1_654055_656350_+	cd01678, PFL1, Pyruvate formate lyase 1	NA|130aa|up_3|CP024141.1_656363_656753_+	PRK11401, PRK11401, enamine/imine deaminase	NA|455aa|up_2|CP024141.1_656824_658189_+	PRK15040, PRK15040, L-serine ammonia-lyase	NA|444aa|up_1|CP024141.1_658463_659795_+	TIGR00814, membrane_transport_protein_YhjV, serine transporter	NA|437aa|up_0|CP024141.1_659822_661133_+	COG3681, COG3681, L-cysteine desulfidase [Amino acid transport and metabolism]	NA|55aa|down_0|CP024141.1_661344_661509_-	NA	NA|234aa|down_1|CP024141.1_661531_662233_-	COG1741, COG1741, Pirin-related protein [General function prediction only]	NA|299aa|down_2|CP024141.1_662337_663234_+	cd08431, PBP2_HupR, The C-terminal substrate binding domain of LysR-type transcriptional regulator, HupR, which regulates expression of the heme uptake receptor HupA; contains the type 2 periplasmic binding fold	NA|119aa|down_3|CP024141.1_663284_663641_-	COG3152, COG3152, Predicted membrane protein [Function unknown]	NA|122aa|down_4|CP024141.1_663882_664248_-	COG3152, COG3152, Predicted membrane protein [Function unknown]	NA|329aa|down_5|CP024141.1_664540_665527_-	COG0435, ECM4, Predicted glutathione S-transferase [Posttranslational modification, protein turnover, chaperones]	NA|161aa|down_6|CP024141.1_665596_666079_-	COG2259, COG2259, Predicted membrane protein [Function unknown]	NA|100aa|down_7|CP024141.1_666174_666474_-	pfam13997, YqjK, YqjK-like protein	NA|135aa|down_8|CP024141.1_666463_666868_-	COG5393, COG5393, Predicted membrane protein [Function unknown]	NA|102aa|down_9|CP024141.1_666870_667176_-	COG4575, ElaB, Uncharacterized conserved protein [Function unknown]
GCA_002853805.1_ASM285380v1	CP024141	Escherichia coli strain 14EC029 chromosome, complete genome	2	1053284-1053800	1,2,1	PILER-CR,CRISPRCasFinder,CRT	no		WYL,cas3,RT,csa3,PD-DExK,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,DEDDh,DinG,c2c9_V-U4	Orphan	GAGTTCCCCGCGCCAGCGGGGATAAACCG,GAGTTCCCCGCGCCAGCGGGGATAAACCG,GAGTTCCCCGCGCCAGCGGGGATAAACCG	29,29,29	0	0	NA	NA	I-E:I-E:I-E	8,8,8	8	Orphan	WYL,cas3,RT,csa3,PD-DExK,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,DEDDh,DinG,c2c9_V-U4	NA|47aa|up_1|CP024141.1_1051993_1052134_-,NA	NA|434aa|up_9|CP024141.1_1042690_1043992_+	PRK13168, rumA, 23S rRNA (uracil(1939)-C(5))-methyltransferase RlmD	NA|745aa|up_8|CP024141.1_1044039_1046274_+	PRK10872, relA, (p)ppGpp synthetase I/GTP pyrophosphokinase; Provisional	NA|83aa|up_7|CP024141.1_1046351_1046600_+	PRK09798, PRK09798, MazF-MazE toxin-antitoxin system antitoxin MazE	NA|112aa|up_6|CP024141.1_1046599_1046935_+	PRK09907, PRK09907, endoribonuclease MazF	NA|264aa|up_5|CP024141.1_1047005_1047797_+	PRK09562, mazG, nucleoside triphosphate pyrophosphohydrolase; Reviewed	NA|546aa|up_4|CP024141.1_1048024_1049662_+	PRK05380, pyrG, CTP synthetase; Validated	NA|433aa|up_3|CP024141.1_1049749_1051048_+	PRK00077, eno, enolase; Provisional	NA|291aa|up_2|CP024141.1_1051107_1051980_-	COG1512, COG1512, Beta-propeller domains of methanol dehydrogenase type [General function prediction only]	NA|47aa|up_1|CP024141.1_1051993_1052134_-	NA	NA|224aa|up_0|CP024141.1_1052272_1052944_+	TIGR04322, organic_radical_activating_enzyme, putative 7-cyano-7-deazaguanosine (preQ0) biosynthesis protein QueE	NA|493aa|down_0|CP024141.1_1054437_1055916_-	cd07779, FGGY_ygcE_like, uncharacterized ygcE-like proteins	NA|426aa|down_1|CP024141.1_1055942_1057220_-	cd06174, MFS, Major Facilitator Superfamily	NA|262aa|down_2|CP024141.1_1057538_1058324_+	cd05347, Ga5DH-like_SDR_c, gluconate 5-dehydrogenase (Ga5DH)-like, classical (c) SDRs	NA|485aa|down_3|CP024141.1_1058393_1059848_+	COG0277, GlcD, FAD/FMN-containing dehydrogenases [Energy production and conversion]	NA|446aa|down_4|CP024141.1_1059941_1061279_+	cd17371, MFS_MucK, Cis,cis-muconate transport protein and similar proteins of the Major Facilitator Superfamily	NA|260aa|down_5|CP024141.1_1061256_1062036_+	COG2086, FixA, Electron transfer flavoprotein, beta subunit [Energy production and conversion]	NA|287aa|down_6|CP024141.1_1062032_1062893_+	COG2025, FixB, Electron transfer flavoprotein, alpha subunit [Energy production and conversion]	NA|192aa|down_7|CP024141.1_1063040_1063616_-	COG1954, GlpP, Glycerol-3-phosphate responsive antiterminator (mRNA-binding) [Transcription]	NA|87aa|down_8|CP024141.1_1063632_1063893_-	COG2440, FixX, Ferredoxin-like protein [Energy production and conversion]	NA|424aa|down_9|CP024141.1_1063883_1065155_-	PRK10015, PRK10015, oxidoreductase; Provisional
GCA_002853805.1_ASM285380v1	CP024141	Escherichia coli strain 14EC029 chromosome, complete genome	3	1080848-1081241	2,2,3	PILER-CR,CRT,CRISPRCasFinder	no	cas3,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2	WYL,cas3,RT,csa3,PD-DExK,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,DEDDh,DinG,c2c9_V-U4	Type I-E	GTGTTCCCCGCGCCAGCGGGGATAAACCG,GTGTTCCCCGCGCCAGCGGGGATAAACC,GTTCCCCGCGCCAGCGGGGATAAACCG	29,28,27	0	0	NA	NA	I-E:I-E:I-E	5,6,5	6	TypeI-E	WYL,cas3,RT,csa3,PD-DExK,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,DEDDh,DinG,c2c9_V-U4	NA,NA	NA|51aa|up_9|CP024141.1_1070497_1070650_+	pfam01848, HOK_GEF, Hok/gef family	NA|371aa|up_8|CP024141.1_1070726_1071839_+	pfam01609, DDE_Tnp_1, Transposase DDE domain	cas3|900aa|up_7|CP024141.1_1072192_1074892_+	COG1203, COG1203, CRISPR-associated helicase Cas3 [Defense mechanisms]	cas8e|521aa|up_6|CP024141.1_1074989_1076552_+	TIGR02547, CRISPR_system_Cascade_subunit_CasA, CRISPR type I-E/ECOLI-associated protein CasA/Cse1	cse2gr11|179aa|up_5|CP024141.1_1076548_1077085_+	TIGR02548, CRISPR_system_Cascade_subunit_CasB, CRISPR type I-E/ECOLI-associated protein CasB/Cse2	cas7|352aa|up_4|CP024141.1_1077096_1078152_+	TIGR01869, CRISPR_system_Cascade_subunit_CasC, CRISPR-associated protein Cas7/Cse4/CasC, subtype I-E/ECOLI	cas5|249aa|up_3|CP024141.1_1078162_1078909_+	cd09645, Cas5_I-E, CRISPR/Cas system-associated RAMP superfamily protein Cas5	cas6e|217aa|up_2|CP024141.1_1078890_1079541_+	TIGR01907, CRISPR_system_Cascade_subunit_CasE, CRISPR-associated protein Cas6/Cse3/CasE, subtype I-E/ECOLI	cas1|308aa|up_1|CP024141.1_1079537_1080461_+	TIGR03638, cas1_ECOLI, CRISPR-associated endonuclease Cas1, subtype I-E/ECOLI	cas2|98aa|up_0|CP024141.1_1080457_1080751_+	PRK11558, PRK11558, putative ssRNA endonuclease; Provisional	NA|346aa|down_0|CP024141.1_1081262_1082300_-	PRK10199, PRK10199, alkaline phosphatase isozyme conversion aminopeptidase; Provisional	NA|303aa|down_1|CP024141.1_1082551_1083460_+	PRK05253, PRK05253, sulfate adenylyltransferase subunit CysD	NA|476aa|down_2|CP024141.1_1083461_1084889_+	PRK05124, cysN, sulfate adenylyltransferase subunit 1; Provisional	NA|202aa|down_3|CP024141.1_1084888_1085494_+	PRK03846, PRK03846, adenylylsulfate kinase; Provisional	NA|108aa|down_4|CP024141.1_1085543_1085867_+	pfam12084, DUF3561, Protein of unknown function (DUF3561)	NA|104aa|down_5|CP024141.1_1086060_1086372_+	PRK00888, ftsB, cell division protein FtsB; Reviewed	NA|237aa|down_6|CP024141.1_1086390_1087101_+	PRK00155, ispD, D-ribitol-5-phosphate cytidylyltransferase	NA|160aa|down_7|CP024141.1_1087100_1087580_+	PRK00084, ispF, 2-C-methyl-D-erythritol 2,4-cyclodiphosphate synthase; Reviewed	NA|350aa|down_8|CP024141.1_1087576_1088626_+	PRK00984, truD, tRNA pseudouridine synthase D; Reviewed	NA|254aa|down_9|CP024141.1_1088606_1089368_+	PRK00346, surE, 5'(3')-nucleotidase/polyphosphatase; Provisional
GCA_002853805.1_ASM285380v1	CP024141	Escherichia coli strain 14EC029 chromosome, complete genome	4	1629791-1629932	4	CRISPRCasFinder	no		WYL,cas3,RT,csa3,PD-DExK,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,DEDDh,DinG,c2c9_V-U4	Orphan	CGCGTCTTATCAGGCCTACAAATCCGAGCCGTAGGCCGGATAAGGCGTTCACGC	54	0	0	NA	NA	NA	1	1	Orphan	WYL,cas3,RT,csa3,PD-DExK,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,DEDDh,DinG,c2c9_V-U4	NA|66aa|up_7|CP024141.1_1622765_1622963_-,NA|357aa|up_4|CP024141.1_1625671_1626742_-,NA	NA|420aa|up_9|CP024141.1_1619863_1621123_-	COG3075, GlpB, Anaerobic glycerol-3-phosphate dehydrogenase [Amino acid transport and metabolism]	NA|543aa|up_8|CP024141.1_1621112_1622741_-	PRK11101, glpA, anaerobic glycerol-3-phosphate dehydrogenase subunit A	NA|66aa|up_7|CP024141.1_1622765_1622963_-	NA	NA|453aa|up_6|CP024141.1_1623013_1624372_+	PRK11273, glpT, glycerol-3-phosphate transporter	NA|359aa|up_5|CP024141.1_1624376_1625453_+	PRK11143, glpQ, glycerophosphodiester phosphodiesterase; Provisional	NA|357aa|up_4|CP024141.1_1625671_1626742_-	NA	NA|69aa|up_3|CP024141.1_1627207_1627414_-	PRK09729, PRK09729, hypothetical protein; Provisional	NA|217aa|up_2|CP024141.1_1627628_1628279_+	PRK09902, PRK09902, lipopolysaccharide kinase InaA	NA|85aa|up_1|CP024141.1_1628332_1628587_-	PRK10713, PRK10713, 2Fe-2S ferredoxin-like protein	NA|377aa|up_0|CP024141.1_1628586_1629717_-	PRK09101, nrdB, ribonucleotide-diphosphate reductase subunit beta; Reviewed	NA|762aa|down_0|CP024141.1_1629951_1632237_-	PRK09103, PRK09103, ribonucleoside-diphosphate reductase subunit alpha	NA|1251aa|down_1|CP024141.1_1632932_1636685_+	PRK09752, PRK09752, AIDA-I family autotransporter YfaL	NA|241aa|down_2|CP024141.1_1636812_1637535_-	PRK05134, PRK05134, bifunctional 2-polyprenyl-6-hydroxyphenol methylase/3-demethylubiquinol 3-O-methyltransferase UbiG	NA|876aa|down_3|CP024141.1_1637681_1640309_+	PRK05560, PRK05560, DNA gyrase subunit A; Validated	NA|563aa|down_4|CP024141.1_1640457_1642146_+	COG4685, COG4685, Uncharacterized protein conserved in bacteria [Function unknown]	NA|208aa|down_5|CP024141.1_1642142_1642766_+	COG3234, COG3234, Uncharacterized protein conserved in bacteria [Function unknown]	NA|1535aa|down_6|CP024141.1_1642699_1647304_+	COG2373, COG2373, Large extracellular alpha-helical protein [General function prediction only]	NA|550aa|down_7|CP024141.1_1647304_1648954_+	COG5445, COG5445, Predicted secreted protein [Function unknown]	NA|259aa|down_8|CP024141.1_1648958_1649735_+	COG4676, COG4676, Uncharacterized protein conserved in bacteria [Function unknown]	NA|395aa|down_9|CP024141.1_1649808_1650993_-	PRK05790, PRK05790, putative acyltransferase; Provisional
GCA_002853805.1_ASM285380v1	CP024141	Escherichia coli strain 14EC029 chromosome, complete genome	5	1856242-1856377	5	CRISPRCasFinder	no		WYL,cas3,RT,csa3,PD-DExK,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,DEDDh,DinG,c2c9_V-U4	Orphan	CTTGTAGGCCTGATAAGACGCGCCAGCGTCGCATCAGGCA	40	0	0	NA	NA	NA	1	1	Orphan	WYL,cas3,RT,csa3,PD-DExK,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,DEDDh,DinG,c2c9_V-U4	NA,NA|402aa|down_8|CP024141.1_1865844_1867050_+	NA|249aa|up_9|CP024141.1_1845104_1845851_+	PRK10063, PRK10063, colanic acid biosynthesis glycosyltransferase WcaE	NA|183aa|up_8|CP024141.1_1845866_1846415_+	TIGR04008, WcaF, colanic acid biosynthesis acetyltransferase WcaF	NA|374aa|up_7|CP024141.1_1846440_1847562_+	COG1089, Gmd, GDP-D-mannose dehydratase [Cell envelope biogenesis, outer membrane]	NA|322aa|up_6|CP024141.1_1847564_1848530_+	cd05239, GDP_FS_SDR_e, GDP-fucose synthetase, extended (e) SDRs	NA|160aa|up_5|CP024141.1_1848532_1849012_+	PRK15434, PRK15434, GDP-mannose mannosyl hydrolase	NA|408aa|up_4|CP024141.1_1849008_1850232_+	TIGR04007, wcaI, colanic acid biosynthesis glycosyl transferase WcaI	NA|479aa|up_3|CP024141.1_1850234_1851671_+	PRK15460, cpsB, mannose-1-phosphate guanyltransferase; Provisional	NA|457aa|up_2|CP024141.1_1851863_1853234_+	PRK15414, PRK15414, phosphomannomutase	NA|465aa|up_1|CP024141.1_1853288_1854683_+	PRK10124, PRK10124, putative UDP-glucose lipid carrier transferase; Provisional	NA|493aa|up_0|CP024141.1_1854684_1856163_+	PRK10459, PRK10459, MOP flippase family protein	NA|427aa|down_0|CP024141.1_1856437_1857718_+	TIGR04006, wcaK, colanic acid biosynthesis pyruvyl transferase WcaK	NA|407aa|down_1|CP024141.1_1857714_1858935_+	TIGR04005, wcaL, colanic acid biosynthesis glycosyltransferase WcaL	NA|465aa|down_2|CP024141.1_1858945_1860340_+	PRK10123, wcaM, putative colanic acid biosynthesis protein; Provisional	NA|298aa|down_3|CP024141.1_1860514_1861408_+	PRK10122, PRK10122, UTP--glucose-1-phosphate uridylyltransferase GalF	NA|207aa|down_4|CP024141.1_1861762_1862383_+	TIGR03570, NeuD_NnaD, sugar O-acyltransferase, sialic acid O-acetyltransferase NeuD family	NA|347aa|down_5|CP024141.1_1862382_1863423_+	TIGR03569, ORF_8_similar_to_NeuB_family, N-acetylneuraminate synthase	NA|420aa|down_6|CP024141.1_1863422_1864682_+	cd02513, CMP-NeuAc_Synthase, CMP-NeuAc_Synthase activates N-acetylneuraminic acid by adding CMP moiety	NA|385aa|down_7|CP024141.1_1864678_1865833_+	TIGR03568, Polysialic_acid_biosynthesis_protein_P7, UDP-N-acetyl-D-glucosamine 2-epimerase, UDP-hydrolysing	NA|402aa|down_8|CP024141.1_1865844_1867050_+	NA	NA|416aa|down_9|CP024141.1_1867046_1868294_+	cd13128, MATE_Wzx_like, Wzx, a subfamily of the multidrug and toxic compound extrusion (MATE)-like proteins
GCA_002853805.1_ASM285380v1	CP024141	Escherichia coli strain 14EC029 chromosome, complete genome	6	2305030-2305153	6	CRISPRCasFinder	no	DEDDh	WYL,cas3,RT,csa3,PD-DExK,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,DEDDh,DinG,c2c9_V-U4	Unclear	CGACCCCCACCATGTCAAGGTGGTGCTCTAACCAACTGAGCTA	43	0	0	NA	NA	NA	1	1	Orphan	WYL,cas3,RT,csa3,PD-DExK,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,DEDDh,DinG,c2c9_V-U4	NA,NA|75aa|down_7|CP024141.1_2313738_2313963_-,NA|30aa|down_8|CP024141.1_2314049_2314139_+	NA|79aa|up_9|CP024141.1_2293937_2294174_-	PRK15396, PRK15396, major outer membrane lipoprotein	NA|471aa|up_8|CP024141.1_2294484_2295897_-	PRK09206, PRK09206, pyruvate kinase PykF	NA|70aa|up_7|CP024141.1_2296453_2296663_+	PRK10292, PRK10292, fumarate hydratase FumD	NA|209aa|up_6|CP024141.1_2297118_2297745_+	PRK09898, PRK09898, ferredoxin-like protein	NA|701aa|up_5|CP024141.1_2297765_2299868_+	PRK09849, PRK09849, putative oxidoreductase; Provisional	NA|216aa|up_4|CP024141.1_2299871_2300519_+	PRK09947, PRK09947, YdhW family putative oxidoreductase system protein	NA|223aa|up_3|CP024141.1_2300582_2301251_+	TIGR03149, cyt_nit_nrfC, cytochrome c nitrite reductase, Fe-S protein	NA|271aa|up_2|CP024141.1_2302035_2302848_+	PRK09946, PRK09946, hypothetical protein; Provisional	NA|535aa|up_1|CP024141.1_2302859_2304464_-	PRK09897, PRK09897, FAD-NAD(P)-binding protein	NA|102aa|up_0|CP024141.1_2304589_2304895_-	PRK11118, PRK11118, putative monooxygenase; Provisional	NA|419aa|down_0|CP024141.1_2305467_2306724_+	PRK09945, PRK09945, hypothetical protein; Provisional	NA|458aa|down_1|CP024141.1_2306764_2308138_-	PRK01766, PRK01766, multidrug efflux protein; Reviewed	NA|214aa|down_2|CP024141.1_2308352_2308994_+	PRK13020, PRK13020, riboflavin synthase subunit alpha; Provisional	NA|383aa|down_3|CP024141.1_2309033_2310182_-	PRK11705, PRK11705, cyclopropane fatty acyl phospholipid synthase	NA|404aa|down_4|CP024141.1_2310472_2311684_-	PRK11043, PRK11043, Bcr/CflA family multidrug efflux MFS transporter	NA|311aa|down_5|CP024141.1_2311796_2312729_+	PRK11074, PRK11074, putative DNA-binding transcriptional regulator; Provisional	NA|342aa|down_6|CP024141.1_2312725_2313751_-	PRK10703, PRK10703, HTH-type transcriptional repressor PurR	NA|75aa|down_7|CP024141.1_2313738_2313963_-	NA	NA|30aa|down_8|CP024141.1_2314049_2314139_+	NA	NA|390aa|down_9|CP024141.1_2314304_2315474_+	COG2814, AraJ, Arabinose efflux permease [Carbohydrate transport and metabolism]
GCA_002853805.1_ASM285380v1	CP024141	Escherichia coli strain 14EC029 chromosome, complete genome	8	3362723-3362867	8	CRISPRCasFinder	no		WYL,cas3,RT,csa3,PD-DExK,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,DEDDh,DinG,c2c9_V-U4	Orphan	GTAGGTCGGATAAGATGCGCAAGCATCGCATCCGACAATAAGTGCCGGATGC	52	0	0	NA	NA	NA	1	1	Orphan	WYL,cas3,RT,csa3,PD-DExK,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,DEDDh,DinG,c2c9_V-U4	NA|60aa|up_1|CP024141.1_3361136_3361316_+,NA|70aa|down_9|CP024141.1_3373587_3373797_-	NA|303aa|up_9|CP024141.1_3352007_3352916_+	TIGR01826, Putative_gluconeogenesis_factor, conserved hypothetical protein, cofD-related	NA|674aa|up_8|CP024141.1_3353107_3355129_-	PRK05298, PRK05298, excinuclease ABC subunit UvrB	NA|226aa|up_7|CP024141.1_3355707_3356385_-	PRK00090, bioD, ATP-dependent dethiobiotin synthetase BioD	NA|252aa|up_6|CP024141.1_3356377_3357133_-	PRK10258, PRK10258, biotin biosynthesis protein BioC; Provisional	NA|385aa|up_5|CP024141.1_3357119_3358274_-	PRK05958, PRK05958, 8-amino-7-oxononanoate synthase; Reviewed	NA|347aa|up_4|CP024141.1_3358270_3359311_-	PRK15108, PRK15108, biotin synthase; Provisional	NA|430aa|up_3|CP024141.1_3359397_3360687_+	PRK07986, PRK07986, adenosylmethionine--8-amino-7-oxononanoate transaminase; Validated	NA|159aa|up_2|CP024141.1_3360745_3361222_+	PRK10257, PRK10257, putative kinase inhibitor protein; Provisional	NA|60aa|up_1|CP024141.1_3361136_3361316_+	NA	NA|428aa|up_0|CP024141.1_3361373_3362657_+	PRK10531, PRK10531, putative acyl-CoA thioester hydrolase	NA|754aa|down_0|CP024141.1_3362890_3365152_-	PRK11413, PRK11413, putative hydratase; Provisional	NA|478aa|down_1|CP024141.1_3365334_3366768_-	pfam00939, Na_sulph_symp, Sodium:sulfate symporter transmembrane region	NA|351aa|down_2|CP024141.1_3366843_3367896_-	NF033377, OMA_tautomer, 4-oxalomesaconate tautomerase	NA|318aa|down_3|CP024141.1_3368079_3369033_+	cd08440, PBP2_LTTR_like_4, TThe C-terminal substrate binding domain of an uncharacterized LysR-type transcriptional regulator, contains the type 2 periplasmic binding fold	NA|332aa|down_4|CP024141.1_3369073_3370069_-	PRK11028, PRK11028, 6-phosphogluconolactonase; Provisional	NA|273aa|down_5|CP024141.1_3370223_3371042_+	PRK10530, PRK10530, pyridoxal phosphate (PLP) phosphatase; Provisional	NA|353aa|down_6|CP024141.1_3371042_3372101_-	PRK11144, modC, molybdenum ABC transporter ATP-binding protein ModC	NA|230aa|down_7|CP024141.1_3372103_3372793_-	PRK09421, modB, molybdate ABC transporter permease subunit	NA|258aa|down_8|CP024141.1_3372792_3373566_-	PRK10677, modA, molybdate transporter periplasmic protein; Provisional	NA|70aa|down_9|CP024141.1_3373587_3373797_-	NA
GCA_002853805.1_ASM285380v1	CP024141	Escherichia coli strain 14EC029 chromosome, complete genome	9	3859616-3859712	9	CRISPRCasFinder	no		WYL,cas3,RT,csa3,PD-DExK,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,DEDDh,DinG,c2c9_V-U4	Orphan	TAGTTCACAGTGACCTACCCCCTAGCTCAC	30	0	0	NA	NA	NA	1	1	Orphan	WYL,cas3,RT,csa3,PD-DExK,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,DEDDh,DinG,c2c9_V-U4	NA|68aa|up_9|CP024141.1_3853845_3854049_-,NA|121aa|up_7|CP024141.1_3855182_3855545_-,NA|71aa|up_6|CP024141.1_3855541_3855754_-,NA|112aa|up_5|CP024141.1_3856097_3856433_-,NA|137aa|up_4|CP024141.1_3856434_3856845_-,NA|92aa|up_3|CP024141.1_3856974_3857250_-,NA|63aa|up_1|CP024141.1_3858192_3858381_-,NA|80aa|up_0|CP024141.1_3858912_3859152_-,NA|79aa|down_0|CP024141.1_3860277_3860514_-,NA|118aa|down_1|CP024141.1_3861384_3861738_+	NA|68aa|up_9|CP024141.1_3853845_3854049_-	NA	NA|343aa|up_8|CP024141.1_3854135_3855164_-	pfam03864, Phage_cap_E, Phage major capsid protein E	NA|121aa|up_7|CP024141.1_3855182_3855545_-	NA	NA|71aa|up_6|CP024141.1_3855541_3855754_-	NA	NA|112aa|up_5|CP024141.1_3856097_3856433_-	NA	NA|137aa|up_4|CP024141.1_3856434_3856845_-	NA	NA|92aa|up_3|CP024141.1_3856974_3857250_-	NA	NA|231aa|up_2|CP024141.1_3857261_3857954_-	pfam13730, HTH_36, Helix-turn-helix domain	NA|63aa|up_1|CP024141.1_3858192_3858381_-	NA	NA|80aa|up_0|CP024141.1_3858912_3859152_-	NA	NA|79aa|down_0|CP024141.1_3860277_3860514_-	NA	NA|118aa|down_1|CP024141.1_3861384_3861738_+	NA	NA|418aa|down_2|CP024141.1_3862011_3863265_-	PRK00197, proA, gamma-glutamyl phosphate reductase; Provisional	NA|368aa|down_3|CP024141.1_3863276_3864380_-	PRK05429, PRK05429, gamma-glutamyl kinase; Provisional	NA|352aa|down_4|CP024141.1_3864667_3865723_+	PRK10159, PRK10159, phosphoporin PhoE	NA|134aa|down_5|CP024141.1_3865761_3866163_-	PRK10984, PRK10984, sigma factor-binding protein Crl	NA|415aa|down_6|CP024141.1_3866220_3867465_-	PRK05077, frsA, esterase FrsA	NA|153aa|down_7|CP024141.1_3867556_3868015_-	PRK09177, PRK09177, xanthine-guanine phosphoribosyltransferase; Validated	NA|486aa|down_8|CP024141.1_3868275_3869733_+	PRK15026, PRK15026, aminoacyl-histidine dipeptidase; Provisional	NA|89aa|down_9|CP024141.1_3870089_3870356_-	PRK09588, PRK09588, hypothetical protein; Reviewed
GCA_002853805.1_ASM285380v1	CP024141	Escherichia coli strain 14EC029 chromosome, complete genome	10	3874842-3874995	10	CRISPRCasFinder	no		WYL,cas3,RT,csa3,PD-DExK,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,DEDDh,DinG,c2c9_V-U4	Orphan	CGCCTTATCCGGCCTACCGATCCAGCACAGGTTTGTAGGCATGATAAGACGCG	53	0	0	NA	NA	NA	1	1	Orphan	WYL,cas3,RT,csa3,PD-DExK,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,DEDDh,DinG,c2c9_V-U4	NA|76aa|up_4|CP024141.1_3870342_3870570_+,NA|150aa|down_8|CP024141.1_3884572_3885022_-	NA|134aa|up_9|CP024141.1_3865761_3866163_-	PRK10984, PRK10984, sigma factor-binding protein Crl	NA|415aa|up_8|CP024141.1_3866220_3867465_-	PRK05077, frsA, esterase FrsA	NA|153aa|up_7|CP024141.1_3867556_3868015_-	PRK09177, PRK09177, xanthine-guanine phosphoribosyltransferase; Validated	NA|486aa|up_6|CP024141.1_3868275_3869733_+	PRK15026, PRK15026, aminoacyl-histidine dipeptidase; Provisional	NA|89aa|up_5|CP024141.1_3870089_3870356_-	PRK09588, PRK09588, hypothetical protein; Reviewed	NA|76aa|up_4|CP024141.1_3870342_3870570_+	NA	NA|151aa|up_3|CP024141.1_3870662_3871115_-	PRK09831, PRK09831, GNAT family N-acetyltransferase	NA|352aa|up_2|CP024141.1_3871111_3872167_-	PRK02406, PRK02406, DNA polymerase IV; Validated	NA|263aa|up_1|CP024141.1_3872237_3873026_-	PRK06778, PRK06778, hypothetical protein; Validated	NA|580aa|up_0|CP024141.1_3872967_3874707_+	COG1298, FlhA, Flagellar biosynthesis pathway, component FlhA [Cell motility and secretion / Intracellular trafficking and secretion]	NA|166aa|down_0|CP024141.1_3875024_3875522_-	COG1943, COG1943, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|253aa|down_1|CP024141.1_3875697_3876456_-	COG0791, Spr, Cell wall-associated hydrolases (invasion-associated proteins) [Cell envelope biogenesis, outer membrane]	NA|247aa|down_2|CP024141.1_3876747_3877488_+	COG3034, COG3034, Uncharacterized protein conserved in bacteria [Function unknown]	NA|256aa|down_3|CP024141.1_3877458_3878226_-	pfam13230, GATase_4, Glutamine amidotransferases class-II	NA|193aa|down_4|CP024141.1_3878431_3879010_-	PRK00414, gmhA, D-sedoheptulose 7-phosphate isomerase	NA|815aa|down_5|CP024141.1_3879249_3881694_+	PRK09463, fadE, acyl-CoA dehydrogenase; Reviewed	NA|158aa|down_6|CP024141.1_3881736_3882210_-	PRK09993, PRK09993, C-lysozyme inhibitor; Provisional	NA|257aa|down_7|CP024141.1_3882363_3883134_+	PRK10438, PRK10438, C-N hydrolase family amidase; Provisional	NA|150aa|down_8|CP024141.1_3884572_3885022_-	NA	NA|1418aa|down_9|CP024141.1_3885033_3889287_-	COG3209, RhsA, Rhs family protein [Cell envelope biogenesis, outer membrane]
GCA_002853805.1_ASM285380v1	CP024141	Escherichia coli strain 14EC029 chromosome, complete genome	11	4084713-4084828	11	CRISPRCasFinder	no		WYL,cas3,RT,csa3,PD-DExK,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,DEDDh,DinG,c2c9_V-U4	Orphan	AACGCCTGATGCGACGCTGACGCGTCTTATC	31	0	0	NA	NA	NA	1	1	Orphan	WYL,cas3,RT,csa3,PD-DExK,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,DEDDh,DinG,c2c9_V-U4	NA,NA	NA|393aa|up_9|CP024141.1_4072728_4073907_-	TIGR00899, Sugar_efflux_transporter_A, sugar efflux transporter	NA|44aa|up_8|CP024141.1_4074008_4074140_-	pfam15894, SgrT, Inhibitor of glucose uptake transporter SgrT	NA|552aa|up_7|CP024141.1_4074228_4075884_+	PRK13626, PRK13626, HTH-type transcriptional regulator SgrR	NA|328aa|up_6|CP024141.1_4076047_4077031_+	PRK11205, tbpA, thiamine transporter substrate binding subunit; Provisional	NA|537aa|up_5|CP024141.1_4077006_4078617_+	PRK09433, thiP, thiamine transporter membrane protein; Reviewed	NA|233aa|up_4|CP024141.1_4078600_4079299_+	PRK10771, thiQ, thiamine ABC transporter ATP-binding protein ThiQ	NA|255aa|up_3|CP024141.1_4079412_4080177_-	COG0586, DedA, Uncharacterized membrane-associated protein [Function unknown]	NA|293aa|up_2|CP024141.1_4080262_4081141_-	PRK10572, PRK10572, arabinose operon transcriptional regulator AraC	NA|567aa|up_1|CP024141.1_4081479_4083180_+	PRK04123, PRK04123, ribulokinase; Provisional	NA|501aa|up_0|CP024141.1_4083190_4084693_+	PRK02929, PRK02929, L-arabinose isomerase; Provisional	NA|232aa|down_0|CP024141.1_4084892_4085588_+	PRK08193, araD, L-ribulose-5-phosphate 4-epimerase AraD	NA|784aa|down_1|CP024141.1_4085662_4088014_+	PRK05762, PRK05762, DNA polymerase II; Reviewed	NA|969aa|down_2|CP024141.1_4088178_4091085_+	PRK04914, PRK04914, RNA polymerase-associated protein RapA	NA|220aa|down_3|CP024141.1_4091096_4091756_+	PRK10158, PRK10158, bifunctional tRNA pseudouridine(32) synthase/23S rRNA pseudouridine(746) synthase RluA	NA|272aa|down_4|CP024141.1_4091872_4092688_-	PRK09430, djlA, co-chaperone DjlA	NA|785aa|down_5|CP024141.1_4092942_4095297_+	PRK03761, PRK03761, LPS assembly outer membrane complex protein LptD; Provisional	NA|429aa|down_6|CP024141.1_4095349_4096636_+	PRK10770, PRK10770, peptidyl-prolyl cis-trans isomerase SurA; Provisional	NA|330aa|down_7|CP024141.1_4096635_4097625_+	PRK00232, pdxA, 4-hydroxythreonine-4-phosphate dehydrogenase; Reviewed	NA|274aa|down_8|CP024141.1_4097621_4098443_+	PRK00274, ksgA, 16S rRNA (adenine(1518)-N(6)/adenine(1519)-N(6))-dimethyltransferase RsmA	NA|126aa|down_9|CP024141.1_4098445_4098823_+	PRK05461, apaG, CO2+/MG2+ efflux protein ApaG; Reviewed
GCA_002853805.1_ASM285380v1	CP024141	Escherichia coli strain 14EC029 chromosome, complete genome	12	4107741-4107873	3	PILER-CR	no		WYL,cas3,RT,csa3,PD-DExK,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,DEDDh,DinG,c2c9_V-U4	Orphan	ATCACCAATATTGAAAA	17	0	0	NA	NA	NA	2	2	Orphan	WYL,cas3,RT,csa3,PD-DExK,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,DEDDh,DinG,c2c9_V-U4	NA,NA|61aa|down_7|CP024141.1_4115789_4115972_+	NA|126aa|up_9|CP024141.1_4098445_4098823_+	PRK05461, apaG, CO2+/MG2+ efflux protein ApaG; Reviewed	NA|281aa|up_8|CP024141.1_4098829_4099672_+	TIGR00668, Bis5'-nucleosyl-tetraphosphatase_symmetrical, bis(5'-nucleosyl)-tetraphosphatase (symmetrical)	NA|160aa|up_7|CP024141.1_4099749_4100229_-	PRK10769, folA, type 3 dihydrofolate reductase	NA|621aa|up_6|CP024141.1_4100420_4102283_-	PRK03562, PRK03562, glutathione-regulated potassium-efflux system protein KefC; Provisional	NA|177aa|up_5|CP024141.1_4102275_4102806_-	PRK00871, PRK00871, glutathione-regulated potassium-efflux system oxidoreductase KefF	NA|444aa|up_4|CP024141.1_4102913_4104245_-	cd17316, MFS_SV2_like, Metazoan Synaptic vesicle glycoprotein 2 (SV2) and related small molecule transporters of the Major Facilitator Superfamily	NA|96aa|up_3|CP024141.1_4104302_4104590_-	PRK15449, PRK15449, ferredoxin-like protein FixX; Provisional	NA|429aa|up_2|CP024141.1_4104586_4105873_-	PRK10157, PRK10157, putative oxidoreductase FixC; Provisional	NA|314aa|up_1|CP024141.1_4105923_4106865_-	PRK03363, fixB, electron transfer flavoprotein subunit alpha/FixB family protein	NA|257aa|up_0|CP024141.1_4106879_4107650_-	PRK03359, PRK03359, putative electron transfer flavoprotein FixA; Reviewed	NA|505aa|down_0|CP024141.1_4108123_4109638_+	PRK03356, PRK03356, L-carnitine/gamma-butyrobetaine antiport BCCT transporter	NA|381aa|down_1|CP024141.1_4109668_4110811_+	PRK03354, PRK03354, crotonobetainyl-CoA dehydrogenase; Validated	NA|406aa|down_2|CP024141.1_4110939_4112157_+	PRK03525, PRK03525, L-carnitine CoA-transferase	NA|518aa|down_3|CP024141.1_4112230_4113784_+	PRK08008, caiC, putative crotonobetaine/carnitine-CoA ligase; Validated	NA|262aa|down_4|CP024141.1_4113892_4114678_+	PRK03580, PRK03580, crotonobetainyl-CoA hydratase	NA|197aa|down_5|CP024141.1_4114683_4115274_+	PRK13627, PRK13627, carnitine operon protein CaiE; Provisional	NA|132aa|down_6|CP024141.1_4115359_4115755_-	PRK11476, PRK11476, carnitine metabolism transcriptional regulator CaiF	NA|61aa|down_7|CP024141.1_4115789_4115972_+	NA	NA|1074aa|down_8|CP024141.1_4116015_4119237_-	PRK05294, carB, carbamoyl-phosphate synthase large subunit	NA|383aa|down_9|CP024141.1_4119254_4120403_-	PRK12564, PRK12564, carbamoyl-phosphate synthase small subunit
GCA_002853805.1_ASM285380v1	CP024141	Escherichia coli strain 14EC029 chromosome, complete genome	13	4353233-4353382	12	CRISPRCasFinder	no		WYL,cas3,RT,csa3,PD-DExK,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,DEDDh,DinG,c2c9_V-U4	Orphan	TGAACGCCTTATCCGACCTACACAGCACTGAACTCGTAGGCCTGATAAGACGCG	54	0	0	NA	NA	NA	1	1	Orphan	WYL,cas3,RT,csa3,PD-DExK,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,DEDDh,DinG,c2c9_V-U4	NA,NA	NA|340aa|up_9|CP024141.1_4341425_4342445_+	cd05283, CAD1, Cinnamyl alcohol dehydrogenases (CAD)	NA|188aa|up_8|CP024141.1_4342448_4343012_-	PRK09825, idnK, gluconokinase	NA|344aa|up_7|CP024141.1_4343228_4344260_+	PRK09880, PRK09880, L-idonate 5-dehydrogenase; Provisional	NA|255aa|up_6|CP024141.1_4344283_4345048_+	PRK08085, PRK08085, gluconate 5-dehydrogenase; Provisional	NA|440aa|up_5|CP024141.1_4345110_4346430_+	TIGR00791, Gluconate_permease, gluconate transporter	NA|333aa|up_4|CP024141.1_4346496_4347495_+	cd01575, PBP1_GntR, ligand-binding domain of DNA transcription repressor GntR specific for gluconate, a member of the LacI-GalR family of bacterial transcription regulators	NA|501aa|up_3|CP024141.1_4347572_4349075_+	pfam05872, DUF853, Bacterial protein of unknown function (DUF853)	NA|361aa|up_2|CP024141.1_4349235_4350318_-	PRK15071, PRK15071, lipopolysaccharide ABC transporter permease; Provisional	NA|367aa|up_1|CP024141.1_4350317_4351418_-	PRK15120, PRK15120, lipopolysaccharide ABC transporter permease LptF; Provisional	NA|504aa|up_0|CP024141.1_4351684_4353196_+	PRK00913, PRK00913, multifunctional aminopeptidase A; Provisional	NA|148aa|down_0|CP024141.1_4353549_4353993_+	PRK05728, PRK05728, DNA polymerase III subunit chi; Validated	NA|952aa|down_1|CP024141.1_4353992_4356848_+	PRK05729, valS, valyl-tRNA synthetase; Reviewed	NA|399aa|down_2|CP024141.1_4356901_4358098_-	COG4269, COG4269, Predicted membrane protein [Function unknown]	NA|168aa|down_3|CP024141.1_4358290_4358794_+	pfam00583, Acetyltransf_1, Acetyltransferase (GNAT) family	NA|139aa|down_4|CP024141.1_4358839_4359256_-	PRK11191, PRK11191, ribonuclease E inhibitor RraB	NA|338aa|down_5|CP024141.1_4359417_4360431_+	PRK03515, PRK03515, ornithine carbamoyltransferase subunit I; Provisional	NA|151aa|down_6|CP024141.1_4362305_4362758_-	COG2731, EbgC, Beta-galactosidase, beta subunit [Carbohydrate transport and metabolism]	NA|198aa|down_7|CP024141.1_4362902_4363496_-	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|238aa|down_8|CP024141.1_4363566_4364280_+	PRK12742, PRK12742, SDR family oxidoreductase	NA|132aa|down_9|CP024141.1_4364410_4364806_+	cd02198, YjgH_like, YjgH belongs to a large family of YjgF/YER057c/UK114-like proteins present in bacteria, archaea, and eukaryotes with no definitive function
