assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_008375335.1_ASM837533v1	NZ_CP041627	Escherichia coli strain ETEC6 chromosome, complete genome	1	358324-358470	1	CRISPRCasFinder	no		DEDDh,DinG,cas3,c2c9_V-U4,RT,csa3,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,PD-DExK,WYL	Orphan	TCCGGCCTACGGATGGCGCGAGAATTTGTAGGCCTGATAAGACGCG	46	0	0	NA	NA	NA	1	1	Orphan	DEDDh,DinG,cas3,c2c9_V-U4,RT,csa3,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,PD-DExK,WYL	NA,NA	NA|418aa|up_9|NZ_CP041627.1_345295_346549_-	PRK09528, lacY, galactoside permease; Reviewed	NA|1025aa|up_8|NZ_CP041627.1_346600_349675_-	PRK09525, lacZ, beta-galactosidase	NA|361aa|up_7|NZ_CP041627.1_349797_350880_-	PRK09526, lacI, lac repressor; Reviewed	NA|316aa|up_6|NZ_CP041627.1_350956_351904_-	PRK09834, PRK09834, DNA-binding transcriptional regulator	NA|555aa|up_5|NZ_CP041627.1_351980_353645_+	PRK06183, mhpA, bifunctional 3-(3-hydroxy-phenyl)propionate/3-hydroxycinnamic acid hydroxylase	NA|315aa|up_4|NZ_CP041627.1_353646_354591_+	cd07365, MhpB_like, Subunit B of the Class III Extradiol ring-cleavage dioxygenase, 2,3-dihydroxyphenylpropionate 1,2-dioxygenase (MhpB), which catalyzes the oxidization and subsequent ring-opening of 2,3-dihydroxyphenylpropionate	NA|289aa|up_3|NZ_CP041627.1_354608_355475_+	TIGR03343, 2-hydroxy-6-oxo-6-phenylhexa-24-dienoate_hydrolase, 2-hydroxy-6-oxo-6-phenylhexa-2,4-dienoate hydrolase	NA|270aa|up_2|NZ_CP041627.1_355484_356294_+	PRK11342, mhpD, 2-keto-4-pentenoate hydratase; Provisional	NA|317aa|up_1|NZ_CP041627.1_356290_357241_+	PRK08300, PRK08300, acetaldehyde dehydrogenase; Validated	NA|338aa|up_0|NZ_CP041627.1_357237_358251_+	PRK08195, PRK08195, 4-hyroxy-2-oxovalerate/4-hydroxy-2-oxopentanoic acid aldolase,; Validated	NA|404aa|down_0|NZ_CP041627.1_358628_359840_+	PRK11551, PRK11551, putative 3-hydroxyphenylpropionic transporter MhpT; Provisional	NA|180aa|down_1|NZ_CP041627.1_359941_360481_+	COG3122, COG3122, Uncharacterized protein conserved in bacteria [Function unknown]	NA|278aa|down_2|NZ_CP041627.1_360706_361540_-	TIGR02821, S-formylglutathione_hydrolase, S-formylglutathione hydrolase	NA|370aa|down_3|NZ_CP041627.1_361632_362742_-	cd08300, alcohol_DH_class_III, class III alcohol dehydrogenases	NA|92aa|down_4|NZ_CP041627.1_362776_363052_-	PRK11352, PRK11352, formaldehyde-responsive transcriptional repressor FrmR	NA|258aa|down_5|NZ_CP041627.1_363240_364014_-	TIGR04390, hypothetical_protein, outer membrane protein, YaiO family	NA|194aa|down_6|NZ_CP041627.1_364015_364597_-	cd05636, LbH_G1P_TT_C_like, Putative glucose-1-phosphate thymidylyltransferase, C-terminal Left-handed parallel beta-Helix (LbH) domain: Proteins in this family show simlarity to glucose-1-phosphate adenylyltransferases in that they contain N-terminal catalytic domains that resemble a dinucleotide-binding Rossmann fold and C-terminal LbH fold domains	NA|399aa|down_7|NZ_CP041627.1_364574_365771_-	COG1215, COG1215, Glycosyltransferases, probably involved in cell wall biogenesis [Cell envelope biogenesis, outer membrane]	NA|224aa|down_8|NZ_CP041627.1_365780_366452_-	COG2120, COG2120, Uncharacterized proteins, LmbE homologs [Function unknown]	NA|321aa|down_9|NZ_CP041627.1_367067_368030_+	PRK11480, tauA, taurine transporter substrate binding subunit; Provisional
GCF_008375335.1_ASM837533v1	NZ_CP041627	Escherichia coli strain ETEC6 chromosome, complete genome	2	391367-391511	2	CRISPRCasFinder	no		DEDDh,DinG,cas3,c2c9_V-U4,RT,csa3,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,PD-DExK,WYL	Orphan	ATGCCTGATGCGACGCTTGCCGCGTCTTATCAGGCCTACAAAA	43	0	0	NA	NA	NA	1	1	Orphan	DEDDh,DinG,cas3,c2c9_V-U4,RT,csa3,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,PD-DExK,WYL	NA|28aa|up_2|NZ_CP041627.1_389237_389321_+,NA	NA|372aa|up_9|NZ_CP041627.1_383993_385109_+	PRK10245, adrA, diguanylate cyclase AdrA; Provisional	NA|270aa|up_8|NZ_CP041627.1_385125_385935_-	PRK11880, PRK11880, pyrroline-5-carboxylate reductase; Reviewed	NA|155aa|up_7|NZ_CP041627.1_386054_386519_+	PRK00124, PRK00124, YaiI/YqxD family protein	NA|175aa|up_6|NZ_CP041627.1_386695_387220_+	PRK03731, aroL, shikimate kinase AroL	NA|64aa|up_5|NZ_CP041627.1_387269_387461_+	PRK10380, PRK10380, hypothetical protein; Provisional	NA|226aa|up_4|NZ_CP041627.1_387718_388396_+	PRK10481, PRK10481, hypothetical protein; Provisional	NA|95aa|up_3|NZ_CP041627.1_388467_388752_+	PRK10579, PRK10579, pyrimidine/purine nucleoside phosphorylase	NA|28aa|up_2|NZ_CP041627.1_389237_389321_+	NA	NA|304aa|up_1|NZ_CP041627.1_389398_390310_-	COG2974, RdgC, DNA recombination-dependent growth factor C [DNA replication, recombination, and repair]	NA|303aa|up_0|NZ_CP041627.1_390434_391343_+	PRK09557, PRK09557, fructokinase; Reviewed	NA|395aa|down_0|NZ_CP041627.1_391587_392772_-	PRK10091, PRK10091, MFS transport protein AraJ; Provisional	NA|1049aa|down_1|NZ_CP041627.1_392897_396044_-	PRK10246, PRK10246, exonuclease subunit SbcC; Provisional	NA|401aa|down_2|NZ_CP041627.1_396040_397243_-	PRK10966, PRK10966, exonuclease subunit SbcD; Provisional	NA|230aa|down_3|NZ_CP041627.1_397432_398122_+	PRK10161, PRK10161, phosphate response regulator transcription factor PhoB	NA|432aa|down_4|NZ_CP041627.1_398179_399475_+	PRK11006, phoR, phosphate regulon sensor histidine kinase PhoR	NA|440aa|down_5|NZ_CP041627.1_399881_401201_+	PRK15433, PRK15433, branched-chain amino acid transporter carrier protein BrnQ	NA|458aa|down_6|NZ_CP041627.1_401276_402650_+	PRK10580, proY, putative proline-specific permease; Provisional	NA|606aa|down_7|NZ_CP041627.1_402805_404623_+	PRK10785, PRK10785, maltodextrin glucosidase; Provisional	NA|194aa|down_8|NZ_CP041627.1_404627_405209_-	PRK10045, PRK10045, ACP phosphodiesterase	NA|357aa|down_9|NZ_CP041627.1_405301_406372_+	PRK00147, queA, S-adenosylmethionine:tRNA ribosyltransferase-isomerase; Provisional
GCF_008375335.1_ASM837533v1	NZ_CP041627	Escherichia coli strain ETEC6 chromosome, complete genome	3	534364-534460	3	CRISPRCasFinder	no		DEDDh,DinG,cas3,c2c9_V-U4,RT,csa3,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,PD-DExK,WYL	Orphan	GCTTGACGCGTCTTATCAGGCCTACAA	27	0	0	NA	NA	NA	1	1	Orphan	DEDDh,DinG,cas3,c2c9_V-U4,RT,csa3,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,PD-DExK,WYL	NA,NA	NA|454aa|up_9|NZ_CP041627.1_522144_523506_+	PRK08044, PRK08044, allantoinase AllB	NA|434aa|up_8|NZ_CP041627.1_523562_524864_+	PRK11412, PRK11412, uracil/xanthine transporter	NA|382aa|up_7|NZ_CP041627.1_524885_526031_+	PRK09932, PRK09932, glycerate 3-kinase	NA|262aa|up_6|NZ_CP041627.1_526258_527044_-	COG3257, GlxB, Uncharacterized protein, possibly involved in glyoxylate utilization [General function prediction only]	NA|412aa|up_5|NZ_CP041627.1_527054_528290_-	TIGR03176, AllC, allantoate amidohydrolase	NA|350aa|up_4|NZ_CP041627.1_528311_529361_-	PRK15025, PRK15025, ureidoglycolate dehydrogenase; Provisional	NA|556aa|up_3|NZ_CP041627.1_529677_531345_+	PRK06091, PRK06091, membrane protein FdrA; Validated	NA|420aa|up_2|NZ_CP041627.1_531354_532614_+	pfam06545, DUF1116, Protein of unknown function (DUF1116)	NA|272aa|up_1|NZ_CP041627.1_532624_533440_+	pfam11392, DUF2877, Protein of unknown function (DUF2877)	NA|298aa|up_0|NZ_CP041627.1_533436_534330_+	PRK09411, PRK09411, carbamate kinase; Reviewed	NA|356aa|down_0|NZ_CP041627.1_534524_535592_-	PRK06019, PRK06019, phosphoribosylaminoimidazole carboxylase ATPase subunit; Reviewed	NA|170aa|down_1|NZ_CP041627.1_535588_536098_-	COG0041, PurE, Phosphoribosylcarboxyaminoimidazole (NCAIR) mutase [Nucleotide transport and metabolism]	NA|241aa|down_2|NZ_CP041627.1_536215_536938_-	PRK05340, PRK05340, UDP-2,3-diacylglucosamine hydrolase; Provisional	NA|165aa|down_3|NZ_CP041627.1_536940_537435_-	PRK10791, PRK10791, peptidylprolyl isomerase B	NA|462aa|down_4|NZ_CP041627.1_537608_538994_+	PRK00260, cysS, cysteinyl-tRNA synthetase; Validated	NA|174aa|down_5|NZ_CP041627.1_539029_539551_-	COG1988, COG1988, Predicted membrane-bound metal-dependent hydrolases [General function prediction only]	NA|71aa|down_6|NZ_CP041627.1_539658_539871_-	PRK11507, PRK11507, ribosome-associated protein YbcJ	NA|289aa|down_7|NZ_CP041627.1_539872_540739_-	PRK10792, PRK10792, bifunctional methylenetetrahydrofolate dehydrogenase/methenyltetrahydrofolate cyclohydrolase FolD	NA|181aa|down_8|NZ_CP041627.1_541209_541752_+	PRK15194, PRK15194, type 1 fimbrial protein subunit FimA	NA|870aa|down_9|NZ_CP041627.1_542693_545303_+	PRK15198, PRK15198, outer membrane usher protein FimD
GCF_008375335.1_ASM837533v1	NZ_CP041627	Escherichia coli strain ETEC6 chromosome, complete genome	4	2257261-2257380	4	CRISPRCasFinder	no	DEDDh	DEDDh,DinG,cas3,c2c9_V-U4,RT,csa3,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,PD-DExK,WYL	Unclear	ATGCTGCCAACTTACTGATTTAGTGTATGATGGTGTTTT	39	0	0	NA	NA	NA	1	1	Orphan	DEDDh,DinG,cas3,c2c9_V-U4,RT,csa3,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,PD-DExK,WYL	NA,NA	NA|300aa|up_9|NZ_CP041627.1_2247677_2248577_+	PRK00489, hisG, ATP phosphoribosyltransferase; Reviewed	NA|435aa|up_8|NZ_CP041627.1_2248582_2249887_+	PRK00877, hisD, bifunctional histidinal dehydrogenase/ histidinol dehydrogenase; Reviewed	NA|357aa|up_7|NZ_CP041627.1_2249883_2250954_+	PRK01688, PRK01688, histidinol-phosphate aminotransferase; Provisional	NA|356aa|up_6|NZ_CP041627.1_2250953_2252021_+	PRK05446, PRK05446, bifunctional histidinol-phosphatase/imidazoleglycerol-phosphate dehydratase HisB	NA|197aa|up_5|NZ_CP041627.1_2252020_2252611_+	PRK13170, hisH, imidazole glycerol phosphate synthase subunit HisH; Provisional	NA|246aa|up_4|NZ_CP041627.1_2252610_2253348_+	PRK00748, PRK00748, 1-(5-phosphoribosyl)-5-[(5-phosphoribosylamino)methylideneamino] imidazole-4-carboxamide isomerase; Validated	NA|259aa|up_3|NZ_CP041627.1_2253329_2254106_+	PRK02083, PRK02083, imidazole glycerol phosphate synthase subunit HisF; Provisional	NA|204aa|up_2|NZ_CP041627.1_2254099_2254711_+	PRK02759, PRK02759, bifunctional phosphoribosyl-AMP cyclohydrolase/phosphoribosyl-ATP diphosphatase HisIE	NA|326aa|up_1|NZ_CP041627.1_2254806_2255784_-	PRK15471, PRK15471, chain length determinant protein WzzB; Provisional	NA|389aa|up_0|NZ_CP041627.1_2255929_2257096_-	PRK15057, PRK15057, UDP-glucose 6-dehydrogenase; Provisional	NA|469aa|down_0|NZ_CP041627.1_2258202_2259609_-	PRK09287, PRK09287, NADP-dependent phosphogluconate dehydrogenase	NA|457aa|down_1|NZ_CP041627.1_2259771_2261142_-	PRK15414, PRK15414, phosphomannomutase	NA|480aa|down_2|NZ_CP041627.1_2261224_2262664_-	TIGR01479, Mannose-1-phosphate_guanylyltransferase, mannose-1-phosphate guanylyltransferase/mannose-6-phosphate isomerase	NA|402aa|down_3|NZ_CP041627.1_2262653_2263859_-	TIGR03087, stp1, sugar transferase, PEP-CTERM/EpsH1 system associated	NA|336aa|down_4|NZ_CP041627.1_2263879_2264887_-	cd05247, UDP_G4E_1_SDR_e, UDP-glucose 4 epimerase, subgroup 1, extended (e) SDRs	NA|382aa|down_5|NZ_CP041627.1_2264903_2266049_-	cd03801, GT4_PimA-like, phosphatidyl-myo-inositol mannosyltransferase	NA|344aa|down_6|NZ_CP041627.1_2266035_2267067_-	cd03801, GT4_PimA-like, phosphatidyl-myo-inositol mannosyltransferase	NA|287aa|down_7|NZ_CP041627.1_2267063_2267924_-	pfam00535, Glycos_transf_2, Glycosyl transferase family 2	NA|448aa|down_8|NZ_CP041627.1_2267929_2269273_-	pfam14296, O-ag_pol_Wzy, O-antigen polysaccharide polymerase Wzy	NA|419aa|down_9|NZ_CP041627.1_2269272_2270529_-	cd13128, MATE_Wzx_like, Wzx, a subfamily of the multidrug and toxic compound extrusion (MATE)-like proteins
GCF_008375335.1_ASM837533v1	NZ_CP041627	Escherichia coli strain ETEC6 chromosome, complete genome	5	2275917-2276050	5	CRISPRCasFinder	no		DEDDh,DinG,cas3,c2c9_V-U4,RT,csa3,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,PD-DExK,WYL	Orphan	TGCCTGATGCGACGCTGGCGCGTCTTATCAGGCCTACA	38	0	0	NA	NA	NA	1	1	Orphan	DEDDh,DinG,cas3,c2c9_V-U4,RT,csa3,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,PD-DExK,WYL	NA,NA	NA|336aa|up_9|NZ_CP041627.1_2263879_2264887_-	cd05247, UDP_G4E_1_SDR_e, UDP-glucose 4 epimerase, subgroup 1, extended (e) SDRs	NA|382aa|up_8|NZ_CP041627.1_2264903_2266049_-	cd03801, GT4_PimA-like, phosphatidyl-myo-inositol mannosyltransferase	NA|344aa|up_7|NZ_CP041627.1_2266035_2267067_-	cd03801, GT4_PimA-like, phosphatidyl-myo-inositol mannosyltransferase	NA|287aa|up_6|NZ_CP041627.1_2267063_2267924_-	pfam00535, Glycos_transf_2, Glycosyl transferase family 2	NA|448aa|up_5|NZ_CP041627.1_2267929_2269273_-	pfam14296, O-ag_pol_Wzy, O-antigen polysaccharide polymerase Wzy	NA|419aa|up_4|NZ_CP041627.1_2269272_2270529_-	cd13128, MATE_Wzx_like, Wzx, a subfamily of the multidrug and toxic compound extrusion (MATE)-like proteins	NA|298aa|up_3|NZ_CP041627.1_2270885_2271779_-	PRK10122, PRK10122, UTP--glucose-1-phosphate uridylyltransferase GalF	NA|465aa|up_2|NZ_CP041627.1_2271953_2273348_-	PRK10123, wcaM, putative colanic acid biosynthesis protein; Provisional	NA|407aa|up_1|NZ_CP041627.1_2273358_2274579_-	TIGR04005, wcaL, colanic acid biosynthesis glycosyltransferase WcaL	NA|427aa|up_0|NZ_CP041627.1_2274575_2275856_-	TIGR04006, wcaK, colanic acid biosynthesis pyruvyl transferase WcaK	NA|493aa|down_0|NZ_CP041627.1_2276131_2277610_-	PRK10459, PRK10459, MOP flippase family protein	NA|465aa|down_1|NZ_CP041627.1_2277611_2279006_-	PRK10124, PRK10124, putative UDP-glucose lipid carrier transferase; Provisional	NA|457aa|down_2|NZ_CP041627.1_2279060_2280431_-	PRK15414, PRK15414, phosphomannomutase	NA|479aa|down_3|NZ_CP041627.1_2280711_2282148_-	PRK15460, cpsB, mannose-1-phosphate guanyltransferase; Provisional	NA|408aa|down_4|NZ_CP041627.1_2282150_2283374_-	TIGR04007, wcaI, colanic acid biosynthesis glycosyl transferase WcaI	NA|160aa|down_5|NZ_CP041627.1_2283370_2283850_-	PRK15434, PRK15434, GDP-mannose mannosyl hydrolase	NA|322aa|down_6|NZ_CP041627.1_2283852_2284818_-	cd05239, GDP_FS_SDR_e, GDP-fucose synthetase, extended (e) SDRs	NA|374aa|down_7|NZ_CP041627.1_2284820_2285942_-	COG1089, Gmd, GDP-D-mannose dehydratase [Cell envelope biogenesis, outer membrane]	NA|183aa|down_8|NZ_CP041627.1_2285969_2286518_-	TIGR04008, WcaF, colanic acid biosynthesis acetyltransferase WcaF	NA|249aa|down_9|NZ_CP041627.1_2286533_2287280_-	PRK10063, PRK10063, colanic acid biosynthesis glycosyltransferase WcaE
GCF_008375335.1_ASM837533v1	NZ_CP041627	Escherichia coli strain ETEC6 chromosome, complete genome	6	2280576-2280710	6	CRISPRCasFinder	no		DEDDh,DinG,cas3,c2c9_V-U4,RT,csa3,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,PD-DExK,WYL	Orphan	GGATAAGGCGTTCACGCCGCATCCGACAAACAGCGCCTGATGCGACG	47	0	0	NA	NA	NA	1	1	Orphan	DEDDh,DinG,cas3,c2c9_V-U4,RT,csa3,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,PD-DExK,WYL	NA,NA	NA|287aa|up_9|NZ_CP041627.1_2267063_2267924_-	pfam00535, Glycos_transf_2, Glycosyl transferase family 2	NA|448aa|up_8|NZ_CP041627.1_2267929_2269273_-	pfam14296, O-ag_pol_Wzy, O-antigen polysaccharide polymerase Wzy	NA|419aa|up_7|NZ_CP041627.1_2269272_2270529_-	cd13128, MATE_Wzx_like, Wzx, a subfamily of the multidrug and toxic compound extrusion (MATE)-like proteins	NA|298aa|up_6|NZ_CP041627.1_2270885_2271779_-	PRK10122, PRK10122, UTP--glucose-1-phosphate uridylyltransferase GalF	NA|465aa|up_5|NZ_CP041627.1_2271953_2273348_-	PRK10123, wcaM, putative colanic acid biosynthesis protein; Provisional	NA|407aa|up_4|NZ_CP041627.1_2273358_2274579_-	TIGR04005, wcaL, colanic acid biosynthesis glycosyltransferase WcaL	NA|427aa|up_3|NZ_CP041627.1_2274575_2275856_-	TIGR04006, wcaK, colanic acid biosynthesis pyruvyl transferase WcaK	NA|493aa|up_2|NZ_CP041627.1_2276131_2277610_-	PRK10459, PRK10459, MOP flippase family protein	NA|465aa|up_1|NZ_CP041627.1_2277611_2279006_-	PRK10124, PRK10124, putative UDP-glucose lipid carrier transferase; Provisional	NA|457aa|up_0|NZ_CP041627.1_2279060_2280431_-	PRK15414, PRK15414, phosphomannomutase	NA|479aa|down_0|NZ_CP041627.1_2280711_2282148_-	PRK15460, cpsB, mannose-1-phosphate guanyltransferase; Provisional	NA|408aa|down_1|NZ_CP041627.1_2282150_2283374_-	TIGR04007, wcaI, colanic acid biosynthesis glycosyl transferase WcaI	NA|160aa|down_2|NZ_CP041627.1_2283370_2283850_-	PRK15434, PRK15434, GDP-mannose mannosyl hydrolase	NA|322aa|down_3|NZ_CP041627.1_2283852_2284818_-	cd05239, GDP_FS_SDR_e, GDP-fucose synthetase, extended (e) SDRs	NA|374aa|down_4|NZ_CP041627.1_2284820_2285942_-	COG1089, Gmd, GDP-D-mannose dehydratase [Cell envelope biogenesis, outer membrane]	NA|183aa|down_5|NZ_CP041627.1_2285969_2286518_-	TIGR04008, WcaF, colanic acid biosynthesis acetyltransferase WcaF	NA|249aa|down_6|NZ_CP041627.1_2286533_2287280_-	PRK10063, PRK10063, colanic acid biosynthesis glycosyltransferase WcaE	NA|406aa|down_7|NZ_CP041627.1_2287290_2288508_-	TIGR04010, WcaD, putative colanic acid polymerase WcaD	NA|406aa|down_8|NZ_CP041627.1_2288482_2289700_-	TIGR04015, WcaC, colanic acid biosynthesis glycosyl transferase WcaC	NA|163aa|down_9|NZ_CP041627.1_2289696_2290185_-	PRK10191, PRK10191, putative acyl transferase; Provisional
GCF_008375335.1_ASM837533v1	NZ_CP041627	Escherichia coli strain ETEC6 chromosome, complete genome	7	2541846-2541972	7	CRISPRCasFinder	no		DEDDh,DinG,cas3,c2c9_V-U4,RT,csa3,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,PD-DExK,WYL	Orphan	TTTGTAGGCCTGATAAGACGCGCCAGCGTCGCATCAGGC	39	0	0	NA	NA	NA	1	1	Orphan	DEDDh,DinG,cas3,c2c9_V-U4,RT,csa3,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,PD-DExK,WYL	NA,NA	NA|395aa|up_9|NZ_CP041627.1_2520753_2521938_+	PRK05790, PRK05790, putative acyltransferase; Provisional	NA|259aa|up_8|NZ_CP041627.1_2522011_2522788_-	COG4676, COG4676, Uncharacterized protein conserved in bacteria [Function unknown]	NA|550aa|up_7|NZ_CP041627.1_2522792_2524442_-	COG5445, COG5445, Predicted secreted protein [Function unknown]	NA|1465aa|up_6|NZ_CP041627.1_2524442_2528837_-	COG2373, COG2373, Large extracellular alpha-helical protein [General function prediction only]	NA|208aa|up_5|NZ_CP041627.1_2528980_2529604_-	COG3234, COG3234, Uncharacterized protein conserved in bacteria [Function unknown]	NA|563aa|up_4|NZ_CP041627.1_2529600_2531289_-	COG4685, COG4685, Uncharacterized protein conserved in bacteria [Function unknown]	NA|876aa|up_3|NZ_CP041627.1_2531437_2534065_-	PRK05560, PRK05560, DNA gyrase subunit A; Validated	NA|241aa|up_2|NZ_CP041627.1_2534211_2534934_+	PRK05134, PRK05134, bifunctional 2-polyprenyl-6-hydroxyphenol methylase/3-demethylubiquinol 3-O-methyltransferase UbiG	NA|1251aa|up_1|NZ_CP041627.1_2535061_2538814_-	PRK09752, PRK09752, AIDA-I family autotransporter YfaL	NA|762aa|up_0|NZ_CP041627.1_2539509_2541795_+	PRK09103, PRK09103, ribonucleoside-diphosphate reductase subunit alpha	NA|377aa|down_0|NZ_CP041627.1_2542028_2543159_+	PRK09101, nrdB, ribonucleotide-diphosphate reductase subunit beta; Reviewed	NA|85aa|down_1|NZ_CP041627.1_2543158_2543413_+	PRK10713, PRK10713, 2Fe-2S ferredoxin-like protein	NA|217aa|down_2|NZ_CP041627.1_2543466_2544117_-	PRK09902, PRK09902, lipopolysaccharide kinase InaA	NA|359aa|down_3|NZ_CP041627.1_2544579_2545656_-	PRK11143, glpQ, glycerophosphodiester phosphodiesterase; Provisional	NA|453aa|down_4|NZ_CP041627.1_2545660_2547019_-	PRK11273, glpT, glycerol-3-phosphate transporter	NA|543aa|down_5|NZ_CP041627.1_2547291_2548920_+	PRK11101, glpA, anaerobic glycerol-3-phosphate dehydrogenase subunit A	NA|420aa|down_6|NZ_CP041627.1_2548909_2550169_+	COG3075, GlpB, Anaerobic glycerol-3-phosphate dehydrogenase [Amino acid transport and metabolism]	NA|397aa|down_7|NZ_CP041627.1_2550165_2551356_+	TIGR03379, glycerol3P_GlpC, glycerol-3-phosphate dehydrogenase, anaerobic, C subunit	NA|300aa|down_8|NZ_CP041627.1_2551548_2552448_+	PRK09956, PRK09956, ISNCY family transposase	NA|62aa|down_9|NZ_CP041627.1_2552460_2552646_+	PRK09956, PRK09956, ISNCY family transposase
GCF_008375335.1_ASM837533v1	NZ_CP041627	Escherichia coli strain ETEC6 chromosome, complete genome	8	2970401-2970545	8	CRISPRCasFinder	no	csa3	DEDDh,DinG,cas3,c2c9_V-U4,RT,csa3,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,PD-DExK,WYL	Type I-A	CACAATGCCTGATGCGACGCTGGAGCGTCTTATCATGCCTACAAA	45	0	0	NA	NA	NA	1	1	Orphan	DEDDh,DinG,cas3,c2c9_V-U4,RT,csa3,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,PD-DExK,WYL	NA,NA	NA|150aa|up_9|NZ_CP041627.1_2961391_2961841_+	pfam06610, AlaE, L-alanine exporter	NA|115aa|up_8|NZ_CP041627.1_2961877_2962222_-	PRK10556, PRK10556, hypothetical protein; Provisional	NA|110aa|up_7|NZ_CP041627.1_2962373_2962703_+	PRK10132, PRK10132, hypothetical protein; Provisional	NA|82aa|up_6|NZ_CP041627.1_2962950_2963196_+	PRK10329, PRK10329, glutaredoxin-like protein NrdH	NA|137aa|up_5|NZ_CP041627.1_2963192_2963603_+	PRK03600, nrdI, class Ib ribonucleoside-diphosphate reductase assembly flavoprotein NrdI	NA|715aa|up_4|NZ_CP041627.1_2963575_2965720_+	PRK08188, PRK08188, ribonucleotide-diphosphate reductase subunit alpha; Validated	NA|320aa|up_3|NZ_CP041627.1_2965729_2966689_+	TIGR04171, ribonucleotide-diphosphate_reductase_subunit_beta, ribonucleoside-diphosphate reductase, class 1b, beta subunit	NA|401aa|up_2|NZ_CP041627.1_2967042_2968245_+	PRK10070, PRK10070, proline/glycine betaine ABC transporter ATP-binding protein ProV	NA|355aa|up_1|NZ_CP041627.1_2968237_2969302_+	PRK10952, PRK10952, proline/glycine betaine ABC transporter permease ProW	NA|331aa|up_0|NZ_CP041627.1_2969359_2970352_+	PRK11119, proX, proline/glycine betaine ABC transporter substrate-binding protein ProX	NA|395aa|down_0|NZ_CP041627.1_2970643_2971828_+	cd17324, MFS_NepI_like, Purine ribonucleoside efflux pump NepI and similar transporters of the Major Facilitator Superfamily	NA|246aa|down_1|NZ_CP041627.1_2971951_2972689_+	COG1296, AzlC, Predicted branched-chain amino acid permease (azaleucine resistance) [Amino acid transport and metabolism]	NA|112aa|down_2|NZ_CP041627.1_2972678_2973014_+	PRK10408, PRK10408, L-valine transporter subunit YgaH	NA|177aa|down_3|NZ_CP041627.1_2973104_2973635_+	PRK10870, PRK10870, transcriptional repressor MprA; Provisional	NA|391aa|down_4|NZ_CP041627.1_2973761_2974934_+	PRK15136, PRK15136, multidrug efflux MFS transporter periplasmic adaptor subunit EmrA	NA|513aa|down_5|NZ_CP041627.1_2974950_2976489_+	TIGR00711, Uncharacterized_MFS-type_transporter_YhcA, drug resistance transporter, EmrB/QacA subfamily	NA|172aa|down_6|NZ_CP041627.1_2976552_2977068_-	PRK02260, PRK02260, S-ribosylhomocysteine lyase	NA|519aa|down_7|NZ_CP041627.1_2977217_2978774_-	PRK02107, PRK02107, glutamate--cysteine ligase; Provisional	NA|143aa|down_8|NZ_CP041627.1_2978846_2979275_-	COG1238, COG1238, Predicted membrane protein [Function unknown]	NA|189aa|down_9|NZ_CP041627.1_2979271_2979838_-	PRK10725, PRK10725, fructose-1-phosphate/6-phosphogluconate phosphatase
GCF_008375335.1_ASM837533v1	NZ_CP041627	Escherichia coli strain ETEC6 chromosome, complete genome	9	3039700-3039973	1,9,1	CRT,CRISPRCasFinder,PILER-CR	no	cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,cas3	DEDDh,DinG,cas3,c2c9_V-U4,RT,csa3,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,PD-DExK,WYL	Type I-E	CGGTTTATCCCCGCTGGCGCGGGGAACTC,CCCGGTTTATCCCCGCTGGCGCGGGGAACTCT,CCCGGTTTATCCCCGCTGGCGCGGGGAACTCT	29,32,32	0	0	NA	NA	I-E:I-E:I-E	4,3,2	4	TypeI-E	DEDDh,DinG,cas3,c2c9_V-U4,RT,csa3,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,PD-DExK,WYL	NA,NA	NA|254aa|up_9|NZ_CP041627.1_3031573_3032335_-	PRK00346, surE, 5'(3')-nucleotidase/polyphosphatase; Provisional	NA|350aa|up_8|NZ_CP041627.1_3032315_3033365_-	PRK00984, truD, tRNA pseudouridine synthase D; Reviewed	NA|160aa|up_7|NZ_CP041627.1_3033361_3033841_-	PRK00084, ispF, 2-C-methyl-D-erythritol 2,4-cyclodiphosphate synthase; Reviewed	NA|237aa|up_6|NZ_CP041627.1_3033840_3034551_-	PRK00155, ispD, D-ribitol-5-phosphate cytidylyltransferase	NA|104aa|up_5|NZ_CP041627.1_3034569_3034881_-	PRK00888, ftsB, cell division protein FtsB; Reviewed	NA|108aa|up_4|NZ_CP041627.1_3035074_3035398_-	pfam12084, DUF3561, Protein of unknown function (DUF3561)	NA|202aa|up_3|NZ_CP041627.1_3035447_3036053_-	PRK03846, PRK03846, adenylylsulfate kinase; Provisional	NA|476aa|up_2|NZ_CP041627.1_3036052_3037480_-	PRK05124, cysN, sulfate adenylyltransferase subunit 1; Provisional	NA|303aa|up_1|NZ_CP041627.1_3037481_3038390_-	PRK05253, PRK05253, sulfate adenylyltransferase subunit CysD	NA|346aa|up_0|NZ_CP041627.1_3038641_3039679_+	PRK10199, PRK10199, alkaline phosphatase isozyme conversion aminopeptidase; Provisional	cas2|95aa|down_0|NZ_CP041627.1_3040077_3040362_-	PRK11558, PRK11558, putative ssRNA endonuclease; Provisional	cas1|306aa|down_1|NZ_CP041627.1_3040363_3041281_-	TIGR03638, cas1_ECOLI, CRISPR-associated endonuclease Cas1, subtype I-E/ECOLI	cas6e|200aa|down_2|NZ_CP041627.1_3041296_3041896_-	TIGR01907, CRISPR_system_Cascade_subunit_CasE, CRISPR-associated protein Cas6/Cse3/CasE, subtype I-E/ECOLI	cas5|225aa|down_3|NZ_CP041627.1_3041882_3042557_-	TIGR01868, hypothetical_protein, CRISPR-associated protein Cas5/CasD, subtype I-E/ECOLI	cas7|364aa|down_4|NZ_CP041627.1_3042559_3043651_-	TIGR01869, CRISPR_system_Cascade_subunit_CasC, CRISPR-associated protein Cas7/Cse4/CasC, subtype I-E/ECOLI	cse2gr11|161aa|down_5|NZ_CP041627.1_3043663_3044146_-	cd09670, Cse2_I-E, CRISPR/Cas system-associated protein Cse2	cas8e|451aa|down_6|NZ_CP041627.1_3044294_3045647_-	PRK09693, PRK09693, Cascade antiviral complex protein; Validated	cas3|889aa|down_7|NZ_CP041627.1_3046838_3049505_-	PRK09694, PRK09694, CRISPR-associated helicase/endonuclease Cas3	NA|245aa|down_8|NZ_CP041627.1_3049863_3050598_-	PRK02090, PRK02090, phosphoadenylyl-sulfate reductase	NA|571aa|down_9|NZ_CP041627.1_3050671_3052384_-	PRK13504, PRK13504, NADPH-dependent assimilatory sulfite reductase hemoprotein subunit
GCF_008375335.1_ASM837533v1	NZ_CP041627	Escherichia coli strain ETEC6 chromosome, complete genome	10	3069782-3070889	10,2,2,3,11	CRISPRCasFinder,CRT,PILER-CR,CRT,CRISPRCasFinder	no		DEDDh,DinG,cas3,c2c9_V-U4,RT,csa3,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,PD-DExK,WYL	Orphan	CGGTTTATCCCCGCTGGCGCGGGGAACAC,CGGTTTATCCCCGCTGGCGCGGGGAACAC,GGTTTATCCCCGCTGGCGCGGGGAACAC,CGGTTTATCCCCGCTGGCGCGGGGAACAC,CGGTTTATCCCCGCTGGCGCGGGGAACAC	29,29,28,29,29	0	0	NA	NA	I-E:I-E:I-E:I-E:I-E	16,17,18,17,16	18	Orphan	DEDDh,DinG,cas3,c2c9_V-U4,RT,csa3,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,PD-DExK,WYL	NA,NA	NA|287aa|up_9|NZ_CP041627.1_3057203_3058064_-	COG2025, FixB, Electron transfer flavoprotein, alpha subunit [Energy production and conversion]	NA|260aa|up_8|NZ_CP041627.1_3058060_3058840_-	COG2086, FixA, Electron transfer flavoprotein, beta subunit [Energy production and conversion]	NA|470aa|up_7|NZ_CP041627.1_3058817_3060227_-	cd17371, MFS_MucK, Cis,cis-muconate transport protein and similar proteins of the Major Facilitator Superfamily	NA|485aa|up_6|NZ_CP041627.1_3060248_3061703_-	COG0277, GlcD, FAD/FMN-containing dehydrogenases [Energy production and conversion]	NA|262aa|up_5|NZ_CP041627.1_3061772_3062558_-	cd05347, Ga5DH-like_SDR_c, gluconate 5-dehydrogenase (Ga5DH)-like, classical (c) SDRs	NA|426aa|up_4|NZ_CP041627.1_3062876_3064154_+	cd06174, MFS, Major Facilitator Superfamily	NA|468aa|up_3|NZ_CP041627.1_3064180_3065584_+	cd07779, FGGY_ygcE_like, uncharacterized ygcE-like proteins	NA|524aa|up_2|NZ_CP041627.1_3066700_3068272_-	pfam03050, DDE_Tnp_IS66, Transposase IS66 family	NA|116aa|up_1|NZ_CP041627.1_3068291_3068639_-	pfam05717, TnpB_IS66, IS66 Orf2 like protein	NA|226aa|up_0|NZ_CP041627.1_3068638_3069316_-	pfam01527, HTH_Tnp_1, Transposase	NA|224aa|down_0|NZ_CP041627.1_3071228_3071900_-	TIGR04322, organic_radical_activating_enzyme, putative 7-cyano-7-deazaguanosine (preQ0) biosynthesis protein QueE	NA|238aa|down_1|NZ_CP041627.1_3072194_3072908_+	COG1512, COG1512, Beta-propeller domains of methanol dehydrogenase type [General function prediction only]	NA|433aa|down_2|NZ_CP041627.1_3073902_3075201_-	PRK00077, eno, enolase; Provisional	NA|546aa|down_3|NZ_CP041627.1_3075288_3076926_-	PRK05380, pyrG, CTP synthetase; Validated	NA|264aa|down_4|NZ_CP041627.1_3077153_3077945_-	PRK09562, mazG, nucleoside triphosphate pyrophosphohydrolase; Reviewed	NA|112aa|down_5|NZ_CP041627.1_3078015_3078351_-	PRK09907, PRK09907, endoribonuclease MazF	NA|83aa|down_6|NZ_CP041627.1_3078350_3078599_-	PRK09798, PRK09798, MazF-MazE toxin-antitoxin system antitoxin MazE	NA|745aa|down_7|NZ_CP041627.1_3078676_3080911_-	PRK10872, relA, (p)ppGpp synthetase I/GTP pyrophosphokinase; Provisional	NA|434aa|down_8|NZ_CP041627.1_3080958_3082260_-	PRK13168, rumA, 23S rRNA (uracil(1939)-C(5))-methyltransferase RlmD	NA|919aa|down_9|NZ_CP041627.1_3082316_3085073_+	PRK11107, PRK11107, hybrid sensory histidine kinase BarA; Provisional
GCF_008375335.1_ASM837533v1	NZ_CP041627	Escherichia coli strain ETEC6 chromosome, complete genome	11	3246173-3246292	12	CRISPRCasFinder	no		DEDDh,DinG,cas3,c2c9_V-U4,RT,csa3,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,PD-DExK,WYL	Orphan	ATGCTGCCAACTTACTGATTTAGTGTATGATGGTGTTTT	39	0	0	NA	NA	NA	1	1	Orphan	DEDDh,DinG,cas3,c2c9_V-U4,RT,csa3,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,PD-DExK,WYL	NA|166aa|up_0|NZ_CP041627.1_3245549_3246047_-,NA|128aa|down_2|NZ_CP041627.1_3250079_3250463_-,NA|340aa|down_7|NZ_CP041627.1_3254951_3255971_-	NA|240aa|up_9|NZ_CP041627.1_3232784_3233504_-	PRK10626, PRK10626, hypothetical protein; Provisional	NA|109aa|up_8|NZ_CP041627.1_3233687_3234014_-	PRK11702, PRK11702, hypothetical protein; Provisional	NA|240aa|up_7|NZ_CP041627.1_3234013_3234733_-	PRK00121, trmB, tRNA (guanine-N(7)-)-methyltransferase; Reviewed	NA|351aa|up_6|NZ_CP041627.1_3234893_3235946_+	PRK10880, PRK10880, adenine DNA glycosylase	NA|92aa|up_5|NZ_CP041627.1_3235973_3236249_+	PRK05408, PRK05408, oxidative damage protection protein; Provisional	NA|360aa|up_4|NZ_CP041627.1_3236313_3237393_+	PRK11671, mltC, membrane-bound lytic murein transglycosylase MltC	NA|419aa|up_3|NZ_CP041627.1_3237594_3238851_+	TIGR00889, Putative_nucleoside_transporter_YegT, nucleoside transporter	NA|712aa|up_2|NZ_CP041627.1_3238899_3241035_-	PRK13578, PRK13578, ornithine decarboxylase; Provisional	NA|236aa|up_1|NZ_CP041627.1_3241427_3242135_+	COG1811, COG1811, Uncharacterized membrane protein, possible Na+ channel or pump [General function prediction only]	NA|166aa|up_0|NZ_CP041627.1_3245549_3246047_-	NA	NA|667aa|down_0|NZ_CP041627.1_3247012_3249013_-	cd04186, GT_2_like_c, Subfamily of Glycosyltransferase Family GT2 of unknown function	NA|302aa|down_1|NZ_CP041627.1_3249116_3250022_-	pfam01697, Glyco_transf_92, Glycosyltransferase family 92	NA|128aa|down_2|NZ_CP041627.1_3250079_3250463_-	NA	NA|515aa|down_3|NZ_CP041627.1_3251195_3252740_-	pfam03102, NeuB, NeuB family	NA|231aa|down_4|NZ_CP041627.1_3252739_3253432_-	cd01637, IMPase_like, Inositol-monophosphatase-like domains	NA|279aa|down_5|NZ_CP041627.1_3253409_3254246_-	cd03349, LbH_XAT, Xenobiotic acyltransferase (XAT): The XAT class of hexapeptide acyltransferases is composed of a large number of microbial enzymes that catalyze the CoA-dependent acetylation of a variety of hydroxyl-bearing acceptors such as chloramphenicol and streptogramin, among others	NA|224aa|down_6|NZ_CP041627.1_3254235_3254907_-	cd02513, CMP-NeuAc_Synthase, CMP-NeuAc_Synthase activates N-acetylneuraminic acid by adding CMP moiety	NA|340aa|down_7|NZ_CP041627.1_3254951_3255971_-	NA	NA|805aa|down_8|NZ_CP041627.1_3256058_3258473_-	cd04184, GT2_RfbC_Mx_like, Myxococcus xanthus RfbC like proteins are required for O-antigen biosynthesis	NA|911aa|down_9|NZ_CP041627.1_3258478_3261211_-	pfam13578, Methyltransf_24, Methyltransferase domain
