assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_012974525.1_ASM1297452v1	NZ_CP052877	Escherichia coli strain C21 chromosome, complete genome	1	596167-596565	1	PILER-CR	no		cas3,csa3,PD-DExK,RT,cas8e,cse2gr11,cas7,cas5,cas6e,cas2,DEDDh,c2c9_V-U4,DinG	Orphan	AGGGGCAGAAAGATGAATGACTGTCCACGACACTATACCCAAAAGAAAGCGGCTTATCG	59	0	0	NA	NA	NA	3	3	Orphan	cas3,csa3,PD-DExK,RT,cas8e,cse2gr11,cas7,cas5,cas6e,cas2,DEDDh,c2c9_V-U4,DinG	NA,NA|180aa|down_1|NZ_CP052877.1_598798_599338_-	NA|158aa|up_9|NZ_CP052877.1_585649_586123_-	PRK11425, PRK11425, PTS N-acetylgalactosamine transporter subunit IIB	NA|427aa|up_8|NZ_CP052877.1_586145_587426_-	PRK15458, PRK15458, tagatose 6-phosphate aldolase subunit KbaZ; Provisional	NA|270aa|up_7|NZ_CP052877.1_587674_588484_+	PRK09802, PRK09802, DeoR family transcriptional regulator	NA|155aa|up_6|NZ_CP052877.1_588538_589003_-	pfam11663, Toxin_YhaV, Toxin with endonuclease activity, of toxin-antitoxin system	NA|112aa|up_5|NZ_CP052877.1_589002_589338_-	PRK09974, PRK09974, type II toxin-antitoxin system PrlF family antitoxin	NA|524aa|up_4|NZ_CP052877.1_589486_591058_-	TIGR03248, galactar-dH20, galactarate dehydratase	NA|445aa|up_3|NZ_CP052877.1_591432_592767_+	TIGR00893, Probable_glucarate_transporter, D-galactonate transporter	NA|257aa|up_2|NZ_CP052877.1_592782_593553_+	PRK10558, PRK10558, alpha-dehydro-beta-deoxy-D-glucarate aldolase; Provisional	NA|297aa|up_1|NZ_CP052877.1_593582_594473_+	PRK11559, garR, tartronate semialdehyde reductase; Provisional	NA|382aa|up_0|NZ_CP052877.1_594569_595715_+	PRK10342, PRK10342, glycerate kinase I; Provisional	NA|135aa|down_0|NZ_CP052877.1_598372_598777_-	PRK09716, PRK09716, YhaC family protein	NA|180aa|down_1|NZ_CP052877.1_598798_599338_-	NA	NA|115aa|down_2|NZ_CP052877.1_599593_599938_-	PRK11424, PRK11424, DNA-binding transcriptional activator TdcR; Provisional	NA|313aa|down_3|NZ_CP052877.1_600126_601065_+	PRK10341, PRK10341, transcriptional regulator TdcA	NA|330aa|down_4|NZ_CP052877.1_601163_602153_+	PRK08638, PRK08638, bifunctional threonine ammonia-lyase/L-serine ammonia-lyase TdcB	NA|444aa|down_5|NZ_CP052877.1_602174_603506_+	PRK13629, PRK13629, threonine/serine transporter TdcC; Provisional	NA|403aa|down_6|NZ_CP052877.1_603531_604740_+	PRK12379, PRK12379, propionate kinase	NA|765aa|down_7|NZ_CP052877.1_604773_607068_+	cd01678, PFL1, Pyruvate formate lyase 1	NA|130aa|down_8|NZ_CP052877.1_607081_607471_+	PRK11401, PRK11401, enamine/imine deaminase	NA|455aa|down_9|NZ_CP052877.1_607542_608907_+	PRK15040, PRK15040, L-serine ammonia-lyase
GCF_012974525.1_ASM1297452v1	NZ_CP052877	Escherichia coli strain C21 chromosome, complete genome	2	1012810-1013387	1,1,2	CRISPRCasFinder,CRT,PILER-CR	no	cas3	cas3,csa3,PD-DExK,RT,cas8e,cse2gr11,cas7,cas5,cas6e,cas2,DEDDh,c2c9_V-U4,DinG	Unclear	GTGTTCCCCGCGCCAGCGGGGATAAACCG,GNGTTCCCCGCGCCAGCGGGGATAAACCG,GTTCCCCGCGCCAGCGGGGATAAACC	29,29,26	0	0	NA	NA	I-E:I-E:I-E	9,9,9	9	Unclear	cas3,csa3,PD-DExK,RT,cas8e,cse2gr11,cas7,cas5,cas6e,cas2,DEDDh,c2c9_V-U4,DinG	NA|47aa|up_1|NZ_CP052877.1_1011520_1011661_-,NA	NA|434aa|up_9|NZ_CP052877.1_1001443_1002745_+	PRK13168, rumA, 23S rRNA (uracil(1939)-C(5))-methyltransferase RlmD	NA|745aa|up_8|NZ_CP052877.1_1002792_1005027_+	PRK10872, relA, (p)ppGpp synthetase I/GTP pyrophosphokinase; Provisional	NA|83aa|up_7|NZ_CP052877.1_1005104_1005353_+	PRK09798, PRK09798, MazF-MazE toxin-antitoxin system antitoxin MazE	NA|112aa|up_6|NZ_CP052877.1_1005352_1005688_+	PRK09907, PRK09907, endoribonuclease MazF	NA|264aa|up_5|NZ_CP052877.1_1005758_1006550_+	PRK09562, mazG, nucleoside triphosphate pyrophosphohydrolase; Reviewed	NA|546aa|up_4|NZ_CP052877.1_1006777_1008415_+	PRK05380, pyrG, CTP synthetase; Validated	NA|433aa|up_3|NZ_CP052877.1_1008502_1009801_+	PRK00077, eno, enolase; Provisional	NA|250aa|up_2|NZ_CP052877.1_1010757_1011507_-	COG1512, COG1512, Beta-propeller domains of methanol dehydrogenase type [General function prediction only]	NA|47aa|up_1|NZ_CP052877.1_1011520_1011661_-	NA	NA|224aa|up_0|NZ_CP052877.1_1011799_1012471_+	TIGR04322, organic_radical_activating_enzyme, putative 7-cyano-7-deazaguanosine (preQ0) biosynthesis protein QueE	NA|493aa|down_0|NZ_CP052877.1_1014024_1015503_-	cd07779, FGGY_ygcE_like, uncharacterized ygcE-like proteins	NA|426aa|down_1|NZ_CP052877.1_1015529_1016807_-	cd06174, MFS, Major Facilitator Superfamily	NA|485aa|down_2|NZ_CP052877.1_1018289_1019744_+	COG0277, GlcD, FAD/FMN-containing dehydrogenases [Energy production and conversion]	NA|470aa|down_3|NZ_CP052877.1_1019765_1021175_+	cd17371, MFS_MucK, Cis,cis-muconate transport protein and similar proteins of the Major Facilitator Superfamily	NA|260aa|down_4|NZ_CP052877.1_1021152_1021932_+	COG2086, FixA, Electron transfer flavoprotein, beta subunit [Energy production and conversion]	NA|287aa|down_5|NZ_CP052877.1_1021928_1022789_+	COG2025, FixB, Electron transfer flavoprotein, alpha subunit [Energy production and conversion]	NA|192aa|down_6|NZ_CP052877.1_1022936_1023512_-	COG1954, GlpP, Glycerol-3-phosphate responsive antiterminator (mRNA-binding) [Transcription]	NA|87aa|down_7|NZ_CP052877.1_1023528_1023789_-	COG2440, FixX, Ferredoxin-like protein [Energy production and conversion]	NA|424aa|down_8|NZ_CP052877.1_1023779_1025051_-	PRK10015, PRK10015, oxidoreductase; Provisional	NA|121aa|down_9|NZ_CP052877.1_1025128_1025491_-	cd00470, PTPS, 6-pyruvoyl tetrahydropterin synthase (PTPS)
GCF_012974525.1_ASM1297452v1	NZ_CP052877	Escherichia coli strain C21 chromosome, complete genome	3	1039299-1039633	3,2,2	PILER-CR,CRISPRCasFinder,CRT	no	cas3,cas8e,cse2gr11,cas7,cas5,cas6e,cas2	cas3,csa3,PD-DExK,RT,cas8e,cse2gr11,cas7,cas5,cas6e,cas2,DEDDh,c2c9_V-U4,DinG	Type I-E	GTGTTCCCCGCGCCAGCGGGGATAAACCG,GTGTTCCCCGCGCCAGCGGGGATAAACCG,GTGTTCCCCGCGCCAGCGGGGATAAACCG	29,29,29	0	0	NA	NA	I-E:I-E:I-E	4,5,5	5	TypeI-E	cas3,csa3,PD-DExK,RT,cas8e,cse2gr11,cas7,cas5,cas6e,cas2,DEDDh,c2c9_V-U4,DinG	NA,NA	NA|571aa|up_9|NZ_CP052877.1_1027608_1029321_+	PRK13504, PRK13504, NADPH-dependent assimilatory sulfite reductase hemoprotein subunit	NA|245aa|up_8|NZ_CP052877.1_1029395_1030130_+	PRK02090, PRK02090, phosphoadenylyl-sulfate reductase	NA|51aa|up_7|NZ_CP052877.1_1030394_1030547_+	pfam01848, HOK_GEF, Hok/gef family	cas3|868aa|up_6|NZ_CP052877.1_1030740_1033344_+	COG1203, COG1203, CRISPR-associated helicase Cas3 [Defense mechanisms]	cas8e|521aa|up_5|NZ_CP052877.1_1033441_1035004_+	TIGR02547, CRISPR_system_Cascade_subunit_CasA, CRISPR type I-E/ECOLI-associated protein CasA/Cse1	cse2gr11|179aa|up_4|NZ_CP052877.1_1035000_1035537_+	TIGR02548, CRISPR_system_Cascade_subunit_CasB, CRISPR type I-E/ECOLI-associated protein CasB/Cse2	cas7|352aa|up_3|NZ_CP052877.1_1035548_1036604_+	TIGR01869, CRISPR_system_Cascade_subunit_CasC, CRISPR-associated protein Cas7/Cse4/CasC, subtype I-E/ECOLI	cas5|249aa|up_2|NZ_CP052877.1_1036614_1037361_+	cd09645, Cas5_I-E, CRISPR/Cas system-associated RAMP superfamily protein Cas5	cas6e|217aa|up_1|NZ_CP052877.1_1037342_1037993_+	TIGR01907, CRISPR_system_Cascade_subunit_CasE, CRISPR-associated protein Cas6/Cse3/CasE, subtype I-E/ECOLI	cas2|98aa|up_0|NZ_CP052877.1_1038908_1039202_+	PRK11558, PRK11558, putative ssRNA endonuclease; Provisional	NA|346aa|down_0|NZ_CP052877.1_1039714_1040752_-	PRK10199, PRK10199, alkaline phosphatase isozyme conversion aminopeptidase; Provisional	NA|303aa|down_1|NZ_CP052877.1_1041003_1041912_+	PRK05253, PRK05253, sulfate adenylyltransferase subunit CysD	NA|476aa|down_2|NZ_CP052877.1_1041913_1043341_+	PRK05124, cysN, sulfate adenylyltransferase subunit 1; Provisional	NA|202aa|down_3|NZ_CP052877.1_1043340_1043946_+	PRK03846, PRK03846, adenylylsulfate kinase; Provisional	NA|108aa|down_4|NZ_CP052877.1_1043995_1044319_+	pfam12084, DUF3561, Protein of unknown function (DUF3561)	NA|104aa|down_5|NZ_CP052877.1_1044512_1044824_+	PRK00888, ftsB, cell division protein FtsB; Reviewed	NA|237aa|down_6|NZ_CP052877.1_1044842_1045553_+	PRK00155, ispD, D-ribitol-5-phosphate cytidylyltransferase	NA|160aa|down_7|NZ_CP052877.1_1045552_1046032_+	PRK00084, ispF, 2-C-methyl-D-erythritol 2,4-cyclodiphosphate synthase; Reviewed	NA|350aa|down_8|NZ_CP052877.1_1046028_1047078_+	PRK00984, truD, tRNA pseudouridine synthase D; Reviewed	NA|254aa|down_9|NZ_CP052877.1_1047058_1047820_+	PRK00346, surE, 5'(3')-nucleotidase/polyphosphatase; Provisional
GCF_012974525.1_ASM1297452v1	NZ_CP052877	Escherichia coli strain C21 chromosome, complete genome	4	1563894-1564011	3	CRISPRCasFinder	no		cas3,csa3,PD-DExK,RT,cas8e,cse2gr11,cas7,cas5,cas6e,cas2,DEDDh,c2c9_V-U4,DinG	Orphan	CCGAGCCGTAGGCCGGATAAGGCGTTCACGC	31	0	0	NA	NA	NA	1	1	Orphan	cas3,csa3,PD-DExK,RT,cas8e,cse2gr11,cas7,cas5,cas6e,cas2,DEDDh,c2c9_V-U4,DinG	NA|105aa|up_3|NZ_CP052877.1_1558974_1559289_-,NA	NA|62aa|up_9|NZ_CP052877.1_1550690_1550876_-	PRK09956, PRK09956, ISNCY family transposase	NA|300aa|up_8|NZ_CP052877.1_1550888_1551788_-	PRK09956, PRK09956, ISNCY family transposase	NA|397aa|up_7|NZ_CP052877.1_1551980_1553171_-	TIGR03379, glycerol3P_GlpC, glycerol-3-phosphate dehydrogenase, anaerobic, C subunit	NA|420aa|up_6|NZ_CP052877.1_1553167_1554427_-	COG3075, GlpB, Anaerobic glycerol-3-phosphate dehydrogenase [Amino acid transport and metabolism]	NA|543aa|up_5|NZ_CP052877.1_1554416_1556045_-	PRK11101, glpA, anaerobic glycerol-3-phosphate dehydrogenase subunit A	NA|359aa|up_4|NZ_CP052877.1_1557679_1558756_+	PRK11143, glpQ, glycerophosphodiester phosphodiesterase; Provisional	NA|105aa|up_3|NZ_CP052877.1_1558974_1559289_-	NA	NA|217aa|up_2|NZ_CP052877.1_1561708_1562359_+	PRK09902, PRK09902, lipopolysaccharide kinase InaA	NA|85aa|up_1|NZ_CP052877.1_1562412_1562667_-	PRK10713, PRK10713, 2Fe-2S ferredoxin-like protein	NA|377aa|up_0|NZ_CP052877.1_1562666_1563797_-	PRK09101, nrdB, ribonucleotide-diphosphate reductase subunit beta; Reviewed	NA|762aa|down_0|NZ_CP052877.1_1564030_1566316_-	PRK09103, PRK09103, ribonucleoside-diphosphate reductase subunit alpha	NA|1251aa|down_1|NZ_CP052877.1_1567011_1570764_+	PRK09752, PRK09752, AIDA-I family autotransporter YfaL	NA|241aa|down_2|NZ_CP052877.1_1570891_1571614_-	PRK05134, PRK05134, bifunctional 2-polyprenyl-6-hydroxyphenol methylase/3-demethylubiquinol 3-O-methyltransferase UbiG	NA|876aa|down_3|NZ_CP052877.1_1571760_1574388_+	PRK05560, PRK05560, DNA gyrase subunit A; Validated	NA|563aa|down_4|NZ_CP052877.1_1574536_1576225_+	COG4685, COG4685, Uncharacterized protein conserved in bacteria [Function unknown]	NA|208aa|down_5|NZ_CP052877.1_1576221_1576845_+	COG3234, COG3234, Uncharacterized protein conserved in bacteria [Function unknown]	NA|1465aa|down_6|NZ_CP052877.1_1576988_1581383_+	COG2373, COG2373, Large extracellular alpha-helical protein [General function prediction only]	NA|550aa|down_7|NZ_CP052877.1_1581383_1583033_+	COG5445, COG5445, Predicted secreted protein [Function unknown]	NA|259aa|down_8|NZ_CP052877.1_1583037_1583814_+	COG4676, COG4676, Uncharacterized protein conserved in bacteria [Function unknown]	NA|395aa|down_9|NZ_CP052877.1_1583887_1585072_-	PRK05790, PRK05790, putative acyltransferase; Provisional
GCF_012974525.1_ASM1297452v1	NZ_CP052877	Escherichia coli strain C21 chromosome, complete genome	5	2176689-2176812	4	CRISPRCasFinder	no	DEDDh	cas3,csa3,PD-DExK,RT,cas8e,cse2gr11,cas7,cas5,cas6e,cas2,DEDDh,c2c9_V-U4,DinG	Unclear	CGACCCCCACCATGTCAAGGTGGTGCTCTAACCAACTGAGCTA	43	0	0	NA	NA	NA	1	1	Orphan	cas3,csa3,PD-DExK,RT,cas8e,cse2gr11,cas7,cas5,cas6e,cas2,DEDDh,c2c9_V-U4,DinG	NA,NA|30aa|down_7|NZ_CP052877.1_2185708_2185798_+	NA|79aa|up_9|NZ_CP052877.1_2164818_2165055_-	PRK15396, PRK15396, major outer membrane lipoprotein	NA|471aa|up_8|NZ_CP052877.1_2165365_2166778_-	PRK09206, PRK09206, pyruvate kinase PykF	NA|70aa|up_7|NZ_CP052877.1_2167334_2167544_+	PRK10292, PRK10292, fumarate hydratase FumD	NA|701aa|up_6|NZ_CP052877.1_2169423_2171526_+	PRK09849, PRK09849, putative oxidoreductase; Provisional	NA|213aa|up_5|NZ_CP052877.1_2171538_2172177_+	PRK09947, PRK09947, YdhW family putative oxidoreductase system protein	NA|223aa|up_4|NZ_CP052877.1_2172240_2172909_+	TIGR03149, cyt_nit_nrfC, cytochrome c nitrite reductase, Fe-S protein	NA|262aa|up_3|NZ_CP052877.1_2172905_2173691_+	PRK15006, PRK15006, thiosulfate reductase cytochrome B subunit; Provisional	NA|271aa|up_2|NZ_CP052877.1_2173694_2174507_+	PRK09946, PRK09946, hypothetical protein; Provisional	NA|535aa|up_1|NZ_CP052877.1_2174518_2176123_-	PRK09897, PRK09897, FAD-NAD(P)-binding protein	NA|102aa|up_0|NZ_CP052877.1_2176248_2176554_-	PRK11118, PRK11118, putative monooxygenase; Provisional	NA|419aa|down_0|NZ_CP052877.1_2177126_2178383_+	PRK09945, PRK09945, hypothetical protein; Provisional	NA|458aa|down_1|NZ_CP052877.1_2178423_2179797_-	PRK01766, PRK01766, multidrug efflux protein; Reviewed	NA|214aa|down_2|NZ_CP052877.1_2180011_2180653_+	PRK13020, PRK13020, riboflavin synthase subunit alpha; Provisional	NA|383aa|down_3|NZ_CP052877.1_2180692_2181841_-	PRK11705, PRK11705, cyclopropane fatty acyl phospholipid synthase	NA|404aa|down_4|NZ_CP052877.1_2182131_2183343_-	PRK11043, PRK11043, Bcr/CflA family multidrug efflux MFS transporter	NA|311aa|down_5|NZ_CP052877.1_2183455_2184388_+	PRK11074, PRK11074, putative DNA-binding transcriptional regulator; Provisional	NA|342aa|down_6|NZ_CP052877.1_2184384_2185410_-	PRK10703, PRK10703, HTH-type transcriptional repressor PurR	NA|30aa|down_7|NZ_CP052877.1_2185708_2185798_+	NA	NA|390aa|down_8|NZ_CP052877.1_2185963_2187133_+	COG2814, AraJ, Arabinose efflux permease [Carbohydrate transport and metabolism]	NA|194aa|down_9|NZ_CP052877.1_2187278_2187860_-	PRK10543, PRK10543, superoxide dismutase [Fe]
GCF_012974525.1_ASM1297452v1	NZ_CP052877	Escherichia coli strain C21 chromosome, complete genome	6	2859479-2859570	5	CRISPRCasFinder	no		cas3,csa3,PD-DExK,RT,cas8e,cse2gr11,cas7,cas5,cas6e,cas2,DEDDh,c2c9_V-U4,DinG	Orphan	CCACCTTTTTTACCTGCTTCAGATGC	26	0	0	NA	NA	NA	1	1	Orphan	cas3,csa3,PD-DExK,RT,cas8e,cse2gr11,cas7,cas5,cas6e,cas2,DEDDh,c2c9_V-U4,DinG	NA|70aa|up_9|NZ_CP052877.1_2848729_2848939_-,NA	NA|70aa|up_9|NZ_CP052877.1_2848729_2848939_-	NA	NA|1321aa|up_8|NZ_CP052877.1_2848993_2852956_+	PRK11809, putA, trifunctional transcriptional regulator/proline dehydrogenase/pyrroline-5-carboxylate dehydrogenase; Reviewed	NA|213aa|up_7|NZ_CP052877.1_2852995_2853634_-	PRK15008, PRK15008, HTH-type transcriptional regulator RutR; Provisional	NA|364aa|up_6|NZ_CP052877.1_2853921_2855013_+	TIGR03612, RutA, pyrimidine utilization protein A	NA|231aa|up_5|NZ_CP052877.1_2855012_2855705_+	TIGR03614, RutB, pyrimidine utilization protein B	NA|129aa|up_4|NZ_CP052877.1_2855716_2856103_+	TIGR03610, RutC, pyrimidine utilization protein C	NA|267aa|up_3|NZ_CP052877.1_2856110_2856911_+	TIGR03611, RutD, pyrimidine utilization protein D	NA|197aa|up_2|NZ_CP052877.1_2856920_2857511_+	PRK05365, PRK05365, malonic semialdehyde reductase; Provisional	NA|165aa|up_1|NZ_CP052877.1_2857521_2858016_+	TIGR03615, flavoprotein_oxidoreductase, pyrimidine utilization flavin reductase protein F	NA|443aa|up_0|NZ_CP052877.1_2858036_2859365_+	TIGR03616, Putative_pyrimidine_permease_RutG, pyrimidine utilization transport protein G	NA|199aa|down_0|NZ_CP052877.1_2859993_2860590_+	PRK03767, PRK03767, NAD(P)H:quinone oxidoreductase; Provisional	NA|76aa|down_1|NZ_CP052877.1_2860610_2860838_+	PRK10174, PRK10174, hypothetical protein; Provisional	NA|414aa|down_2|NZ_CP052877.1_2860875_2862117_-	PRK10173, PRK10173, glucose-1-phosphatase/inositol phosphatase; Provisional	NA|420aa|down_3|NZ_CP052877.1_2862405_2863665_-	PRK09784, PRK09784, YccE family protein	NA|307aa|down_4|NZ_CP052877.1_2863925_2864846_+	PRK10266, PRK10266, curved DNA-binding protein	NA|102aa|down_5|NZ_CP052877.1_2864845_2865151_+	PRK10265, PRK10265, chaperone modulator CbpM	NA|200aa|down_6|NZ_CP052877.1_2865243_2865843_-	PRK04976, torD, chaperone protein TorD; Validated	NA|849aa|down_7|NZ_CP052877.1_2865839_2868386_-	PRK15102, PRK15102, trimethylamine-N-oxide reductase TorA	NA|391aa|down_8|NZ_CP052877.1_2868385_2869558_-	PRK15032, PRK15032, pentaheme c-type cytochrome TorC	NA|231aa|down_9|NZ_CP052877.1_2869687_2870380_+	PRK10766, PRK10766, two-component system response regulator TorR
GCF_012974525.1_ASM1297452v1	NZ_CP052877	Escherichia coli strain C21 chromosome, complete genome	7	3165813-3165952	6	CRISPRCasFinder	no		cas3,csa3,PD-DExK,RT,cas8e,cse2gr11,cas7,cas5,cas6e,cas2,DEDDh,c2c9_V-U4,DinG	Orphan	GTAGGTCGGATAAGATGCGCAAGCATCGCATCCGACAATAAGTGCCG	47	0	0	NA	NA	NA	1	1	Orphan	cas3,csa3,PD-DExK,RT,cas8e,cse2gr11,cas7,cas5,cas6e,cas2,DEDDh,c2c9_V-U4,DinG	NA,NA	NA|330aa|up_9|NZ_CP052877.1_3153711_3154701_-	PRK00164, moaA, GTP 3',8-cyclase MoaA	NA|303aa|up_8|NZ_CP052877.1_3155097_3156006_+	TIGR01826, Putative_gluconeogenesis_factor, conserved hypothetical protein, cofD-related	NA|674aa|up_7|NZ_CP052877.1_3156197_3158219_-	PRK05298, PRK05298, excinuclease ABC subunit UvrB	NA|226aa|up_6|NZ_CP052877.1_3158797_3159475_-	PRK00090, bioD, ATP-dependent dethiobiotin synthetase BioD	NA|252aa|up_5|NZ_CP052877.1_3159467_3160223_-	PRK10258, PRK10258, biotin biosynthesis protein BioC; Provisional	NA|385aa|up_4|NZ_CP052877.1_3160209_3161364_-	PRK05958, PRK05958, 8-amino-7-oxononanoate synthase; Reviewed	NA|347aa|up_3|NZ_CP052877.1_3161360_3162401_-	PRK15108, PRK15108, biotin synthase; Provisional	NA|430aa|up_2|NZ_CP052877.1_3162487_3163777_+	PRK07986, PRK07986, adenosylmethionine--8-amino-7-oxononanoate transaminase; Validated	NA|159aa|up_1|NZ_CP052877.1_3163835_3164312_+	PRK10257, PRK10257, putative kinase inhibitor protein; Provisional	NA|428aa|up_0|NZ_CP052877.1_3164463_3165747_+	PRK10531, PRK10531, putative acyl-CoA thioester hydrolase	NA|754aa|down_0|NZ_CP052877.1_3165980_3168242_-	PRK11413, PRK11413, putative hydratase; Provisional	NA|478aa|down_1|NZ_CP052877.1_3168424_3169858_-	pfam00939, Na_sulph_symp, Sodium:sulfate symporter transmembrane region	NA|351aa|down_2|NZ_CP052877.1_3169933_3170986_-	NF033377, OMA_tautomer, 4-oxalomesaconate tautomerase	NA|318aa|down_3|NZ_CP052877.1_3171169_3172123_+	cd08440, PBP2_LTTR_like_4, TThe C-terminal substrate binding domain of an uncharacterized LysR-type transcriptional regulator, contains the type 2 periplasmic binding fold	NA|332aa|down_4|NZ_CP052877.1_3172163_3173159_-	PRK11028, PRK11028, 6-phosphogluconolactonase; Provisional	NA|273aa|down_5|NZ_CP052877.1_3173313_3174132_+	PRK10530, PRK10530, pyridoxal phosphate (PLP) phosphatase; Provisional	NA|353aa|down_6|NZ_CP052877.1_3174132_3175191_-	PRK11144, modC, molybdenum ABC transporter ATP-binding protein ModC	NA|230aa|down_7|NZ_CP052877.1_3175193_3175883_-	PRK09421, modB, molybdate ABC transporter permease subunit	NA|258aa|down_8|NZ_CP052877.1_3175882_3176656_-	PRK10677, modA, molybdate transporter periplasmic protein; Provisional	NA|50aa|down_9|NZ_CP052877.1_3176821_3176971_-	pfam10766, AcrZ, Multidrug efflux pump-associated protein AcrZ
GCF_012974525.1_ASM1297452v1	NZ_CP052877	Escherichia coli strain C21 chromosome, complete genome	8	3408488-3408584	7	CRISPRCasFinder	no		cas3,csa3,PD-DExK,RT,cas8e,cse2gr11,cas7,cas5,cas6e,cas2,DEDDh,c2c9_V-U4,DinG	Orphan	TTGTAGGCCTGATAAGATGCGTCAAGC	27	0	0	NA	NA	NA	1	1	Orphan	cas3,csa3,PD-DExK,RT,cas8e,cse2gr11,cas7,cas5,cas6e,cas2,DEDDh,c2c9_V-U4,DinG	NA,NA	NA|231aa|up_9|NZ_CP052877.1_3400273_3400966_-	PRK15195, PRK15195, molecular chaperone FimC	NA|181aa|up_8|NZ_CP052877.1_3401185_3401728_-	PRK15194, PRK15194, type 1 fimbrial protein subunit FimA	NA|289aa|up_7|NZ_CP052877.1_3402208_3403075_+	PRK10792, PRK10792, bifunctional methylenetetrahydrofolate dehydrogenase/methenyltetrahydrofolate cyclohydrolase FolD	NA|71aa|up_6|NZ_CP052877.1_3403076_3403289_+	PRK11507, PRK11507, ribosome-associated protein YbcJ	NA|174aa|up_5|NZ_CP052877.1_3403396_3403918_+	COG1988, COG1988, Predicted membrane-bound metal-dependent hydrolases [General function prediction only]	NA|462aa|up_4|NZ_CP052877.1_3403953_3405339_-	PRK00260, cysS, cysteinyl-tRNA synthetase; Validated	NA|165aa|up_3|NZ_CP052877.1_3405512_3406007_+	PRK10791, PRK10791, peptidylprolyl isomerase B	NA|241aa|up_2|NZ_CP052877.1_3406009_3406732_+	PRK05340, PRK05340, UDP-2,3-diacylglucosamine hydrolase; Provisional	NA|170aa|up_1|NZ_CP052877.1_3406849_3407359_+	COG0041, PurE, Phosphoribosylcarboxyaminoimidazole (NCAIR) mutase [Nucleotide transport and metabolism]	NA|356aa|up_0|NZ_CP052877.1_3407355_3408423_+	PRK06019, PRK06019, phosphoribosylaminoimidazole carboxylase ATPase subunit; Reviewed	NA|298aa|down_0|NZ_CP052877.1_3408627_3409521_-	PRK09411, PRK09411, carbamate kinase; Reviewed	NA|272aa|down_1|NZ_CP052877.1_3409517_3410333_-	pfam11392, DUF2877, Protein of unknown function (DUF2877)	NA|420aa|down_2|NZ_CP052877.1_3410343_3411603_-	pfam06545, DUF1116, Protein of unknown function (DUF1116)	NA|556aa|down_3|NZ_CP052877.1_3411612_3413280_-	PRK06091, PRK06091, membrane protein FdrA; Validated	NA|350aa|down_4|NZ_CP052877.1_3413596_3414646_+	PRK15025, PRK15025, ureidoglycolate dehydrogenase; Provisional	NA|412aa|down_5|NZ_CP052877.1_3414667_3415903_+	TIGR03176, AllC, allantoate amidohydrolase	NA|262aa|down_6|NZ_CP052877.1_3415913_3416699_+	COG3257, GlxB, Uncharacterized protein, possibly involved in glyoxylate utilization [General function prediction only]	NA|382aa|down_7|NZ_CP052877.1_3416827_3417973_-	PRK09932, PRK09932, glycerate 3-kinase	NA|434aa|down_8|NZ_CP052877.1_3417994_3419296_-	PRK11412, PRK11412, uracil/xanthine transporter	NA|454aa|down_9|NZ_CP052877.1_3419352_3420714_-	PRK08044, PRK08044, allantoinase AllB
GCF_012974525.1_ASM1297452v1	NZ_CP052877	Escherichia coli strain C21 chromosome, complete genome	9	3713348-3713441	8	CRISPRCasFinder	no		cas3,csa3,PD-DExK,RT,cas8e,cse2gr11,cas7,cas5,cas6e,cas2,DEDDh,c2c9_V-U4,DinG	Orphan	GGGGACTGATTTGTGCGCGTTGTTACATCG	30	0	0	NA	NA	NA	1	1	Orphan	cas3,csa3,PD-DExK,RT,cas8e,cse2gr11,cas7,cas5,cas6e,cas2,DEDDh,c2c9_V-U4,DinG	NA|326aa|up_5|NZ_CP052877.1_3707662_3708640_-,NA|310aa|down_9|NZ_CP052877.1_3719732_3720662_+	NA|131aa|up_9|NZ_CP052877.1_3703427_3703820_-	pfam02561, FliS, Flagellar protein FliS	NA|439aa|up_8|NZ_CP052877.1_3703842_3705159_-	PRK08032, fliD, flagellar capping protein; Reviewed	NA|305aa|up_7|NZ_CP052877.1_3705365_3706280_-	NF033376, flg_lateral_LafA, lateral flagellin LafA	NA|281aa|up_6|NZ_CP052877.1_3706765_3707608_+	COG3710, CadC, DNA-binding winged-HTH domains [Transcription]	NA|326aa|up_5|NZ_CP052877.1_3707662_3708640_-	NA	NA|310aa|up_4|NZ_CP052877.1_3708656_3709586_-	PRK07192, flgL, flagellar hook-associated protein FlgL; Reviewed	NA|459aa|up_3|NZ_CP052877.1_3709600_3710977_-	PRK07191, flgK, flagellar hook-associated protein FlgK; Validated	NA|100aa|up_2|NZ_CP052877.1_3711165_3711465_-	PRK12708, flgJ, peptidoglycan hydrolase; Reviewed	NA|367aa|up_1|NZ_CP052877.1_3711464_3712565_-	PRK05303, flgI, flagellar basal body P-ring protein FlgI	NA|222aa|up_0|NZ_CP052877.1_3712579_3713245_-	PRK12407, flgH, flagellar basal body L-ring protein FlgH	NA|262aa|down_0|NZ_CP052877.1_3713516_3714302_-	PRK12693, flgG, flagellar basal body rod protein FlgG; Provisional	NA|246aa|down_1|NZ_CP052877.1_3714480_3715218_-	PRK12640, flgF, flagellar basal body rod protein FlgF; Reviewed	NA|401aa|down_2|NZ_CP052877.1_3715217_3716420_-	PRK06803, flgE, flagellar basal body protein FlaE	NA|238aa|down_3|NZ_CP052877.1_3716504_3717218_-	PRK09619, flgD, flagellar hook assembly protein FlgD	NA|144aa|down_4|NZ_CP052877.1_3717217_3717649_-	PRK06802, flgC, flagellar basal body rod protein FlgC; Reviewed	NA|112aa|down_5|NZ_CP052877.1_3717651_3717987_-	PRK12685, flgB, flagellar basal body rod protein FlgB; Reviewed	NA|246aa|down_6|NZ_CP052877.1_3718068_3718806_+	PRK06804, flgA, flagellar basal body P-ring formation protein FlgA	NA|93aa|down_7|NZ_CP052877.1_3718886_3719165_+	TIGR03824, FlgM_jcvi, flagellar biosynthesis anti-sigma factor FlgM	NA|143aa|down_8|NZ_CP052877.1_3719177_3719606_+	pfam05130, FlgN, FlgN protein	NA|310aa|down_9|NZ_CP052877.1_3719732_3720662_+	NA
GCF_012974525.1_ASM1297452v1	NZ_CP052879	Escherichia coli strain C21 plasmid pC21-2, complete sequence	1	61104-61223	1	CRISPRCasFinder	no			Orphan	TGCGTACCCATCCACCTTTCAGTGCGTACCCATCCACCTTTCA	43	0	0	NA	NA	NA	1	1	Orphan	cas3,csa3,PD-DExK,RT,cas8e,cse2gr11,cas7,cas5,cas6e,cas2,DEDDh,c2c9_V-U4,DinG	NA|60aa|up_8|NZ_CP052879.1_54046_54226_+,NA|118aa|up_3|NZ_CP052879.1_58208_58562_-,NA|149aa|up_2|NZ_CP052879.1_58698_59145_-,NA	NA|235aa|up_9|NZ_CP052879.1_52276_52981_+	COG3316, COG3316, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|60aa|up_8|NZ_CP052879.1_54046_54226_+	NA	NA|215aa|up_7|NZ_CP052879.1_54341_54986_-	cd03767, SR_Res_par, Serine recombinase (SR) family, Partitioning (par)-Resolvase subfamily, catalytic domain; Serine recombinases catalyze site-specific recombination of DNA molecules by a concerted, four-strand cleavage and rejoining mechanism which involves a transient phosphoserine linkage between DNA and the enzyme	NA|207aa|up_6|NZ_CP052879.1_55280_55901_+	cd02042, ParAB_family, partition proteins ParAB family	NA|77aa|up_5|NZ_CP052879.1_55952_56183_+	pfam09274, ParG, ParG	NA|355aa|up_4|NZ_CP052879.1_56606_57671_+	pfam13614, AAA_31, AAA domain	NA|118aa|up_3|NZ_CP052879.1_58208_58562_-	NA	NA|149aa|up_2|NZ_CP052879.1_58698_59145_-	NA	NA|281aa|up_1|NZ_CP052879.1_59148_59991_-	pfam01051, Rep_3, Initiator Replication protein	NA|280aa|up_0|NZ_CP052879.1_60011_60851_-	pfam01051, Rep_3, Initiator Replication protein	NA|84aa|down_0|NZ_CP052879.1_62085_62337_+	PRK02854, PRK02854, primosomal protein DnaT	NA|94aa|down_1|NZ_CP052879.1_62326_62608_+	COG2026, RelE, Cytotoxic translational repressor of toxin-antitoxin stability system [Translation, ribosomal structure and biogenesis / Cell division and chromosome partitioning]	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA
