assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_008033315.1_ASM803331v1	NZ_CP042865	Escherichia coli strain ATCC BAA-196 chromosome, complete genome	1	653758-653898	1	CRISPRCasFinder	no		cas3,csa3,PD-DExK,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,WYL,DEDDh,DinG,c2c9_V-U4	Orphan	CACGCCGCATCCGCCAGTGGCGCGGTGCAGATGCCGGATGC	41	0	0	NA	NA	NA	1	1	Orphan	cas3,csa3,PD-DExK,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,WYL,DEDDh,DinG,c2c9_V-U4,RT,csf2gr7	NA,NA|357aa|down_1|NZ_CP042865.1_656352_657423_-	NA|184aa|up_9|NZ_CP042865.1_643479_644031_+	pfam10688, Imp-YgjV, Bacterial inner membrane protein	NA|415aa|up_8|NZ_CP042865.1_644035_645280_-	PRK13628, PRK13628, serine/threonine transporter SstT; Provisional	NA|322aa|up_7|NZ_CP042865.1_645678_646644_-	TIGR03718, R_switched_Alx, integral membrane protein, TerC family	NA|329aa|up_6|NZ_CP042865.1_646926_647913_-	COG0673, MviM, Predicted dehydrogenases and related proteins [General function prediction only]	NA|231aa|up_5|NZ_CP042865.1_647991_648684_-	COG2949, SanA, Uncharacterized membrane protein [Function unknown]	NA|168aa|up_4|NZ_CP042865.1_648760_649264_-	COG1451, COG1451, Predicted metal-dependent hydrolase [General function prediction only]	NA|379aa|up_3|NZ_CP042865.1_649348_650485_+	PRK15001, PRK15001, 23S rRNA (guanine(1835)-N(2))-methyltransferase RlmG	NA|105aa|up_2|NZ_CP042865.1_650768_651083_+	COG4680, COG4680, Uncharacterized protein conserved in bacteria [Function unknown]	NA|139aa|up_1|NZ_CP042865.1_651079_651496_+	COG5499, COG5499, Predicted transcription regulator containing HTH domain [Transcription]	NA|673aa|up_0|NZ_CP042865.1_651540_653559_-	cd02930, DCR_FMN, 2,4-dienoyl-CoA reductase (DCR) FMN-binding domain	NA|784aa|down_0|NZ_CP042865.1_653984_656336_-	PRK10137, PRK10137, alpha-glucosidase; Provisional	NA|357aa|down_1|NZ_CP042865.1_656352_657423_-	NA	NA|478aa|down_2|NZ_CP042865.1_657556_658990_-	PRK15238, PRK15238, inner membrane transporter YjeM; Provisional	NA|150aa|down_3|NZ_CP042865.1_659052_659502_-	PRK10202, ebgC, beta-galactosidase subunit beta	NA|1031aa|down_4|NZ_CP042865.1_659498_662591_-	PRK10340, ebgA, cryptic beta-D-galactosidase subunit alpha; Reviewed	NA|328aa|down_5|NZ_CP042865.1_662774_663758_-	PRK10339, PRK10339, DNA-binding transcriptional repressor EbgR; Provisional	NA|111aa|down_6|NZ_CP042865.1_663976_664309_+	PRK10089, PRK10089, chaperone CsaA	NA|460aa|down_7|NZ_CP042865.1_664350_665730_-	PRK11522, PRK11522, putrescine--2-oxoglutarate aminotransferase; Provisional	NA|507aa|down_8|NZ_CP042865.1_666147_667668_+	smart00283, MA, Methyl-accepting chemotaxis-like domains (chemotaxis sensory transducer)	NA|208aa|down_9|NZ_CP042865.1_667821_668445_-	COG1695, COG1695, Predicted transcriptional regulators [Transcription]
GCF_008033315.1_ASM803331v1	NZ_CP042865	Escherichia coli strain ATCC BAA-196 chromosome, complete genome	2	980811-981204	1,2,1	PILER-CR,CRISPRCasFinder,CRT	no	cas3	cas3,csa3,PD-DExK,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,WYL,DEDDh,DinG,c2c9_V-U4	Unclear	GTGTTCCCCGCGCCAGCGGGGATAAACC,GTGTTCCCCGCGCCAGCGGGGATAAACC,GTGTTCCCCGCGCCAGCGGGGATAAACC	28,28,28	0	0	NA	NA	I-E:I-E:I-E	6,6,6	6	Unclear	cas3,csa3,PD-DExK,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,WYL,DEDDh,DinG,c2c9_V-U4,RT,csf2gr7	NA|47aa|up_1|NZ_CP042865.1_979520_979661_-,NA	NA|434aa|up_9|NZ_CP042865.1_970217_971519_+	PRK13168, rumA, 23S rRNA (uracil(1939)-C(5))-methyltransferase RlmD	NA|745aa|up_8|NZ_CP042865.1_971566_973801_+	PRK10872, relA, (p)ppGpp synthetase I/GTP pyrophosphokinase; Provisional	NA|83aa|up_7|NZ_CP042865.1_973878_974127_+	PRK09798, PRK09798, MazF-MazE toxin-antitoxin system antitoxin MazE	NA|112aa|up_6|NZ_CP042865.1_974126_974462_+	PRK09907, PRK09907, endoribonuclease MazF	NA|264aa|up_5|NZ_CP042865.1_974532_975324_+	PRK09562, mazG, nucleoside triphosphate pyrophosphohydrolase; Reviewed	NA|546aa|up_4|NZ_CP042865.1_975551_977189_+	PRK05380, pyrG, CTP synthetase; Validated	NA|433aa|up_3|NZ_CP042865.1_977276_978575_+	PRK00077, eno, enolase; Provisional	NA|291aa|up_2|NZ_CP042865.1_978634_979507_-	COG1512, COG1512, Beta-propeller domains of methanol dehydrogenase type [General function prediction only]	NA|47aa|up_1|NZ_CP042865.1_979520_979661_-	NA	NA|224aa|up_0|NZ_CP042865.1_979799_980471_+	TIGR04322, organic_radical_activating_enzyme, putative 7-cyano-7-deazaguanosine (preQ0) biosynthesis protein QueE	NA|493aa|down_0|NZ_CP042865.1_981843_983322_-	cd07779, FGGY_ygcE_like, uncharacterized ygcE-like proteins	NA|426aa|down_1|NZ_CP042865.1_983348_984626_-	cd06174, MFS, Major Facilitator Superfamily	NA|262aa|down_2|NZ_CP042865.1_984944_985730_+	cd05347, Ga5DH-like_SDR_c, gluconate 5-dehydrogenase (Ga5DH)-like, classical (c) SDRs	NA|485aa|down_3|NZ_CP042865.1_985799_987254_+	COG0277, GlcD, FAD/FMN-containing dehydrogenases [Energy production and conversion]	NA|470aa|down_4|NZ_CP042865.1_987275_988685_+	cd17371, MFS_MucK, Cis,cis-muconate transport protein and similar proteins of the Major Facilitator Superfamily	NA|260aa|down_5|NZ_CP042865.1_988662_989442_+	COG2086, FixA, Electron transfer flavoprotein, beta subunit [Energy production and conversion]	NA|287aa|down_6|NZ_CP042865.1_989438_990299_+	COG2025, FixB, Electron transfer flavoprotein, alpha subunit [Energy production and conversion]	NA|192aa|down_7|NZ_CP042865.1_990446_991022_-	COG1954, GlpP, Glycerol-3-phosphate responsive antiterminator (mRNA-binding) [Transcription]	NA|87aa|down_8|NZ_CP042865.1_991038_991299_-	COG2440, FixX, Ferredoxin-like protein [Energy production and conversion]	NA|424aa|down_9|NZ_CP042865.1_991289_992561_-	PRK10015, PRK10015, oxidoreductase; Provisional
GCF_008033315.1_ASM803331v1	NZ_CP042865	Escherichia coli strain ATCC BAA-196 chromosome, complete genome	3	1006755-1007517	2,3,2	PILER-CR,CRISPRCasFinder,CRT	no	cas3,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2	cas3,csa3,PD-DExK,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,WYL,DEDDh,DinG,c2c9_V-U4	Type I-E	GAGTTCCCCGCGCCAGCGGGGATAAACCG,GAGTTCCCCGCGCCAGCGGGGATAAACCG,GAGTTCCCCGCGCCAGCGGGGATAAACCG	29,29,29	0	0	NA	NA	I-E:I-E:I-E	8,12,12	12	TypeI-E	cas3,csa3,PD-DExK,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,WYL,DEDDh,DinG,c2c9_V-U4,RT,csf2gr7	NA,NA	NA|571aa|up_9|NZ_CP042865.1_995118_996831_+	PRK13504, PRK13504, NADPH-dependent assimilatory sulfite reductase hemoprotein subunit	NA|245aa|up_8|NZ_CP042865.1_996905_997640_+	PRK02090, PRK02090, phosphoadenylyl-sulfate reductase	cas3|889aa|up_7|NZ_CP042865.1_997998_1000665_+	PRK09694, PRK09694, CRISPR-associated helicase/endonuclease Cas3	cas8e|503aa|up_6|NZ_CP042865.1_1001079_1002588_+	PRK09693, PRK09693, Cascade antiviral complex protein; Validated	cse2gr11|161aa|up_5|NZ_CP042865.1_1002580_1003063_+	cd09670, Cse2_I-E, CRISPR/Cas system-associated protein Cse2	cas7|364aa|up_4|NZ_CP042865.1_1003075_1004167_+	TIGR01869, CRISPR_system_Cascade_subunit_CasC, CRISPR-associated protein Cas7/Cse4/CasC, subtype I-E/ECOLI	cas5|225aa|up_3|NZ_CP042865.1_1004169_1004844_+	TIGR01868, hypothetical_protein, CRISPR-associated protein Cas5/CasD, subtype I-E/ECOLI	cas6e|200aa|up_2|NZ_CP042865.1_1004830_1005430_+	TIGR01907, CRISPR_system_Cascade_subunit_CasE, CRISPR-associated protein Cas6/Cse3/CasE, subtype I-E/ECOLI	cas1|306aa|up_1|NZ_CP042865.1_1005445_1006363_+	TIGR03638, cas1_ECOLI, CRISPR-associated endonuclease Cas1, subtype I-E/ECOLI	cas2|95aa|up_0|NZ_CP042865.1_1006364_1006649_+	PRK11558, PRK11558, putative ssRNA endonuclease; Provisional	NA|346aa|down_0|NZ_CP042865.1_1007599_1008637_-	PRK10199, PRK10199, alkaline phosphatase isozyme conversion aminopeptidase; Provisional	NA|303aa|down_1|NZ_CP042865.1_1008888_1009797_+	PRK05253, PRK05253, sulfate adenylyltransferase subunit CysD	NA|476aa|down_2|NZ_CP042865.1_1009798_1011226_+	PRK05124, cysN, sulfate adenylyltransferase subunit 1; Provisional	NA|202aa|down_3|NZ_CP042865.1_1011225_1011831_+	PRK03846, PRK03846, adenylylsulfate kinase; Provisional	NA|108aa|down_4|NZ_CP042865.1_1011880_1012204_+	pfam12084, DUF3561, Protein of unknown function (DUF3561)	NA|104aa|down_5|NZ_CP042865.1_1012397_1012709_+	PRK00888, ftsB, cell division protein FtsB; Reviewed	NA|237aa|down_6|NZ_CP042865.1_1012727_1013438_+	PRK00155, ispD, D-ribitol-5-phosphate cytidylyltransferase	NA|160aa|down_7|NZ_CP042865.1_1013437_1013917_+	PRK00084, ispF, 2-C-methyl-D-erythritol 2,4-cyclodiphosphate synthase; Reviewed	NA|350aa|down_8|NZ_CP042865.1_1013913_1014963_+	PRK00984, truD, tRNA pseudouridine synthase D; Reviewed	NA|254aa|down_9|NZ_CP042865.1_1014943_1015705_+	PRK00346, surE, 5'(3')-nucleotidase/polyphosphatase; Provisional
GCF_008033315.1_ASM803331v1	NZ_CP042865	Escherichia coli strain ATCC BAA-196 chromosome, complete genome	4	1537930-1538047	4	CRISPRCasFinder	no		cas3,csa3,PD-DExK,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,WYL,DEDDh,DinG,c2c9_V-U4	Orphan	CCGAGCCGTAGGCCGGATAAGGCGTTCACGC	31	0	0	NA	NA	NA	1	1	Orphan	cas3,csa3,PD-DExK,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,WYL,DEDDh,DinG,c2c9_V-U4,RT,csf2gr7	NA,NA	NA|62aa|up_9|NZ_CP042865.1_1527215_1527401_-	PRK09956, PRK09956, ISNCY family transposase	NA|300aa|up_8|NZ_CP042865.1_1527413_1528313_-	PRK09956, PRK09956, ISNCY family transposase	NA|397aa|up_7|NZ_CP042865.1_1528505_1529696_-	TIGR03379, glycerol3P_GlpC, glycerol-3-phosphate dehydrogenase, anaerobic, C subunit	NA|420aa|up_6|NZ_CP042865.1_1529692_1530952_-	COG3075, GlpB, Anaerobic glycerol-3-phosphate dehydrogenase [Amino acid transport and metabolism]	NA|543aa|up_5|NZ_CP042865.1_1530941_1532570_-	PRK11101, glpA, anaerobic glycerol-3-phosphate dehydrogenase subunit A	NA|453aa|up_4|NZ_CP042865.1_1532842_1534201_+	PRK11273, glpT, glycerol-3-phosphate transporter	NA|359aa|up_3|NZ_CP042865.1_1534205_1535282_+	PRK11143, glpQ, glycerophosphodiester phosphodiesterase; Provisional	NA|217aa|up_2|NZ_CP042865.1_1535744_1536395_+	PRK09902, PRK09902, lipopolysaccharide kinase InaA	NA|85aa|up_1|NZ_CP042865.1_1536448_1536703_-	PRK10713, PRK10713, 2Fe-2S ferredoxin-like protein	NA|377aa|up_0|NZ_CP042865.1_1536702_1537833_-	PRK09101, nrdB, ribonucleotide-diphosphate reductase subunit beta; Reviewed	NA|762aa|down_0|NZ_CP042865.1_1538066_1540352_-	PRK09103, PRK09103, ribonucleoside-diphosphate reductase subunit alpha	NA|1251aa|down_1|NZ_CP042865.1_1541047_1544800_+	PRK09752, PRK09752, AIDA-I family autotransporter YfaL	NA|241aa|down_2|NZ_CP042865.1_1544927_1545650_-	PRK05134, PRK05134, bifunctional 2-polyprenyl-6-hydroxyphenol methylase/3-demethylubiquinol 3-O-methyltransferase UbiG	NA|876aa|down_3|NZ_CP042865.1_1545796_1548424_+	PRK05560, PRK05560, DNA gyrase subunit A; Validated	NA|563aa|down_4|NZ_CP042865.1_1548572_1550261_+	COG4685, COG4685, Uncharacterized protein conserved in bacteria [Function unknown]	NA|208aa|down_5|NZ_CP042865.1_1550257_1550881_+	COG3234, COG3234, Uncharacterized protein conserved in bacteria [Function unknown]	NA|550aa|down_6|NZ_CP042865.1_1555419_1557069_+	COG5445, COG5445, Predicted secreted protein [Function unknown]	NA|259aa|down_7|NZ_CP042865.1_1557073_1557850_+	COG4676, COG4676, Uncharacterized protein conserved in bacteria [Function unknown]	NA|395aa|down_8|NZ_CP042865.1_1557923_1559108_-	PRK05790, PRK05790, putative acyltransferase; Provisional	NA|441aa|down_9|NZ_CP042865.1_1559138_1560461_-	pfam02667, SCFA_trans, Short chain fatty acid transporter
GCF_008033315.1_ASM803331v1	NZ_CP042865	Escherichia coli strain ATCC BAA-196 chromosome, complete genome	5	2140396-2140519	5	CRISPRCasFinder	no	DEDDh	cas3,csa3,PD-DExK,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,WYL,DEDDh,DinG,c2c9_V-U4	Unclear	CGACCCCCACCATGTCAAGGTGGTGCTCTAACCAACTGAGCTA	43	0	0	NA	NA	NA	1	1	Orphan	cas3,csa3,PD-DExK,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,WYL,DEDDh,DinG,c2c9_V-U4,RT,csf2gr7	NA,NA|30aa|down_7|NZ_CP042865.1_2149415_2149505_+	NA|471aa|up_9|NZ_CP042865.1_2129850_2131263_-	PRK09206, PRK09206, pyruvate kinase PykF	NA|70aa|up_8|NZ_CP042865.1_2131819_2132029_+	PRK10292, PRK10292, fumarate hydratase FumD	NA|209aa|up_7|NZ_CP042865.1_2132483_2133110_+	PRK09898, PRK09898, ferredoxin-like protein	NA|701aa|up_6|NZ_CP042865.1_2133130_2135233_+	PRK09849, PRK09849, putative oxidoreductase; Provisional	NA|213aa|up_5|NZ_CP042865.1_2135245_2135884_+	PRK09947, PRK09947, YdhW family putative oxidoreductase system protein	NA|223aa|up_4|NZ_CP042865.1_2135947_2136616_+	TIGR03149, cyt_nit_nrfC, cytochrome c nitrite reductase, Fe-S protein	NA|262aa|up_3|NZ_CP042865.1_2136612_2137398_+	PRK15006, PRK15006, thiosulfate reductase cytochrome B subunit; Provisional	NA|271aa|up_2|NZ_CP042865.1_2137401_2138214_+	PRK09946, PRK09946, hypothetical protein; Provisional	NA|535aa|up_1|NZ_CP042865.1_2138225_2139830_-	PRK09897, PRK09897, FAD-NAD(P)-binding protein	NA|102aa|up_0|NZ_CP042865.1_2139955_2140261_-	PRK11118, PRK11118, putative monooxygenase; Provisional	NA|419aa|down_0|NZ_CP042865.1_2140833_2142090_+	PRK09945, PRK09945, hypothetical protein; Provisional	NA|458aa|down_1|NZ_CP042865.1_2142130_2143504_-	PRK01766, PRK01766, multidrug efflux protein; Reviewed	NA|214aa|down_2|NZ_CP042865.1_2143718_2144360_+	PRK13020, PRK13020, riboflavin synthase subunit alpha; Provisional	NA|383aa|down_3|NZ_CP042865.1_2144399_2145548_-	PRK11705, PRK11705, cyclopropane fatty acyl phospholipid synthase	NA|404aa|down_4|NZ_CP042865.1_2145838_2147050_-	PRK11043, PRK11043, Bcr/CflA family multidrug efflux MFS transporter	NA|311aa|down_5|NZ_CP042865.1_2147162_2148095_+	PRK11074, PRK11074, putative DNA-binding transcriptional regulator; Provisional	NA|342aa|down_6|NZ_CP042865.1_2148091_2149117_-	PRK10703, PRK10703, HTH-type transcriptional repressor PurR	NA|30aa|down_7|NZ_CP042865.1_2149415_2149505_+	NA	NA|390aa|down_8|NZ_CP042865.1_2149670_2150840_+	COG2814, AraJ, Arabinose efflux permease [Carbohydrate transport and metabolism]	NA|194aa|down_9|NZ_CP042865.1_2151001_2151583_-	PRK10543, PRK10543, superoxide dismutase [Fe]
GCF_008033315.1_ASM803331v1	NZ_CP042865	Escherichia coli strain ATCC BAA-196 chromosome, complete genome	6	2819146-2819237	6	CRISPRCasFinder	no		cas3,csa3,PD-DExK,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,WYL,DEDDh,DinG,c2c9_V-U4	Orphan	CCACCTTTTTTACCTGCTTCAGATGC	26	0	0	NA	NA	NA	1	1	Orphan	cas3,csa3,PD-DExK,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,WYL,DEDDh,DinG,c2c9_V-U4,RT,csf2gr7	NA|70aa|up_9|NZ_CP042865.1_2808222_2808432_-,NA	NA|70aa|up_9|NZ_CP042865.1_2808222_2808432_-	NA	NA|1321aa|up_8|NZ_CP042865.1_2808486_2812449_+	PRK11809, putA, trifunctional transcriptional regulator/proline dehydrogenase/pyrroline-5-carboxylate dehydrogenase; Reviewed	NA|213aa|up_7|NZ_CP042865.1_2812488_2813127_-	PRK15008, PRK15008, HTH-type transcriptional regulator RutR; Provisional	NA|364aa|up_6|NZ_CP042865.1_2813414_2814506_+	TIGR03612, RutA, pyrimidine utilization protein A	NA|231aa|up_5|NZ_CP042865.1_2814505_2815198_+	TIGR03614, RutB, pyrimidine utilization protein B	NA|129aa|up_4|NZ_CP042865.1_2815209_2815596_+	TIGR03610, RutC, pyrimidine utilization protein C	NA|267aa|up_3|NZ_CP042865.1_2815603_2816404_+	TIGR03611, RutD, pyrimidine utilization protein D	NA|197aa|up_2|NZ_CP042865.1_2816413_2817004_+	PRK05365, PRK05365, malonic semialdehyde reductase; Provisional	NA|165aa|up_1|NZ_CP042865.1_2817014_2817509_+	TIGR03615, flavoprotein_oxidoreductase, pyrimidine utilization flavin reductase protein F	NA|443aa|up_0|NZ_CP042865.1_2817529_2818858_+	TIGR03616, Putative_pyrimidine_permease_RutG, pyrimidine utilization transport protein G	NA|199aa|down_0|NZ_CP042865.1_2819660_2820257_+	PRK03767, PRK03767, NAD(P)H:quinone oxidoreductase; Provisional	NA|76aa|down_1|NZ_CP042865.1_2820277_2820505_+	PRK10174, PRK10174, hypothetical protein; Provisional	NA|414aa|down_2|NZ_CP042865.1_2820542_2821784_-	PRK10173, PRK10173, glucose-1-phosphatase/inositol phosphatase; Provisional	NA|419aa|down_3|NZ_CP042865.1_2822076_2823333_-	PRK09784, PRK09784, YccE family protein	NA|307aa|down_4|NZ_CP042865.1_2823593_2824514_+	PRK10266, PRK10266, curved DNA-binding protein	NA|102aa|down_5|NZ_CP042865.1_2824513_2824819_+	PRK10265, PRK10265, chaperone modulator CbpM	NA|200aa|down_6|NZ_CP042865.1_2824970_2825570_-	PRK04976, torD, chaperone protein TorD; Validated	NA|849aa|down_7|NZ_CP042865.1_2825566_2828113_-	PRK15102, PRK15102, trimethylamine-N-oxide reductase TorA	NA|391aa|down_8|NZ_CP042865.1_2828112_2829285_-	PRK15032, PRK15032, pentaheme c-type cytochrome TorC	NA|231aa|down_9|NZ_CP042865.1_2829414_2830107_+	PRK10766, PRK10766, two-component system response regulator TorR
GCF_008033315.1_ASM803331v1	NZ_CP042865	Escherichia coli strain ATCC BAA-196 chromosome, complete genome	7	3131276-3131420	7	CRISPRCasFinder	no		cas3,csa3,PD-DExK,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,WYL,DEDDh,DinG,c2c9_V-U4	Orphan	GTAGGTCGGATAAGATGCGCAAGCATCGCATCCGACAATAAGTGCCGGATGC	52	0	0	NA	NA	NA	1	1	Orphan	cas3,csa3,PD-DExK,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,WYL,DEDDh,DinG,c2c9_V-U4,RT,csf2gr7	NA|94aa|up_4|NZ_CP042865.1_3127948_3128230_+,NA|56aa|up_3|NZ_CP042865.1_3128318_3128486_+,NA	NA|227aa|up_9|NZ_CP042865.1_3125575_3126256_+	pfam09588, YqaJ, YqaJ-like viral recombinase domain	NA|61aa|up_8|NZ_CP042865.1_3126252_3126435_+	pfam07026, DUF1317, Protein of unknown function (DUF1317)	NA|64aa|up_7|NZ_CP042865.1_3126407_3126599_+	pfam07131, DUF1382, Protein of unknown function (DUF1382)	NA|74aa|up_6|NZ_CP042865.1_3126990_3127212_+	PHA00080, PHA00080, DksA-like zinc finger domain containing protein	NA|183aa|up_5|NZ_CP042865.1_3127208_3127757_+	pfam13935, Ead_Ea22, Ead/Ea22-like protein	NA|94aa|up_4|NZ_CP042865.1_3127948_3128230_+	NA	NA|56aa|up_3|NZ_CP042865.1_3128318_3128486_+	NA	NA|73aa|up_2|NZ_CP042865.1_3128525_3128744_+	pfam07825, Exc, Excisionase-like protein	NA|357aa|up_1|NZ_CP042865.1_3128721_3129792_+	cd00800, INT_Lambda_C, C-terminal catalytic domain of Lambda integrase, a tyrosine-based site-specific recombinase	NA|428aa|up_0|NZ_CP042865.1_3129926_3131210_+	PRK10531, PRK10531, putative acyl-CoA thioester hydrolase	NA|754aa|down_0|NZ_CP042865.1_3131443_3133705_-	PRK11413, PRK11413, putative hydratase; Provisional	NA|478aa|down_1|NZ_CP042865.1_3133887_3135321_-	pfam00939, Na_sulph_symp, Sodium:sulfate symporter transmembrane region	NA|351aa|down_2|NZ_CP042865.1_3135396_3136449_-	NF033377, OMA_tautomer, 4-oxalomesaconate tautomerase	NA|318aa|down_3|NZ_CP042865.1_3136632_3137586_+	cd08440, PBP2_LTTR_like_4, TThe C-terminal substrate binding domain of an uncharacterized LysR-type transcriptional regulator, contains the type 2 periplasmic binding fold	NA|332aa|down_4|NZ_CP042865.1_3137626_3138622_-	PRK11028, PRK11028, 6-phosphogluconolactonase; Provisional	NA|273aa|down_5|NZ_CP042865.1_3138776_3139595_+	PRK10530, PRK10530, pyridoxal phosphate (PLP) phosphatase; Provisional	NA|353aa|down_6|NZ_CP042865.1_3139595_3140654_-	PRK11144, modC, molybdenum ABC transporter ATP-binding protein ModC	NA|230aa|down_7|NZ_CP042865.1_3140656_3141346_-	PRK09421, modB, molybdate ABC transporter permease subunit	NA|258aa|down_8|NZ_CP042865.1_3141345_3142119_-	PRK10677, modA, molybdate transporter periplasmic protein; Provisional	NA|50aa|down_9|NZ_CP042865.1_3142285_3142435_-	pfam10766, AcrZ, Multidrug efflux pump-associated protein AcrZ
GCF_008033315.1_ASM803331v1	NZ_CP042865	Escherichia coli strain ATCC BAA-196 chromosome, complete genome	8	3385746-3385842	8	CRISPRCasFinder	no		cas3,csa3,PD-DExK,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,WYL,DEDDh,DinG,c2c9_V-U4	Orphan	TTGTAGGCCTGATAAGATGCGTCAAGC	27	0	0	NA	NA	NA	1	1	Orphan	cas3,csa3,PD-DExK,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,WYL,DEDDh,DinG,c2c9_V-U4,RT,csf2gr7	NA,NA	NA|231aa|up_9|NZ_CP042865.1_3377541_3378234_-	PRK15195, PRK15195, molecular chaperone FimC	NA|181aa|up_8|NZ_CP042865.1_3378453_3378996_-	PRK15194, PRK15194, type 1 fimbrial protein subunit FimA	NA|289aa|up_7|NZ_CP042865.1_3379466_3380333_+	PRK10792, PRK10792, bifunctional methylenetetrahydrofolate dehydrogenase/methenyltetrahydrofolate cyclohydrolase FolD	NA|71aa|up_6|NZ_CP042865.1_3380334_3380547_+	PRK11507, PRK11507, ribosome-associated protein YbcJ	NA|174aa|up_5|NZ_CP042865.1_3380654_3381176_+	COG1988, COG1988, Predicted membrane-bound metal-dependent hydrolases [General function prediction only]	NA|462aa|up_4|NZ_CP042865.1_3381211_3382597_-	PRK00260, cysS, cysteinyl-tRNA synthetase; Validated	NA|165aa|up_3|NZ_CP042865.1_3382770_3383265_+	PRK10791, PRK10791, peptidylprolyl isomerase B	NA|241aa|up_2|NZ_CP042865.1_3383267_3383990_+	PRK05340, PRK05340, UDP-2,3-diacylglucosamine hydrolase; Provisional	NA|170aa|up_1|NZ_CP042865.1_3384107_3384617_+	COG0041, PurE, Phosphoribosylcarboxyaminoimidazole (NCAIR) mutase [Nucleotide transport and metabolism]	NA|356aa|up_0|NZ_CP042865.1_3384613_3385681_+	PRK06019, PRK06019, phosphoribosylaminoimidazole carboxylase ATPase subunit; Reviewed	NA|298aa|down_0|NZ_CP042865.1_3385875_3386769_-	PRK09411, PRK09411, carbamate kinase; Reviewed	NA|272aa|down_1|NZ_CP042865.1_3386765_3387581_-	pfam11392, DUF2877, Protein of unknown function (DUF2877)	NA|420aa|down_2|NZ_CP042865.1_3387591_3388851_-	pfam06545, DUF1116, Protein of unknown function (DUF1116)	NA|556aa|down_3|NZ_CP042865.1_3388860_3390528_-	PRK06091, PRK06091, membrane protein FdrA; Validated	NA|350aa|down_4|NZ_CP042865.1_3390844_3391894_+	PRK15025, PRK15025, ureidoglycolate dehydrogenase; Provisional	NA|412aa|down_5|NZ_CP042865.1_3391915_3393151_+	TIGR03176, AllC, allantoate amidohydrolase	NA|262aa|down_6|NZ_CP042865.1_3393161_3393947_+	TIGR03214, ura-cupin, putative allantoin catabolism protein	NA|382aa|down_7|NZ_CP042865.1_3394174_3395320_-	PRK09932, PRK09932, glycerate 3-kinase	NA|434aa|down_8|NZ_CP042865.1_3395341_3396643_-	PRK11412, PRK11412, uracil/xanthine transporter	NA|454aa|down_9|NZ_CP042865.1_3396699_3398061_-	PRK08044, PRK08044, allantoinase AllB
GCF_008033315.1_ASM803331v1	NZ_CP042865	Escherichia coli strain ATCC BAA-196 chromosome, complete genome	9	3525988-3526132	9	CRISPRCasFinder	no		cas3,csa3,PD-DExK,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,WYL,DEDDh,DinG,c2c9_V-U4	Orphan	TTTTGCAGGCCTGATAAGACGCGGCAAGCGTCGCATCAGGCAT	43	0	0	NA	NA	NA	1	1	Orphan	cas3,csa3,PD-DExK,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,WYL,DEDDh,DinG,c2c9_V-U4,RT,csf2gr7	NA,NA|28aa|down_2|NZ_CP042865.1_3528177_3528261_-	NA|357aa|up_9|NZ_CP042865.1_3511126_3512197_-	PRK00147, queA, S-adenosylmethionine:tRNA ribosyltransferase-isomerase; Provisional	NA|194aa|up_8|NZ_CP042865.1_3512289_3512871_+	PRK10045, PRK10045, ACP phosphodiesterase	NA|606aa|up_7|NZ_CP042865.1_3512875_3514693_-	PRK10785, PRK10785, maltodextrin glucosidase; Provisional	NA|458aa|up_6|NZ_CP042865.1_3514848_3516222_-	PRK10580, proY, putative proline-specific permease; Provisional	NA|440aa|up_5|NZ_CP042865.1_3516297_3517617_-	PRK15433, PRK15433, branched-chain amino acid transporter carrier protein BrnQ	NA|432aa|up_4|NZ_CP042865.1_3518023_3519319_-	PRK11006, phoR, phosphate regulon sensor histidine kinase PhoR	NA|230aa|up_3|NZ_CP042865.1_3519376_3520066_-	PRK10161, PRK10161, phosphate response regulator transcription factor PhoB	NA|401aa|up_2|NZ_CP042865.1_3520255_3521458_+	PRK10966, PRK10966, exonuclease subunit SbcD; Provisional	NA|1049aa|up_1|NZ_CP042865.1_3521454_3524601_+	PRK10246, PRK10246, exonuclease subunit SbcC; Provisional	NA|395aa|up_0|NZ_CP042865.1_3524726_3525911_+	PRK10091, PRK10091, MFS transport protein AraJ; Provisional	NA|303aa|down_0|NZ_CP042865.1_3526155_3527064_-	PRK09557, PRK09557, fructokinase; Reviewed	NA|304aa|down_1|NZ_CP042865.1_3527188_3528100_+	COG2974, RdgC, DNA recombination-dependent growth factor C [DNA replication, recombination, and repair]	NA|28aa|down_2|NZ_CP042865.1_3528177_3528261_-	NA	NA|95aa|down_3|NZ_CP042865.1_3528746_3529031_-	PRK10579, PRK10579, pyrimidine/purine nucleoside phosphorylase	NA|226aa|down_4|NZ_CP042865.1_3529102_3529780_-	PRK10481, PRK10481, hypothetical protein; Provisional	NA|64aa|down_5|NZ_CP042865.1_3530037_3530229_-	PRK10380, PRK10380, hypothetical protein; Provisional	NA|175aa|down_6|NZ_CP042865.1_3530278_3530803_-	PRK03731, aroL, shikimate kinase AroL	NA|153aa|down_7|NZ_CP042865.1_3530985_3531444_-	PRK00124, PRK00124, YaiI/YqxD family protein	NA|270aa|down_8|NZ_CP042865.1_3531563_3532373_+	PRK11880, PRK11880, pyrroline-5-carboxylate reductase; Reviewed	NA|372aa|down_9|NZ_CP042865.1_3532389_3533505_-	PRK10245, adrA, diguanylate cyclase AdrA; Provisional
GCF_008033315.1_ASM803331v1	NZ_CP042865	Escherichia coli strain ATCC BAA-196 chromosome, complete genome	10	3880335-3880470	3	PILER-CR	no		cas3,csa3,PD-DExK,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,WYL,DEDDh,DinG,c2c9_V-U4	Orphan	TGAATCACCAATATTGAAAA	20	0	0	NA	NA	NA	2	2	Orphan	cas3,csa3,PD-DExK,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,WYL,DEDDh,DinG,c2c9_V-U4,RT,csf2gr7	NA,NA	NA|126aa|up_9|NZ_CP042865.1_3871043_3871421_+	PRK05461, apaG, CO2+/MG2+ efflux protein ApaG; Reviewed	NA|281aa|up_8|NZ_CP042865.1_3871427_3872270_+	TIGR00668, Bis5'-nucleosyl-tetraphosphatase_symmetrical, bis(5'-nucleosyl)-tetraphosphatase (symmetrical)	NA|160aa|up_7|NZ_CP042865.1_3872347_3872827_-	PRK10769, folA, type 3 dihydrofolate reductase	NA|621aa|up_6|NZ_CP042865.1_3873018_3874881_-	PRK03562, PRK03562, glutathione-regulated potassium-efflux system protein KefC; Provisional	NA|177aa|up_5|NZ_CP042865.1_3874873_3875404_-	PRK00871, PRK00871, glutathione-regulated potassium-efflux system oxidoreductase KefF	NA|444aa|up_4|NZ_CP042865.1_3875511_3876843_-	cd17316, MFS_SV2_like, Metazoan Synaptic vesicle glycoprotein 2 (SV2) and related small molecule transporters of the Major Facilitator Superfamily	NA|96aa|up_3|NZ_CP042865.1_3876899_3877187_-	PRK15449, PRK15449, ferredoxin-like protein FixX; Provisional	NA|429aa|up_2|NZ_CP042865.1_3877183_3878470_-	PRK10157, PRK10157, putative oxidoreductase FixC; Provisional	NA|314aa|up_1|NZ_CP042865.1_3878520_3879462_-	PRK03363, fixB, electron transfer flavoprotein subunit alpha/FixB family protein	NA|257aa|up_0|NZ_CP042865.1_3879476_3880247_-	PRK03359, PRK03359, putative electron transfer flavoprotein FixA; Reviewed	NA|505aa|down_0|NZ_CP042865.1_3880718_3882233_+	PRK03356, PRK03356, L-carnitine/gamma-butyrobetaine antiport BCCT transporter	NA|381aa|down_1|NZ_CP042865.1_3882263_3883406_+	PRK03354, PRK03354, crotonobetainyl-CoA dehydrogenase; Validated	NA|406aa|down_2|NZ_CP042865.1_3883534_3884752_+	PRK03525, PRK03525, L-carnitine CoA-transferase	NA|518aa|down_3|NZ_CP042865.1_3884825_3886379_+	PRK08008, caiC, putative crotonobetaine/carnitine-CoA ligase; Validated	NA|262aa|down_4|NZ_CP042865.1_3886487_3887273_+	PRK03580, PRK03580, crotonobetainyl-CoA hydratase	NA|197aa|down_5|NZ_CP042865.1_3887278_3887869_+	PRK13627, PRK13627, carnitine operon protein CaiE; Provisional	NA|132aa|down_6|NZ_CP042865.1_3887954_3888350_-	PRK11476, PRK11476, carnitine metabolism transcriptional regulator CaiF	NA|1074aa|down_7|NZ_CP042865.1_3888611_3891833_-	PRK05294, carB, carbamoyl-phosphate synthase large subunit	NA|383aa|down_8|NZ_CP042865.1_3891850_3892999_-	PRK12564, PRK12564, carbamoyl-phosphate synthase small subunit	NA|274aa|down_9|NZ_CP042865.1_3893454_3894276_-	COG0289, DapB, Dihydrodipicolinate reductase [Amino acid transport and metabolism]
GCF_008033315.1_ASM803331v1	NZ_CP042866	Escherichia coli strain ATCC BAA-196 plasmid unnamed1, complete sequence	1	137862-139106	1,1	CRISPRCasFinder,CRT	no	RT,csf2gr7,DinG	RT,csa3,csf2gr7,DinG	Type IV-A	CCGATAACCCCCGCATGCGGGGGGAATAC,CCGATAACCCCCGCANGCGGGGGGAATAC	29,29	0	0	NA	NA	NA:NA	20,20	20	TypeIV-A	cas3,csa3,PD-DExK,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,WYL,DEDDh,DinG,c2c9_V-U4,RT,csf2gr7	NA|126aa|up_9|NZ_CP042866.1_127421_127799_+,NA|325aa|up_8|NZ_CP042866.1_127996_128971_-,NA|87aa|up_5|NZ_CP042866.1_131555_131816_+,NA|236aa|down_1|NZ_CP042866.1_140207_140915_-,NA|264aa|down_4|NZ_CP042866.1_143354_144146_-	NA|126aa|up_9|NZ_CP042866.1_127421_127799_+	NA	NA|325aa|up_8|NZ_CP042866.1_127996_128971_-	NA	NA|53aa|up_7|NZ_CP042866.1_129573_129732_-	PRK09738, PRK09738, small toxic polypeptide; Provisional	NA|394aa|up_6|NZ_CP042866.1_129803_130985_+	pfam00665, rve, Integrase core domain	NA|87aa|up_5|NZ_CP042866.1_131555_131816_+	NA	NA|157aa|up_4|NZ_CP042866.1_131925_132396_-	pfam08808, RES, RES domain	NA|148aa|up_3|NZ_CP042866.1_132392_132836_-	TIGR02293, conserved_protein_of_unknown_function, putative toxin-antitoxin system antitoxin component, TIGR02293 family	NA|424aa|up_2|NZ_CP042866.1_132936_134208_-	PRK03609, umuC, translesion error-prone DNA polymerase V subunit UmuC	NA|141aa|up_1|NZ_CP042866.1_134207_134630_-	PRK10276, PRK10276, translesion error-prone DNA polymerase V autoproteolytic subunit	RT|496aa|up_0|NZ_CP042866.1_134715_136203_-	TIGR04416, hypothetical_protein, group II intron reverse transcriptase/maturase	csf2gr7|344aa|down_0|NZ_CP042866.1_139171_140203_-	TIGR03115, cas7_csf2, CRISPR type IV/AFERR-associated protein Csf2	NA|236aa|down_1|NZ_CP042866.1_140207_140915_-	NA	DinG|625aa|down_2|NZ_CP042866.1_140908_142783_-	COG1199, DinG, Rad3-related DNA helicases [Transcription / DNA replication, recombination, and repair]	NA|189aa|down_3|NZ_CP042866.1_142785_143352_-	cd09727, Cas6_I-E, CRISPR/Cas system-associated RAMP superfamily protein Cas6e	NA|264aa|down_4|NZ_CP042866.1_143354_144146_-	NA	NA|138aa|down_5|NZ_CP042866.1_144208_144622_-	pfam01637, ATPase_2, ATPase domain predominantly from Archaea	NA|220aa|down_6|NZ_CP042866.1_145742_146402_+	PRK13757, PRK13757, type A chloramphenicol O-acetyltransferase	NA|126aa|down_7|NZ_CP042866.1_146602_146980_-	pfam00583, Acetyltransf_1, Acetyltransferase (GNAT) family	NA|335aa|down_8|NZ_CP042866.1_147290_148295_-	COG3547, COG3547, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|989aa|down_9|NZ_CP042866.1_148373_151340_-	pfam01526, DDE_Tnp_Tn3, Tn3 transposase DDE domain
