assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_003073815.1_ASM307381v1	NZ_CP029108	Escherichia coli strain AR437 chromosome, complete genome	1	28694-28838	1	CRISPRCasFinder	no		DEDDh,RT,cas3,csa3,PD-DExK,cas5,cas6e,cas1,cas2,c2c9_V-U4,DinG	Orphan	GTAGGTCGGATAAGATGCGCAAGCATCGCATCCGACAATAAGTGCCGGATGC	52	0	0	NA	NA	NA	1	1	Orphan	DEDDh,RT,cas3,csa3,PD-DExK,cas5,cas6e,cas1,cas2,c2c9_V-U4,DinG	NA|94aa|up_6|NZ_CP029108.1_24079_24361_+,NA|201aa|up_4|NZ_CP029108.1_24891_25494_-,NA|56aa|up_3|NZ_CP029108.1_25736_25904_+,NA	NA|227aa|up_9|NZ_CP029108.1_23045_23726_+	pfam09588, YqaJ, YqaJ-like viral recombinase domain	NA|61aa|up_8|NZ_CP029108.1_23722_23905_+	pfam07026, DUF1317, Protein of unknown function (DUF1317)	NA|64aa|up_7|NZ_CP029108.1_23877_24069_+	pfam07131, DUF1382, Protein of unknown function (DUF1382)	NA|94aa|up_6|NZ_CP029108.1_24079_24361_+	NA	NA|74aa|up_5|NZ_CP029108.1_24459_24681_+	PHA00080, PHA00080, DksA-like zinc finger domain containing protein	NA|201aa|up_4|NZ_CP029108.1_24891_25494_-	NA	NA|56aa|up_3|NZ_CP029108.1_25736_25904_+	NA	NA|73aa|up_2|NZ_CP029108.1_25943_26162_+	pfam07825, Exc, Excisionase-like protein	NA|357aa|up_1|NZ_CP029108.1_26139_27210_+	cd00800, INT_Lambda_C, C-terminal catalytic domain of Lambda integrase, a tyrosine-based site-specific recombinase	NA|428aa|up_0|NZ_CP029108.1_27344_28628_+	PRK10531, PRK10531, putative acyl-CoA thioester hydrolase	NA|754aa|down_0|NZ_CP029108.1_28861_31123_-	PRK11413, PRK11413, putative hydratase; Provisional	NA|478aa|down_1|NZ_CP029108.1_31305_32739_-	pfam00939, Na_sulph_symp, Sodium:sulfate symporter transmembrane region	NA|351aa|down_2|NZ_CP029108.1_32814_33867_-	NF033377, OMA_tautomer, 4-oxalomesaconate tautomerase	NA|318aa|down_3|NZ_CP029108.1_34050_35004_+	cd08440, PBP2_LTTR_like_4, TThe C-terminal substrate binding domain of an uncharacterized LysR-type transcriptional regulator, contains the type 2 periplasmic binding fold	NA|332aa|down_4|NZ_CP029108.1_35044_36040_-	PRK11028, PRK11028, 6-phosphogluconolactonase; Provisional	NA|273aa|down_5|NZ_CP029108.1_36194_37013_+	PRK10530, PRK10530, pyridoxal phosphate (PLP) phosphatase; Provisional	NA|353aa|down_6|NZ_CP029108.1_37013_38072_-	PRK11144, modC, molybdenum ABC transporter ATP-binding protein ModC	NA|230aa|down_7|NZ_CP029108.1_38074_38764_-	PRK09421, modB, molybdate ABC transporter permease subunit	NA|258aa|down_8|NZ_CP029108.1_38763_39537_-	PRK10677, modA, molybdate transporter periplasmic protein; Provisional	NA|50aa|down_9|NZ_CP029108.1_39703_39853_-	pfam10766, AcrZ, Multidrug efflux pump-associated protein AcrZ
GCF_003073815.1_ASM307381v1	NZ_CP029108	Escherichia coli strain AR437 chromosome, complete genome	2	550299-550452	2	CRISPRCasFinder	no		DEDDh,RT,cas3,csa3,PD-DExK,cas5,cas6e,cas1,cas2,c2c9_V-U4,DinG	Orphan	CGCCTTATCCGGCCTACCGATCCAGCACAGGTTTGTAGGCATGATAAGACGCG	53	0	0	NA	NA	NA	1	1	Orphan	DEDDh,RT,cas3,csa3,PD-DExK,cas5,cas6e,cas1,cas2,c2c9_V-U4,DinG	NA,NA|150aa|down_8|NZ_CP029108.1_560029_560479_-	NA|352aa|up_9|NZ_CP029108.1_540124_541180_+	PRK10159, PRK10159, phosphoporin PhoE	NA|134aa|up_8|NZ_CP029108.1_541218_541620_-	PRK10984, PRK10984, sigma factor-binding protein Crl	NA|415aa|up_7|NZ_CP029108.1_541677_542922_-	PRK05077, frsA, esterase FrsA	NA|153aa|up_6|NZ_CP029108.1_543013_543472_-	PRK09177, PRK09177, xanthine-guanine phosphoribosyltransferase; Validated	NA|486aa|up_5|NZ_CP029108.1_543732_545190_+	PRK15026, PRK15026, aminoacyl-histidine dipeptidase; Provisional	NA|89aa|up_4|NZ_CP029108.1_545546_545813_-	PRK09588, PRK09588, hypothetical protein; Reviewed	NA|151aa|up_3|NZ_CP029108.1_546119_546572_-	PRK09831, PRK09831, GNAT family N-acetyltransferase	NA|352aa|up_2|NZ_CP029108.1_546568_547624_-	PRK02406, PRK02406, DNA polymerase IV; Validated	NA|262aa|up_1|NZ_CP029108.1_547694_548480_-	PRK06778, PRK06778, hypothetical protein; Validated	NA|580aa|up_0|NZ_CP029108.1_548424_550164_+	COG1298, FlhA, Flagellar biosynthesis pathway, component FlhA [Cell motility and secretion / Intracellular trafficking and secretion]	NA|166aa|down_0|NZ_CP029108.1_550481_550979_-	COG1943, COG1943, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|253aa|down_1|NZ_CP029108.1_551154_551913_-	COG0791, Spr, Cell wall-associated hydrolases (invasion-associated proteins) [Cell envelope biogenesis, outer membrane]	NA|247aa|down_2|NZ_CP029108.1_552204_552945_+	COG3034, COG3034, Uncharacterized protein conserved in bacteria [Function unknown]	NA|256aa|down_3|NZ_CP029108.1_552915_553683_-	pfam13230, GATase_4, Glutamine amidotransferases class-II	NA|193aa|down_4|NZ_CP029108.1_553888_554467_-	PRK00414, gmhA, D-sedoheptulose 7-phosphate isomerase	NA|815aa|down_5|NZ_CP029108.1_554706_557151_+	PRK09463, fadE, acyl-CoA dehydrogenase; Reviewed	NA|158aa|down_6|NZ_CP029108.1_557193_557667_-	PRK09993, PRK09993, C-lysozyme inhibitor; Provisional	NA|257aa|down_7|NZ_CP029108.1_557820_558591_+	PRK10438, PRK10438, C-N hydrolase family amidase; Provisional	NA|150aa|down_8|NZ_CP029108.1_560029_560479_-	NA	NA|451aa|down_9|NZ_CP029108.1_565391_566744_+	COG3515, COG3515, Predicted component of the type VI protein secretion system [Intracellular trafficking, secretion, and    vesicular transport]
GCF_003073815.1_ASM307381v1	NZ_CP029108	Escherichia coli strain AR437 chromosome, complete genome	3	740363-740478	3	CRISPRCasFinder	no		DEDDh,RT,cas3,csa3,PD-DExK,cas5,cas6e,cas1,cas2,c2c9_V-U4,DinG	Orphan	AACGCCTGATGCGACGCTGACGCGTCTTATC	31	0	0	NA	NA	NA	1	1	Orphan	DEDDh,RT,cas3,csa3,PD-DExK,cas5,cas6e,cas1,cas2,c2c9_V-U4,DinG	NA,NA	NA|393aa|up_9|NZ_CP029108.1_728378_729557_-	TIGR00899, Sugar_efflux_transporter_A, sugar efflux transporter	NA|44aa|up_8|NZ_CP029108.1_729658_729790_-	pfam15894, SgrT, Inhibitor of glucose uptake transporter SgrT	NA|552aa|up_7|NZ_CP029108.1_729878_731534_+	PRK13626, PRK13626, HTH-type transcriptional regulator SgrR	NA|328aa|up_6|NZ_CP029108.1_731697_732681_+	PRK11205, tbpA, thiamine transporter substrate binding subunit; Provisional	NA|537aa|up_5|NZ_CP029108.1_732656_734267_+	PRK09433, thiP, thiamine transporter membrane protein; Reviewed	NA|233aa|up_4|NZ_CP029108.1_734250_734949_+	PRK10771, thiQ, thiamine ABC transporter ATP-binding protein ThiQ	NA|255aa|up_3|NZ_CP029108.1_735062_735827_-	COG0586, DedA, Uncharacterized membrane-associated protein [Function unknown]	NA|293aa|up_2|NZ_CP029108.1_735912_736791_-	PRK10572, PRK10572, arabinose operon transcriptional regulator AraC	NA|567aa|up_1|NZ_CP029108.1_737129_738830_+	PRK04123, PRK04123, ribulokinase; Provisional	NA|501aa|up_0|NZ_CP029108.1_738840_740343_+	PRK02929, PRK02929, L-arabinose isomerase; Provisional	NA|232aa|down_0|NZ_CP029108.1_740542_741238_+	PRK08193, araD, L-ribulose-5-phosphate 4-epimerase AraD	NA|784aa|down_1|NZ_CP029108.1_741312_743664_+	PRK05762, PRK05762, DNA polymerase II; Reviewed	NA|969aa|down_2|NZ_CP029108.1_743828_746735_+	PRK04914, PRK04914, RNA polymerase-associated protein RapA	NA|220aa|down_3|NZ_CP029108.1_746746_747406_+	PRK10158, PRK10158, bifunctional tRNA pseudouridine(32) synthase/23S rRNA pseudouridine(746) synthase RluA	NA|272aa|down_4|NZ_CP029108.1_747522_748338_-	PRK09430, djlA, co-chaperone DjlA	NA|785aa|down_5|NZ_CP029108.1_748592_750947_+	PRK03761, PRK03761, LPS assembly outer membrane complex protein LptD; Provisional	NA|429aa|down_6|NZ_CP029108.1_750999_752286_+	PRK10770, PRK10770, peptidyl-prolyl cis-trans isomerase SurA; Provisional	NA|330aa|down_7|NZ_CP029108.1_752285_753275_+	PRK00232, pdxA, 4-hydroxythreonine-4-phosphate dehydrogenase; Reviewed	NA|274aa|down_8|NZ_CP029108.1_753271_754093_+	PRK00274, ksgA, 16S rRNA (adenine(1518)-N(6)/adenine(1519)-N(6))-dimethyltransferase RsmA	NA|126aa|down_9|NZ_CP029108.1_754095_754473_+	PRK05461, apaG, CO2+/MG2+ efflux protein ApaG; Reviewed
GCF_003073815.1_ASM307381v1	NZ_CP029108	Escherichia coli strain AR437 chromosome, complete genome	4	763391-763523	1	PILER-CR	no		DEDDh,RT,cas3,csa3,PD-DExK,cas5,cas6e,cas1,cas2,c2c9_V-U4,DinG	Orphan	ATCACCAATATTGAAAA	17	0	0	NA	NA	NA	2	2	Orphan	DEDDh,RT,cas3,csa3,PD-DExK,cas5,cas6e,cas1,cas2,c2c9_V-U4,DinG	NA,NA	NA|126aa|up_9|NZ_CP029108.1_754095_754473_+	PRK05461, apaG, CO2+/MG2+ efflux protein ApaG; Reviewed	NA|281aa|up_8|NZ_CP029108.1_754479_755322_+	TIGR00668, Bis5'-nucleosyl-tetraphosphatase_symmetrical, bis(5'-nucleosyl)-tetraphosphatase (symmetrical)	NA|160aa|up_7|NZ_CP029108.1_755399_755879_-	PRK10769, folA, type 3 dihydrofolate reductase	NA|621aa|up_6|NZ_CP029108.1_756070_757933_-	PRK03562, PRK03562, glutathione-regulated potassium-efflux system protein KefC; Provisional	NA|177aa|up_5|NZ_CP029108.1_757925_758456_-	PRK00871, PRK00871, glutathione-regulated potassium-efflux system oxidoreductase KefF	NA|444aa|up_4|NZ_CP029108.1_758563_759895_-	cd17316, MFS_SV2_like, Metazoan Synaptic vesicle glycoprotein 2 (SV2) and related small molecule transporters of the Major Facilitator Superfamily	NA|96aa|up_3|NZ_CP029108.1_759952_760240_-	PRK15449, PRK15449, ferredoxin-like protein FixX; Provisional	NA|429aa|up_2|NZ_CP029108.1_760236_761523_-	PRK10157, PRK10157, putative oxidoreductase FixC; Provisional	NA|314aa|up_1|NZ_CP029108.1_761573_762515_-	PRK03363, fixB, electron transfer flavoprotein subunit alpha/FixB family protein	NA|257aa|up_0|NZ_CP029108.1_762529_763300_-	PRK03359, PRK03359, putative electron transfer flavoprotein FixA; Reviewed	NA|505aa|down_0|NZ_CP029108.1_763773_765288_+	PRK03356, PRK03356, L-carnitine/gamma-butyrobetaine antiport BCCT transporter	NA|381aa|down_1|NZ_CP029108.1_765318_766461_+	PRK03354, PRK03354, crotonobetainyl-CoA dehydrogenase; Validated	NA|406aa|down_2|NZ_CP029108.1_766589_767807_+	PRK03525, PRK03525, L-carnitine CoA-transferase	NA|518aa|down_3|NZ_CP029108.1_767880_769434_+	PRK08008, caiC, putative crotonobetaine/carnitine-CoA ligase; Validated	NA|262aa|down_4|NZ_CP029108.1_769542_770328_+	PRK03580, PRK03580, crotonobetainyl-CoA hydratase	NA|197aa|down_5|NZ_CP029108.1_770333_770924_+	PRK13627, PRK13627, carnitine operon protein CaiE; Provisional	NA|132aa|down_6|NZ_CP029108.1_771009_771405_-	PRK11476, PRK11476, carnitine metabolism transcriptional regulator CaiF	NA|1074aa|down_7|NZ_CP029108.1_771665_774887_-	PRK05294, carB, carbamoyl-phosphate synthase large subunit	NA|383aa|down_8|NZ_CP029108.1_774904_776053_-	PRK12564, PRK12564, carbamoyl-phosphate synthase small subunit	NA|274aa|down_9|NZ_CP029108.1_776508_777330_-	COG0289, DapB, Dihydrodipicolinate reductase [Amino acid transport and metabolism]
GCF_003073815.1_ASM307381v1	NZ_CP029108	Escherichia coli strain AR437 chromosome, complete genome	5	2216933-2217072	4	CRISPRCasFinder	no		DEDDh,RT,cas3,csa3,PD-DExK,cas5,cas6e,cas1,cas2,c2c9_V-U4,DinG	Orphan	TTTGTATCGTTGTAGGCCTGATAAGACGCGGCAAGCGTCGCATCAGGCA	49	0	0	NA	NA	NA	1	1	Orphan	DEDDh,RT,cas3,csa3,PD-DExK,cas5,cas6e,cas1,cas2,c2c9_V-U4,DinG	NA,NA	NA|336aa|up_9|NZ_CP029108.1_2209398_2210406_-	PRK10508, PRK10508, luciferase-like monooxygenase	NA|293aa|up_8|NZ_CP029108.1_2210611_2211490_-	PRK15447, PRK15447, putative protease; Provisional	NA|332aa|up_7|NZ_CP029108.1_2211498_2212494_-	COG0826, COG0826, Collagenase and related proteases [Posttranslational modification, protein turnover, chaperones]	NA|175aa|up_6|NZ_CP029108.1_2212702_2213227_+	COG3154, COG3154, Putative lipid carrier protein [Lipid metabolism]	NA|168aa|up_5|NZ_CP029108.1_2213220_2213724_+	COG3153, COG3153, Predicted acetyltransferase [General function prediction only]	NA|101aa|up_4|NZ_CP029108.1_2213710_2214013_-	PRK00329, PRK00329, GIY-YIG nuclease superfamily protein; Validated	NA|148aa|up_3|NZ_CP029108.1_2214063_2214507_+	PRK03467, PRK03467, hypothetical protein; Provisional	NA|173aa|up_2|NZ_CP029108.1_2214486_2215005_-	cd03134, GATase1_PfpI_like, A type 1 glutamine amidotransferase (GATase1)-like domain found in PfpI from Pyrococcus furiosus	NA|212aa|up_1|NZ_CP029108.1_2215132_2215768_+	cd05250, CC3_like_SDR_a, CC3(TIP30)-like, atypical (a) SDRs	NA|347aa|up_0|NZ_CP029108.1_2215840_2216881_+	COG0701, COG0701, Predicted permeases [General function prediction only]	NA|192aa|down_0|NZ_CP029108.1_2217085_2217661_-	PRK11023, PRK11023, divisome-associated lipoprotein YraP	NA|197aa|down_1|NZ_CP029108.1_2217670_2218261_-	PRK10886, PRK10886, DnaA initiator-associating protein DiaA; Provisional	NA|132aa|down_2|NZ_CP029108.1_2218280_2218676_-	TIGR00252, UPF0102_protein_HI_1656, TIGR00252 family protein	NA|679aa|down_3|NZ_CP029108.1_2218633_2220670_-	COG3107, LppC, Putative lipoprotein [General function prediction only]	NA|287aa|down_4|NZ_CP029108.1_2220734_2221595_+	PRK14994, PRK14994, SAM-dependent 16S ribosomal RNA C1402 ribose 2'-O-methyltransferase; Provisional	NA|364aa|down_5|NZ_CP029108.1_2221637_2222729_-	pfam00419, Fimbrial, Fimbrial protein	NA|232aa|down_6|NZ_CP029108.1_2226062_2226758_-	COG3121, FimC, P pilus assembly protein, chaperone PapD [Cell motility and secretion / Intracellular trafficking and secretion]	NA|195aa|down_7|NZ_CP029108.1_2226837_2227422_-	COG3539, FimA, P pilus assembly protein, pilin FimA [Cell motility and secretion / Intracellular trafficking and secretion]	NA|252aa|down_8|NZ_CP029108.1_2227822_2228578_-	PRK09762, PRK09762, galactosamine-6-phosphate isomerase; Provisional	NA|264aa|down_9|NZ_CP029108.1_2228578_2229370_-	PRK09855, PRK09855, PTS N-acetylgalactosamine transporter subunit IID
GCF_003073815.1_ASM307381v1	NZ_CP029108	Escherichia coli strain AR437 chromosome, complete genome	6	2261833-2261950	5	CRISPRCasFinder	no		DEDDh,RT,cas3,csa3,PD-DExK,cas5,cas6e,cas1,cas2,c2c9_V-U4,DinG	Orphan	TGCCGGATGCGATGCTGGCGCACCTTATCCGGCCTACGGG	40	0	0	NA	NA	NA	1	1	Orphan	DEDDh,RT,cas3,csa3,PD-DExK,cas5,cas6e,cas1,cas2,c2c9_V-U4,DinG	NA,NA|55aa|down_0|NZ_CP029108.1_2262023_2262188_-	NA|115aa|up_9|NZ_CP029108.1_2249554_2249899_-	PRK11424, PRK11424, DNA-binding transcriptional activator TdcR; Provisional	NA|313aa|up_8|NZ_CP029108.1_2250087_2251026_+	PRK10341, PRK10341, transcriptional regulator TdcA	NA|330aa|up_7|NZ_CP029108.1_2251124_2252114_+	PRK08638, PRK08638, bifunctional threonine ammonia-lyase/L-serine ammonia-lyase TdcB	NA|444aa|up_6|NZ_CP029108.1_2252135_2253467_+	PRK13629, PRK13629, threonine/serine transporter TdcC; Provisional	NA|403aa|up_5|NZ_CP029108.1_2253492_2254701_+	PRK12379, PRK12379, propionate kinase	NA|765aa|up_4|NZ_CP029108.1_2254734_2257029_+	cd01678, PFL1, Pyruvate formate lyase 1	NA|130aa|up_3|NZ_CP029108.1_2257042_2257432_+	PRK11401, PRK11401, enamine/imine deaminase	NA|455aa|up_2|NZ_CP029108.1_2257503_2258868_+	PRK15040, PRK15040, L-serine ammonia-lyase	NA|444aa|up_1|NZ_CP029108.1_2259142_2260474_+	TIGR00814, membrane_transport_protein_YhjV, serine transporter	NA|437aa|up_0|NZ_CP029108.1_2260501_2261812_+	COG3681, COG3681, L-cysteine desulfidase [Amino acid transport and metabolism]	NA|55aa|down_0|NZ_CP029108.1_2262023_2262188_-	NA	NA|234aa|down_1|NZ_CP029108.1_2262210_2262912_-	COG1741, COG1741, Pirin-related protein [General function prediction only]	NA|299aa|down_2|NZ_CP029108.1_2263016_2263913_+	cd08431, PBP2_HupR, The C-terminal substrate binding domain of LysR-type transcriptional regulator, HupR, which regulates expression of the heme uptake receptor HupA; contains the type 2 periplasmic binding fold	NA|119aa|down_3|NZ_CP029108.1_2263963_2264320_-	COG3152, COG3152, Predicted membrane protein [Function unknown]	NA|122aa|down_4|NZ_CP029108.1_2264561_2264927_-	COG3152, COG3152, Predicted membrane protein [Function unknown]	NA|329aa|down_5|NZ_CP029108.1_2265219_2266206_-	COG0435, ECM4, Predicted glutathione S-transferase [Posttranslational modification, protein turnover, chaperones]	NA|161aa|down_6|NZ_CP029108.1_2266275_2266758_-	COG2259, COG2259, Predicted membrane protein [Function unknown]	NA|100aa|down_7|NZ_CP029108.1_2266853_2267153_-	pfam13997, YqjK, YqjK-like protein	NA|135aa|down_8|NZ_CP029108.1_2267142_2267547_-	COG5393, COG5393, Predicted membrane protein [Function unknown]	NA|102aa|down_9|NZ_CP029108.1_2267549_2267855_-	COG4575, ElaB, Uncharacterized conserved protein [Function unknown]
GCF_003073815.1_ASM307381v1	NZ_CP029108	Escherichia coli strain AR437 chromosome, complete genome	7	2635283-2635799	2,6,1	PILER-CR,CRISPRCasFinder,CRT	no		DEDDh,RT,cas3,csa3,PD-DExK,cas5,cas6e,cas1,cas2,c2c9_V-U4,DinG	Orphan	GAGTTCCCCGCGCCAGCGGGGATAAACCG,GAGTTCCCCGCGCCAGCGGGGATAAACCG,GAGTTCCCCGCGCCAGCGGGGATAAACCG	29,29,29	0	0	NA	NA	I-E:I-E:I-E	8,8,8	8	Orphan	DEDDh,RT,cas3,csa3,PD-DExK,cas5,cas6e,cas1,cas2,c2c9_V-U4,DinG	NA|47aa|up_1|NZ_CP029108.1_2633992_2634133_-,NA	NA|434aa|up_9|NZ_CP029108.1_2624689_2625991_+	PRK13168, rumA, 23S rRNA (uracil(1939)-C(5))-methyltransferase RlmD	NA|745aa|up_8|NZ_CP029108.1_2626038_2628273_+	PRK10872, relA, (p)ppGpp synthetase I/GTP pyrophosphokinase; Provisional	NA|83aa|up_7|NZ_CP029108.1_2628350_2628599_+	PRK09798, PRK09798, MazF-MazE toxin-antitoxin system antitoxin MazE	NA|112aa|up_6|NZ_CP029108.1_2628598_2628934_+	PRK09907, PRK09907, endoribonuclease MazF	NA|264aa|up_5|NZ_CP029108.1_2629004_2629796_+	PRK09562, mazG, nucleoside triphosphate pyrophosphohydrolase; Reviewed	NA|546aa|up_4|NZ_CP029108.1_2630023_2631661_+	PRK05380, pyrG, CTP synthetase; Validated	NA|433aa|up_3|NZ_CP029108.1_2631748_2633047_+	PRK00077, eno, enolase; Provisional	NA|291aa|up_2|NZ_CP029108.1_2633106_2633979_-	COG1512, COG1512, Beta-propeller domains of methanol dehydrogenase type [General function prediction only]	NA|47aa|up_1|NZ_CP029108.1_2633992_2634133_-	NA	NA|224aa|up_0|NZ_CP029108.1_2634271_2634943_+	TIGR04322, organic_radical_activating_enzyme, putative 7-cyano-7-deazaguanosine (preQ0) biosynthesis protein QueE	NA|493aa|down_0|NZ_CP029108.1_2636436_2637915_-	cd07779, FGGY_ygcE_like, uncharacterized ygcE-like proteins	NA|426aa|down_1|NZ_CP029108.1_2637941_2639219_-	cd06174, MFS, Major Facilitator Superfamily	NA|262aa|down_2|NZ_CP029108.1_2639537_2640323_+	cd05347, Ga5DH-like_SDR_c, gluconate 5-dehydrogenase (Ga5DH)-like, classical (c) SDRs	NA|485aa|down_3|NZ_CP029108.1_2640392_2641847_+	COG0277, GlcD, FAD/FMN-containing dehydrogenases [Energy production and conversion]	NA|470aa|down_4|NZ_CP029108.1_2641868_2643278_+	cd17371, MFS_MucK, Cis,cis-muconate transport protein and similar proteins of the Major Facilitator Superfamily	NA|260aa|down_5|NZ_CP029108.1_2643255_2644035_+	COG2086, FixA, Electron transfer flavoprotein, beta subunit [Energy production and conversion]	NA|287aa|down_6|NZ_CP029108.1_2644031_2644892_+	COG2025, FixB, Electron transfer flavoprotein, alpha subunit [Energy production and conversion]	NA|192aa|down_7|NZ_CP029108.1_2645039_2645615_-	COG1954, GlpP, Glycerol-3-phosphate responsive antiterminator (mRNA-binding) [Transcription]	NA|87aa|down_8|NZ_CP029108.1_2645631_2645892_-	COG2440, FixX, Ferredoxin-like protein [Energy production and conversion]	NA|424aa|down_9|NZ_CP029108.1_2645882_2647154_-	PRK10015, PRK10015, oxidoreductase; Provisional
GCF_003073815.1_ASM307381v1	NZ_CP029108	Escherichia coli strain AR437 chromosome, complete genome	8	2658184-2658762	7,3,2	CRISPRCasFinder,PILER-CR,CRT	no	cas5,cas6e,cas1,cas2	DEDDh,RT,cas3,csa3,PD-DExK,cas5,cas6e,cas1,cas2,c2c9_V-U4,DinG	Unclear	TGTGTTCCCCGCGCCAGCGGGGATAAACCG,GTGTTCCCCGCGCCAGCGGGGATAAACC,GTGTTCCCCGCGCCAGCGGGGATAAACCG	30,28,29	0	0	NA	NA	I-E:I-E:I-E	9,9,9	9	Unclear	DEDDh,RT,cas3,csa3,PD-DExK,cas5,cas6e,cas1,cas2,c2c9_V-U4,DinG	NA,NA	NA|424aa|up_9|NZ_CP029108.1_2645882_2647154_-	PRK10015, PRK10015, oxidoreductase; Provisional	NA|122aa|up_8|NZ_CP029108.1_2647231_2647597_-	cd00470, PTPS, 6-pyruvoyl tetrahydropterin synthase (PTPS)	NA|600aa|up_7|NZ_CP029108.1_2647912_2649712_+	PRK10953, cysJ, NADPH-dependent assimilatory sulfite reductase flavoprotein subunit	NA|571aa|up_6|NZ_CP029108.1_2649711_2651424_+	PRK13504, PRK13504, NADPH-dependent assimilatory sulfite reductase hemoprotein subunit	NA|245aa|up_5|NZ_CP029108.1_2651497_2652232_+	PRK02090, PRK02090, phosphoadenylyl-sulfate reductase	NA|51aa|up_4|NZ_CP029108.1_2652496_2652649_+	pfam01848, HOK_GEF, Hok/gef family	cas5|249aa|up_3|NZ_CP029108.1_2655499_2656246_+	cd09645, Cas5_I-E, CRISPR/Cas system-associated RAMP superfamily protein Cas5	cas6e|217aa|up_2|NZ_CP029108.1_2656227_2656878_+	TIGR01907, CRISPR_system_Cascade_subunit_CasE, CRISPR-associated protein Cas6/Cse3/CasE, subtype I-E/ECOLI	cas1|308aa|up_1|NZ_CP029108.1_2656874_2657798_+	TIGR03638, cas1_ECOLI, CRISPR-associated endonuclease Cas1, subtype I-E/ECOLI	cas2|98aa|up_0|NZ_CP029108.1_2657794_2658088_+	PRK11558, PRK11558, putative ssRNA endonuclease; Provisional	NA|346aa|down_0|NZ_CP029108.1_2658843_2659881_-	PRK10199, PRK10199, alkaline phosphatase isozyme conversion aminopeptidase; Provisional	NA|303aa|down_1|NZ_CP029108.1_2660132_2661041_+	PRK05253, PRK05253, sulfate adenylyltransferase subunit CysD	NA|476aa|down_2|NZ_CP029108.1_2661042_2662470_+	PRK05124, cysN, sulfate adenylyltransferase subunit 1; Provisional	NA|202aa|down_3|NZ_CP029108.1_2662469_2663075_+	PRK03846, PRK03846, adenylylsulfate kinase; Provisional	NA|108aa|down_4|NZ_CP029108.1_2663124_2663448_+	pfam12084, DUF3561, Protein of unknown function (DUF3561)	NA|104aa|down_5|NZ_CP029108.1_2663641_2663953_+	PRK00888, ftsB, cell division protein FtsB; Reviewed	NA|237aa|down_6|NZ_CP029108.1_2663971_2664682_+	PRK00155, ispD, D-ribitol-5-phosphate cytidylyltransferase	NA|160aa|down_7|NZ_CP029108.1_2664681_2665161_+	PRK00084, ispF, 2-C-methyl-D-erythritol 2,4-cyclodiphosphate synthase; Reviewed	NA|350aa|down_8|NZ_CP029108.1_2665157_2666207_+	PRK00984, truD, tRNA pseudouridine synthase D; Reviewed	NA|254aa|down_9|NZ_CP029108.1_2666187_2666949_+	PRK00346, surE, 5'(3')-nucleotidase/polyphosphatase; Provisional
GCF_003073815.1_ASM307381v1	NZ_CP029108	Escherichia coli strain AR437 chromosome, complete genome	9	3160085-3160202	8	CRISPRCasFinder	no		DEDDh,RT,cas3,csa3,PD-DExK,cas5,cas6e,cas1,cas2,c2c9_V-U4,DinG	Orphan	CCGAGCCGTAGGCCGGATAAGGCGTTCACGC	31	0	0	NA	NA	NA	1	1	Orphan	DEDDh,RT,cas3,csa3,PD-DExK,cas5,cas6e,cas1,cas2,c2c9_V-U4,DinG	NA|105aa|up_3|NZ_CP029108.1_3155165_3155480_-,NA	NA|300aa|up_9|NZ_CP029108.1_3147078_3147978_-	PRK09956, PRK09956, ISNCY family transposase	NA|397aa|up_8|NZ_CP029108.1_3148170_3149361_-	TIGR03379, glycerol3P_GlpC, glycerol-3-phosphate dehydrogenase, anaerobic, C subunit	NA|420aa|up_7|NZ_CP029108.1_3149357_3150617_-	COG3075, GlpB, Anaerobic glycerol-3-phosphate dehydrogenase [Amino acid transport and metabolism]	NA|543aa|up_6|NZ_CP029108.1_3150606_3152235_-	PRK11101, glpA, anaerobic glycerol-3-phosphate dehydrogenase subunit A	NA|453aa|up_5|NZ_CP029108.1_3152507_3153866_+	PRK11273, glpT, glycerol-3-phosphate transporter	NA|359aa|up_4|NZ_CP029108.1_3153870_3154947_+	PRK11143, glpQ, glycerophosphodiester phosphodiesterase; Provisional	NA|105aa|up_3|NZ_CP029108.1_3155165_3155480_-	NA	NA|217aa|up_2|NZ_CP029108.1_3157899_3158550_+	PRK09902, PRK09902, lipopolysaccharide kinase InaA	NA|85aa|up_1|NZ_CP029108.1_3158603_3158858_-	PRK10713, PRK10713, 2Fe-2S ferredoxin-like protein	NA|377aa|up_0|NZ_CP029108.1_3158857_3159988_-	PRK09101, nrdB, ribonucleotide-diphosphate reductase subunit beta; Reviewed	NA|762aa|down_0|NZ_CP029108.1_3160221_3162507_-	PRK09103, PRK09103, ribonucleoside-diphosphate reductase subunit alpha	NA|1251aa|down_1|NZ_CP029108.1_3163202_3166955_+	PRK09752, PRK09752, AIDA-I family autotransporter YfaL	NA|241aa|down_2|NZ_CP029108.1_3167082_3167805_-	PRK05134, PRK05134, bifunctional 2-polyprenyl-6-hydroxyphenol methylase/3-demethylubiquinol 3-O-methyltransferase UbiG	NA|876aa|down_3|NZ_CP029108.1_3167951_3170579_+	PRK05560, PRK05560, DNA gyrase subunit A; Validated	NA|563aa|down_4|NZ_CP029108.1_3170727_3172416_+	COG4685, COG4685, Uncharacterized protein conserved in bacteria [Function unknown]	NA|208aa|down_5|NZ_CP029108.1_3172412_3173036_+	COG3234, COG3234, Uncharacterized protein conserved in bacteria [Function unknown]	NA|1465aa|down_6|NZ_CP029108.1_3173179_3177574_+	COG2373, COG2373, Large extracellular alpha-helical protein [General function prediction only]	NA|550aa|down_7|NZ_CP029108.1_3177574_3179224_+	COG5445, COG5445, Predicted secreted protein [Function unknown]	NA|259aa|down_8|NZ_CP029108.1_3179228_3180005_+	COG4676, COG4676, Uncharacterized protein conserved in bacteria [Function unknown]	NA|395aa|down_9|NZ_CP029108.1_3180078_3181263_-	PRK05790, PRK05790, putative acyltransferase; Provisional
GCF_003073815.1_ASM307381v1	NZ_CP029108	Escherichia coli strain AR437 chromosome, complete genome	10	3756772-3756895	9	CRISPRCasFinder	no	DEDDh	DEDDh,RT,cas3,csa3,PD-DExK,cas5,cas6e,cas1,cas2,c2c9_V-U4,DinG	Unclear	CGACCCCCACCATGTCAAGGTGGTGCTCTAACCAACTGAGCTA	43	0	0	NA	NA	NA	1	1	Orphan	DEDDh,RT,cas3,csa3,PD-DExK,cas5,cas6e,cas1,cas2,c2c9_V-U4,DinG	NA,NA|30aa|down_7|NZ_CP029108.1_3765791_3765881_+	NA|471aa|up_9|NZ_CP029108.1_3746226_3747639_-	PRK09206, PRK09206, pyruvate kinase PykF	NA|70aa|up_8|NZ_CP029108.1_3748194_3748404_+	PRK10292, PRK10292, fumarate hydratase FumD	NA|209aa|up_7|NZ_CP029108.1_3748859_3749486_+	PRK09898, PRK09898, ferredoxin-like protein	NA|701aa|up_6|NZ_CP029108.1_3749506_3751609_+	PRK09849, PRK09849, putative oxidoreductase; Provisional	NA|213aa|up_5|NZ_CP029108.1_3751621_3752260_+	PRK09947, PRK09947, YdhW family putative oxidoreductase system protein	NA|223aa|up_4|NZ_CP029108.1_3752323_3752992_+	TIGR03149, cyt_nit_nrfC, cytochrome c nitrite reductase, Fe-S protein	NA|262aa|up_3|NZ_CP029108.1_3752988_3753774_+	PRK15006, PRK15006, thiosulfate reductase cytochrome B subunit; Provisional	NA|271aa|up_2|NZ_CP029108.1_3753777_3754590_+	PRK09946, PRK09946, hypothetical protein; Provisional	NA|535aa|up_1|NZ_CP029108.1_3754601_3756206_-	PRK09897, PRK09897, FAD-NAD(P)-binding protein	NA|102aa|up_0|NZ_CP029108.1_3756331_3756637_-	PRK11118, PRK11118, putative monooxygenase; Provisional	NA|419aa|down_0|NZ_CP029108.1_3757209_3758466_+	PRK09945, PRK09945, hypothetical protein; Provisional	NA|458aa|down_1|NZ_CP029108.1_3758506_3759880_-	PRK01766, PRK01766, multidrug efflux protein; Reviewed	NA|214aa|down_2|NZ_CP029108.1_3760094_3760736_+	PRK13020, PRK13020, riboflavin synthase subunit alpha; Provisional	NA|383aa|down_3|NZ_CP029108.1_3760775_3761924_-	PRK11705, PRK11705, cyclopropane fatty acyl phospholipid synthase	NA|404aa|down_4|NZ_CP029108.1_3762214_3763426_-	PRK11043, PRK11043, Bcr/CflA family multidrug efflux MFS transporter	NA|311aa|down_5|NZ_CP029108.1_3763538_3764471_+	PRK11074, PRK11074, putative DNA-binding transcriptional regulator; Provisional	NA|342aa|down_6|NZ_CP029108.1_3764467_3765493_-	PRK10703, PRK10703, HTH-type transcriptional repressor PurR	NA|30aa|down_7|NZ_CP029108.1_3765791_3765881_+	NA	NA|390aa|down_8|NZ_CP029108.1_3766046_3767216_+	COG2814, AraJ, Arabinose efflux permease [Carbohydrate transport and metabolism]	NA|194aa|down_9|NZ_CP029108.1_3767361_3767943_-	PRK10543, PRK10543, superoxide dismutase [Fe]
GCF_003073815.1_ASM307381v1	NZ_CP029108	Escherichia coli strain AR437 chromosome, complete genome	11	4411980-4412071	10	CRISPRCasFinder	no		DEDDh,RT,cas3,csa3,PD-DExK,cas5,cas6e,cas1,cas2,c2c9_V-U4,DinG	Orphan	CCACCTTTTTTACCTGCTTCAGATGC	26	0	0	NA	NA	NA	1	1	Orphan	DEDDh,RT,cas3,csa3,PD-DExK,cas5,cas6e,cas1,cas2,c2c9_V-U4,DinG	NA,NA	NA|503aa|up_9|NZ_CP029108.1_4399564_4401073_-	PRK15419, PRK15419, sodium/proline symporter PutP	NA|1321aa|up_8|NZ_CP029108.1_4401494_4405457_+	PRK11809, putA, trifunctional transcriptional regulator/proline dehydrogenase/pyrroline-5-carboxylate dehydrogenase; Reviewed	NA|213aa|up_7|NZ_CP029108.1_4405496_4406135_-	PRK15008, PRK15008, HTH-type transcriptional regulator RutR; Provisional	NA|364aa|up_6|NZ_CP029108.1_4406422_4407514_+	TIGR03612, RutA, pyrimidine utilization protein A	NA|231aa|up_5|NZ_CP029108.1_4407513_4408206_+	TIGR03614, RutB, pyrimidine utilization protein B	NA|129aa|up_4|NZ_CP029108.1_4408217_4408604_+	TIGR03610, RutC, pyrimidine utilization protein C	NA|267aa|up_3|NZ_CP029108.1_4408611_4409412_+	TIGR03611, RutD, pyrimidine utilization protein D	NA|197aa|up_2|NZ_CP029108.1_4409421_4410012_+	PRK05365, PRK05365, malonic semialdehyde reductase; Provisional	NA|165aa|up_1|NZ_CP029108.1_4410022_4410517_+	TIGR03615, flavoprotein_oxidoreductase, pyrimidine utilization flavin reductase protein F	NA|443aa|up_0|NZ_CP029108.1_4410537_4411866_+	TIGR03616, Putative_pyrimidine_permease_RutG, pyrimidine utilization transport protein G	NA|199aa|down_0|NZ_CP029108.1_4412494_4413091_+	PRK03767, PRK03767, NAD(P)H:quinone oxidoreductase; Provisional	NA|76aa|down_1|NZ_CP029108.1_4413111_4413339_+	PRK10174, PRK10174, hypothetical protein; Provisional	NA|414aa|down_2|NZ_CP029108.1_4413376_4414618_-	PRK10173, PRK10173, glucose-1-phosphatase/inositol phosphatase; Provisional	NA|420aa|down_3|NZ_CP029108.1_4414909_4416169_-	PRK09784, PRK09784, YccE family protein	NA|307aa|down_4|NZ_CP029108.1_4416428_4417349_+	PRK10266, PRK10266, curved DNA-binding protein	NA|102aa|down_5|NZ_CP029108.1_4417348_4417654_+	PRK10265, PRK10265, chaperone modulator CbpM	NA|200aa|down_6|NZ_CP029108.1_4417746_4418346_-	PRK04976, torD, chaperone protein TorD; Validated	NA|849aa|down_7|NZ_CP029108.1_4418342_4420889_-	PRK15102, PRK15102, trimethylamine-N-oxide reductase TorA	NA|391aa|down_8|NZ_CP029108.1_4420888_4422061_-	PRK15032, PRK15032, pentaheme c-type cytochrome TorC	NA|231aa|down_9|NZ_CP029108.1_4422190_4422883_+	PRK10766, PRK10766, two-component system response regulator TorR
GCF_003073815.1_ASM307381v1	NZ_CP029103	Escherichia coli strain AR437 plasmid unnamed1, complete sequence	1	82099-82172	1	CRISPRCasFinder	no			Orphan	GATTTCTTTCGCATTAGCCTCGC	23	0	0	NA	NA	NA	1	1	Orphan	DEDDh,RT,cas3,csa3,PD-DExK,cas5,cas6e,cas1,cas2,c2c9_V-U4,DinG	NA|145aa|up_3|NZ_CP029103.1_77436_77871_-,NA|279aa|up_2|NZ_CP029103.1_77949_78786_-,NA|119aa|up_0|NZ_CP029103.1_80215_80572_-,NA|294aa|down_0|NZ_CP029103.1_84393_85275_-,NA|204aa|down_1|NZ_CP029103.1_85289_85901_-,NA|189aa|down_2|NZ_CP029103.1_85911_86478_-,NA|74aa|down_5|NZ_CP029103.1_88233_88455_+	NA|201aa|up_9|NZ_CP029103.1_70188_70791_-	pfam18788, DarA_N, Defence against restriction A N-terminal	NA|148aa|up_8|NZ_CP029103.1_70777_71221_-	pfam16084, LydB, LydA-holin antagonist	NA|110aa|up_7|NZ_CP029103.1_71217_71547_-	pfam16083, Phage_holin_3_3, LydA holin phage, holin superfamily III	NA|94aa|up_6|NZ_CP029103.1_71614_71896_-	pfam16083, Phage_holin_3_3, LydA holin phage, holin superfamily III	NA|137aa|up_5|NZ_CP029103.1_72314_72725_-	pfam02413, Caudo_TAP, Caudovirales tail fibre assembly protein, lambda gpK	NA|1567aa|up_4|NZ_CP029103.1_72724_77425_-	pfam03406, Phage_fiber_2, Phage tail fibre repeat	NA|145aa|up_3|NZ_CP029103.1_77436_77871_-	NA	NA|279aa|up_2|NZ_CP029103.1_77949_78786_-	NA	NA|478aa|up_1|NZ_CP029103.1_78785_80219_-	PHA02553, 6, baseplate wedge subunit; Provisional	NA|119aa|up_0|NZ_CP029103.1_80215_80572_-	NA	NA|294aa|down_0|NZ_CP029103.1_84393_85275_-	NA	NA|204aa|down_1|NZ_CP029103.1_85289_85901_-	NA	NA|189aa|down_2|NZ_CP029103.1_85911_86478_-	NA	NA|180aa|down_3|NZ_CP029103.1_86558_87098_-	pfam17358, DUF5384, Family of unknown function (DUF5384)	NA|171aa|down_4|NZ_CP029103.1_87101_87614_-	pfam05818, TraT, Enterobacterial TraT complement resistance protein	NA|74aa|down_5|NZ_CP029103.1_88233_88455_+	NA	NA|345aa|down_6|NZ_CP029103.1_88451_89486_+	COG3645, COG3645, Uncharacterized phage-encoded protein [Function unknown]	NA|NA	NA	NA|NA	NA	NA|NA	NA
