assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_002310635.1_ASM231063v1	NZ_CP023388	Escherichia coli strain 1105 chromosome, complete genome	1	1826124-1826237	1	CRISPRCasFinder	no		WYL,RT,cas3,PD-DExK,csa3,DEDDh,DinG,cas14j	Orphan	TGCCGGATGCGCTTCGCTTATCCGGCCTACAAA	33	0	0	NA	NA	NA	1	1	Orphan	WYL,RT,cas3,PD-DExK,csa3,DEDDh,DinG,cas14j	NA,NA	NA|66aa|up_9|NZ_CP023388.1_1815717_1815915_-	PRK09956, PRK09956, ISNCY family transposase	NA|304aa|up_8|NZ_CP023388.1_1815927_1816839_-	PRK09956, PRK09956, ISNCY family transposase	NA|397aa|up_7|NZ_CP023388.1_1817031_1818222_-	TIGR03379, glycerol3P_GlpC, glycerol-3-phosphate dehydrogenase, anaerobic, C subunit	NA|420aa|up_6|NZ_CP023388.1_1818218_1819478_-	COG3075, GlpB, Anaerobic glycerol-3-phosphate dehydrogenase [Amino acid transport and metabolism]	NA|543aa|up_5|NZ_CP023388.1_1819467_1821096_-	PRK11101, glpA, anaerobic glycerol-3-phosphate dehydrogenase subunit A	NA|453aa|up_4|NZ_CP023388.1_1821368_1822727_+	PRK11273, glpT, glycerol-3-phosphate transporter	NA|359aa|up_3|NZ_CP023388.1_1822731_1823808_+	PRK11143, glpQ, glycerophosphodiester phosphodiesterase; Provisional	NA|217aa|up_2|NZ_CP023388.1_1824010_1824661_+	PRK09902, PRK09902, lipopolysaccharide kinase InaA	NA|85aa|up_1|NZ_CP023388.1_1824714_1824969_-	PRK10713, PRK10713, 2Fe-2S ferredoxin-like protein	NA|377aa|up_0|NZ_CP023388.1_1824968_1826099_-	PRK09101, nrdB, ribonucleotide-diphosphate reductase subunit beta; Reviewed	NA|762aa|down_0|NZ_CP023388.1_1826288_1828574_-	PRK09103, PRK09103, ribonucleoside-diphosphate reductase subunit alpha	NA|1261aa|down_1|NZ_CP023388.1_1829255_1833038_+	PRK09752, PRK09752, AIDA-I family autotransporter YfaL	NA|241aa|down_2|NZ_CP023388.1_1833177_1833900_-	PRK05134, PRK05134, bifunctional 2-polyprenyl-6-hydroxyphenol methylase/3-demethylubiquinol 3-O-methyltransferase UbiG	NA|876aa|down_3|NZ_CP023388.1_1834046_1836674_+	PRK05560, PRK05560, DNA gyrase subunit A; Validated	NA|563aa|down_4|NZ_CP023388.1_1836822_1838511_+	COG4685, COG4685, Uncharacterized protein conserved in bacteria [Function unknown]	NA|208aa|down_5|NZ_CP023388.1_1838507_1839131_+	COG3234, COG3234, Uncharacterized protein conserved in bacteria [Function unknown]	NA|1465aa|down_6|NZ_CP023388.1_1839274_1843669_+	COG2373, COG2373, Large extracellular alpha-helical protein [General function prediction only]	NA|550aa|down_7|NZ_CP023388.1_1843669_1845319_+	COG5445, COG5445, Predicted secreted protein [Function unknown]	NA|259aa|down_8|NZ_CP023388.1_1845323_1846100_+	COG4676, COG4676, Uncharacterized protein conserved in bacteria [Function unknown]	NA|395aa|down_9|NZ_CP023388.1_1846173_1847358_-	PRK05790, PRK05790, putative acyltransferase; Provisional
GCF_002310635.1_ASM231063v1	NZ_CP023388	Escherichia coli strain 1105 chromosome, complete genome	2	2048788-2048924	2	CRISPRCasFinder	no		WYL,RT,cas3,PD-DExK,csa3,DEDDh,DinG,cas14j	Orphan	TCTTGTAGGCCTGATAAGACGCGCCAGCGTCGCATCAGGCA	41	0	0	NA	NA	NA	1	1	Orphan	WYL,RT,cas3,PD-DExK,csa3,DEDDh,DinG,cas14j	NA,NA	NA|249aa|up_9|NZ_CP023388.1_2037650_2038397_+	PRK10063, PRK10063, colanic acid biosynthesis glycosyltransferase WcaE	NA|183aa|up_8|NZ_CP023388.1_2038412_2038961_+	TIGR04008, WcaF, colanic acid biosynthesis acetyltransferase WcaF	NA|374aa|up_7|NZ_CP023388.1_2038986_2040108_+	COG1089, Gmd, GDP-D-mannose dehydratase [Cell envelope biogenesis, outer membrane]	NA|322aa|up_6|NZ_CP023388.1_2040110_2041076_+	cd05239, GDP_FS_SDR_e, GDP-fucose synthetase, extended (e) SDRs	NA|160aa|up_5|NZ_CP023388.1_2041078_2041558_+	PRK15434, PRK15434, GDP-mannose mannosyl hydrolase	NA|408aa|up_4|NZ_CP023388.1_2041554_2042778_+	TIGR04007, wcaI, colanic acid biosynthesis glycosyl transferase WcaI	NA|479aa|up_3|NZ_CP023388.1_2042780_2044217_+	PRK15460, cpsB, mannose-1-phosphate guanyltransferase; Provisional	NA|457aa|up_2|NZ_CP023388.1_2044409_2045780_+	PRK15414, PRK15414, phosphomannomutase	NA|465aa|up_1|NZ_CP023388.1_2045834_2047229_+	PRK10124, PRK10124, putative UDP-glucose lipid carrier transferase; Provisional	NA|493aa|up_0|NZ_CP023388.1_2047230_2048709_+	PRK10459, PRK10459, MOP flippase family protein	NA|427aa|down_0|NZ_CP023388.1_2048984_2050265_+	TIGR04006, wcaK, colanic acid biosynthesis pyruvyl transferase WcaK	NA|407aa|down_1|NZ_CP023388.1_2050261_2051482_+	TIGR04005, wcaL, colanic acid biosynthesis glycosyltransferase WcaL	NA|465aa|down_2|NZ_CP023388.1_2051492_2052887_+	PRK10123, wcaM, putative colanic acid biosynthesis protein; Provisional	NA|332aa|down_3|NZ_CP023388.1_2053044_2054040_+	cd05238, Gne_like_SDR_e, Escherichia coli Gne (a nucleoside-diphosphate-sugar 4-epimerase)-like, extended (e) SDRs	NA|300aa|down_4|NZ_CP023388.1_2054281_2055181_+	PRK10122, PRK10122, UTP--glucose-1-phosphate uridylyltransferase GalF	NA|412aa|down_5|NZ_CP023388.1_2055491_2056727_+	cd13128, MATE_Wzx_like, Wzx, a subfamily of the multidrug and toxic compound extrusion (MATE)-like proteins	NA|264aa|down_6|NZ_CP023388.1_2056726_2057518_+	cd02511, Beta4Glucosyltransferase, UDP-glucose LOS-beta-1,4 glucosyltransferase is required for biosynthesis of lipooligosaccharide	NA|270aa|down_7|NZ_CP023388.1_2057526_2058336_+	cd00761, Glyco_tranf_GTA_type, Glycosyltransferase family A (GT-A) includes diverse families of glycosyl transferases with a common GT-A type structural fold	NA|227aa|down_8|NZ_CP023388.1_2058338_2059019_+	cd06532, Glyco_transf_25, Glycosyltransferase family 25 [lipooligosaccharide (LOS) biosynthesis protein] is a family of glycosyltransferases involved in LOS biosynthesis	NA|361aa|down_9|NZ_CP023388.1_2059015_2060098_+	pfam14897, EpsG, EpsG family
GCF_002310635.1_ASM231063v1	NZ_CP023388	Escherichia coli strain 1105 chromosome, complete genome	3	2445567-2445690	3	CRISPRCasFinder	no	DEDDh	WYL,RT,cas3,PD-DExK,csa3,DEDDh,DinG,cas14j	Unclear	CGACCCCCACCATGTCAAGGTGGTGCTCTAACCAACTGAGCTA	43	0	0	NA	NA	NA	1	1	Orphan	WYL,RT,cas3,PD-DExK,csa3,DEDDh,DinG,cas14j	NA,NA|30aa|down_7|NZ_CP023388.1_2454586_2454676_+	NA|471aa|up_9|NZ_CP023388.1_2435019_2436432_-	PRK09206, PRK09206, pyruvate kinase PykF	NA|70aa|up_8|NZ_CP023388.1_2436988_2437198_+	PRK10292, PRK10292, fumarate hydratase FumD	NA|209aa|up_7|NZ_CP023388.1_2437653_2438280_+	PRK09898, PRK09898, ferredoxin-like protein	NA|701aa|up_6|NZ_CP023388.1_2438300_2440403_+	PRK09849, PRK09849, putative oxidoreductase; Provisional	NA|216aa|up_5|NZ_CP023388.1_2440406_2441054_+	PRK09947, PRK09947, YdhW family putative oxidoreductase system protein	NA|223aa|up_4|NZ_CP023388.1_2441117_2441786_+	TIGR03149, cyt_nit_nrfC, cytochrome c nitrite reductase, Fe-S protein	NA|262aa|up_3|NZ_CP023388.1_2441782_2442568_+	PRK15006, PRK15006, thiosulfate reductase cytochrome B subunit; Provisional	NA|271aa|up_2|NZ_CP023388.1_2442571_2443384_+	PRK09946, PRK09946, hypothetical protein; Provisional	NA|537aa|up_1|NZ_CP023388.1_2443389_2445000_-	PRK09897, PRK09897, FAD-NAD(P)-binding protein	NA|102aa|up_0|NZ_CP023388.1_2445125_2445431_-	PRK11118, PRK11118, putative monooxygenase; Provisional	NA|419aa|down_0|NZ_CP023388.1_2446004_2447261_+	PRK09945, PRK09945, hypothetical protein; Provisional	NA|458aa|down_1|NZ_CP023388.1_2447301_2448675_-	PRK01766, PRK01766, multidrug efflux protein; Reviewed	NA|214aa|down_2|NZ_CP023388.1_2448889_2449531_+	PRK13020, PRK13020, riboflavin synthase subunit alpha; Provisional	NA|383aa|down_3|NZ_CP023388.1_2449570_2450719_-	PRK11705, PRK11705, cyclopropane fatty acyl phospholipid synthase	NA|404aa|down_4|NZ_CP023388.1_2451009_2452221_-	PRK11043, PRK11043, Bcr/CflA family multidrug efflux MFS transporter	NA|311aa|down_5|NZ_CP023388.1_2452333_2453266_+	PRK11074, PRK11074, putative DNA-binding transcriptional regulator; Provisional	NA|342aa|down_6|NZ_CP023388.1_2453262_2454288_-	PRK10703, PRK10703, HTH-type transcriptional repressor PurR	NA|30aa|down_7|NZ_CP023388.1_2454586_2454676_+	NA	NA|390aa|down_8|NZ_CP023388.1_2454841_2456011_+	COG2814, AraJ, Arabinose efflux permease [Carbohydrate transport and metabolism]	NA|194aa|down_9|NZ_CP023388.1_2456156_2456738_-	PRK10543, PRK10543, superoxide dismutase [Fe]
GCF_002310635.1_ASM231063v1	NZ_CP023388	Escherichia coli strain 1105 chromosome, complete genome	4	3162638-3162729	4	CRISPRCasFinder	no		WYL,RT,cas3,PD-DExK,csa3,DEDDh,DinG,cas14j	Orphan	CCACCTTTTTTACCTGCTTCAGATGC	26	0	0	NA	NA	NA	1	1	Orphan	WYL,RT,cas3,PD-DExK,csa3,DEDDh,DinG,cas14j	NA|70aa|up_8|NZ_CP023388.1_3151887_3152097_-,NA	NA|503aa|up_9|NZ_CP023388.1_3150220_3151729_-	PRK15419, PRK15419, sodium/proline symporter PutP	NA|70aa|up_8|NZ_CP023388.1_3151887_3152097_-	NA	NA|1321aa|up_7|NZ_CP023388.1_3152151_3156114_+	PRK11809, putA, trifunctional transcriptional regulator/proline dehydrogenase/pyrroline-5-carboxylate dehydrogenase; Reviewed	NA|213aa|up_6|NZ_CP023388.1_3156153_3156792_-	PRK15008, PRK15008, HTH-type transcriptional regulator RutR; Provisional	NA|364aa|up_5|NZ_CP023388.1_3157079_3158171_+	TIGR03612, RutA, pyrimidine utilization protein A	NA|231aa|up_4|NZ_CP023388.1_3158170_3158863_+	TIGR03614, RutB, pyrimidine utilization protein B	NA|129aa|up_3|NZ_CP023388.1_3158874_3159261_+	TIGR03610, RutC, pyrimidine utilization protein C	NA|267aa|up_2|NZ_CP023388.1_3159268_3160069_+	TIGR03611, RutD, pyrimidine utilization protein D	NA|165aa|up_1|NZ_CP023388.1_3160680_3161175_+	TIGR03615, flavoprotein_oxidoreductase, pyrimidine utilization flavin reductase protein F	NA|443aa|up_0|NZ_CP023388.1_3161195_3162524_+	TIGR03616, Putative_pyrimidine_permease_RutG, pyrimidine utilization transport protein G	NA|199aa|down_0|NZ_CP023388.1_3163152_3163749_+	PRK03767, PRK03767, NAD(P)H:quinone oxidoreductase; Provisional	NA|76aa|down_1|NZ_CP023388.1_3163769_3163997_+	PRK10174, PRK10174, hypothetical protein; Provisional	NA|414aa|down_2|NZ_CP023388.1_3164034_3165276_-	PRK10173, PRK10173, glucose-1-phosphatase/inositol phosphatase; Provisional	NA|307aa|down_3|NZ_CP023388.1_3165810_3166731_+	PRK10266, PRK10266, curved DNA-binding protein	NA|102aa|down_4|NZ_CP023388.1_3166730_3167036_+	PRK10265, PRK10265, chaperone modulator CbpM	NA|200aa|down_5|NZ_CP023388.1_3167290_3167890_-	PRK04976, torD, chaperone protein TorD; Validated	NA|849aa|down_6|NZ_CP023388.1_3167886_3170433_-	PRK15102, PRK15102, trimethylamine-N-oxide reductase TorA	NA|391aa|down_7|NZ_CP023388.1_3170432_3171605_-	PRK15032, PRK15032, pentaheme c-type cytochrome TorC	NA|231aa|down_8|NZ_CP023388.1_3171734_3172427_+	PRK10766, PRK10766, two-component system response regulator TorR	NA|343aa|down_9|NZ_CP023388.1_3172399_3173428_-	PRK10936, PRK10936, TMAO reductase system periplasmic protein TorT; Provisional
GCF_002310635.1_ASM231063v1	NZ_CP023388	Escherichia coli strain 1105 chromosome, complete genome	5	4361243-4361382	5	CRISPRCasFinder	no		WYL,RT,cas3,PD-DExK,csa3,DEDDh,DinG,cas14j	Orphan	TGTTATTGTCGGATGCGGCGTGAACGCCTTATCCGACCTACACA	44	0	0	NA	NA	NA	1	1	Orphan	WYL,RT,cas3,PD-DExK,csa3,DEDDh,DinG,cas14j	NA,NA	NA|340aa|up_9|NZ_CP023388.1_4349495_4350515_+	cd05283, CAD1, Cinnamyl alcohol dehydrogenases (CAD)	NA|188aa|up_8|NZ_CP023388.1_4350518_4351082_-	PRK09825, idnK, gluconokinase	NA|344aa|up_7|NZ_CP023388.1_4351298_4352330_+	PRK09880, PRK09880, L-idonate 5-dehydrogenase; Provisional	NA|255aa|up_6|NZ_CP023388.1_4352353_4353118_+	PRK08085, PRK08085, gluconate 5-dehydrogenase; Provisional	NA|440aa|up_5|NZ_CP023388.1_4353182_4354502_+	TIGR00791, Gluconate_permease, gluconate transporter	NA|333aa|up_4|NZ_CP023388.1_4354568_4355567_+	cd01575, PBP1_GntR, ligand-binding domain of DNA transcription repressor GntR specific for gluconate, a member of the LacI-GalR family of bacterial transcription regulators	NA|501aa|up_3|NZ_CP023388.1_4355644_4357147_+	pfam05872, DUF853, Bacterial protein of unknown function (DUF853)	NA|361aa|up_2|NZ_CP023388.1_4357265_4358348_-	PRK15071, PRK15071, lipopolysaccharide ABC transporter permease; Provisional	NA|367aa|up_1|NZ_CP023388.1_4358347_4359448_-	PRK15120, PRK15120, lipopolysaccharide ABC transporter permease LptF; Provisional	NA|504aa|up_0|NZ_CP023388.1_4359714_4361226_+	PRK00913, PRK00913, multifunctional aminopeptidase A; Provisional	NA|148aa|down_0|NZ_CP023388.1_4361483_4361927_+	PRK05728, PRK05728, DNA polymerase III subunit chi; Validated	NA|952aa|down_1|NZ_CP023388.1_4361926_4364782_+	PRK05729, valS, valyl-tRNA synthetase; Reviewed	NA|396aa|down_2|NZ_CP023388.1_4364828_4366016_-	COG4269, COG4269, Predicted membrane protein [Function unknown]	NA|168aa|down_3|NZ_CP023388.1_4366208_4366712_+	pfam00583, Acetyltransf_1, Acetyltransferase (GNAT) family	NA|139aa|down_4|NZ_CP023388.1_4366889_4367306_-	PRK11191, PRK11191, ribonuclease E inhibitor RraB	NA|335aa|down_5|NZ_CP023388.1_4367467_4368472_+	PRK03515, PRK03515, ornithine carbamoyltransferase subunit I; Provisional	NA|151aa|down_6|NZ_CP023388.1_4368516_4368969_-	COG2731, EbgC, Beta-galactosidase, beta subunit [Carbohydrate transport and metabolism]	NA|407aa|down_7|NZ_CP023388.1_4369646_4370867_+	PRK01388, PRK01388, arginine deiminase; Provisional	NA|311aa|down_8|NZ_CP023388.1_4370877_4371810_+	PRK12354, PRK12354, carbamate kinase; Reviewed	NA|335aa|down_9|NZ_CP023388.1_4371833_4372838_+	PRK02102, PRK02102, ornithine carbamoyltransferase; Validated
GCF_002310635.1_ASM231063v1	NZ_CP023388	Escherichia coli strain 1105 chromosome, complete genome	6	4659164-4659292	6	CRISPRCasFinder	no		WYL,RT,cas3,PD-DExK,csa3,DEDDh,DinG,cas14j	Orphan	TGTAGGCCTGATAAGACGCGACAGCGGCGCATCAGGC	37	0	0	NA	NA	NA	1	1	Orphan	WYL,RT,cas3,PD-DExK,csa3,DEDDh,DinG,cas14j	NA,NA	NA|230aa|up_9|NZ_CP023388.1_4647457_4648147_+	COG0790, COG0790, FOG: TPR repeat, SEL1 subfamily [General function prediction only]	NA|438aa|up_8|NZ_CP023388.1_4648748_4650062_-	PRK11283, gltP, glutamate/aspartate:proton symporter; Provisional	NA|199aa|up_7|NZ_CP023388.1_4650404_4651001_-	PRK10370, PRK10370, formate-dependent nitrite reductase complex subunit NrfG; Provisional	NA|128aa|up_6|NZ_CP023388.1_4650997_4651381_-	PRK10144, PRK10144, formate-dependent nitrite reductase complex subunit NrfF; Provisional	NA|553aa|up_5|NZ_CP023388.1_4651373_4653032_-	PRK10369, PRK10369, heme lyase subunit NrfE; Provisional	NA|319aa|up_4|NZ_CP023388.1_4653111_4654068_-	TIGR03148, cyt_nit_nrfD, cytochrome c nitrite reductase, NrfD subunit	NA|224aa|up_3|NZ_CP023388.1_4654064_4654736_-	TIGR03149, cyt_nit_nrfC, cytochrome c nitrite reductase, Fe-S protein	NA|189aa|up_2|NZ_CP023388.1_4654732_4655299_-	PRK11659, PRK11659, cytochrome c nitrite reductase pentaheme subunit; Provisional	NA|479aa|up_1|NZ_CP023388.1_4655343_4656780_-	PRK11125, nrfA, ammonia-forming cytochrome c nitrite reductase	NA|653aa|up_0|NZ_CP023388.1_4657171_4659130_+	PRK00174, PRK00174, acetyl-CoA synthetase; Provisional	NA|105aa|down_0|NZ_CP023388.1_4659421_4659736_+	COG3162, COG3162, Predicted membrane protein [Function unknown]	NA|550aa|down_1|NZ_CP023388.1_4659732_4661382_+	PRK09395, actP, cation/acetate symporter ActP	NA|230aa|down_2|NZ_CP023388.1_4661420_4662110_-	COG1346, LrgB, Putative effector of murein hydrolase [Cell envelope biogenesis, outer membrane]	NA|137aa|down_3|NZ_CP023388.1_4662102_4662513_-	COG1380, COG1380, Putative effector of murein hydrolase LrgA [General function prediction only]	NA|295aa|down_4|NZ_CP023388.1_4662616_4663501_+	cd08438, PBP2_CidR, The C-terminal substrate binding domain of LysR-like transcriptional regulator CidR, contains the type 2 periplasmic binding fold	NA|550aa|down_5|NZ_CP023388.1_4663536_4665186_-	TIGR00831, Putative_Na+/H+_exchanger_Rv2287/MT2345/Mb2309	NA|450aa|down_6|NZ_CP023388.1_4665336_4666686_-	COG2252, COG2252, Xanthine/uracil/vitamin C permease [Nucleotide transport and    metabolism]	NA|155aa|down_7|NZ_CP023388.1_4667231_4667696_-	PRK15002, PRK15002, redox-sensitive transcriptional activator SoxR	NA|108aa|down_8|NZ_CP023388.1_4667781_4668105_+	PRK10219, PRK10219, superoxide response transcriptional regulator SoxS	NA|529aa|down_9|NZ_CP023388.1_4668107_4669694_-	COG4943, COG4943, Predicted signal transduction protein containing sensor and EAL domains [Signal transduction mechanisms]
