assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_002812685.1_ASM281268v1	NZ_CP025036	Escherichia coli strain S17-20 chromosome, complete genome	1	151967-152106	1	CRISPRCasFinder	no		PD-DExK,csa3,RT,cas3,WYL,DEDDh,DinG,c2c9_V-U4,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e	Orphan	TGCCTGATGCGACGCTTGCCGCGTCTTATCAGGCCTACAACGACACAAA	49	0	0	NA	NA	NA	1	1	Orphan	PD-DExK,csa3,RT,cas3,WYL,DEDDh,DinG,c2c9_V-U4,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e	NA,NA	NA|252aa|up_9|NZ_CP025036.1_141237_141993_+	PRK09762, PRK09762, galactosamine-6-phosphate isomerase; Provisional	NA|195aa|up_8|NZ_CP025036.1_142393_142978_+	COG3539, FimA, P pilus assembly protein, pilin FimA [Cell motility and secretion / Intracellular trafficking and secretion]	NA|232aa|up_7|NZ_CP025036.1_143057_143753_+	COG3121, FimC, P pilus assembly protein, chaperone PapD [Cell motility and secretion / Intracellular trafficking and secretion]	NA|839aa|up_6|NZ_CP025036.1_143782_146299_+	COG3188, FimD, P pilus assembly protein, porin PapC [Cell motility and secretion / Intracellular trafficking and secretion]	NA|364aa|up_5|NZ_CP025036.1_146309_147401_+	pfam00419, Fimbrial, Fimbrial protein	NA|287aa|up_4|NZ_CP025036.1_147443_148304_-	PRK14994, PRK14994, SAM-dependent 16S ribosomal RNA C1402 ribose 2'-O-methyltransferase; Provisional	NA|679aa|up_3|NZ_CP025036.1_148368_150405_+	COG3107, LppC, Putative lipoprotein [General function prediction only]	NA|132aa|up_2|NZ_CP025036.1_150362_150758_+	TIGR00252, UPF0102_protein_HI_1656, TIGR00252 family protein	NA|197aa|up_1|NZ_CP025036.1_150777_151368_+	PRK10886, PRK10886, DnaA initiator-associating protein DiaA; Provisional	NA|192aa|up_0|NZ_CP025036.1_151377_151953_+	PRK11023, PRK11023, divisome-associated lipoprotein YraP	NA|347aa|down_0|NZ_CP025036.1_152157_153198_-	COG0701, COG0701, Predicted permeases [General function prediction only]	NA|212aa|down_1|NZ_CP025036.1_153270_153906_-	cd05250, CC3_like_SDR_a, CC3(TIP30)-like, atypical (a) SDRs	NA|173aa|down_2|NZ_CP025036.1_154033_154552_+	cd03134, GATase1_PfpI_like, A type 1 glutamine amidotransferase (GATase1)-like domain found in PfpI from Pyrococcus furiosus	NA|148aa|down_3|NZ_CP025036.1_154531_154975_-	PRK03467, PRK03467, hypothetical protein; Provisional	NA|101aa|down_4|NZ_CP025036.1_155025_155328_+	PRK00329, PRK00329, GIY-YIG nuclease superfamily protein; Validated	NA|168aa|down_5|NZ_CP025036.1_155314_155818_-	COG3153, COG3153, Predicted acetyltransferase [General function prediction only]	NA|175aa|down_6|NZ_CP025036.1_155811_156336_-	COG3154, COG3154, Putative lipid carrier protein [Lipid metabolism]	NA|332aa|down_7|NZ_CP025036.1_156544_157540_+	COG0826, COG0826, Collagenase and related proteases [Posttranslational modification, protein turnover, chaperones]	NA|293aa|down_8|NZ_CP025036.1_157548_158427_+	PRK15447, PRK15447, putative protease; Provisional	NA|336aa|down_9|NZ_CP025036.1_158632_159640_+	PRK10508, PRK10508, luciferase-like monooxygenase
GCF_002812685.1_ASM281268v1	NZ_CP025036	Escherichia coli strain S17-20 chromosome, complete genome	2	1421563-1421712	2	CRISPRCasFinder	no		PD-DExK,csa3,RT,cas3,WYL,DEDDh,DinG,c2c9_V-U4,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e	Orphan	CGCGTCTTATCAGGCCTACGAGTTCGGTGCTGTGTAGGTCGGATAAGGCGTTCA	54	0	0	NA	NA	NA	1	1	Orphan	PD-DExK,csa3,RT,cas3,WYL,DEDDh,DinG,c2c9_V-U4,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e	NA|528aa|up_7|NZ_CP025036.1_1412769_1414353_+,NA|36aa|up_6|NZ_CP025036.1_1414353_1414461_+,NA	NA|198aa|up_9|NZ_CP025036.1_1411456_1412050_+	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|151aa|up_8|NZ_CP025036.1_1412194_1412647_+	COG2731, EbgC, Beta-galactosidase, beta subunit [Carbohydrate transport and metabolism]	NA|528aa|up_7|NZ_CP025036.1_1412769_1414353_+	NA	NA|36aa|up_6|NZ_CP025036.1_1414353_1414461_+	NA	NA|338aa|up_5|NZ_CP025036.1_1414513_1415527_-	PRK03515, PRK03515, ornithine carbamoyltransferase subunit I; Provisional	NA|139aa|up_4|NZ_CP025036.1_1415688_1416105_+	PRK11191, PRK11191, ribonuclease E inhibitor RraB	NA|168aa|up_3|NZ_CP025036.1_1416150_1416654_-	pfam00583, Acetyltransf_1, Acetyltransferase (GNAT) family	NA|399aa|up_2|NZ_CP025036.1_1416846_1418043_+	COG4269, COG4269, Predicted membrane protein [Function unknown]	NA|952aa|up_1|NZ_CP025036.1_1418096_1420952_-	PRK05729, valS, valyl-tRNA synthetase; Reviewed	NA|148aa|up_0|NZ_CP025036.1_1420951_1421395_-	PRK05728, PRK05728, DNA polymerase III subunit chi; Validated	NA|504aa|down_0|NZ_CP025036.1_1421748_1423260_-	PRK00913, PRK00913, multifunctional aminopeptidase A; Provisional	NA|367aa|down_1|NZ_CP025036.1_1423526_1424627_+	PRK15120, PRK15120, lipopolysaccharide ABC transporter permease LptF; Provisional	NA|361aa|down_2|NZ_CP025036.1_1424626_1425709_+	PRK15071, PRK15071, lipopolysaccharide ABC transporter permease; Provisional	NA|501aa|down_3|NZ_CP025036.1_1425869_1427372_-	pfam05872, DUF853, Bacterial protein of unknown function (DUF853)	NA|333aa|down_4|NZ_CP025036.1_1427449_1428448_-	cd01575, PBP1_GntR, ligand-binding domain of DNA transcription repressor GntR specific for gluconate, a member of the LacI-GalR family of bacterial transcription regulators	NA|440aa|down_5|NZ_CP025036.1_1428514_1429834_-	TIGR00791, Gluconate_permease, gluconate transporter	NA|255aa|down_6|NZ_CP025036.1_1429896_1430661_-	PRK08085, PRK08085, gluconate 5-dehydrogenase; Provisional	NA|344aa|down_7|NZ_CP025036.1_1430684_1431716_-	PRK09880, PRK09880, L-idonate 5-dehydrogenase; Provisional	NA|188aa|down_8|NZ_CP025036.1_1431932_1432496_+	PRK09825, idnK, gluconokinase	NA|340aa|down_9|NZ_CP025036.1_1432499_1433519_-	cd05283, CAD1, Cinnamyl alcohol dehydrogenases (CAD)
GCF_002812685.1_ASM281268v1	NZ_CP025036	Escherichia coli strain S17-20 chromosome, complete genome	3	1691989-1692104	3	CRISPRCasFinder	no		PD-DExK,csa3,RT,cas3,WYL,DEDDh,DinG,c2c9_V-U4,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e	Orphan	GATAAGACGCGCCAGCGTCGCATCAGGCGTT	31	0	0	NA	NA	NA	1	1	Orphan	PD-DExK,csa3,RT,cas3,WYL,DEDDh,DinG,c2c9_V-U4,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e	NA,NA	NA|126aa|up_9|NZ_CP025036.1_1677993_1678371_-	PRK05461, apaG, CO2+/MG2+ efflux protein ApaG; Reviewed	NA|274aa|up_8|NZ_CP025036.1_1678373_1679195_-	PRK00274, ksgA, 16S rRNA (adenine(1518)-N(6)/adenine(1519)-N(6))-dimethyltransferase RsmA	NA|330aa|up_7|NZ_CP025036.1_1679191_1680181_-	PRK00232, pdxA, 4-hydroxythreonine-4-phosphate dehydrogenase; Reviewed	NA|429aa|up_6|NZ_CP025036.1_1680180_1681467_-	PRK10770, PRK10770, peptidyl-prolyl cis-trans isomerase SurA; Provisional	NA|785aa|up_5|NZ_CP025036.1_1681519_1683874_-	PRK03761, PRK03761, LPS assembly outer membrane complex protein LptD; Provisional	NA|272aa|up_4|NZ_CP025036.1_1684128_1684944_+	PRK09430, djlA, co-chaperone DjlA	NA|220aa|up_3|NZ_CP025036.1_1685060_1685720_-	PRK10158, PRK10158, bifunctional tRNA pseudouridine(32) synthase/23S rRNA pseudouridine(746) synthase RluA	NA|969aa|up_2|NZ_CP025036.1_1685731_1688638_-	PRK04914, PRK04914, RNA polymerase-associated protein RapA	NA|784aa|up_1|NZ_CP025036.1_1688802_1691154_-	PRK05762, PRK05762, DNA polymerase II; Reviewed	NA|232aa|up_0|NZ_CP025036.1_1691228_1691924_-	PRK08193, araD, L-ribulose-5-phosphate 4-epimerase AraD	NA|501aa|down_0|NZ_CP025036.1_1692123_1693626_-	PRK02929, PRK02929, L-arabinose isomerase; Provisional	NA|567aa|down_1|NZ_CP025036.1_1693636_1695337_-	PRK04123, PRK04123, ribulokinase; Provisional	NA|293aa|down_2|NZ_CP025036.1_1695675_1696554_+	PRK10572, PRK10572, arabinose operon transcriptional regulator AraC	NA|255aa|down_3|NZ_CP025036.1_1696639_1697404_+	COG0586, DedA, Uncharacterized membrane-associated protein [Function unknown]	NA|233aa|down_4|NZ_CP025036.1_1697517_1698216_-	PRK10771, thiQ, thiamine ABC transporter ATP-binding protein ThiQ	NA|537aa|down_5|NZ_CP025036.1_1698199_1699810_-	PRK09433, thiP, thiamine transporter membrane protein; Reviewed	NA|328aa|down_6|NZ_CP025036.1_1699785_1700769_-	PRK11205, tbpA, thiamine transporter substrate binding subunit; Provisional	NA|552aa|down_7|NZ_CP025036.1_1700932_1702588_-	PRK13626, PRK13626, HTH-type transcriptional regulator SgrR	NA|44aa|down_8|NZ_CP025036.1_1702676_1702808_+	pfam15894, SgrT, Inhibitor of glucose uptake transporter SgrT	NA|393aa|down_9|NZ_CP025036.1_1702909_1704088_+	TIGR00899, Sugar_efflux_transporter_A, sugar efflux transporter
GCF_002812685.1_ASM281268v1	NZ_CP025036	Escherichia coli strain S17-20 chromosome, complete genome	4	1901831-1901984	4	CRISPRCasFinder	no		PD-DExK,csa3,RT,cas3,WYL,DEDDh,DinG,c2c9_V-U4,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e	Orphan	CGCGTCTTATCATGCCTACAAACCTGTGCCGGATCGGTAGGCCGGATAAGGCG	53	0	0	NA	NA	NA	1	1	Orphan	PD-DExK,csa3,RT,cas3,WYL,DEDDh,DinG,c2c9_V-U4,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e	NA|150aa|up_8|NZ_CP025036.1_1891803_1892253_+,NA|63aa|down_4|NZ_CP025036.1_1906255_1906444_-	NA|1418aa|up_9|NZ_CP025036.1_1887538_1891792_+	COG3209, RhsA, Rhs family protein [Cell envelope biogenesis, outer membrane]	NA|150aa|up_8|NZ_CP025036.1_1891803_1892253_+	NA	NA|257aa|up_7|NZ_CP025036.1_1893691_1894462_-	PRK10438, PRK10438, C-N hydrolase family amidase; Provisional	NA|158aa|up_6|NZ_CP025036.1_1894615_1895089_+	PRK09993, PRK09993, C-lysozyme inhibitor; Provisional	NA|815aa|up_5|NZ_CP025036.1_1895131_1897576_-	PRK09463, fadE, acyl-CoA dehydrogenase; Reviewed	NA|193aa|up_4|NZ_CP025036.1_1897815_1898394_+	PRK00414, gmhA, D-sedoheptulose 7-phosphate isomerase	NA|256aa|up_3|NZ_CP025036.1_1898599_1899367_+	pfam13230, GATase_4, Glutamine amidotransferases class-II	NA|247aa|up_2|NZ_CP025036.1_1899337_1900078_-	COG3034, COG3034, Uncharacterized protein conserved in bacteria [Function unknown]	NA|253aa|up_1|NZ_CP025036.1_1900369_1901128_+	COG0791, Spr, Cell wall-associated hydrolases (invasion-associated proteins) [Cell envelope biogenesis, outer membrane]	NA|166aa|up_0|NZ_CP025036.1_1901303_1901801_+	COG1943, COG1943, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|580aa|down_0|NZ_CP025036.1_1902118_1903858_-	COG1298, FlhA, Flagellar biosynthesis pathway, component FlhA [Cell motility and secretion / Intracellular trafficking and secretion]	NA|263aa|down_1|NZ_CP025036.1_1903799_1904588_+	PRK06778, PRK06778, hypothetical protein; Validated	NA|352aa|down_2|NZ_CP025036.1_1904658_1905714_+	PRK02406, PRK02406, DNA polymerase IV; Validated	NA|151aa|down_3|NZ_CP025036.1_1905710_1906163_+	PRK09831, PRK09831, GNAT family N-acetyltransferase	NA|63aa|down_4|NZ_CP025036.1_1906255_1906444_-	NA	NA|89aa|down_5|NZ_CP025036.1_1906469_1906736_+	PRK09588, PRK09588, hypothetical protein; Reviewed	NA|486aa|down_6|NZ_CP025036.1_1907092_1908550_-	PRK15026, PRK15026, aminoacyl-histidine dipeptidase; Provisional	NA|153aa|down_7|NZ_CP025036.1_1908810_1909269_+	PRK09177, PRK09177, xanthine-guanine phosphoribosyltransferase; Validated	NA|415aa|down_8|NZ_CP025036.1_1909360_1910605_+	PRK05077, frsA, esterase FrsA	NA|134aa|down_9|NZ_CP025036.1_1910662_1911064_+	PRK10984, PRK10984, sigma factor-binding protein Crl
GCF_002812685.1_ASM281268v1	NZ_CP025036	Escherichia coli strain S17-20 chromosome, complete genome	5	4798807-4799201	5,1,1	CRISPRCasFinder,CRT,PILER-CR	no	cas3	PD-DExK,csa3,RT,cas3,WYL,DEDDh,DinG,c2c9_V-U4,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e	Unclear	CGGTTTATCCCCGCTGGCGCGGGGAACTC,CGGTTTATCCCCGCTGGCGCGGGGAACTC,GGTTTATCCCCGCTGGCGCGGGGAACTC	29,29,28	0	0	NA	NA	I-E:I-E:I-E	6,6,6	6	Unclear	PD-DExK,csa3,RT,cas3,WYL,DEDDh,DinG,c2c9_V-U4,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e	NA,NA|47aa|down_1|NZ_CP025036.1_4800350_4800491_+	NA|424aa|up_9|NZ_CP025036.1_4787451_4788723_+	PRK10015, PRK10015, oxidoreductase; Provisional	NA|87aa|up_8|NZ_CP025036.1_4788713_4788974_+	COG2440, FixX, Ferredoxin-like protein [Energy production and conversion]	NA|192aa|up_7|NZ_CP025036.1_4788990_4789566_+	COG1954, GlpP, Glycerol-3-phosphate responsive antiterminator (mRNA-binding) [Transcription]	NA|287aa|up_6|NZ_CP025036.1_4789713_4790574_-	COG2025, FixB, Electron transfer flavoprotein, alpha subunit [Energy production and conversion]	NA|260aa|up_5|NZ_CP025036.1_4790570_4791350_-	COG2086, FixA, Electron transfer flavoprotein, beta subunit [Energy production and conversion]	NA|446aa|up_4|NZ_CP025036.1_4791327_4792665_-	cd17371, MFS_MucK, Cis,cis-muconate transport protein and similar proteins of the Major Facilitator Superfamily	NA|485aa|up_3|NZ_CP025036.1_4792758_4794213_-	COG0277, GlcD, FAD/FMN-containing dehydrogenases [Energy production and conversion]	NA|262aa|up_2|NZ_CP025036.1_4794282_4795068_-	cd05347, Ga5DH-like_SDR_c, gluconate 5-dehydrogenase (Ga5DH)-like, classical (c) SDRs	NA|426aa|up_1|NZ_CP025036.1_4795386_4796664_+	cd06174, MFS, Major Facilitator Superfamily	NA|493aa|up_0|NZ_CP025036.1_4796690_4798169_+	cd07779, FGGY_ygcE_like, uncharacterized ygcE-like proteins	NA|224aa|down_0|NZ_CP025036.1_4799540_4800212_-	TIGR04322, organic_radical_activating_enzyme, putative 7-cyano-7-deazaguanosine (preQ0) biosynthesis protein QueE	NA|47aa|down_1|NZ_CP025036.1_4800350_4800491_+	NA	NA|291aa|down_2|NZ_CP025036.1_4800504_4801377_+	COG1512, COG1512, Beta-propeller domains of methanol dehydrogenase type [General function prediction only]	NA|433aa|down_3|NZ_CP025036.1_4801436_4802735_-	PRK00077, eno, enolase; Provisional	NA|546aa|down_4|NZ_CP025036.1_4802822_4804460_-	PRK05380, pyrG, CTP synthetase; Validated	NA|264aa|down_5|NZ_CP025036.1_4804687_4805479_-	PRK09562, mazG, nucleoside triphosphate pyrophosphohydrolase; Reviewed	NA|112aa|down_6|NZ_CP025036.1_4805549_4805885_-	PRK09907, PRK09907, endoribonuclease MazF	NA|83aa|down_7|NZ_CP025036.1_4805884_4806133_-	PRK09798, PRK09798, MazF-MazE toxin-antitoxin system antitoxin MazE	NA|745aa|down_8|NZ_CP025036.1_4806210_4808445_-	PRK10872, relA, (p)ppGpp synthetase I/GTP pyrophosphokinase; Provisional	NA|434aa|down_9|NZ_CP025036.1_4808492_4809794_-	PRK13168, rumA, 23S rRNA (uracil(1939)-C(5))-methyltransferase RlmD
