assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_001890325.1_ASM189032v1	NZ_CP018241	Escherichia coli strain 319 chromosome, complete genome	1	71506-71640	1	CRISPRCasFinder	no		DEDDh,DinG,PrimPol,cas3,c2c9_V-U4,csa3,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e	Orphan	TGCCGGATGCGCGTTGCTTATCCGGCCTACAAAATCGCAGCGTGTAGGCC	50	0	0	NA	NA	NA	1	1	Orphan	DEDDh,DinG,PrimPol,cas3,c2c9_V-U4,csa3,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e	NA,NA|190aa|down_7|NZ_CP018241.1_80681_81251_+	NA|274aa|up_9|NZ_CP018241.1_56579_57401_-	PRK00274, ksgA, 16S rRNA (adenine(1518)-N(6)/adenine(1519)-N(6))-dimethyltransferase RsmA	NA|330aa|up_8|NZ_CP018241.1_57397_58387_-	PRK00232, pdxA, 4-hydroxythreonine-4-phosphate dehydrogenase; Reviewed	NA|429aa|up_7|NZ_CP018241.1_58386_59673_-	PRK10770, PRK10770, peptidyl-prolyl cis-trans isomerase SurA; Provisional	NA|785aa|up_6|NZ_CP018241.1_59725_62080_-	PRK03761, PRK03761, LPS assembly outer membrane complex protein LptD; Provisional	NA|272aa|up_5|NZ_CP018241.1_62334_63150_+	PRK09430, djlA, co-chaperone DjlA	NA|256aa|up_4|NZ_CP018241.1_63444_64212_+	COG1357, COG1357, Pentapeptide repeats containing protein [Function unknown]	NA|220aa|up_3|NZ_CP018241.1_64629_65289_-	PRK10158, PRK10158, bifunctional tRNA pseudouridine(32) synthase/23S rRNA pseudouridine(746) synthase RluA	NA|969aa|up_2|NZ_CP018241.1_65300_68207_-	PRK04914, PRK04914, RNA polymerase-associated protein RapA	NA|784aa|up_1|NZ_CP018241.1_68370_70722_-	PRK05762, PRK05762, DNA polymerase II; Reviewed	NA|232aa|up_0|NZ_CP018241.1_70796_71492_-	PRK08193, araD, L-ribulose-5-phosphate 4-epimerase AraD	NA|501aa|down_0|NZ_CP018241.1_71691_73194_-	PRK02929, PRK02929, L-arabinose isomerase; Provisional	NA|567aa|down_1|NZ_CP018241.1_73204_74905_-	PRK04123, PRK04123, ribulokinase; Provisional	NA|293aa|down_2|NZ_CP018241.1_75243_76122_+	PRK10572, PRK10572, arabinose operon transcriptional regulator AraC	NA|255aa|down_3|NZ_CP018241.1_76207_76972_+	COG0586, DedA, Uncharacterized membrane-associated protein [Function unknown]	NA|233aa|down_4|NZ_CP018241.1_77058_77757_-	PRK10771, thiQ, thiamine ABC transporter ATP-binding protein ThiQ	NA|537aa|down_5|NZ_CP018241.1_77740_79351_-	PRK09433, thiP, thiamine transporter membrane protein; Reviewed	NA|328aa|down_6|NZ_CP018241.1_79326_80310_-	PRK11205, tbpA, thiamine transporter substrate binding subunit; Provisional	NA|190aa|down_7|NZ_CP018241.1_80681_81251_+	NA	NA|553aa|down_8|NZ_CP018241.1_81479_83138_-	PRK13626, PRK13626, HTH-type transcriptional regulator SgrR	NA|44aa|down_9|NZ_CP018241.1_83226_83358_+	pfam15894, SgrT, Inhibitor of glucose uptake transporter SgrT
GCF_001890325.1_ASM189032v1	NZ_CP018241	Escherichia coli strain 319 chromosome, complete genome	2	2902820-2902931	1	PILER-CR	no		DEDDh,DinG,PrimPol,cas3,c2c9_V-U4,csa3,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e	Orphan	TTTTCTTACCTGATTCGGGTAA	22	1	1	2902889-2902914	NZ_CP018241.1_1610796-1610771	NA	2	2	Orphan	DEDDh,DinG,PrimPol,cas3,c2c9_V-U4,csa3,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e	NA|47aa|up_9|NZ_CP018241.1_2899030_2899171_-,NA|34aa|up_4|NZ_CP018241.1_2900431_2900533_-,NA|94aa|up_3|NZ_CP018241.1_2900623_2900905_-,NA|66aa|up_2|NZ_CP018241.1_2900948_2901146_-,NA|95aa|up_1|NZ_CP018241.1_2901373_2901658_-,NA|94aa|down_9|NZ_CP018241.1_2909665_2909947_+	NA|47aa|up_9|NZ_CP018241.1_2899030_2899171_-	NA	NA|121aa|up_8|NZ_CP018241.1_2899167_2899530_-	PRK09786, PRK09786, endodeoxyribonuclease RUS; Reviewed	NA|97aa|up_7|NZ_CP018241.1_2899526_2899817_-	pfam07102, DUF1364, Protein of unknown function (DUF1364)	NA|57aa|up_6|NZ_CP018241.1_2899809_2899980_-	PRK09689, PRK09689, prophage protein NinE; Provisional	NA|152aa|up_5|NZ_CP018241.1_2899979_2900435_-	PRK09741, PRK09741, hypothetical protein; Provisional	NA|34aa|up_4|NZ_CP018241.1_2900431_2900533_-	NA	NA|94aa|up_3|NZ_CP018241.1_2900623_2900905_-	NA	NA|66aa|up_2|NZ_CP018241.1_2900948_2901146_-	NA	NA|95aa|up_1|NZ_CP018241.1_2901373_2901658_-	NA	NA|234aa|up_0|NZ_CP018241.1_2901654_2902356_-	pfam06992, Phage_lambda_P, Replication protein P	NA|180aa|down_0|NZ_CP018241.1_2903368_2903908_-	pfam06254, YdaT_toxin, Putative bacterial toxin ydaT	NA|231aa|down_1|NZ_CP018241.1_2904271_2904964_+	COG2932, COG2932, Predicted transcriptional regulator [Transcription]	NA|536aa|down_2|NZ_CP018241.1_2905070_2906678_+	cd01406, SIR2-like, Sir2-like: Prokaryotic group of uncharacterized Sir2-like proteins which lack certain key catalytic residues and conserved zinc binding cysteines; and are members of the SIR2 superfamily of proteins, silent information regulator 2 (Sir2) enzymes which catalyze NAD+-dependent protein/histone deacetylation	NA|97aa|down_3|NZ_CP018241.1_2907181_2907472_+	PRK11354, kil, FtsZ inhibitor protein; Reviewed	NA|99aa|down_4|NZ_CP018241.1_2907547_2907844_+	pfam06064, Gam, Host-nuclease inhibitor protein Gam	NA|262aa|down_5|NZ_CP018241.1_2907849_2908635_+	TIGR01913, Uncharacterized_protein_UU154, phage recombination protein Bet	NA|226aa|down_6|NZ_CP018241.1_2908631_2909309_+	pfam09588, YqaJ, YqaJ-like viral recombinase domain	NA|61aa|down_7|NZ_CP018241.1_2909308_2909491_+	pfam07026, DUF1317, Protein of unknown function (DUF1317)	NA|64aa|down_8|NZ_CP018241.1_2909463_2909655_+	pfam07131, DUF1382, Protein of unknown function (DUF1382)	NA|94aa|down_9|NZ_CP018241.1_2909665_2909947_+	NA
GCF_001890325.1_ASM189032v1	NZ_CP018241	Escherichia coli strain 319 chromosome, complete genome	3	3592126-3592214	2	CRISPRCasFinder	no	cas3	DEDDh,DinG,PrimPol,cas3,c2c9_V-U4,csa3,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e	Unclear	GGTTTATCCCCACTGGCGCGGGGAACTC	28	0	0	NA	NA	I-E	1	1	Unclear	DEDDh,DinG,PrimPol,cas3,c2c9_V-U4,csa3,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e	NA,NA	NA|600aa|up_9|NZ_CP018241.1_3578215_3580015_-	PRK10953, cysJ, NADPH-dependent assimilatory sulfite reductase flavoprotein subunit	NA|122aa|up_8|NZ_CP018241.1_3580330_3580696_+	cd00470, PTPS, 6-pyruvoyl tetrahydropterin synthase (PTPS)	NA|424aa|up_7|NZ_CP018241.1_3580773_3582045_+	PRK10015, PRK10015, oxidoreductase; Provisional	NA|87aa|up_6|NZ_CP018241.1_3582035_3582296_+	COG2440, FixX, Ferredoxin-like protein [Energy production and conversion]	NA|192aa|up_5|NZ_CP018241.1_3582312_3582888_+	COG1954, GlpP, Glycerol-3-phosphate responsive antiterminator (mRNA-binding) [Transcription]	NA|260aa|up_4|NZ_CP018241.1_3583892_3584672_-	COG2086, FixA, Electron transfer flavoprotein, beta subunit [Energy production and conversion]	NA|485aa|up_3|NZ_CP018241.1_3586079_3587534_-	COG0277, GlcD, FAD/FMN-containing dehydrogenases [Energy production and conversion]	NA|262aa|up_2|NZ_CP018241.1_3587603_3588389_-	cd05347, Ga5DH-like_SDR_c, gluconate 5-dehydrogenase (Ga5DH)-like, classical (c) SDRs	NA|426aa|up_1|NZ_CP018241.1_3588707_3589985_+	cd06174, MFS, Major Facilitator Superfamily	NA|493aa|up_0|NZ_CP018241.1_3590011_3591490_+	cd07779, FGGY_ygcE_like, uncharacterized ygcE-like proteins	NA|224aa|down_0|NZ_CP018241.1_3592553_3593225_-	TIGR04322, organic_radical_activating_enzyme, putative 7-cyano-7-deazaguanosine (preQ0) biosynthesis protein QueE	NA|207aa|down_1|NZ_CP018241.1_3593379_3594000_+	COG1704, LemA, Uncharacterized conserved protein [Function unknown]	NA|225aa|down_2|NZ_CP018241.1_3594013_3594688_+	COG1512, COG1512, Beta-propeller domains of methanol dehydrogenase type [General function prediction only]	NA|383aa|down_3|NZ_CP018241.1_3594921_3596070_+	COG1512, COG1512, Beta-propeller domains of methanol dehydrogenase type [General function prediction only]	NA|305aa|down_4|NZ_CP018241.1_3596066_3596981_+	COG1512, COG1512, Beta-propeller domains of methanol dehydrogenase type [General function prediction only]	NA|433aa|down_5|NZ_CP018241.1_3597040_3598339_-	PRK00077, eno, enolase; Provisional	NA|546aa|down_6|NZ_CP018241.1_3598426_3600064_-	PRK05380, pyrG, CTP synthetase; Validated	NA|264aa|down_7|NZ_CP018241.1_3600291_3601083_-	PRK09562, mazG, nucleoside triphosphate pyrophosphohydrolase; Reviewed	NA|112aa|down_8|NZ_CP018241.1_3601153_3601489_-	PRK09907, PRK09907, endoribonuclease MazF	NA|83aa|down_9|NZ_CP018241.1_3601488_3601737_-	PRK09798, PRK09798, MazF-MazE toxin-antitoxin system antitoxin MazE
GCF_001890325.1_ASM189032v1	NZ_CP018241	Escherichia coli strain 319 chromosome, complete genome	4	5316464-5316613	3	CRISPRCasFinder	no		DEDDh,DinG,PrimPol,cas3,c2c9_V-U4,csa3,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e	Orphan	CGCGTCTTATCAGGCCTACGAGTTCGGTGCTGTGTAGGTCGGATAAGGCGTTCA	54	0	0	NA	NA	NA	1	1	Orphan	DEDDh,DinG,PrimPol,cas3,c2c9_V-U4,csa3,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e	NA,NA	NA|132aa|up_9|NZ_CP018241.1_5305006_5305402_-	cd02198, YjgH_like, YjgH belongs to a large family of YjgF/YER057c/UK114-like proteins present in bacteria, archaea, and eukaryotes with no definitive function	NA|238aa|up_8|NZ_CP018241.1_5305532_5306246_-	PRK12742, PRK12742, SDR family oxidoreductase	NA|198aa|up_7|NZ_CP018241.1_5306316_5306910_+	pfam00440, TetR_N, Bacterial regulatory proteins, tetR family	NA|151aa|up_6|NZ_CP018241.1_5307054_5307507_+	COG2731, EbgC, Beta-galactosidase, beta subunit [Carbohydrate transport and metabolism]	NA|335aa|up_5|NZ_CP018241.1_5309422_5310427_-	PRK03515, PRK03515, ornithine carbamoyltransferase subunit I; Provisional	NA|139aa|up_4|NZ_CP018241.1_5310588_5311005_+	PRK11191, PRK11191, ribonuclease E inhibitor RraB	NA|168aa|up_3|NZ_CP018241.1_5311050_5311554_-	pfam00583, Acetyltransf_1, Acetyltransferase (GNAT) family	NA|399aa|up_2|NZ_CP018241.1_5311746_5312943_+	COG4269, COG4269, Predicted membrane protein [Function unknown]	NA|952aa|up_1|NZ_CP018241.1_5312997_5315853_-	PRK05729, valS, valyl-tRNA synthetase; Reviewed	NA|148aa|up_0|NZ_CP018241.1_5315852_5316296_-	PRK05728, PRK05728, DNA polymerase III subunit chi; Validated	NA|504aa|down_0|NZ_CP018241.1_5316649_5318161_-	PRK00913, PRK00913, multifunctional aminopeptidase A; Provisional	NA|367aa|down_1|NZ_CP018241.1_5318427_5319528_+	PRK15120, PRK15120, lipopolysaccharide ABC transporter permease LptF; Provisional	NA|361aa|down_2|NZ_CP018241.1_5319527_5320610_+	PRK15071, PRK15071, lipopolysaccharide ABC transporter permease; Provisional	NA|501aa|down_3|NZ_CP018241.1_5320770_5322273_-	pfam05872, DUF853, Bacterial protein of unknown function (DUF853)	NA|340aa|down_4|NZ_CP018241.1_5322402_5323422_-	cd05283, CAD1, Cinnamyl alcohol dehydrogenases (CAD)	NA|653aa|down_5|NZ_CP018241.1_5323792_5325751_+	cd01184, INT_C_like_1, Uncharacterized site-specific tyrosine recombinase, C-terminal catalytic domain	NA|126aa|down_6|NZ_CP018241.1_5327151_5327529_-	pfam13711, DUF4160, Domain of unknown function (DUF4160)	NA|111aa|down_7|NZ_CP018241.1_5327650_5327983_-	pfam07978, NIPSNAP, NIPSNAP	NA|156aa|down_8|NZ_CP018241.1_5328986_5329454_-	COG4333, COG4333, Uncharacterized protein conserved in bacteria [Function unknown]	NA|190aa|down_9|NZ_CP018241.1_5329646_5330216_+	cd03768, SR_ResInv, Serine Recombinase (SR) family, Resolvase and Invertase subfamily, catalytic domain; members contain a C-terminal DNA binding domain
