assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_002073375.2_ASM207337v2	NZ_CP020410	Corynebacterium diphtheriae strain FDAARGOS_197 chromosome, complete genome	1	357502-357601	1	CRISPRCasFinder	no		cas3,DEDDh,WYL,cas4,csa3,DinG,cas5,cas7,cse2gr11,cas8e,cas6e,cas1,cas2,cas9	Orphan	GGTGCGGCAGATGGTATCGACGATGGCATTTCCGTT	36	1	1	357538-357565	NZ_CP020410.2_1149266-1149239	NA	1	1	Orphan	cas3,DEDDh,WYL,cas4,csa3,DinG,cas5,cas7,cse2gr11,cas8e,cas6e,cas1,cas2,cas9	NA|138aa|up_9|NZ_CP020410.2_343132_343546_+,NA|170aa|up_4|NZ_CP020410.2_347183_347693_+,NA|227aa|up_3|NZ_CP020410.2_347743_348424_+,NA|277aa|up_0|NZ_CP020410.2_356102_356933_-,NA|181aa|down_4|NZ_CP020410.2_361830_362373_-,NA|94aa|down_7|NZ_CP020410.2_365302_365584_+	NA|138aa|up_9|NZ_CP020410.2_343132_343546_+	NA	NA|148aa|up_8|NZ_CP020410.2_343576_344020_+	PRK00182, tatB, Sec-independent protein translocase subunit TatB	NA|378aa|up_7|NZ_CP020410.2_344022_345156_-	pfam10609, ParA, NUBPL iron-transfer P-loop NTPase	NA|199aa|up_6|NZ_CP020410.2_345168_345765_-	COG4420, COG4420, Predicted membrane protein [Function unknown]	NA|430aa|up_5|NZ_CP020410.2_345768_347058_-	COG2239, MgtE, Mg/Co/Ni transporter MgtE (contains CBS domain) [Inorganic ion transport and metabolism]	NA|170aa|up_4|NZ_CP020410.2_347183_347693_+	NA	NA|227aa|up_3|NZ_CP020410.2_347743_348424_+	NA	NA|1238aa|up_2|NZ_CP020410.2_348478_352192_-	PRK12270, kgd, multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit	NA|1247aa|up_1|NZ_CP020410.2_352354_356095_-	COG1132, MdlB, ABC-type multidrug transport system, ATPase and permease components [Defense mechanisms]	NA|277aa|up_0|NZ_CP020410.2_356102_356933_-	NA	NA|271aa|down_0|NZ_CP020410.2_357716_358529_-	PRK12550, PRK12550, shikimate 5-dehydrogenase; Reviewed	NA|527aa|down_1|NZ_CP020410.2_358548_360129_+	COG2272, PnbA, Carboxylesterase type B [Lipid metabolism]	NA|270aa|down_2|NZ_CP020410.2_360150_360960_+	pfam11575, FhuF_C, FhuF 2Fe-2S C-terminal domain	NA|271aa|down_3|NZ_CP020410.2_361009_361822_+	PRK12298, obgE, GTPase CgtA; Reviewed	NA|181aa|down_4|NZ_CP020410.2_361830_362373_-	NA	NA|306aa|down_5|NZ_CP020410.2_362677_363595_+	cd09022, Aldose_epim_Ec_YihR, Aldose 1-epimerase, similar to Escherichia coli YihR	NA|553aa|down_6|NZ_CP020410.2_363631_365290_+	cd11478, SLC5sbd_u2, Uncharacterized bacterial solute carrier 5 subfamily; putative solute-binding domain	NA|94aa|down_7|NZ_CP020410.2_365302_365584_+	NA	NA|377aa|down_8|NZ_CP020410.2_365583_366714_+	cd00608, GalT, Galactose-1-phosphate uridyl transferase (GalT): This enzyme plays a key role in galactose metabolism by catalysing the transfer of a uridine 5'-phosphoryl group from UDP-galactose 1-phosphate	NA|410aa|down_9|NZ_CP020410.2_366697_367927_+	COG0153, GalK, Galactokinase [Carbohydrate transport and metabolism]
GCF_002073375.2_ASM207337v2	NZ_CP020410	Corynebacterium diphtheriae strain FDAARGOS_197 chromosome, complete genome	2	877268-877380	2	CRISPRCasFinder	no		cas3,DEDDh,WYL,cas4,csa3,DinG,cas5,cas7,cse2gr11,cas8e,cas6e,cas1,cas2,cas9	Orphan	GCTGGTTTAGGAGCCGCAGGCTT	23	0	0	NA	NA	NA	2	2	Orphan	cas3,DEDDh,WYL,cas4,csa3,DinG,cas5,cas7,cse2gr11,cas8e,cas6e,cas1,cas2,cas9	NA,NA	NA|756aa|up_9|NZ_CP020410.2_864531_866799_-	TIGR02696, polyribonucleotide_nucleotidyltransferase, guanosine pentaphosphate synthetase I/polynucleotide phosphorylase	NA|90aa|up_8|NZ_CP020410.2_866990_867260_-	PRK05626, rpsO, 30S ribosomal protein S15; Reviewed	NA|320aa|up_7|NZ_CP020410.2_867431_868391_-	cd02650, nuc_hydro_CaPnhB, NH_hydro_CaPnhB: A subgroup of nucleoside hydrolases similar to Corynebacterium ammoniagenes Purine/pyrimidine nucleoside hydrolase (pnhB)	NA|324aa|up_6|NZ_CP020410.2_868428_869400_-	PRK05627, PRK05627, bifunctional riboflavin kinase/FAD synthetase	NA|301aa|up_5|NZ_CP020410.2_869422_870325_+	PRK03287, truB, tRNA pseudouridine synthase B; Provisional	NA|231aa|up_4|NZ_CP020410.2_870321_871014_-	COG2977, EntD, Phosphopantetheinyl transferase component of siderophore synthetase [Secondary metabolites biosynthesis, transport, and catabolism]	NA|269aa|up_3|NZ_CP020410.2_871010_871817_-	COG1409, Icc, Predicted phosphohydrolases [General function prediction only]	NA|440aa|up_2|NZ_CP020410.2_871863_873183_-	cd13136, MATE_DinF_like, DinF and similar proteins, a subfamily of the multidrug and toxic compound extrusion (MATE)-like proteins	NA|314aa|up_1|NZ_CP020410.2_873179_874121_-	COG0618, COG0618, Exopolyphosphatase-related proteins [General function prediction only]	NA|148aa|up_0|NZ_CP020410.2_874160_874604_-	PRK00521, rbfA, 30S ribosome-binding factor RbfA	NA|111aa|down_0|NZ_CP020410.2_877715_878048_-	pfam04296, DUF448, Protein of unknown function (DUF448)	NA|333aa|down_1|NZ_CP020410.2_878303_879302_-	PRK12327, nusA, transcription elongation factor NusA; Provisional	NA|185aa|down_2|NZ_CP020410.2_879298_879853_-	PRK00092, PRK00092, ribosome maturation protein RimP; Reviewed	NA|303aa|down_3|NZ_CP020410.2_879904_880813_+	pfam14530, DUF4439, Domain of unknown function (DUF4439)	NA|586aa|down_4|NZ_CP020410.2_880891_882649_-	PRK09194, PRK09194, prolyl-tRNA synthetase; Provisional	NA|249aa|down_5|NZ_CP020410.2_882680_883427_+	PRK02101, PRK02101, peroxide stress protein YaaA	NA|275aa|down_6|NZ_CP020410.2_883446_884271_-	cd11642, SUMT, Uroporphyrin-III C-methyltransferase (also known as S-Adenosyl-L-methionine:uroporphyrinogen III methyltransferase, SUMT)	NA|449aa|down_7|NZ_CP020410.2_884269_885616_+	PRK00029, PRK00029, YdiU family protein	NA|375aa|down_8|NZ_CP020410.2_885619_886744_-	COG0523, COG0523, Putative GTPases (G3E family) [General function prediction only]	NA|170aa|down_9|NZ_CP020410.2_886763_887273_-	pfam01035, DNA_binding_1, 6-O-methylguanine DNA methyltransferase, DNA binding domain
GCF_002073375.2_ASM207337v2	NZ_CP020410	Corynebacterium diphtheriae strain FDAARGOS_197 chromosome, complete genome	3	1203024-1203131	3	CRISPRCasFinder	no		cas3,DEDDh,WYL,cas4,csa3,DinG,cas5,cas7,cse2gr11,cas8e,cas6e,cas1,cas2,cas9	Orphan	ACAGCGCGACGACGCCCACGGGAG	24	0	0	NA	NA	NA	1	1	Orphan	cas3,DEDDh,WYL,cas4,csa3,DinG,cas5,cas7,cse2gr11,cas8e,cas6e,cas1,cas2,cas9	NA|84aa|up_3|NZ_CP020410.2_1200167_1200419_-,NA	NA|274aa|up_9|NZ_CP020410.2_1194951_1195773_-	pfam02645, DegV, Uncharacterized protein, DegV family COG1307	NA|241aa|up_8|NZ_CP020410.2_1195777_1196500_-	pfam00300, His_Phos_1, Histidine phosphatase superfamily (branch 1)	NA|156aa|up_7|NZ_CP020410.2_1196506_1196974_-	pfam02410, RsfS, Ribosomal silencing factor during starvation	NA|229aa|up_6|NZ_CP020410.2_1196996_1197683_-	PRK00071, nadD, nicotinate-nucleotide adenylyltransferase	NA|431aa|up_5|NZ_CP020410.2_1197706_1198999_-	PRK00197, proA, gamma-glutamyl phosphate reductase; Provisional	NA|377aa|up_4|NZ_CP020410.2_1199016_1200147_-	PRK05429, PRK05429, gamma-glutamyl kinase; Provisional	NA|84aa|up_3|NZ_CP020410.2_1200167_1200419_-	NA	NA|509aa|up_2|NZ_CP020410.2_1200422_1201949_-	PRK12296, obgE, GTPase CgtA; Reviewed	NA|89aa|up_1|NZ_CP020410.2_1202109_1202376_-	PRK05435, rpmA, 50S ribosomal protein L27; Validated	NA|102aa|up_0|NZ_CP020410.2_1202416_1202722_-	PRK05573, rplU, 50S ribosomal protein L21; Validated	NA|137aa|down_0|NZ_CP020410.2_1206066_1206477_-	PRK00668, ndk, mulitfunctional nucleoside diphosphate kinase/apyrimidinic endonuclease/3'-; Validated	NA|138aa|down_1|NZ_CP020410.2_1206668_1207082_-	pfam14017, DUF4233, Protein of unknown function (DUF4233)	NA|499aa|down_2|NZ_CP020410.2_1207078_1208575_-	COG0285, FolC, Folylpolyglutamate synthase [Coenzyme metabolism]	NA|903aa|down_3|NZ_CP020410.2_1208571_1211280_-	PRK05729, valS, valyl-tRNA synthetase; Reviewed	NA|327aa|down_4|NZ_CP020410.2_1211374_1212355_-	PRK05442, PRK05442, malate dehydrogenase; Provisional	NA|251aa|down_5|NZ_CP020410.2_1212837_1213590_+	pfam17938, TetR_C_29, Tetracyclin repressor-like, C-terminal domain	NA|431aa|down_6|NZ_CP020410.2_1213626_1214919_-	PRK05342, clpX, ATP-dependent Clp protease ATP-binding subunit ClpX	NA|842aa|down_7|NZ_CP020410.2_1215065_1217591_-	COG0147, TrpE, Anthranilate/para-aminobenzoate synthases component I [Amino acid transport and metabolism / Coenzyme metabolism]	NA|210aa|down_8|NZ_CP020410.2_1217685_1218315_-	PRK12553, PRK12553, ATP-dependent Clp protease proteolytic subunit; Reviewed	NA|200aa|down_9|NZ_CP020410.2_1218332_1218932_-	PRK00277, clpP, ATP-dependent Clp protease proteolytic subunit; Reviewed
GCF_002073375.2_ASM207337v2	NZ_CP020410	Corynebacterium diphtheriae strain FDAARGOS_197 chromosome, complete genome	4	1684657-1686273	1,4,1	PILER-CR,CRISPRCasFinder,CRT	no	cas5,cas7,cse2gr11,cas8e,cas6e,cas3,cas1,cas2	cas3,DEDDh,WYL,cas4,csa3,DinG,cas5,cas7,cse2gr11,cas8e,cas6e,cas1,cas2,cas9	Type I-E	GTCTTCTCCGCACACGCGGAGGTATTTC,GTCTTCTCCGCACACGCGGAGGTATTTCC,GTCTTCTCCGCACACGCGGAGGTATTTCC	28,29,29	0	0	NA	NA	I-C,I-E,II-B:I-C,I-E,II-B:I-C,I-E,II-B	26,26,26	26	TypeI-E	cas3,DEDDh,WYL,cas4,csa3,DinG,cas5,cas7,cse2gr11,cas8e,cas6e,cas1,cas2,cas9	NA,NA|57aa|down_0|NZ_CP020410.2_1686298_1686469_+,NA|226aa|down_2|NZ_CP020410.2_1687761_1688439_+,NA|92aa|down_3|NZ_CP020410.2_1688844_1689120_+,NA|50aa|down_4|NZ_CP020410.2_1689275_1689425_+,NA|47aa|down_5|NZ_CP020410.2_1689421_1689562_+	NA|92aa|up_9|NZ_CP020410.2_1673037_1673313_-	cd03214, ABC_Iron-Siderophores_B12_Hemin, ATP-binding component of iron-siderophores, vitamin B12 and hemin transporters and related proteins	NA|607aa|up_8|NZ_CP020410.2_1673782_1675603_+	COG5479, COG5479, Uncharacterized protein potentially involved in peptidoglycan biosynthesis [Cell envelope biogenesis, outer membrane]	cas5|243aa|up_7|NZ_CP020410.2_1675711_1676440_-	cd09645, Cas5_I-E, CRISPR/Cas system-associated RAMP superfamily protein Cas5	cas7|353aa|up_6|NZ_CP020410.2_1676432_1677491_-	pfam09344, Cas_CT1975, CT1975-like protein	cse2gr11|195aa|up_5|NZ_CP020410.2_1677546_1678131_-	TIGR02548, CRISPR_system_Cascade_subunit_CasB, CRISPR type I-E/ECOLI-associated protein CasB/Cse2	cas8e|519aa|up_4|NZ_CP020410.2_1678123_1679680_-	pfam09481, CRISPR_Cse1, CRISPR-associated protein Cse1 (CRISPR_cse1)	cas6e|229aa|up_3|NZ_CP020410.2_1680045_1680732_+	cd09727, Cas6_I-E, CRISPR/Cas system-associated RAMP superfamily protein Cas6e	cas3|877aa|up_2|NZ_CP020410.2_1680731_1683362_+	PRK09694, PRK09694, CRISPR-associated helicase/endonuclease Cas3	cas1|320aa|up_1|NZ_CP020410.2_1683363_1684323_+	TIGR03638, cas1_ECOLI, CRISPR-associated endonuclease Cas1, subtype I-E/ECOLI	cas2|105aa|up_0|NZ_CP020410.2_1684323_1684638_+	pfam09707, Cas_Cas2CT1978, CRISPR-associated protein (Cas_Cas2CT1978)	NA|57aa|down_0|NZ_CP020410.2_1686298_1686469_+	NA	NA|172aa|down_1|NZ_CP020410.2_1686527_1687043_+	PRK07772, PRK07772, single-stranded DNA-binding protein; Provisional	NA|226aa|down_2|NZ_CP020410.2_1687761_1688439_+	NA	NA|92aa|down_3|NZ_CP020410.2_1688844_1689120_+	NA	NA|50aa|down_4|NZ_CP020410.2_1689275_1689425_+	NA	NA|47aa|down_5|NZ_CP020410.2_1689421_1689562_+	NA	NA|270aa|down_6|NZ_CP020410.2_1689582_1690392_-	pfam17802, SpaA, Prealbumin-like fold domain	NA|349aa|down_7|NZ_CP020410.2_1690464_1691511_-	cd05827, Sortase_C, Sortase domain found in class C sortases	NA|319aa|down_8|NZ_CP020410.2_1691494_1692451_-	cd05827, Sortase_C, Sortase domain found in class C sortases	NA|556aa|down_9|NZ_CP020410.2_1692640_1694308_-	TIGR04226, Fimbrial_subunit_type_2, fimbrial isopeptide formation D2 domain
GCF_002073375.2_ASM207337v2	NZ_CP020410	Corynebacterium diphtheriae strain FDAARGOS_197 chromosome, complete genome	5	1906284-1906766	5,2,2	CRISPRCasFinder,CRT,PILER-CR	no	cas9,cas1,cas2	cas3,DEDDh,WYL,cas4,csa3,DinG,cas5,cas7,cse2gr11,cas8e,cas6e,cas1,cas2,cas9	Type II-B, Type II-B, or Type II-C?,Type II-A,Type II-C	GAAGTCTATCAGGGTTTTTGAGAACTGAACCCCAGT,GAAGTCTATCAGGGTTTTTGAGAACTGAACCCCAGT,GAAGTCTATCAGGGTTTTTGAGAACTGAACCCCAG	36,36,35	0	0	NA	NA	NA:NA:NA	6,7,3	7	TypeII-B,TypeII-B,orTypeII-C?,TypeII-A,TypeII-C	cas3,DEDDh,WYL,cas4,csa3,DinG,cas5,cas7,cse2gr11,cas8e,cas6e,cas1,cas2,cas9	NA,NA	NA|600aa|up_9|NZ_CP020410.2_1894836_1896636_+	PRK09284, PRK09284, thiamine biosynthesis protein ThiC; Provisional	NA|223aa|up_8|NZ_CP020410.2_1896619_1897288_+	PRK00043, thiE, thiamine phosphate synthase	NA|363aa|up_7|NZ_CP020410.2_1897284_1898373_+	TIGR02352, Glycine_oxidase, glycine oxidase ThiO	NA|67aa|up_6|NZ_CP020410.2_1898356_1898557_+	TIGR01683, thiamine_biosynthesis_protein_ThiS, thiamine biosynthesis protein ThiS	NA|262aa|up_5|NZ_CP020410.2_1898558_1899344_+	PRK00208, thiG, thiazole synthase; Reviewed	NA|337aa|up_4|NZ_CP020410.2_1899343_1900354_+	PRK05600, PRK05600, thiamine biosynthesis protein ThiF; Validated	NA|273aa|up_3|NZ_CP020410.2_1900350_1901169_+	PRK14713, PRK14713, bifunctional hydroxymethylpyrimidine kinase/phosphomethylpyrimidine kinase	cas9|1085aa|up_2|NZ_CP020410.2_1901747_1905002_+	pfam18470, Cas9_a, Cas9 alpha-helical lobe domain	cas1|305aa|up_1|NZ_CP020410.2_1905005_1905920_+	cd09720, Cas1_II, CRISPR/Cas system-associated protein Cas1	cas2|110aa|up_0|NZ_CP020410.2_1905903_1906233_+	COG3512, COG3512, CRISPR-associated protein, Cas2 homolog [Defense mechanisms]	NA|78aa|down_0|NZ_CP020410.2_1910005_1910239_+	PHA02517, PHA02517, putative transposase OrfB; Reviewed	NA|90aa|down_1|NZ_CP020410.2_1910577_1910847_-	PRK00159, PRK00159, putative septation inhibitor protein; Reviewed	NA|674aa|down_2|NZ_CP020410.2_1910963_1912985_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|502aa|down_3|NZ_CP020410.2_1912981_1914487_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|487aa|down_4|NZ_CP020410.2_1914499_1915960_-	COG0768, FtsI, Cell division protein FtsI/penicillin-binding protein 2 [Cell envelope biogenesis, outer membrane]	NA|450aa|down_5|NZ_CP020410.2_1915956_1917306_-	COG0772, FtsW, Bacterial cell division membrane protein [Cell division and chromosome partitioning]	NA|485aa|down_6|NZ_CP020410.2_1917306_1918761_-	COG0631, PTC1, Serine/threonine protein phosphatase [Signal transduction mechanisms]	NA|163aa|down_7|NZ_CP020410.2_1918760_1919249_-	COG1716, COG1716, FOG: FHA domain [Signal transduction mechanisms]	NA|289aa|down_8|NZ_CP020410.2_1919262_1920129_-	pfam12401, DUF3662, Protein of unknown function (DUF2662)	NA|745aa|down_9|NZ_CP020410.2_1921082_1923317_-	cd07552, P-type_ATPase_Cu-like, P-type heavy metal-transporting ATPase, similar to Archaeoglobus fulgidus CopB, a Cu(2+)-ATPase
