assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000241875.1_ASM24187v1	NC_016799	Corynebacterium diphtheriae 31A, complete sequence	1	39133-41022	1,1,1,2	CRISPRCasFinder,CRT,PILER-CR,PILER-CR	no	cas9,cas1,cas2	cas9,cas1,cas2,csa3,DEDDh,cas3,WYL,cas4,DinG	Type II-B,Type II-A, Type II-B, or Type II-C?,Type II-C	GAAGTCTATCAGGGTTTTTGAGAACTGAACCCCAGC,GAAGTCTATCAGGGTTTTTGAGAACTGAACCCCAGC,GAAGTCTATCAGGGTTTTTGAGAACTGAACCCCAGC,GAAGTCTATCAGGGTTTTTGAGAACTGAACCCCAGCAC	36,36,36,38	0	0	NA	NA	NA:NA:NA:NA	28,29,20,20	29	TypeII-B,TypeII-A,orTypeII-C?,TypeII-B,TypeII-C	cas9,cas1,cas2,csa3,DEDDh,cas3,WYL,cas4,DinG	NA,NA	NA|600aa|up_9|NC_016799.1_27320_29120_+	PRK09284, PRK09284, thiamine biosynthesis protein ThiC; Provisional	NA|223aa|up_8|NC_016799.1_29103_29772_+	PRK00043, thiE, thiamine phosphate synthase	NA|363aa|up_7|NC_016799.1_29768_30857_+	TIGR02352, Glycine_oxidase, glycine oxidase ThiO	NA|67aa|up_6|NC_016799.1_30840_31041_+	TIGR01683, thiamine_biosynthesis_protein_ThiS, thiamine biosynthesis protein ThiS	NA|262aa|up_5|NC_016799.1_31042_31828_+	PRK00208, thiG, thiazole synthase; Reviewed	NA|337aa|up_4|NC_016799.1_31827_32838_+	PRK05600, PRK05600, thiamine biosynthesis protein ThiF; Validated	NA|285aa|up_3|NC_016799.1_32834_33689_+	PRK14713, PRK14713, bifunctional hydroxymethylpyrimidine kinase/phosphomethylpyrimidine kinase	cas9|1085aa|up_2|NC_016799.1_34596_37851_+	pfam18470, Cas9_a, Cas9 alpha-helical lobe domain	cas1|305aa|up_1|NC_016799.1_37854_38769_+	cd09720, Cas1_II, CRISPR/Cas system-associated protein Cas1	cas2|110aa|up_0|NC_016799.1_38752_39082_+	COG3512, COG3512, CRISPR-associated protein, Cas2 homolog [Defense mechanisms]	NA|90aa|down_0|NC_016799.1_42923_43193_-	PRK00159, PRK00159, putative septation inhibitor protein; Reviewed	NA|670aa|down_1|NC_016799.1_43309_45319_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|505aa|down_2|NC_016799.1_45315_46830_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|487aa|down_3|NC_016799.1_46842_48303_-	COG0768, FtsI, Cell division protein FtsI/penicillin-binding protein 2 [Cell envelope biogenesis, outer membrane]	NA|450aa|down_4|NC_016799.1_48299_49649_-	COG0772, FtsW, Bacterial cell division membrane protein [Cell division and chromosome partitioning]	NA|485aa|down_5|NC_016799.1_49649_51104_-	COG0631, PTC1, Serine/threonine protein phosphatase [Signal transduction mechanisms]	NA|163aa|down_6|NC_016799.1_51103_51592_-	COG1716, COG1716, FOG: FHA domain [Signal transduction mechanisms]	NA|289aa|down_7|NC_016799.1_51605_52472_-	pfam12401, DUF3662, Protein of unknown function (DUF2662)	NA|745aa|down_8|NC_016799.1_53425_55660_-	cd07552, P-type_ATPase_Cu-like, P-type heavy metal-transporting ATPase, similar to Archaeoglobus fulgidus CopB, a Cu(2+)-ATPase	NA|109aa|down_9|NC_016799.1_55711_56038_-	COG2608, CopZ, Copper chaperone [Inorganic ion transport and metabolism]
GCF_000241875.1_ASM24187v1	NC_016799	Corynebacterium diphtheriae 31A, complete sequence	2	1022577-1022644	2	CRISPRCasFinder	no		cas9,cas1,cas2,csa3,DEDDh,cas3,WYL,cas4,DinG	Orphan	GACCGCGGCGGATACCGTGGCGG	23	1	1	1022600-1022621	NC_016799.1_722193-722172	NA	1	1	Orphan	cas9,cas1,cas2,csa3,DEDDh,cas3,WYL,cas4,DinG	NA|181aa|up_7|NC_016799.1_1010804_1011347_-,NA|94aa|up_4|NC_016799.1_1014276_1014558_+,NA|159aa|down_1|NC_016799.1_1023373_1023850_-	NA|270aa|up_9|NC_016799.1_1009124_1009934_+	pfam11575, FhuF_C, FhuF 2Fe-2S C-terminal domain	NA|271aa|up_8|NC_016799.1_1009983_1010796_+	PRK12298, obgE, GTPase CgtA; Reviewed	NA|181aa|up_7|NC_016799.1_1010804_1011347_-	NA	NA|306aa|up_6|NC_016799.1_1011651_1012569_+	cd09022, Aldose_epim_Ec_YihR, Aldose 1-epimerase, similar to Escherichia coli YihR	NA|553aa|up_5|NC_016799.1_1012605_1014264_+	cd11478, SLC5sbd_u2, Uncharacterized bacterial solute carrier 5 subfamily; putative solute-binding domain	NA|94aa|up_4|NC_016799.1_1014276_1014558_+	NA	NA|377aa|up_3|NC_016799.1_1014557_1015688_+	cd00608, GalT, Galactose-1-phosphate uridyl transferase (GalT): This enzyme plays a key role in galactose metabolism by catalysing the transfer of a uridine 5'-phosphoryl group from UDP-galactose 1-phosphate	NA|410aa|up_2|NC_016799.1_1015671_1016901_+	COG0153, GalK, Galactokinase [Carbohydrate transport and metabolism]	NA|1044aa|up_1|NC_016799.1_1016954_1020086_-	cd09203, PLDc_N_DEXD_b1, N-terminal putative catalytic domain of uncharacterized prokaryotic and archeal HKD family nucleases fused to a DEAD/DEAH box helicase domain	NA|132aa|up_0|NC_016799.1_1020096_1020492_-	cd03425, MutT_pyrophosphohydrolase, The MutT pyrophosphohydrolase is a prototypical Nudix hydrolase that catalyzes the hydrolysis of nucleoside and deoxynucleoside triphosphates (NTPs and dNTPs) by substitution at a beta-phosphorus to yield a nucleotide monophosphate (NMP) and inorganic pyrophosphate (PPi)	NA|137aa|down_0|NC_016799.1_1022834_1023245_+	pfam08029, HisG_C, HisG, C-terminal domain	NA|159aa|down_1|NC_016799.1_1023373_1023850_-	NA	NA|197aa|down_2|NC_016799.1_1023898_1024489_-	pfam07510, DUF1524, Protein of unknown function (DUF1524)	NA|522aa|down_3|NC_016799.1_1024528_1026094_-	TIGR02121, Osmoregulated_proline_transporter, sodium/proline symporter	NA|1030aa|down_4|NC_016799.1_1026319_1029409_+	COG0553, HepA, Superfamily II DNA/RNA helicases, SNF2 family [Transcription / DNA replication, recombination, and repair]	NA|274aa|down_5|NC_016799.1_1029409_1030231_+	COG4279, COG4279, Uncharacterized conserved protein [Function unknown]	NA|374aa|down_6|NC_016799.1_1030262_1031384_+	COG0420, SbcD, DNA repair exonuclease [DNA replication, recombination, and repair]	NA|855aa|down_7|NC_016799.1_1031383_1033948_+	COG0419, SbcC, ATPase involved in DNA repair [DNA replication, recombination, and repair]	NA|177aa|down_8|NC_016799.1_1034024_1034555_-	COG2353, COG2353, Uncharacterized conserved protein [Function unknown]	NA|161aa|down_9|NC_016799.1_1034796_1035279_+	COG1846, MarR, Transcriptional regulators [Transcription]
GCF_000241875.1_ASM24187v1	NC_016799	Corynebacterium diphtheriae 31A, complete sequence	3	1539166-1539278	3	CRISPRCasFinder	no		cas9,cas1,cas2,csa3,DEDDh,cas3,WYL,cas4,DinG	Orphan	GCTGGTTTAGGAGCCGCAGGCTT	23	0	0	NA	NA	NA	2	2	Orphan	cas9,cas1,cas2,csa3,DEDDh,cas3,WYL,cas4,DinG	NA,NA	NA|756aa|up_9|NC_016799.1_1526427_1528695_-	TIGR02696, polyribonucleotide_nucleotidyltransferase, guanosine pentaphosphate synthetase I/polynucleotide phosphorylase	NA|90aa|up_8|NC_016799.1_1528887_1529157_-	PRK05626, rpsO, 30S ribosomal protein S15; Reviewed	NA|320aa|up_7|NC_016799.1_1529328_1530288_-	cd02650, nuc_hydro_CaPnhB, NH_hydro_CaPnhB: A subgroup of nucleoside hydrolases similar to Corynebacterium ammoniagenes Purine/pyrimidine nucleoside hydrolase (pnhB)	NA|324aa|up_6|NC_016799.1_1530325_1531297_-	PRK05627, PRK05627, bifunctional riboflavin kinase/FAD synthetase	NA|301aa|up_5|NC_016799.1_1531319_1532222_+	PRK03287, truB, tRNA pseudouridine synthase B; Provisional	NA|231aa|up_4|NC_016799.1_1532218_1532911_-	COG2977, EntD, Phosphopantetheinyl transferase component of siderophore synthetase [Secondary metabolites biosynthesis, transport, and catabolism]	NA|269aa|up_3|NC_016799.1_1532907_1533714_-	COG1409, Icc, Predicted phosphohydrolases [General function prediction only]	NA|440aa|up_2|NC_016799.1_1533760_1535080_-	cd13136, MATE_DinF_like, DinF and similar proteins, a subfamily of the multidrug and toxic compound extrusion (MATE)-like proteins	NA|314aa|up_1|NC_016799.1_1535076_1536018_-	COG0618, COG0618, Exopolyphosphatase-related proteins [General function prediction only]	NA|148aa|up_0|NC_016799.1_1536058_1536502_-	PRK00521, rbfA, 30S ribosome-binding factor RbfA	NA|111aa|down_0|NC_016799.1_1539613_1539946_-	pfam04296, DUF448, Protein of unknown function (DUF448)	NA|333aa|down_1|NC_016799.1_1540201_1541200_-	PRK12327, nusA, transcription elongation factor NusA; Provisional	NA|185aa|down_2|NC_016799.1_1541196_1541751_-	PRK00092, PRK00092, ribosome maturation protein RimP; Reviewed	NA|303aa|down_3|NC_016799.1_1541802_1542711_+	pfam14530, DUF4439, Domain of unknown function (DUF4439)	NA|586aa|down_4|NC_016799.1_1542790_1544548_-	PRK09194, PRK09194, prolyl-tRNA synthetase; Provisional	NA|249aa|down_5|NC_016799.1_1544579_1545326_+	PRK02101, PRK02101, peroxide stress protein YaaA	NA|275aa|down_6|NC_016799.1_1545345_1546170_-	cd11642, SUMT, Uroporphyrin-III C-methyltransferase (also known as S-Adenosyl-L-methionine:uroporphyrinogen III methyltransferase, SUMT)	NA|449aa|down_7|NC_016799.1_1546168_1547515_+	PRK00029, PRK00029, YdiU family protein	NA|375aa|down_8|NC_016799.1_1547518_1548643_-	COG0523, COG0523, Putative GTPases (G3E family) [General function prediction only]	NA|170aa|down_9|NC_016799.1_1548662_1549172_-	pfam01035, DNA_binding_1, 6-O-methylguanine DNA methyltransferase, DNA binding domain
GCF_000241875.1_ASM24187v1	NC_016799	Corynebacterium diphtheriae 31A, complete sequence	4	1845873-1845980	4	CRISPRCasFinder	no		cas9,cas1,cas2,csa3,DEDDh,cas3,WYL,cas4,DinG	Orphan	ACAGCGCGACGACGCCCACGGGAG	24	0	0	NA	NA	NA	1	1	Orphan	cas9,cas1,cas2,csa3,DEDDh,cas3,WYL,cas4,DinG	NA|84aa|up_3|NC_016799.1_1843016_1843268_-,NA	NA|274aa|up_9|NC_016799.1_1837800_1838622_-	pfam02645, DegV, Uncharacterized protein, DegV family COG1307	NA|241aa|up_8|NC_016799.1_1838626_1839349_-	pfam00300, His_Phos_1, Histidine phosphatase superfamily (branch 1)	NA|156aa|up_7|NC_016799.1_1839355_1839823_-	pfam02410, RsfS, Ribosomal silencing factor during starvation	NA|229aa|up_6|NC_016799.1_1839845_1840532_-	PRK00071, nadD, nicotinate-nucleotide adenylyltransferase	NA|431aa|up_5|NC_016799.1_1840555_1841848_-	PRK00197, proA, gamma-glutamyl phosphate reductase; Provisional	NA|377aa|up_4|NC_016799.1_1841865_1842996_-	PRK05429, PRK05429, gamma-glutamyl kinase; Provisional	NA|84aa|up_3|NC_016799.1_1843016_1843268_-	NA	NA|509aa|up_2|NC_016799.1_1843271_1844798_-	PRK12296, obgE, GTPase CgtA; Reviewed	NA|89aa|up_1|NC_016799.1_1844958_1845225_-	PRK05435, rpmA, 50S ribosomal protein L27; Validated	NA|102aa|up_0|NC_016799.1_1845265_1845571_-	PRK05573, rplU, 50S ribosomal protein L21; Validated	NA|137aa|down_0|NC_016799.1_1848914_1849325_-	PRK00668, ndk, mulitfunctional nucleoside diphosphate kinase/apyrimidinic endonuclease/3'-; Validated	NA|138aa|down_1|NC_016799.1_1849535_1849949_-	pfam14017, DUF4233, Protein of unknown function (DUF4233)	NA|499aa|down_2|NC_016799.1_1849945_1851442_-	COG0285, FolC, Folylpolyglutamate synthase [Coenzyme metabolism]	NA|903aa|down_3|NC_016799.1_1851438_1854147_-	PRK05729, valS, valyl-tRNA synthetase; Reviewed	NA|327aa|down_4|NC_016799.1_1854241_1855222_-	PRK05442, PRK05442, malate dehydrogenase; Provisional	NA|251aa|down_5|NC_016799.1_1855704_1856457_+	pfam17938, TetR_C_29, Tetracyclin repressor-like, C-terminal domain	NA|431aa|down_6|NC_016799.1_1856492_1857785_-	PRK05342, clpX, ATP-dependent Clp protease ATP-binding subunit ClpX	NA|842aa|down_7|NC_016799.1_1857931_1860457_-	COG0147, TrpE, Anthranilate/para-aminobenzoate synthases component I [Amino acid transport and metabolism / Coenzyme metabolism]	NA|210aa|down_8|NC_016799.1_1860551_1861181_-	PRK12553, PRK12553, ATP-dependent Clp protease proteolytic subunit; Reviewed	NA|200aa|down_9|NC_016799.1_1861198_1861798_-	PRK00277, clpP, ATP-dependent Clp protease proteolytic subunit; Reviewed
