assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_902386365.1_UHGG_MGYG-HGUT-02369	NZ_LR698984	Clostridioides difficile isolate MGYG-HGUT-02369 chromosome 1	1	39294-39372	1	CRISPRCasFinder	no		csa3,cas14j,WYL,cas3,DEDDh,DinG,cas14k,cas5,cas7,cas6,cas2,cas1,cas4,cas8b2	Orphan	TGTTTATAATGTGGATAATATTTAA	25	0	0	NA	NA	NA	1	1	Orphan	csa3,cas14j,WYL,cas3,DEDDh,DinG,cas14k,cas5,cas7,cas6,cas2,cas1,cas4,cas8b2	NA,NA|226aa|down_4|NZ_LR698984.1_56208_56886_+	NA|278aa|up_9|NZ_LR698984.1_16544_17378_+	TIGR04520, ECF_ATPase_1, energy-coupling factor transporter ATPase	NA|289aa|up_8|NZ_LR698984.1_17365_18232_+	PRK13637, cbiO, energy-coupling factor transporter ATPase	NA|268aa|up_7|NZ_LR698984.1_18225_19029_+	COG0619, CbiQ, ABC-type cobalt transport system, permease component CbiQ and related transporters [Inorganic ion transport and metabolism]	NA|244aa|up_6|NZ_LR698984.1_19060_19792_+	PRK00021, truA, tRNA pseudouridine(38-40) synthase TruA	NA|144aa|up_5|NZ_LR698984.1_19911_20343_+	PRK09216, rplM, 50S ribosomal protein L13; Reviewed	NA|131aa|up_4|NZ_LR698984.1_20371_20764_+	PRK00132, rpsI, 30S ribosomal protein S9; Reviewed	NA|251aa|up_3|NZ_LR698984.1_21222_21975_+	TIGR02883, spore_cwlD, N-acetylmuramoyl-L-alanine amidase CwlD	NA|395aa|up_2|NZ_LR698984.1_34343_35528_-	PRK05764, PRK05764, aspartate aminotransferase; Provisional	NA|784aa|up_1|NZ_LR698984.1_35900_38252_+	PRK07111, PRK07111, anaerobic ribonucleoside triphosphate reductase; Provisional	NA|180aa|up_0|NZ_LR698984.1_38273_38813_+	TIGR02491, Anaerobic_ribonucleoside-triphosphate_reductase, anaerobic ribonucleoside-triphosphate reductase activating protein	NA|280aa|down_0|NZ_LR698984.1_51894_52734_+	COG1624, COG1624, Uncharacterized conserved protein [Function unknown]	NA|394aa|down_1|NZ_LR698984.1_52730_53912_+	COG4856, COG4856, Uncharacterized protein conserved in bacteria [Function unknown]	NA|301aa|down_2|NZ_LR698984.1_54215_55118_+	PRK05805, PRK05805, phosphate butyryltransferase; Validated	NA|360aa|down_3|NZ_LR698984.1_55139_56219_+	PRK03011, PRK03011, butyrate kinase; Provisional	NA|226aa|down_4|NZ_LR698984.1_56208_56886_+	NA	NA|72aa|down_5|NZ_LR698984.1_56928_57144_+	COG2221, DsrA, Dissimilatory sulfite reductase (desulfoviridin), alpha and beta subunits [Energy production and conversion]	NA|360aa|down_6|NZ_LR698984.1_57172_58252_+	PRK07119, PRK07119, 2-ketoisovalerate ferredoxin reductase; Validated	NA|251aa|down_7|NZ_LR698984.1_58251_59004_+	cd03375, TPP_OGFOR, Thiamine pyrophosphate (TPP family), 2-oxoglutarate ferredoxin oxidoreductase (OGFOR) subfamily, TPP-binding module; OGFOR catalyzes the oxidative decarboxylation of 2-oxo-acids, with ferredoxin acting as an electron acceptor	NA|186aa|down_8|NZ_LR698984.1_59004_59562_+	pfam01558, POR, Pyruvate ferredoxin/flavodoxin oxidoreductase	NA|449aa|down_9|NZ_LR698984.1_59731_61078_+	PRK14316, glmM, phosphoglucosamine mutase; Provisional
GCF_902386365.1_UHGG_MGYG-HGUT-02369	NZ_LR698984	Clostridioides difficile isolate MGYG-HGUT-02369 chromosome 1	2	616649-616791	2	CRISPRCasFinder	no		csa3,cas14j,WYL,cas3,DEDDh,DinG,cas14k,cas5,cas7,cas6,cas2,cas1,cas4,cas8b2	Orphan	TTTAATGAAGATGGTATTATGCA	23	0	0	NA	NA	NA	2	2	Orphan	csa3,cas14j,WYL,cas3,DEDDh,DinG,cas14k,cas5,cas7,cas6,cas2,cas1,cas4,cas8b2	NA|397aa|up_8|NZ_LR698984.1_597975_599166_+,NA|61aa|down_1|NZ_LR698984.1_617967_618150_+,NA|101aa|down_6|NZ_LR698984.1_628873_629176_-	NA|652aa|up_9|NZ_LR698984.1_596004_597960_+	COG1506, DAP2, Dipeptidyl aminopeptidases/acylaminoacyl-peptidases [Amino acid transport and metabolism]	NA|397aa|up_8|NZ_LR698984.1_597975_599166_+	NA	NA|125aa|up_7|NZ_LR698984.1_600299_600674_+	COG1725, COG1725, Predicted transcriptional regulators [Transcription]	NA|286aa|up_6|NZ_LR698984.1_600675_601533_+	COG1131, CcmA, ABC-type multidrug transport system, ATPase component [Defense mechanisms]	NA|213aa|up_5|NZ_LR698984.1_601535_602174_+	pfam13346, ABC2_membrane_5, ABC-2 family transporter protein	NA|245aa|up_4|NZ_LR698984.1_603147_603882_+	cd07721, yflN-like_MBL-fold, uncharacterized subgroup which includes Bacillus subtilis yflN; MBL-fold metallo hydrolase domain	NA|419aa|up_3|NZ_LR698984.1_605023_606280_+	cd03682, ClC_sycA_like, ClC sycA-like chloride channel proteins	NA|398aa|up_2|NZ_LR698984.1_606342_607536_+	COG0025, NhaP, NhaP-type Na+/H+ and K+/H+ antiporters [Inorganic ion transport and metabolism]	NA|127aa|up_1|NZ_LR698984.1_607742_608123_+	pfam03965, Penicillinase_R, Penicillinase repressor	NA|185aa|up_0|NZ_LR698984.1_609031_609586_+	TIGR02937, RNA_polymerase_sigma_factor, RNA polymerase sigma factor, sigma-70 family	NA|167aa|down_0|NZ_LR698984.1_617138_617639_+	TIGR01593, Uncharacterized_protein_CPE0383, toxin secretion/phage lysis holin	NA|61aa|down_1|NZ_LR698984.1_617967_618150_+	NA	NA|2711aa|down_2|NZ_LR698984.1_618366_626499_+	pfam12920, TcdA_TcdB_pore, TcdA/TcdB pore forming domain	NA|187aa|down_3|NZ_LR698984.1_626833_627394_-	pfam12569, NARP1, NMDA receptor-regulated protein 1	NA|82aa|down_4|NZ_LR698984.1_627930_628176_+	pfam11731, Cdd1, Pathogenicity locus	NA|90aa|down_5|NZ_LR698984.1_628392_628662_-	pfam12675, DUF3795, Protein of unknown function (DUF3795)	NA|101aa|down_6|NZ_LR698984.1_628873_629176_-	NA	NA|246aa|down_7|NZ_LR698984.1_629695_630433_-	COG4200, COG4200, Uncharacterized protein conserved in bacteria [Function unknown]	NA|254aa|down_8|NZ_LR698984.1_630444_631206_-	pfam12730, ABC2_membrane_4, ABC-2 family transporter protein	NA|305aa|down_9|NZ_LR698984.1_631205_632120_-	COG1131, CcmA, ABC-type multidrug transport system, ATPase component [Defense mechanisms]
GCF_902386365.1_UHGG_MGYG-HGUT-02369	NZ_LR698984	Clostridioides difficile isolate MGYG-HGUT-02369 chromosome 1	3	899654-899731	3	CRISPRCasFinder	no	cas14j	csa3,cas14j,WYL,cas3,DEDDh,DinG,cas14k,cas5,cas7,cas6,cas2,cas1,cas4,cas8b2	Unclear	AAAAAATAAGAAAGTTTATAAATA	24	0	0	NA	NA	NA	1	1	TypeV	csa3,cas14j,WYL,cas3,DEDDh,DinG,cas14k,cas5,cas7,cas6,cas2,cas1,cas4,cas8b2	NA,NA	NA|527aa|up_9|NZ_LR698984.1_884293_885874_+	COG0063, COG0063, Predicted sugar kinase [Carbohydrate transport and metabolism]	cas14j|373aa|up_8|NZ_LR698984.1_886619_887738_+	COG0675, COG0675, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|149aa|up_7|NZ_LR698984.1_888021_888468_+	COG1585, COG1585, Membrane protein implicated in regulation of membrane protease activity [Posttranslational modification, protein turnover, chaperones / Intracellular trafficking and secretion]	NA|348aa|up_6|NZ_LR698984.1_888488_889532_+	COG0330, HflC, Membrane protease subunits, stomatin/prohibitin homologs [Posttranslational modification, protein turnover, chaperones]	NA|387aa|up_5|NZ_LR698984.1_890302_891463_+	PRK05293, glgC, glucose-1-phosphate adenylyltransferase; Provisional	NA|377aa|up_4|NZ_LR698984.1_891464_892595_+	TIGR02092, Glycogen_biosynthesis_protein_GlgD, glucose-1-phosphate adenylyltransferase, GlgD subunit	NA|481aa|up_3|NZ_LR698984.1_892623_894066_+	cd03791, GT5_Glycogen_synthase_DULL1-like, Glycogen synthase GlgA and similar proteins	NA|807aa|up_2|NZ_LR698984.1_894079_896500_+	cd04300, GT35_Glycogen_Phosphorylase, glycogen phosphorylase and similar proteins	NA|622aa|up_1|NZ_LR698984.1_896504_898370_+	cd11338, AmyAc_CMD, Alpha amylase catalytic domain found in cyclomaltodextrinases and related proteins	NA|218aa|up_0|NZ_LR698984.1_898614_899268_+	COG1802, GntR, Transcriptional regulators [Transcription]	NA|492aa|down_0|NZ_LR698984.1_899905_901381_+	COG1982, LdcC, Arginine/lysine/ornithine decarboxylases [Amino acid transport and metabolism]	NA|138aa|down_1|NZ_LR698984.1_901467_901881_+	TIGR03330, SAM_DCase_Bsu, S-adenosylmethionine decarboxylase proenzyme, Bacillus form	NA|284aa|down_2|NZ_LR698984.1_901910_902762_+	PRK00811, PRK00811, polyamine aminopropyltransferase	NA|293aa|down_3|NZ_LR698984.1_902751_903630_+	cd11593, Agmatinase-like_2, Agmatinase and related proteins	NA|67aa|down_4|NZ_LR698984.1_903809_904010_-	COG1278, CspC, Cold shock proteins [Transcription]	NA|497aa|down_5|NZ_LR698984.1_904476_905967_+	TIGR04105, hypothetical_protein, [FeFe] hydrogenase, group B1/B3	NA|499aa|down_6|NZ_LR698984.1_906224_907721_+	TIGR04105, hypothetical_protein, [FeFe] hydrogenase, group B1/B3	NA|117aa|down_7|NZ_LR698984.1_907854_908205_+	cd07731, ComA-like_MBL-fold, Competence protein ComA, ComEC and related proteins; MBL-fold metallo hydrolase domain	cas14j|373aa|down_8|NZ_LR698984.1_908843_909962_+	COG0675, COG0675, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|183aa|down_9|NZ_LR698984.1_910104_910653_+	COG2333, ComEC, Predicted hydrolase (metallo-beta-lactamase superfamily) [General function prediction only]
GCF_902386365.1_UHGG_MGYG-HGUT-02369	NZ_LR698984	Clostridioides difficile isolate MGYG-HGUT-02369 chromosome 1	4	1194845-1195798	1,4,1	PILER-CR,CRISPRCasFinder,CRT	no		csa3,cas14j,WYL,cas3,DEDDh,DinG,cas14k,cas5,cas7,cas6,cas2,cas1,cas4,cas8b2	Orphan	GTTTTATATTAACTAAGTGGTATGTAAAG,GTTTTATATTAACTAAGTGGTATGTAAAG,GTTTTATATTAACTAAGTGGTATGTAAAG	29,29,29	0	0	NA	NA	I-A:I-A:I-A	14,14,14	14	Orphan	csa3,cas14j,WYL,cas3,DEDDh,DinG,cas14k,cas5,cas7,cas6,cas2,cas1,cas4,cas8b2	NA,NA|52aa|down_0|NZ_LR698984.1_1197014_1197170_+,NA|84aa|down_1|NZ_LR698984.1_1197415_1197667_+,NA|128aa|down_2|NZ_LR698984.1_1198875_1199259_-,NA|115aa|down_3|NZ_LR698984.1_1199385_1199730_+,NA|72aa|down_4|NZ_LR698984.1_1199912_1200128_+,NA|106aa|down_5|NZ_LR698984.1_1201113_1201431_-,NA|190aa|down_6|NZ_LR698984.1_1201722_1202292_-,NA|65aa|down_8|NZ_LR698984.1_1204056_1204251_-	NA|360aa|up_9|NZ_LR698984.1_1182682_1183762_+	pfam02618, YceG, YceG-like family	NA|225aa|up_8|NZ_LR698984.1_1183933_1184608_+	COG4122, COG4122, Predicted O-methyltransferase [General function prediction only]	NA|416aa|up_7|NZ_LR698984.1_1184594_1185842_+	COG0826, COG0826, Collagenase and related proteases [Posttranslational modification, protein turnover, chaperones]	NA|555aa|up_6|NZ_LR698984.1_1185913_1187578_+	COG0768, FtsI, Cell division protein FtsI/penicillin-binding protein 2 [Cell envelope biogenesis, outer membrane]	NA|95aa|up_5|NZ_LR698984.1_1187674_1187959_-	PRK05803, PRK05803, RNA polymerase sporulation sigma factor SigK	NA|506aa|up_4|NZ_LR698984.1_1187981_1189499_-	cd00338, Ser_Recombinase, Serine Recombinase family, catalytic domain; a DNA binding domain may be present either N- or C-terminal to the catalytic domain	NA|274aa|up_3|NZ_LR698984.1_1189623_1190445_-	sd00006, TPR, Tetratricopeptide repeat	NA|476aa|up_2|NZ_LR698984.1_1190635_1192063_+	NF033435, S-layer_Clost, S-layer protein SlpA	NA|224aa|up_1|NZ_LR698984.1_1192372_1193044_+	pfam13518, HTH_28, Helix-turn-helix domain	NA|300aa|up_0|NZ_LR698984.1_1193037_1193937_+	PHA02517, PHA02517, putative transposase OrfB; Reviewed	NA|52aa|down_0|NZ_LR698984.1_1197014_1197170_+	NA	NA|84aa|down_1|NZ_LR698984.1_1197415_1197667_+	NA	NA|128aa|down_2|NZ_LR698984.1_1198875_1199259_-	NA	NA|115aa|down_3|NZ_LR698984.1_1199385_1199730_+	NA	NA|72aa|down_4|NZ_LR698984.1_1199912_1200128_+	NA	NA|106aa|down_5|NZ_LR698984.1_1201113_1201431_-	NA	NA|190aa|down_6|NZ_LR698984.1_1201722_1202292_-	NA	NA|170aa|down_7|NZ_LR698984.1_1203445_1203955_+	pfam04892, VanZ, VanZ like family	NA|65aa|down_8|NZ_LR698984.1_1204056_1204251_-	NA	NA|131aa|down_9|NZ_LR698984.1_1204765_1205158_-	PRK05803, PRK05803, RNA polymerase sporulation sigma factor SigK
GCF_902386365.1_UHGG_MGYG-HGUT-02369	NZ_LR698984	Clostridioides difficile isolate MGYG-HGUT-02369 chromosome 1	5	1404187-1404939	2,5,2	PILER-CR,CRISPRCasFinder,CRT	no	DinG	csa3,cas14j,WYL,cas3,DEDDh,DinG,cas14k,cas5,cas7,cas6,cas2,cas1,cas4,cas8b2	Type IV-A	GTTTTATATTAACTAAGTGGTATGTAAAT,GTTTTATATTAACTAAGTGGTATGTAAAT,GTTTTATATTAACTAAGTGGTATGTAAAT	29,29,29	0	0	NA	NA	I-A:I-A:I-A	10,11,10	11	Orphan	csa3,cas14j,WYL,cas3,DEDDh,DinG,cas14k,cas5,cas7,cas6,cas2,cas1,cas4,cas8b2	NA,NA|80aa|down_4|NZ_LR698984.1_1412071_1412311_+,NA|59aa|down_5|NZ_LR698984.1_1412498_1412675_-	NA|260aa|up_9|NZ_LR698984.1_1393149_1393929_+	pfam08282, Hydrolase_3, haloacid dehalogenase-like hydrolase	NA|195aa|up_8|NZ_LR698984.1_1394032_1394617_-	pfam00882, Zn_dep_PLPC, Zinc dependent phospholipase C	NA|477aa|up_7|NZ_LR698984.1_1394816_1396247_-	COG3829, RocR, Transcriptional regulator containing PAS, AAA-type ATPase, and DNA-binding domains [Transcription / Signal transduction mechanisms]	NA|315aa|up_6|NZ_LR698984.1_1396562_1397507_+	TIGR00950, Uncharacterized_inner_membrane_transporter_YicL, Carboxylate/Amino Acid/Amine Transporter	NA|312aa|up_5|NZ_LR698984.1_1397777_1398713_-	COG1242, COG1242, Predicted Fe-S oxidoreductase [General function prediction only]	NA|198aa|up_4|NZ_LR698984.1_1398832_1399426_-	COG3247, HdeD, Uncharacterized conserved protein [Function unknown]	NA|143aa|up_3|NZ_LR698984.1_1399532_1399961_+	pfam04657, DMT_YdcZ, Putative inner membrane exporter, YdcZ	NA|223aa|up_2|NZ_LR698984.1_1400027_1400696_+	cd01994, Alpha_ANH_like_IV, This is a subfamily of Adenine nucleotide alpha hydrolases superfamily	NA|452aa|up_1|NZ_LR698984.1_1400813_1402169_-	cd13143, MATE_MepA_like, Subfamily of the multidrug and toxic compound extrusion (MATE)-like proteins similar to Streptococcus aureus MepA	NA|119aa|up_0|NZ_LR698984.1_1403299_1403656_+	PHA02517, PHA02517, putative transposase OrfB; Reviewed	NA|578aa|down_0|NZ_LR698984.1_1405264_1406998_-	cd01949, GGDEF, Diguanylate-cyclase (DGC) or GGDEF domain	NA|282aa|down_1|NZ_LR698984.1_1407237_1408083_-	cd01949, GGDEF, Diguanylate-cyclase (DGC) or GGDEF domain	NA|745aa|down_2|NZ_LR698984.1_1408113_1410348_-	cd01948, EAL, EAL domain	NA|270aa|down_3|NZ_LR698984.1_1411117_1411927_+	cd00592, HTH_MerR-like, Helix-Turn-Helix DNA binding domain of MerR-like transcription regulators	NA|80aa|down_4|NZ_LR698984.1_1412071_1412311_+	NA	NA|59aa|down_5|NZ_LR698984.1_1412498_1412675_-	NA	NA|225aa|down_6|NZ_LR698984.1_1413179_1413854_-	COG1285, SapB, Uncharacterized membrane protein [Function unknown]	NA|185aa|down_7|NZ_LR698984.1_1414073_1414628_-	cd01014, nicotinamidase_related, Nicotinamidase_ related amidohydrolases	NA|122aa|down_8|NZ_LR698984.1_1414705_1415071_-	COG1733, COG1733, Predicted transcriptional regulators [Transcription]	NA|262aa|down_9|NZ_LR698984.1_1415535_1416321_+	COG3393, COG3393, Predicted acetyltransferase [General function prediction only]
GCF_902386365.1_UHGG_MGYG-HGUT-02369	NZ_LR698984	Clostridioides difficile isolate MGYG-HGUT-02369 chromosome 1	6	1515559-1515852	3,6,3	PILER-CR,CRISPRCasFinder,CRT	no		csa3,cas14j,WYL,cas3,DEDDh,DinG,cas14k,cas5,cas7,cas6,cas2,cas1,cas4,cas8b2	Orphan	GTTTTATATTAACTATATGGAATGTAAAT,GTTTTATATTAACTATATGGAATGTAAAT,GTTTTATATTAACTATATGGAATGTAAAT	29,29,29	0	0	NA	NA	I-A:I-A:I-A	3,4,4	4	Orphan	csa3,cas14j,WYL,cas3,DEDDh,DinG,cas14k,cas5,cas7,cas6,cas2,cas1,cas4,cas8b2	NA|50aa|up_9|NZ_LR698984.1_1504383_1504533_+,NA|190aa|up_7|NZ_LR698984.1_1505447_1506017_+,NA|61aa|up_4|NZ_LR698984.1_1508966_1509149_-,NA|95aa|down_0|NZ_LR698984.1_1516019_1516304_-,NA|330aa|down_5|NZ_LR698984.1_1520753_1521743_+	NA|50aa|up_9|NZ_LR698984.1_1504383_1504533_+	NA	NA|300aa|up_8|NZ_LR698984.1_1504543_1505443_+	TIGR02163, Ferredoxin-type_protein_NapH_homolog, ferredoxin-type protein, NapH/MauN family	NA|190aa|up_7|NZ_LR698984.1_1505447_1506017_+	NA	NA|216aa|up_6|NZ_LR698984.1_1506183_1506831_-	pfam02589, LUD_dom, LUD domain	NA|305aa|up_5|NZ_LR698984.1_1507220_1508135_-	pfam11155, DUF2935, Domain of unknown function (DUF2935)	NA|61aa|up_4|NZ_LR698984.1_1508966_1509149_-	NA	NA|283aa|up_3|NZ_LR698984.1_1509559_1510408_-	PRK00380, panC, pantoate--beta-alanine ligase; Reviewed	NA|276aa|up_2|NZ_LR698984.1_1510426_1511254_-	PRK00311, panB, 3-methyl-2-oxobutanoate hydroxymethyltransferase; Reviewed	NA|308aa|up_1|NZ_LR698984.1_1511228_1512152_-	pfam10728, DUF2520, Domain of unknown function (DUF2520)	NA|698aa|up_0|NZ_LR698984.1_1512804_1514898_+	cd01948, EAL, EAL domain	NA|95aa|down_0|NZ_LR698984.1_1516019_1516304_-	NA	NA|273aa|down_1|NZ_LR698984.1_1516510_1517329_+	cd04782, HTH_BltR, Helix-Turn-Helix DNA binding domain of the BltR transcription regulator	NA|707aa|down_2|NZ_LR698984.1_1517461_1519582_-	COG0370, FeoB, Fe2+ transport system protein B [Inorganic ion transport and metabolism]	NA|76aa|down_3|NZ_LR698984.1_1519607_1519835_-	pfam04023, FeoA, FeoA domain	NA|147aa|down_4|NZ_LR698984.1_1520161_1520602_-	COG2153, ElaA, Predicted acyltransferase [General function prediction only]	NA|330aa|down_5|NZ_LR698984.1_1520753_1521743_+	NA	NA|403aa|down_6|NZ_LR698984.1_1522218_1523427_+	PRK13354, PRK13354, tyrosyl-tRNA synthetase; Provisional	NA|294aa|down_7|NZ_LR698984.1_1523487_1524369_-	cd10944, CE4_SmPgdA_like, Catalytic NodB homology domain of Streptococcus mutans polysaccharide deacetylase PgdA, Bacillus subtilis YheN, and similar proteins	NA|315aa|down_8|NZ_LR698984.1_1524754_1525699_+	COG0697, RhaT, Permeases of the drug/metabolite transporter (DMT) superfamily [Carbohydrate transport and metabolism / Amino acid transport and metabolism / General function prediction only]	NA|182aa|down_9|NZ_LR698984.1_1525921_1526467_+	cd01046, Rubrerythrin_like, rubrerythrin-like, diiron-binding domain
GCF_902386365.1_UHGG_MGYG-HGUT-02369	NZ_LR698984	Clostridioides difficile isolate MGYG-HGUT-02369 chromosome 1	7	1608236-1608592	4,7,4	PILER-CR,CRISPRCasFinder,CRT	no		csa3,cas14j,WYL,cas3,DEDDh,DinG,cas14k,cas5,cas7,cas6,cas2,cas1,cas4,cas8b2	Orphan	GTTTTAGATTAACTATATGGAATGTAAAT,GTTTTAGATTAACTATATGGAATGTAAAT,GTTTTAGATTAACTATATGGAATGTAAAT	29,29,29	0	0	NA	NA	I-A:I-A:I-A	4,5,5	5	Orphan	csa3,cas14j,WYL,cas3,DEDDh,DinG,cas14k,cas5,cas7,cas6,cas2,cas1,cas4,cas8b2	NA|116aa|up_9|NZ_LR698984.1_1601287_1601635_+,NA|269aa|up_7|NZ_LR698984.1_1602034_1602841_+,NA|146aa|up_6|NZ_LR698984.1_1603189_1603627_+,NA|59aa|up_5|NZ_LR698984.1_1603619_1603796_+,NA|276aa|up_2|NZ_LR698984.1_1605652_1606480_+,NA|54aa|up_0|NZ_LR698984.1_1607332_1607494_+,NA|63aa|down_0|NZ_LR698984.1_1608830_1609019_+	NA|116aa|up_9|NZ_LR698984.1_1601287_1601635_+	NA	NA|97aa|up_8|NZ_LR698984.1_1601634_1601925_+	pfam04883, HK97-gp10_like, Bacteriophage HK97-gp10, putative tail-component	NA|269aa|up_7|NZ_LR698984.1_1602034_1602841_+	NA	NA|146aa|up_6|NZ_LR698984.1_1603189_1603627_+	NA	NA|59aa|up_5|NZ_LR698984.1_1603619_1603796_+	NA	NA|437aa|up_4|NZ_LR698984.1_1603796_1605107_+	pfam04984, Phage_sheath_1, Phage tail sheath protein subtilisin-like domain	NA|157aa|up_3|NZ_LR698984.1_1605123_1605594_+	pfam09393, DUF2001, Phage tail tube protein	NA|276aa|up_2|NZ_LR698984.1_1605652_1606480_+	NA	NA|147aa|up_1|NZ_LR698984.1_1606551_1606992_+	pfam08890, Phage_TAC_5, Phage XkdN-like tail assembly chaperone protein, TAC	NA|54aa|up_0|NZ_LR698984.1_1607332_1607494_+	NA	NA|63aa|down_0|NZ_LR698984.1_1608830_1609019_+	NA	NA|289aa|down_1|NZ_LR698984.1_1609137_1610004_+	smart01040, Bro-N, BRO family, N-terminal domain	NA|176aa|down_2|NZ_LR698984.1_1610868_1611396_+	pfam11611, DUF4352, Domain of unknown function (DUF4352)	NA|256aa|down_3|NZ_LR698984.1_1611533_1612301_+	pfam14471, DUF4428, Domain of unknown function (DUF4428)	NA|791aa|down_4|NZ_LR698984.1_1612366_1614739_+	TIGR02675, Mu-like_prophage_FluMu_protein_gp42, tape measure domain	NA|230aa|down_5|NZ_LR698984.1_1614755_1615445_+	PRK11198, PRK11198, LysM domain/BON superfamily protein; Provisional	NA|631aa|down_6|NZ_LR698984.1_1615437_1617330_+	pfam00877, NLPC_P60, NlpC/P60 family	NA|87aa|down_7|NZ_LR698984.1_1617343_1617604_+	pfam10844, DUF2577, Protein of unknown function (DUF2577)	NA|140aa|down_8|NZ_LR698984.1_1617608_1618028_+	pfam10934, DUF2634, Protein of unknown function (DUF2634)	NA|350aa|down_9|NZ_LR698984.1_1618028_1619078_+	pfam04865, Baseplate_J, Baseplate J-like protein
GCF_902386365.1_UHGG_MGYG-HGUT-02369	NZ_LR698984	Clostridioides difficile isolate MGYG-HGUT-02369 chromosome 1	8	1610457-1610747	8	CRISPRCasFinder	no		csa3,cas14j,WYL,cas3,DEDDh,DinG,cas14k,cas5,cas7,cas6,cas2,cas1,cas4,cas8b2	Orphan	GTTTTATATTAACTATATGGAATGTAAATC	30	0	0	NA	NA	I-A	4	4	Orphan	csa3,cas14j,WYL,cas3,DEDDh,DinG,cas14k,cas5,cas7,cas6,cas2,cas1,cas4,cas8b2	NA|269aa|up_9|NZ_LR698984.1_1602034_1602841_+,NA|146aa|up_8|NZ_LR698984.1_1603189_1603627_+,NA|59aa|up_7|NZ_LR698984.1_1603619_1603796_+,NA|276aa|up_4|NZ_LR698984.1_1605652_1606480_+,NA|54aa|up_2|NZ_LR698984.1_1607332_1607494_+,NA|63aa|up_1|NZ_LR698984.1_1608830_1609019_+,NA	NA|269aa|up_9|NZ_LR698984.1_1602034_1602841_+	NA	NA|146aa|up_8|NZ_LR698984.1_1603189_1603627_+	NA	NA|59aa|up_7|NZ_LR698984.1_1603619_1603796_+	NA	NA|437aa|up_6|NZ_LR698984.1_1603796_1605107_+	pfam04984, Phage_sheath_1, Phage tail sheath protein subtilisin-like domain	NA|157aa|up_5|NZ_LR698984.1_1605123_1605594_+	pfam09393, DUF2001, Phage tail tube protein	NA|276aa|up_4|NZ_LR698984.1_1605652_1606480_+	NA	NA|147aa|up_3|NZ_LR698984.1_1606551_1606992_+	pfam08890, Phage_TAC_5, Phage XkdN-like tail assembly chaperone protein, TAC	NA|54aa|up_2|NZ_LR698984.1_1607332_1607494_+	NA	NA|63aa|up_1|NZ_LR698984.1_1608830_1609019_+	NA	NA|289aa|up_0|NZ_LR698984.1_1609137_1610004_+	smart01040, Bro-N, BRO family, N-terminal domain	NA|176aa|down_0|NZ_LR698984.1_1610868_1611396_+	pfam11611, DUF4352, Domain of unknown function (DUF4352)	NA|256aa|down_1|NZ_LR698984.1_1611533_1612301_+	pfam14471, DUF4428, Domain of unknown function (DUF4428)	NA|791aa|down_2|NZ_LR698984.1_1612366_1614739_+	TIGR02675, Mu-like_prophage_FluMu_protein_gp42, tape measure domain	NA|230aa|down_3|NZ_LR698984.1_1614755_1615445_+	PRK11198, PRK11198, LysM domain/BON superfamily protein; Provisional	NA|631aa|down_4|NZ_LR698984.1_1615437_1617330_+	pfam00877, NLPC_P60, NlpC/P60 family	NA|87aa|down_5|NZ_LR698984.1_1617343_1617604_+	pfam10844, DUF2577, Protein of unknown function (DUF2577)	NA|140aa|down_6|NZ_LR698984.1_1617608_1618028_+	pfam10934, DUF2634, Protein of unknown function (DUF2634)	NA|350aa|down_7|NZ_LR698984.1_1618028_1619078_+	pfam04865, Baseplate_J, Baseplate J-like protein	NA|206aa|down_8|NZ_LR698984.1_1619070_1619688_+	pfam10076, DUF2313, Uncharacterized protein conserved in bacteria (DUF2313)	NA|342aa|down_9|NZ_LR698984.1_1619699_1620725_+	pfam12571, DUF3751, Phage tail-collar fibre protein
GCF_902386365.1_UHGG_MGYG-HGUT-02369	NZ_LR698984	Clostridioides difficile isolate MGYG-HGUT-02369 chromosome 1	9	1748023-1749238	5,9,5	PILER-CR,CRISPRCasFinder,CRT	no		csa3,cas14j,WYL,cas3,DEDDh,DinG,cas14k,cas5,cas7,cas6,cas2,cas1,cas4,cas8b2	Orphan	GTTTTATATTAACTATGTGGTATGTAAAT,GTTTTATATTAACTATGTGGTATGTAAAT,GTTTTATATTAACTATGTGGTATGTAAAT	29,29,29	0	0	NA	NA	I-A:I-A:I-A	16,18,18	18	Orphan	csa3,cas14j,WYL,cas3,DEDDh,DinG,cas14k,cas5,cas7,cas6,cas2,cas1,cas4,cas8b2	NA,NA|96aa|down_0|NZ_LR698984.1_1749646_1749934_+,NA|59aa|down_5|NZ_LR698984.1_1756321_1756498_-,NA|341aa|down_6|NZ_LR698984.1_1757072_1758095_+,NA|258aa|down_8|NZ_LR698984.1_1759001_1759775_+	NA|200aa|up_9|NZ_LR698984.1_1730848_1731448_-	COG4110, COG4110, Uncharacterized protein involved in stress response [General function prediction only]	NA|266aa|up_8|NZ_LR698984.1_1731550_1732348_-	COG1464, NlpA, ABC-type metal ion transport system, periplasmic component/surface antigen [Inorganic ion transport and metabolism]	NA|329aa|up_7|NZ_LR698984.1_1732657_1733644_-	TIGR00545, Probable_lipoate-protein_ligase_A, lipoyltransferase and lipoate-protein ligase	NA|577aa|up_6|NZ_LR698984.1_1734127_1735858_+	COG1757, NhaC, Na+/H+ antiporter [Energy production and conversion]	NA|825aa|up_5|NZ_LR698984.1_1737258_1739733_+	PRK00451, PRK00451, aminomethyl-transferring glycine dehydrogenase subunit GcvPA	NA|486aa|up_4|NZ_LR698984.1_1739732_1741190_+	PRK04366, PRK04366, aminomethyl-transferring glycine dehydrogenase subunit GcvPB	NA|786aa|up_3|NZ_LR698984.1_1741612_1743970_+	cd02609, P-type_ATPase, uncharacterized subfamily of P-type ATPase transporter, similar to uncharacterized Streptococcus pneumoniae exported protein 7, Exp7	NA|106aa|up_2|NZ_LR698984.1_1744055_1744373_-	pfam06865, DUF1255, Protein of unknown function (DUF1255)	NA|252aa|up_1|NZ_LR698984.1_1744710_1745466_+	pfam12395, DUF3658, Protein of unknown function	NA|424aa|up_0|NZ_LR698984.1_1746104_1747376_-	cd01303, GDEase, Guanine deaminase (GDEase)	NA|96aa|down_0|NZ_LR698984.1_1749646_1749934_+	NA	NA|184aa|down_1|NZ_LR698984.1_1750471_1751023_-	cd02209, cupin_XRE_C, XRE (Xenobiotic Response Element) family transcriptional regulators, C-terminal cupin domain	NA|321aa|down_2|NZ_LR698984.1_1751213_1752176_+	cd01561, CBS_like, CBS_like: This subgroup includes Cystathionine beta-synthase (CBS) and Cysteine synthase	NA|615aa|down_3|NZ_LR698984.1_1752470_1754315_+	cd08579, GDPD_memb_like, Glycerophosphodiester phosphodiesterase domain of uncharacterized bacterial glycerophosphodiester phosphodiesterases	NA|195aa|down_4|NZ_LR698984.1_1754636_1755221_+	cd03357, LbH_MAT_GAT, Maltose O-acetyltransferase (MAT) and Galactoside O-acetyltransferase (GAT): MAT and GAT catalyze the CoA-dependent acetylation of the 6-hydroxyl group of their respective sugar substrates	NA|59aa|down_5|NZ_LR698984.1_1756321_1756498_-	NA	NA|341aa|down_6|NZ_LR698984.1_1757072_1758095_+	NA	NA|296aa|down_7|NZ_LR698984.1_1758108_1758996_+	cd03264, ABC_drug_resistance_like, ABC-type multidrug transport system, ATPase component	NA|258aa|down_8|NZ_LR698984.1_1759001_1759775_+	NA	NA|242aa|down_9|NZ_LR698984.1_1759886_1760612_+	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]
GCF_902386365.1_UHGG_MGYG-HGUT-02369	NZ_LR698984	Clostridioides difficile isolate MGYG-HGUT-02369 chromosome 1	10	2098542-2099225	6,10,6	CRT,CRISPRCasFinder,PILER-CR	no		csa3,cas14j,WYL,cas3,DEDDh,DinG,cas14k,cas5,cas7,cas6,cas2,cas1,cas4,cas8b2	Orphan	TTTACATTCCATATAGTTAATATAAAAC,CTTTACATTCCATATAGTTAATATAAAAC,GTTTTATATTAACTATATGGAATGTAAAG	28,29,29	0	0	NA	NA	I-A:I-A:I-A	10,9,8	10	Orphan	csa3,cas14j,WYL,cas3,DEDDh,DinG,cas14k,cas5,cas7,cas6,cas2,cas1,cas4,cas8b2	NA|58aa|up_1|NZ_LR698984.1_2097816_2097990_+,NA|95aa|up_0|NZ_LR698984.1_2098146_2098431_-,NA|206aa|down_7|NZ_LR698984.1_2107844_2108462_-,NA|567aa|down_8|NZ_LR698984.1_2108631_2110332_-,NA|188aa|down_9|NZ_LR698984.1_2110728_2111292_-	NA|406aa|up_9|NZ_LR698984.1_2089256_2090474_-	cd02152, OAT, Ornithine acetyltransferase (OAT) family; also referred to as ArgJ	NA|345aa|up_8|NZ_LR698984.1_2090515_2091550_-	PRK00436, argC, N-acetyl-gamma-glutamyl-phosphate reductase; Validated	NA|255aa|up_7|NZ_LR698984.1_2092239_2093004_-	pfam14268, YoaP, YoaP-like	NA|158aa|up_6|NZ_LR698984.1_2093306_2093780_+	pfam13673, Acetyltransf_10, Acetyltransferase (GNAT) domain	NA|186aa|up_5|NZ_LR698984.1_2094148_2094706_-	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|233aa|up_4|NZ_LR698984.1_2094963_2095662_+	TIGR03502, lipase_Pla1_cef, extracellular lipase, Pla-1/cef family	NA|156aa|up_3|NZ_LR698984.1_2096683_2097151_+	pfam11188, DUF2975, Protein of unknown function (DUF2975)	NA|72aa|up_2|NZ_LR698984.1_2097161_2097377_+	COG3655, COG3655, Predicted transcriptional regulator [Transcription]	NA|58aa|up_1|NZ_LR698984.1_2097816_2097990_+	NA	NA|95aa|up_0|NZ_LR698984.1_2098146_2098431_-	NA	NA|291aa|down_0|NZ_LR698984.1_2100055_2100928_-	COG1737, RpiR, Transcriptional regulators [Transcription]	NA|193aa|down_1|NZ_LR698984.1_2101321_2101900_-	pfam06962, rRNA_methylase, Putative rRNA methylase	NA|239aa|down_2|NZ_LR698984.1_2101957_2102674_-	COG1187, RsuA, 16S rRNA uridine-516 pseudouridylate synthase and related pseudouridylate synthases [Translation, ribosomal structure and biogenesis]	NA|362aa|down_3|NZ_LR698984.1_2102864_2103950_+	COG0628, yhhT, Predicted permease, member of the PurR regulon [General function prediction only]	NA|220aa|down_4|NZ_LR698984.1_2104104_2104764_-	pfam03154, Atrophin-1, Atrophin-1 family	NA|431aa|down_5|NZ_LR698984.1_2104980_2106273_-	cd06828, PLPDE_III_DapDC, Type III Pyridoxal 5-phosphate (PLP)-Dependent Enzyme Diaminopimelate Decarboxylase	NA|398aa|down_6|NZ_LR698984.1_2106290_2107484_-	PRK06635, PRK06635, aspartate kinase; Reviewed	NA|206aa|down_7|NZ_LR698984.1_2107844_2108462_-	NA	NA|567aa|down_8|NZ_LR698984.1_2108631_2110332_-	NA	NA|188aa|down_9|NZ_LR698984.1_2110728_2111292_-	NA
GCF_902386365.1_UHGG_MGYG-HGUT-02369	NZ_LR698984	Clostridioides difficile isolate MGYG-HGUT-02369 chromosome 1	11	2559174-2559331	7	PILER-CR	no	cas3,cas5,cas7,cas6	csa3,cas14j,WYL,cas3,DEDDh,DinG,cas14k,cas5,cas7,cas6,cas2,cas1,cas4,cas8b2	Unclear	AGGTTTAAGATTAACTAAGTGGATTTCA	28	0	0	NA	NA	NA	2	2	Unclear	csa3,cas14j,WYL,cas3,DEDDh,DinG,cas14k,cas5,cas7,cas6,cas2,cas1,cas4,cas8b2	NA,NA|492aa|down_3|NZ_LR698984.1_2563762_2565238_-	NA|448aa|up_9|NZ_LR698984.1_2551630_2552974_-	pfam06898, YqfD, Putative stage IV sporulation protein YqfD	NA|79aa|up_8|NZ_LR698984.1_2552986_2553223_-	pfam07873, YabP, YabP family	NA|188aa|up_7|NZ_LR698984.1_2553435_2553999_-	cd00552, RaiA, RaiA ("ribosome-associated inhibitor A", also known as Protein Y (PY), YfiA, and SpotY,  is a stress-response protein that binds the ribosomal subunit interface and arrests translation by interfering with aminoacyl-tRNA binding to the ribosomal A site	NA|166aa|up_6|NZ_LR698984.1_2554191_2554689_-	cd15904, TSPO_MBR, Translocator protein (TSPO)/peripheral-type benzodiazepine receptor (MBR) family	NA|148aa|up_5|NZ_LR698984.1_2554799_2555243_-	pfam09424, YqeY, Yqey-like protein	NA|60aa|up_4|NZ_LR698984.1_2555272_2555452_-	PRK00270, rpsU, 30S ribosomal protein S21; Reviewed	NA|117aa|up_3|NZ_LR698984.1_2555614_2555965_-	cd01276, PKCI_related, Protein Kinase C Interacting protein related (PKCI): PKCI and related proteins belong to the ubiquitous HIT family of hydrolases that act on alpha-phosphates of ribonucleotides	NA|433aa|up_2|NZ_LR698984.1_2556007_2557306_-	COG0621, MiaB, 2-methylthioadenine synthetase [Translation, ribosomal structure and biogenesis]	NA|253aa|up_1|NZ_LR698984.1_2557307_2558066_-	PRK11713, PRK11713, 16S ribosomal RNA methyltransferase RsmE; Provisional	NA|316aa|up_0|NZ_LR698984.1_2558084_2559032_-	pfam06325, PrmA, Ribosomal protein L11 methyltransferase (PrmA)	cas3|803aa|down_0|NZ_LR698984.1_2559472_2561881_-	TIGR01587, CRISPR-associated_endonuclease/helicase_Cas3, CRISPR-associated helicase Cas3	cas5|269aa|down_1|NZ_LR698984.1_2561907_2562714_-	cd09658, Cas5_I-B, CRISPR/Cas system-associated RAMP superfamily protein Cas5	cas7|338aa|down_2|NZ_LR698984.1_2562729_2563743_-	cd09687, Cas7_I-C, CRISPR/Cas system-associated RAMP superfamily protein Cas7	NA|492aa|down_3|NZ_LR698984.1_2563762_2565238_-	NA	cas6|246aa|down_4|NZ_LR698984.1_2565242_2565980_-	cd09652, Cas6-I-III, CRISPR/Cas system-associated RAMP superfamily protein Cas6	NA|370aa|down_5|NZ_LR698984.1_2566238_2567348_+	PRK11650, ugpC, sn-glycerol-3-phosphate ABC transporter ATP-binding protein UgpC	NA|256aa|down_6|NZ_LR698984.1_2567377_2568145_+	COG2508, COG2508, Regulator of polyketide synthase expression [Signal transduction mechanisms / Secondary metabolites biosynthesis, transport, and catabolism]	NA|458aa|down_7|NZ_LR698984.1_2568332_2569706_-	TIGR00711, Uncharacterized_MFS-type_transporter_YhcA, drug resistance transporter, EmrB/QacA subfamily	NA|312aa|down_8|NZ_LR698984.1_2569741_2570677_-	COG1940, NagC, Transcriptional regulator/sugar kinase [Transcription / Carbohydrate transport and metabolism]	NA|385aa|down_9|NZ_LR698984.1_2571642_2572797_-	PRK10767, PRK10767, chaperone protein DnaJ; Provisional
GCF_902386365.1_UHGG_MGYG-HGUT-02369	NZ_LR698984	Clostridioides difficile isolate MGYG-HGUT-02369 chromosome 1	12	2871225-2871550	11	CRISPRCasFinder	no		csa3,cas14j,WYL,cas3,DEDDh,DinG,cas14k,cas5,cas7,cas6,cas2,cas1,cas4,cas8b2	Orphan	TTACCATCTATTTGTTGCCATCCTGT	26	0	0	NA	NA	NA	5	5	Orphan	csa3,cas14j,WYL,cas3,DEDDh,DinG,cas14k,cas5,cas7,cas6,cas2,cas1,cas4,cas8b2	NA,NA|107aa|down_2|NZ_LR698984.1_2874414_2874735_-,NA|664aa|down_3|NZ_LR698984.1_2874724_2876716_-,NA|458aa|down_7|NZ_LR698984.1_2887373_2888747_-	NA|218aa|up_9|NZ_LR698984.1_2856242_2856896_-	COG1357, COG1357, Pentapeptide repeats containing protein [Function unknown]	NA|448aa|up_8|NZ_LR698984.1_2857399_2858743_+	COG2252, COG2252, Xanthine/uracil/vitamin C permease [Nucleotide transport and    metabolism]	NA|455aa|up_7|NZ_LR698984.1_2858889_2860254_+	cd01298, ATZ_TRZ_like, TRZ/ATZ family contains enzymes from the atrazine degradation pathway and related hydrolases	NA|622aa|up_6|NZ_LR698984.1_2860431_2862297_-	smart00910, HIRAN, The HIRAN protein (HIP116, Rad5p N-terminal) is found in the N-terminal regions of the SWI2/SNF2 proteins typified by HIP116 and Rad5p	NA|406aa|up_5|NZ_LR698984.1_2862532_2863750_-	cd17396, MFS_YdiM_like, Inner membrane transport protein YdiM and similar proteins of the Major Facilitator Superfamily of transporters	NA|288aa|up_4|NZ_LR698984.1_2863949_2864813_-	PRK12548, PRK12548, shikimate dehydrogenase	NA|641aa|up_3|NZ_LR698984.1_2864894_2866817_-	TIGR03997, NADH:flavin_oxidoreductase, mycofactocin system FadH/OYE family oxidoreductase 2	NA|283aa|up_2|NZ_LR698984.1_2866895_2867744_-	COG1082, IolE, Sugar phosphate isomerases/epimerases [Carbohydrate transport and metabolism]	NA|294aa|up_1|NZ_LR698984.1_2868156_2869038_+	cd08434, PBP2_GltC_like, The substrate binding domain of LysR-type transcriptional regulator GltC, which activates gltA expression of glutamate synthase operon, contains type 2 periplasmic binding fold	NA|288aa|up_0|NZ_LR698984.1_2869245_2870109_-	cd01027, TOPRIM_RNase_M5_like, TOPRIM_ RNase M5_like: The topoisomerase-primase (TOPRIM) nucleotidyl transferase/hydrolase domain found in Ribonuclease M5: (RNase M5) and other small primase-like proteins from bacteria and archaea	NA|338aa|down_0|NZ_LR698984.1_2872309_2873323_-	PLN02240, PLN02240, UDP-glucose 4-epimerase	NA|290aa|down_1|NZ_LR698984.1_2873347_2874217_-	COG1210, GalU, UDP-glucose pyrophosphorylase [Cell envelope biogenesis, outer membrane]	NA|107aa|down_2|NZ_LR698984.1_2874414_2874735_-	NA	NA|664aa|down_3|NZ_LR698984.1_2874724_2876716_-	NA	NA|848aa|down_4|NZ_LR698984.1_2876837_2879381_-	PTZ00121, PTZ00121, MAEBL; Provisional	NA|226aa|down_5|NZ_LR698984.1_2880124_2880802_+	cd05826, Sortase_B, Sortase domain found in class B sortases	NA|313aa|down_6|NZ_LR698984.1_2886062_2887001_+	cd10948, CE4_BsPdaA_like, Catalytic NodB homology domain of Bacillus subtilis polysaccharide deacetylase PdaA, and its bacterial homologs	NA|458aa|down_7|NZ_LR698984.1_2887373_2888747_-	NA	NA|441aa|down_8|NZ_LR698984.1_2888933_2890256_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|235aa|down_9|NZ_LR698984.1_2890248_2890953_-	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]
GCF_902386365.1_UHGG_MGYG-HGUT-02369	NZ_LR698984	Clostridioides difficile isolate MGYG-HGUT-02369 chromosome 1	13	3035242-3035453	12	CRISPRCasFinder	no	cas14j	csa3,cas14j,WYL,cas3,DEDDh,DinG,cas14k,cas5,cas7,cas6,cas2,cas1,cas4,cas8b2	Unclear	TCTTCTGATGGTGGTACTGGTGGATT	26	0	0	NA	NA	NA	3	3	TypeV	csa3,cas14j,WYL,cas3,DEDDh,DinG,cas14k,cas5,cas7,cas6,cas2,cas1,cas4,cas8b2	NA|459aa|up_7|NZ_LR698984.1_3024409_3025786_+,NA	NA|216aa|up_9|NZ_LR698984.1_3022308_3022956_-	COG3314, COG3314, Uncharacterized protein conserved in bacteria [Function unknown]	NA|391aa|up_8|NZ_LR698984.1_3022969_3024142_-	cd03885, M20_CPDG2, M20 Peptidase Glutamate carboxypeptidase, a periplasmic enzyme	NA|459aa|up_7|NZ_LR698984.1_3024409_3025786_+	NA	NA|278aa|up_6|NZ_LR698984.1_3026036_3026870_-	cd00592, HTH_MerR-like, Helix-Turn-Helix DNA binding domain of MerR-like transcription regulators	cas14j|373aa|up_5|NZ_LR698984.1_3027729_3028848_-	COG0675, COG0675, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|365aa|up_4|NZ_LR698984.1_3029700_3030795_-	cd02803, OYE_like_FMN_family, Old yellow enzyme (OYE)-like FMN binding domain	NA|149aa|up_3|NZ_LR698984.1_3030796_3031243_-	COG1846, MarR, Transcriptional regulators [Transcription]	NA|397aa|up_2|NZ_LR698984.1_3031548_3032739_+	PRK06836, PRK06836, pyridoxal phosphate-dependent aminotransferase	NA|321aa|up_1|NZ_LR698984.1_3032908_3033871_+	TIGR00950, Uncharacterized_inner_membrane_transporter_YicL, Carboxylate/Amino Acid/Amine Transporter	NA|221aa|up_0|NZ_LR698984.1_3033988_3034651_-	cd20183, M34_PPEP, Pro-Pro endopeptidase (PPEP) and similar proteins; belongs to peptidase family M34	NA|482aa|down_0|NZ_LR698984.1_3038227_3039673_-	cd01189, INT_ICEBs1_C_like, C-terminal catalytic domain of integrases from bacterial phages and conjugate transposons	NA|134aa|down_1|NZ_LR698984.1_3039691_3040093_-	pfam12728, HTH_17, Helix-turn-helix domain	NA|83aa|down_2|NZ_LR698984.1_3040343_3040592_-	pfam12645, HTH_16, Helix-turn-helix domain	NA|139aa|down_3|NZ_LR698984.1_3040598_3041015_-	TIGR02937, RNA_polymerase_sigma_factor, RNA polymerase sigma factor, sigma-70 family	NA|293aa|down_4|NZ_LR698984.1_3041462_3042341_-	pfam01656, CbiA, CobQ/CobB/MinD/ParA nucleotide binding domain	NA|95aa|down_5|NZ_LR698984.1_3042443_3042728_-	PRK07220, PRK07220, DNA topoisomerase I; Validated	NA|246aa|down_6|NZ_LR698984.1_3042873_3043611_-	pfam00398, RrnaAD, Ribosomal RNA adenine dimethylase	NA|53aa|down_7|NZ_LR698984.1_3044128_3044287_-	pfam06414, Zeta_toxin, Zeta toxin	NA|91aa|down_8|NZ_LR698984.1_3044288_3044561_-	pfam08998, Epsilon_antitox, Bacterial epsilon antitoxin	NA|70aa|down_9|NZ_LR698984.1_3044577_3044787_-	pfam07764, Omega_Repress, Omega Transcriptional Repressor
GCF_902386365.1_UHGG_MGYG-HGUT-02369	NZ_LR698984	Clostridioides difficile isolate MGYG-HGUT-02369 chromosome 1	14	3170688-3172301	13,7,8	CRISPRCasFinder,CRT,PILER-CR	no	cas2,cas1,cas4,cas3,cas5,cas7,cas8b2,cas6,WYL	csa3,cas14j,WYL,cas3,DEDDh,DinG,cas14k,cas5,cas7,cas6,cas2,cas1,cas4,cas8b2	Unclear	ATTTACATACCACTTAGTTAATATAAAAC,ATTTACATACCACTTAGTTAATATAAAAC,GTTTTATATTAACTAAGTGGTATGTAAAT	29,29,29	0	0	NA	NA	I-A:I-A:I-A	24,24,21	24	Unclear	csa3,cas14j,WYL,cas3,DEDDh,DinG,cas14k,cas5,cas7,cas6,cas2,cas1,cas4,cas8b2	NA|67aa|up_7|NZ_LR698984.1_3163579_3163780_+,NA|260aa|up_1|NZ_LR698984.1_3168963_3169743_+,NA	NA|740aa|up_9|NZ_LR698984.1_3157926_3160146_-	cd01948, EAL, EAL domain	NA|885aa|up_8|NZ_LR698984.1_3160421_3163076_-	PRK13805, PRK13805, bifunctional acetaldehyde-CoA/alcohol dehydrogenase; Provisional	NA|67aa|up_7|NZ_LR698984.1_3163579_3163780_+	NA	NA|197aa|up_6|NZ_LR698984.1_3163987_3164578_-	PRK08305, spoVFB, dipicolinate synthase subunit B; Reviewed	NA|292aa|up_5|NZ_LR698984.1_3164580_3165456_-	PRK08306, PRK08306, dipicolinate synthase subunit DpsA	NA|464aa|up_4|NZ_LR698984.1_3165531_3166923_-	cd17633, AFD_YhfT-like, fatty acid-CoA ligase VraA	NA|379aa|up_3|NZ_LR698984.1_3166926_3168063_-	cd00751, thiolase, Thiolase are ubiquitous enzymes that catalyze the reversible thiolytic cleavage of 3-ketoacyl-CoA into acyl-CoA and acetyl-CoA, a 2-step reaction involving a covalent intermediate formed with a catalytic cysteine	NA|182aa|up_2|NZ_LR698984.1_3168094_3168640_-	pfam02632, BioY, BioY family	NA|260aa|up_1|NZ_LR698984.1_3168963_3169743_+	NA	NA|143aa|up_0|NZ_LR698984.1_3170198_3170627_-	PRK10562, PRK10562, putative acetyltransferase; Provisional	cas2|89aa|down_0|NZ_LR698984.1_3172484_3172751_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|327aa|down_1|NZ_LR698984.1_3172753_3173734_-	TIGR03641, cas1_HMARI, CRISPR-associated endonuclease Cas1, subtype I-B/HMARI/TNEAP	cas4|184aa|down_2|NZ_LR698984.1_3173736_3174288_-	pfam01930, Cas_Cas4, Domain of unknown function DUF83	cas3|774aa|down_3|NZ_LR698984.1_3174302_3176624_-	TIGR01587, CRISPR-associated_endonuclease/helicase_Cas3, CRISPR-associated helicase Cas3	cas5|234aa|down_4|NZ_LR698984.1_3176634_3177336_-	cd09658, Cas5_I-B, CRISPR/Cas system-associated RAMP superfamily protein Cas5	cas7|305aa|down_5|NZ_LR698984.1_3177337_3178252_-	cd09687, Cas7_I-C, CRISPR/Cas system-associated RAMP superfamily protein Cas7	cas8b2|626aa|down_6|NZ_LR698984.1_3178253_3180131_-	cd09665, Cas8a1_I-A, CRISPR/Cas system-associated protein Cas8a1	cas6|246aa|down_7|NZ_LR698984.1_3180136_3180874_-	cd09652, Cas6-I-III, CRISPR/Cas system-associated RAMP superfamily protein Cas6	WYL|306aa|down_8|NZ_LR698984.1_3180951_3181869_-	pfam13280, WYL, WYL domain	NA|639aa|down_9|NZ_LR698984.1_3182147_3184064_-	pfam02687, FtsX, FtsX-like permease family
GCF_902386365.1_UHGG_MGYG-HGUT-02369	NZ_LR698984	Clostridioides difficile isolate MGYG-HGUT-02369 chromosome 1	15	3417638-3418260	14,8,9	CRISPRCasFinder,CRT,PILER-CR	no	WYL,cas3,cas7,cas6,cas5	csa3,cas14j,WYL,cas3,DEDDh,DinG,cas14k,cas5,cas7,cas6,cas2,cas1,cas4,cas8b2	Unclear	ATTTATAACTAACTTAGTGTAATTTAAAC,ATTTATAACTAACTTAGTGTAATTTAAAC,ATTTATAACTAACTTAGTGTAATTTAAAC	29,29,29	0	0	NA	NA	NA:NA:NA	9,9,8	9	Unclear	csa3,cas14j,WYL,cas3,DEDDh,DinG,cas14k,cas5,cas7,cas6,cas2,cas1,cas4,cas8b2	NA|70aa|up_8|NZ_LR698984.1_3402192_3402402_-,NA|243aa|up_6|NZ_LR698984.1_3406845_3407574_-,NA|540aa|up_5|NZ_LR698984.1_3407591_3409211_-,NA|389aa|down_2|NZ_LR698984.1_3421599_3422766_-,NA|91aa|down_6|NZ_LR698984.1_3426049_3426322_-,NA|248aa|down_8|NZ_LR698984.1_3427918_3428662_-,NA|105aa|down_9|NZ_LR698984.1_3428683_3428998_-	NA|47aa|up_9|NZ_LR698984.1_3401656_3401797_-	COG1476, COG1476, Predicted transcriptional regulators [Transcription]	NA|70aa|up_8|NZ_LR698984.1_3402192_3402402_-	NA	NA|1452aa|up_7|NZ_LR698984.1_3402503_3406859_-	TIGR02168, Chromosome_partition_protein_Smc, chromosome segregation protein SMC, common bacterial type	NA|243aa|up_6|NZ_LR698984.1_3406845_3407574_-	NA	NA|540aa|up_5|NZ_LR698984.1_3407591_3409211_-	NA	NA|337aa|up_4|NZ_LR698984.1_3409210_3410221_-	pfam09983, DUF2220, Uncharacterized protein conserved in bacteria C-term(DUF2220)	NA|130aa|up_3|NZ_LR698984.1_3411121_3411511_+	PHA02517, PHA02517, putative transposase OrfB; Reviewed	NA|53aa|up_2|NZ_LR698984.1_3411567_3411726_+	pfam13333, rve_2, Integrase core domain	NA|431aa|up_1|NZ_LR698984.1_3411952_3413245_-	pfam03235, DUF262, Protein of unknown function DUF262	NA|405aa|up_0|NZ_LR698984.1_3413244_3414459_-	pfam08867, FRG, FRG domain	cas3|662aa|down_0|NZ_LR698984.1_3418512_3420498_-	TIGR01587, CRISPR-associated_endonuclease/helicase_Cas3, CRISPR-associated helicase Cas3	cas7|357aa|down_1|NZ_LR698984.1_3420508_3421579_-	TIGR02585, conserved_protein, CRISPR-associated protein Cas7/Cst2/DevR, subtype I-B/TNEAP	NA|389aa|down_2|NZ_LR698984.1_3421599_3422766_-	NA	cas6|255aa|down_3|NZ_LR698984.1_3422767_3423532_-	COG1583, COG1583, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	cas5|216aa|down_4|NZ_LR698984.1_3423522_3424170_-	TIGR01895, conserved_hypothetical_protein, CRISPR-associated protein Cas5, subtype I-B/TNEAP	NA|376aa|down_5|NZ_LR698984.1_3424649_3425777_-	COG3183, COG3183, Predicted restriction endonuclease [Defense mechanisms]	NA|91aa|down_6|NZ_LR698984.1_3426049_3426322_-	NA	NA|445aa|down_7|NZ_LR698984.1_3426571_3427906_-	TIGR00665, DnaB, replicative DNA helicase	NA|248aa|down_8|NZ_LR698984.1_3427918_3428662_-	NA	NA|105aa|down_9|NZ_LR698984.1_3428683_3428998_-	NA
