assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_010731575.1_ASM1073157v1	NZ_AP022617	Mycolicibacterium monacense strain JCM 15658	1	29245-29331	1	CRISPRCasFinder	no	csa3	csa3,DEDDh,cas3,DinG,cas4,WYL,casR	Type I-A	CGCCGGTCGGGTGACGGCGGATGATGC	27	0	0	NA	NA	NA	1	1	Orphan	csa3,DEDDh,cas3,DinG,cas4,WYL,casR	NA,NA|463aa|down_7|NZ_AP022617.1_37727_39116_+	NA|90aa|up_9|NZ_AP022617.1_13183_13453_+	pfam14019, DUF4235, Protein of unknown function (DUF4235)	NA|351aa|up_8|NZ_AP022617.1_16482_17535_+	pfam03631, Virul_fac_BrkB, Virulence factor BrkB	NA|255aa|up_7|NZ_AP022617.1_17562_18327_+	PRK08284, PRK08284, precorrin 6A synthase; Provisional	NA|130aa|up_6|NZ_AP022617.1_18345_18735_+	COG1950, COG1950, Predicted membrane protein [Function unknown]	NA|250aa|up_5|NZ_AP022617.1_19686_20436_+	PRK08251, PRK08251, SDR family oxidoreductase	NA|555aa|up_4|NZ_AP022617.1_20485_22150_-	PRK00179, pgi, glucose-6-phosphate isomerase; Reviewed	NA|483aa|up_3|NZ_AP022617.1_22201_23650_+	cd07103, ALDH_F5_SSADH_GabD, Mitochondrial succinate-semialdehyde dehydrogenase and ALDH family members 5A1 and 5F1-like	NA|681aa|up_2|NZ_AP022617.1_23809_25852_+	COG1835, COG1835, Predicted acyltransferases [Lipid metabolism]	NA|99aa|up_1|NZ_AP022617.1_25904_26201_-	PRK07857, PRK07857, chorismate mutase	NA|785aa|up_0|NZ_AP022617.1_26399_28754_+	TIGR01073, ATP-dependent_DNA_helicase_PcrA, ATP-dependent DNA helicase PcrA	NA|361aa|down_0|NZ_AP022617.1_29395_30478_-	pfam01551, Peptidase_M23, Peptidase family M23	NA|388aa|down_1|NZ_AP022617.1_30740_31904_+	PRK00696, sucC, ADP-forming succinate--CoA ligase subunit beta	NA|301aa|down_2|NZ_AP022617.1_31914_32817_+	PRK05678, PRK05678, succinyl-CoA synthetase subunit alpha; Validated	NA|510aa|down_3|NZ_AP022617.1_32962_34492_+	PRK08257, PRK08257, acetyl-CoA acetyltransferase; Validated	NA|283aa|down_4|NZ_AP022617.1_34563_35412_-	TIGR03619, F420_Rv2161c, probable F420-dependent oxidoreductase, Rv2161c family	NA|377aa|down_5|NZ_AP022617.1_35512_36643_+	TIGR04021, LLM_DMSO2_sfnG, dimethyl sulfone monooxygenase SfnG	NA|294aa|down_6|NZ_AP022617.1_36788_37670_+	pfam17270, DUF5336, Family of unknown function (DUF5336)	NA|463aa|down_7|NZ_AP022617.1_37727_39116_+	NA	NA|210aa|down_8|NZ_AP022617.1_39135_39765_+	PRK05647, purN, phosphoribosylglycinamide formyltransferase; Reviewed	NA|528aa|down_9|NZ_AP022617.1_39761_41345_+	PRK00881, purH, bifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase; Provisional
GCF_010731575.1_ASM1073157v1	NZ_AP022617	Mycolicibacterium monacense strain JCM 15658	2	993667-993743	2	CRISPRCasFinder	no		csa3,DEDDh,cas3,DinG,cas4,WYL,casR	Orphan	GCCGGCCAAGAAGGCCGCCAAGA	23	0	0	NA	NA	NA	1	1	Orphan	csa3,DEDDh,cas3,DinG,cas4,WYL,casR	NA,NA|220aa|down_6|NZ_AP022617.1_1005966_1006626_+	NA|387aa|up_9|NZ_AP022617.1_982179_983340_+	cd01158, SCAD_SBCAD, Short chain acyl-CoA dehydrogenases and eukaryotic short/branched chain acyl-CoA dehydrogenases	NA|162aa|up_8|NZ_AP022617.1_983336_983822_+	cd03451, FkbR2, FkbR2 is a Streptomyces hygroscopicus protein with a hot dog fold that belongs to a conserved family of proteins found in prokaryotes and archaea but not in eukaryotes	NA|273aa|up_7|NZ_AP022617.1_983818_984637_+	COG2301, CitE, Citrate lyase beta subunit [Carbohydrate transport and metabolism]	NA|357aa|up_6|NZ_AP022617.1_984742_985813_+	TIGR03181, PDH_E1_alph_x, pyruvate dehydrogenase E1 component, alpha subunit	NA|350aa|up_5|NZ_AP022617.1_985814_986864_+	COG0022, AcoB, Pyruvate/2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, eukaryotic type, beta subunit [Energy production and conversion]	NA|385aa|up_4|NZ_AP022617.1_986860_988015_+	PRK11856, PRK11856, branched-chain alpha-keto acid dehydrogenase subunit E2; Reviewed	NA|251aa|up_3|NZ_AP022617.1_988016_988769_-	PRK05870, PRK05870, enoyl-CoA hydratase; Provisional	NA|148aa|up_2|NZ_AP022617.1_988909_989353_-	pfam12900, Pyridox_ox_2, Pyridoxamine 5'-phosphate oxidase	NA|294aa|up_1|NZ_AP022617.1_989521_990403_+	pfam00582, Usp, Universal stress protein family	NA|494aa|up_0|NZ_AP022617.1_990647_992129_+	TIGR02946, Putative_diacyglycerol_O-acyltransferase_Mb3115, acyltransferase, WS/DGAT/MGAT	NA|791aa|down_0|NZ_AP022617.1_993807_996180_+	PRK03355, PRK03355, glycerol-3-phosphate 1-O-acyltransferase	NA|666aa|down_1|NZ_AP022617.1_996206_998204_+	pfam09678, Caa3_CtaG, Cytochrome c oxidase caa3 assembly factor (Caa3_CtaG)	NA|169aa|down_2|NZ_AP022617.1_998349_998856_+	PRK05853, PRK05853, hypothetical protein; Validated	NA|558aa|down_3|NZ_AP022617.1_998932_1000606_+	PRK11819, PRK11819, putative ABC transporter ATP-binding protein; Reviewed	NA|1620aa|down_4|NZ_AP022617.1_1000700_1005560_+	pfam05088, Bac_GDH, Bacterial NAD-glutamate dehydrogenase	NA|138aa|down_5|NZ_AP022617.1_1005556_1005970_+	COG0824, FcbC, Predicted thioesterase [General function prediction only]	NA|220aa|down_6|NZ_AP022617.1_1005966_1006626_+	NA	NA|540aa|down_7|NZ_AP022617.1_1006622_1008242_-	cd11332, AmyAc_OligoGlu_TS, Alpha amylase catalytic domain found in oligo-1,6-glucosidase (also called isomaltase; sucrase-isomaltase; alpha-limit dextrinase), trehalose synthase (also called maltose alpha-D-glucosyltransferase), and related proteins	NA|132aa|down_8|NZ_AP022617.1_1008257_1008653_-	cd14771, TrHb2_Mt-trHbO-like_O, Truncated hemoglobins, group 2 (O); Mycobacterium tuberculosis hemoglobin O like	NA|223aa|down_9|NZ_AP022617.1_1008794_1009463_+	COG1403, McrA, Restriction endonuclease [Defense mechanisms]
GCF_010731575.1_ASM1073157v1	NZ_AP022617	Mycolicibacterium monacense strain JCM 15658	3	1629612-1629694	3	CRISPRCasFinder	no	csa3	csa3,DEDDh,cas3,DinG,cas4,WYL,casR	Type I-A	CGCCTCTGCGACACCGCCGTCGGTGG	26	0	0	NA	NA	NA	1	1	Orphan	csa3,DEDDh,cas3,DinG,cas4,WYL,casR	NA,NA	NA|195aa|up_9|NZ_AP022617.1_1618934_1619519_+	pfam16859, TetR_C_11, Bacterial transcriptional repressor C-terminal	NA|480aa|up_8|NZ_AP022617.1_1619643_1621083_+	PRK07899, rpsA, 30S ribosomal protein S1; Reviewed	NA|386aa|up_7|NZ_AP022617.1_1621160_1622318_+	PRK03333, coaE, dephospho-CoA kinase/protein folding accessory domain-containing protein; Provisional	NA|182aa|up_6|NZ_AP022617.1_1622322_1622868_-	PRK12270, kgd, multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit	NA|138aa|up_5|NZ_AP022617.1_1622881_1623295_-	COG2306, COG2306, Predicted RNA-binding protein, associated with RNAses E/G family [General function prediction only]	NA|720aa|up_4|NZ_AP022617.1_1623442_1625602_+	PRK05298, PRK05298, excinuclease ABC subunit UvrB	NA|466aa|up_3|NZ_AP022617.1_1625646_1627044_+	cd17502, MFS_Azr1_MDR_like, Saccharomyces cerevisiae Azole resistance protein 1 (Azr1p), and similar multidrug resistance (MDR) transporters of the Major Facilitator Superfamily	csa3|120aa|up_2|NZ_AP022617.1_1627214_1627574_-	cd00090, HTH_ARSR, Arsenical Resistance Operon Repressor and similar prokaryotic, metal regulated homodimeric repressors	NA|154aa|up_1|NZ_AP022617.1_1627673_1628135_+	cd07254, VOC_like, uncharacterized subfamily of vicinal oxygen chelate (VOC) family	NA|276aa|up_0|NZ_AP022617.1_1628262_1629090_+	pfam14023, DUF4239, Protein of unknown function (DUF4239)	NA|452aa|down_0|NZ_AP022617.1_1632252_1633608_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|285aa|down_1|NZ_AP022617.1_1633723_1634578_+	COG1946, TesB, Acyl-CoA thioesterase [Lipid metabolism]	NA|230aa|down_2|NZ_AP022617.1_1634553_1635243_-	TIGR03914, SPO1_DNA_polymerase-related_protein, uracil-DNA glycosylase family domain	NA|149aa|down_3|NZ_AP022617.1_1635545_1635992_+	pfam00582, Usp, Universal stress protein family	NA|434aa|down_4|NZ_AP022617.1_1636073_1637375_+	COG2124, CypX, Cytochrome P450 [Secondary metabolites biosynthesis, transport, and catabolism]	NA|167aa|down_5|NZ_AP022617.1_1637378_1637879_-	pfam04248, NTP_transf_9, Domain of unknown function (DUF427)	NA|226aa|down_6|NZ_AP022617.1_1637868_1638546_-	cd06262, metallo-hydrolase-like_MBL-fold, mainly hydrolytic enzymes and related proteins which carry out various biological functions; MBL-fold metallohydrolase domain	NA|965aa|down_7|NZ_AP022617.1_1638634_1641529_+	PRK00349, uvrA, excinuclease ABC subunit UvrA	NA|494aa|down_8|NZ_AP022617.1_1641518_1643000_-	COG2072, TrkA, Predicted flavoprotein involved in K+ transport [Inorganic ion transport and metabolism]	NA|390aa|down_9|NZ_AP022617.1_1643052_1644222_-	pfam06224, HTH_42, Winged helix DNA-binding domain
