assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_006494815.1_ASM649481v1	NZ_CP041091	Nocardioides sp. KUDC 5002 chromosome, complete genome	1	233717-233812	1	CRISPRCasFinder	no		csa3,cas4,WYL,DEDDh,DinG,RT	Orphan	CCCGCCGAGGGGTCACCCCCGCCCCGCCGAG	31	1	1	233748-233781	NZ_CP041091.1_1759826-1759859	NA	1	1	Orphan	csa3,cas4,WYL,DEDDh,DinG,RT	NA|140aa|up_8|NZ_CP041091.1_216606_217026_+,NA|263aa|up_6|NZ_CP041091.1_221653_222442_+,NA|101aa|up_2|NZ_CP041091.1_227247_227550_+,NA|76aa|down_9|NZ_CP041091.1_247892_248120_+	NA|329aa|up_9|NZ_CP041091.1_215261_216248_-	PRK05442, PRK05442, malate dehydrogenase; Provisional	NA|140aa|up_8|NZ_CP041091.1_216606_217026_+	NA	NA|734aa|up_7|NZ_CP041091.1_218311_220513_+	pfam03971, IDH, Monomeric isocitrate dehydrogenase	NA|263aa|up_6|NZ_CP041091.1_221653_222442_+	NA	NA|1225aa|up_5|NZ_CP041091.1_222393_226068_+	TIGR04226, Fimbrial_subunit_type_2, fimbrial isopeptide formation D2 domain	NA|164aa|up_4|NZ_CP041091.1_226034_226526_+	TIGR04226, Fimbrial_subunit_type_2, fimbrial isopeptide formation D2 domain	NA|262aa|up_3|NZ_CP041091.1_226474_227260_+	pfam01345, DUF11, Domain of unknown function DUF11	NA|101aa|up_2|NZ_CP041091.1_227247_227550_+	NA	NA|180aa|up_1|NZ_CP041091.1_227465_228005_+	pfam01345, DUF11, Domain of unknown function DUF11	NA|404aa|up_0|NZ_CP041091.1_228791_230003_+	pfam01345, DUF11, Domain of unknown function DUF11	NA|459aa|down_0|NZ_CP041091.1_233918_235295_+	pfam00067, p450, Cytochrome P450	NA|226aa|down_1|NZ_CP041091.1_235327_236005_-	TIGR03384, betaine_BetI, transcriptional repressor BetI	NA|406aa|down_2|NZ_CP041091.1_236172_237390_+	cd01152, ACAD_fadE6_17_26, Putative acyl-CoA dehydrogenases similar to fadE6, fadE17, and fadE26	NA|333aa|down_3|NZ_CP041091.1_237403_238402_+	cd00567, ACAD, Acyl-CoA dehydrogenase	NA|311aa|down_4|NZ_CP041091.1_239962_240895_-	COG1946, TesB, Acyl-CoA thioesterase [Lipid metabolism]	NA|413aa|down_5|NZ_CP041091.1_243854_245093_+	PRK10091, PRK10091, MFS transport protein AraJ; Provisional	NA|363aa|down_6|NZ_CP041091.1_245160_246249_+	PRK00927, PRK00927, tryptophanyl-tRNA synthetase; Reviewed	NA|173aa|down_7|NZ_CP041091.1_246248_246767_+	pfam13563, 2_5_RNA_ligase2, 2'-5' RNA ligase superfamily	NA|356aa|down_8|NZ_CP041091.1_246763_247831_+	pfam03631, Virul_fac_BrkB, Virulence factor BrkB	NA|76aa|down_9|NZ_CP041091.1_247892_248120_+	NA
GCF_006494815.1_ASM649481v1	NZ_CP041091	Nocardioides sp. KUDC 5002 chromosome, complete genome	2	908132-908216	2	CRISPRCasFinder	no		csa3,cas4,WYL,DEDDh,DinG,RT	Orphan	CGGTCAGCCGGTGAAACCCGCAC	23	0	0	NA	NA	NA	1	1	Orphan	csa3,cas4,WYL,DEDDh,DinG,RT	NA|124aa|up_8|NZ_CP041091.1_898921_899293_+,NA|91aa|up_6|NZ_CP041091.1_901965_902238_+,NA|158aa|down_5|NZ_CP041091.1_916831_917305_+	NA|462aa|up_9|NZ_CP041091.1_897417_898803_-	PRK06416, PRK06416, dihydrolipoamide dehydrogenase; Reviewed	NA|124aa|up_8|NZ_CP041091.1_898921_899293_+	NA	NA|366aa|up_7|NZ_CP041091.1_900871_901969_+	PRK00389, gcvT, glycine cleavage system aminomethyltransferase GcvT	NA|91aa|up_6|NZ_CP041091.1_901965_902238_+	NA	NA|278aa|up_5|NZ_CP041091.1_902254_903088_+	COG0384, COG0384, Predicted epimerase, PhzC/PhzF homolog [General function prediction only]	NA|270aa|up_4|NZ_CP041091.1_903130_903940_+	PRK01827, thyA, thymidylate synthase; Reviewed	NA|174aa|up_3|NZ_CP041091.1_903936_904458_+	pfam00186, DHFR_1, Dihydrofolate reductase	NA|313aa|up_2|NZ_CP041091.1_904484_905423_+	TIGR03560, F420_Rv1855c, probable F420-dependent oxidoreductase, Rv1855c family	NA|312aa|up_1|NZ_CP041091.1_905494_906430_+	PRK03170, PRK03170, dihydrodipicolinate synthase; Provisional	NA|562aa|up_0|NZ_CP041091.1_906426_908112_+	COG0595, COG0595, mRNA degradation ribonucleases J1/J2 (metallo-beta-lactamase superfamily) [Translation, ribosomal structure and biogenesis; Replication, recombination and repair]	NA|817aa|down_0|NZ_CP041091.1_908471_910922_+	COG1674, FtsK, DNA segregation ATPase FtsK/SpoIIIE and related proteins [Cell division and chromosome partitioning]	NA|583aa|down_1|NZ_CP041091.1_910918_912667_+	pfam13413, HTH_25, Helix-turn-helix domain	NA|477aa|down_2|NZ_CP041091.1_912726_914157_+	TIGR01125, Ribosomal_protein_S12_methylthiotransferase_RimO, ribosomal protein S12 methylthiotransferase RimO	NA|209aa|down_3|NZ_CP041091.1_914153_914780_+	TIGR00560, pgsA, CDP-diacylglycerol--glycerol-3-phosphate 3-phosphatidyltransferase	NA|624aa|down_4|NZ_CP041091.1_914975_916847_+	COG0737, UshA, 5'-nucleotidase/2',3'-cyclic phosphodiesterase and related esterases [Nucleotide transport and metabolism]	NA|158aa|down_5|NZ_CP041091.1_916831_917305_+	NA	NA|1038aa|down_6|NZ_CP041091.1_917436_920550_+	COG2374, COG2374, Predicted extracellular nuclease [General function prediction only]	NA|103aa|down_7|NZ_CP041091.1_920771_921080_+	cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|290aa|down_8|NZ_CP041091.1_921109_921979_-	PRK03072, PRK03072, heat shock protein HtpX; Provisional	NA|276aa|down_9|NZ_CP041091.1_922035_922863_-	COG0266, Nei, Formamidopyrimidine-DNA glycosylase [DNA replication, recombination, and repair]
GCF_006494815.1_ASM649481v1	NZ_CP041091	Nocardioides sp. KUDC 5002 chromosome, complete genome	3	961116-961282	1	PILER-CR	no		csa3,cas4,WYL,DEDDh,DinG,RT	Orphan	GCCGAGGTCGCGGCCGAGCGCGGCGCCGAGCTCGCCGGGC	40	0	0	NA	NA	NA	2	2	Orphan	csa3,cas4,WYL,DEDDh,DinG,RT	NA|235aa|up_9|NZ_CP041091.1_950980_951685_-,NA|256aa|up_8|NZ_CP041091.1_951843_952611_+,NA|271aa|down_2|NZ_CP041091.1_963971_964784_+,NA|67aa|down_3|NZ_CP041091.1_964970_965171_+,NA|68aa|down_4|NZ_CP041091.1_965394_965598_+,NA|94aa|down_6|NZ_CP041091.1_967338_967620_+,NA|78aa|down_7|NZ_CP041091.1_967687_967921_+,NA|275aa|down_8|NZ_CP041091.1_967917_968742_+,NA|89aa|down_9|NZ_CP041091.1_968825_969092_-	NA|235aa|up_9|NZ_CP041091.1_950980_951685_-	NA	NA|256aa|up_8|NZ_CP041091.1_951843_952611_+	NA	NA|145aa|up_7|NZ_CP041091.1_952607_953042_+	pfam11236, DUF3037, Protein of unknown function (DUF3037)	NA|248aa|up_6|NZ_CP041091.1_953094_953838_-	COG2138, COG2138, Sirohydrochlorin ferrochelatase [Inorganic ion transport and metabolism]	NA|249aa|up_5|NZ_CP041091.1_953860_954607_-	PRK02090, PRK02090, phosphoadenylyl-sulfate reductase	NA|574aa|up_4|NZ_CP041091.1_954790_956512_-	COG0155, CysI, Sulfite reductase, beta subunit (hemoprotein) [Inorganic ion transport and metabolism]	NA|493aa|up_3|NZ_CP041091.1_956883_958362_+	PRK13580, PRK13580, glycine hydroxymethyltransferase	NA|286aa|up_2|NZ_CP041091.1_958541_959399_+	pfam03631, Virul_fac_BrkB, Virulence factor BrkB	NA|335aa|up_1|NZ_CP041091.1_959520_960525_+	pfam02254, TrkA_N, TrkA-N domain	NA|119aa|up_0|NZ_CP041091.1_960551_960908_-	TIGR01068, Thioredoxin-like_protein_slr0233, thioredoxin	NA|370aa|down_0|NZ_CP041091.1_961789_962899_+	cd01189, INT_ICEBs1_C_like, C-terminal catalytic domain of integrases from bacterial phages and conjugate transposons	NA|76aa|down_1|NZ_CP041091.1_963589_963817_+	pfam12728, HTH_17, Helix-turn-helix domain	NA|271aa|down_2|NZ_CP041091.1_963971_964784_+	NA	NA|67aa|down_3|NZ_CP041091.1_964970_965171_+	NA	NA|68aa|down_4|NZ_CP041091.1_965394_965598_+	NA	NA|570aa|down_5|NZ_CP041091.1_965594_967304_+	TIGR02224, Tyrosine_recombinase_XerC, tyrosine recombinase XerC	NA|94aa|down_6|NZ_CP041091.1_967338_967620_+	NA	NA|78aa|down_7|NZ_CP041091.1_967687_967921_+	NA	NA|275aa|down_8|NZ_CP041091.1_967917_968742_+	NA	NA|89aa|down_9|NZ_CP041091.1_968825_969092_-	NA
GCF_006494815.1_ASM649481v1	NZ_CP041091	Nocardioides sp. KUDC 5002 chromosome, complete genome	4	3949813-3949920	3	CRISPRCasFinder	no		csa3,cas4,WYL,DEDDh,DinG,RT	Orphan	GTGCGGGTTTCACCGGCTGACCGG	24	1	1	3949837-3949896	NZ_CP041091.1_3374401-3374460	NA	1	1	Orphan	csa3,cas4,WYL,DEDDh,DinG,RT	NA|390aa|up_7|NZ_CP041091.1_3939695_3940865_-,NA|81aa|down_8|NZ_CP041091.1_3959150_3959393_-	NA|359aa|up_9|NZ_CP041091.1_3937265_3938342_+	cd05240, UDP_G4E_3_SDR_e, UDP-glucose 4 epimerase (G4E), subgroup 3, extended (e) SDRs	NA|343aa|up_8|NZ_CP041091.1_3938581_3939610_+	cd07987, LPLAT_MGAT-like, Lysophospholipid Acyltransferases (LPLATs) of Glycerophospholipid Biosynthesis: MGAT-like	NA|390aa|up_7|NZ_CP041091.1_3939695_3940865_-	NA	NA|266aa|up_6|NZ_CP041091.1_3940998_3941796_-	TIGR02952, RNA_polymerase_sigma-70_factor_ECF_subfamily, RNA polymerase sigma-70 factor, TIGR02952 family	NA|295aa|up_5|NZ_CP041091.1_3941935_3942820_-	TIGR01490, Uncharacterized_protein_Rv3661/MT3761, HAD-superfamily subfamily IB hydrolase, TIGR01490	NA|529aa|up_4|NZ_CP041091.1_3942911_3944498_+	cd05936, FC-FACS_FadD_like, Prokaryotic long-chain fatty acid CoA synthetases similar to Escherichia coli FadD	NA|102aa|up_3|NZ_CP041091.1_3944505_3944811_+	pfam05768, DUF836, Glutaredoxin-like domain (DUF836)	NA|250aa|up_2|NZ_CP041091.1_3944945_3945695_+	PRK05472, PRK05472, redox-sensing transcriptional repressor Rex; Provisional	NA|441aa|up_1|NZ_CP041091.1_3945691_3947014_+	PRK00045, hemA, glutamyl-tRNA reductase; Reviewed	NA|350aa|up_0|NZ_CP041091.1_3947010_3948060_+	PRK00072, hemC, porphobilinogen deaminase; Reviewed	NA|329aa|down_0|NZ_CP041091.1_3949950_3950937_+	PRK09283, PRK09283, porphobilinogen synthase	NA|418aa|down_1|NZ_CP041091.1_3951060_3952314_-	COG2951, MltB, Membrane-bound lytic murein transglycosylase B [Cell envelope biogenesis, outer membrane]	NA|452aa|down_2|NZ_CP041091.1_3952645_3954001_+	PRK00062, PRK00062, glutamate-1-semialdehyde 2,1-aminomutase	NA|243aa|down_3|NZ_CP041091.1_3954000_3954729_+	pfam00300, His_Phos_1, Histidine phosphatase superfamily (branch 1)	NA|195aa|down_4|NZ_CP041091.1_3954725_3955310_+	cd02966, TlpA_like_family, TlpA-like family; composed of  TlpA, ResA, DsbE and similar proteins	NA|255aa|down_5|NZ_CP041091.1_3955306_3956071_+	pfam02683, DsbD, Cytochrome C biogenesis protein transmembrane region	NA|561aa|down_6|NZ_CP041091.1_3956138_3957821_+	pfam05140, ResB, ResB-like family	NA|364aa|down_7|NZ_CP041091.1_3957817_3958909_+	TIGR03144, cytochrome_c_biogenesis_protein_chloroplast, cytochrome c-type biogenesis protein CcsB	NA|81aa|down_8|NZ_CP041091.1_3959150_3959393_-	NA	NA|87aa|down_9|NZ_CP041091.1_3959455_3959716_+	pfam14012, DUF4229, Protein of unknown function (DUF4229)
GCF_006494815.1_ASM649481v1	NZ_CP041091	Nocardioides sp. KUDC 5002 chromosome, complete genome	5	4030851-4030959	4	CRISPRCasFinder	no		csa3,cas4,WYL,DEDDh,DinG,RT	Orphan	TCACCGGCTGACCGGTGAAACCCGC	25	0	0	NA	NA	NA	1	1	Orphan	csa3,cas4,WYL,DEDDh,DinG,RT	NA|242aa|up_0|NZ_CP041091.1_4029850_4030576_-,NA	NA|256aa|up_9|NZ_CP041091.1_4018943_4019711_-	COG5473, COG5473, Predicted integral membrane protein [Function unknown]	NA|333aa|up_8|NZ_CP041091.1_4020373_4021372_+	TIGR03459, crt_membr, carotene biosynthesis associated membrane protein	NA|356aa|up_7|NZ_CP041091.1_4021402_4022470_-	cd03814, GT4-like, glycosyltransferase family 4 proteins	NA|301aa|up_6|NZ_CP041091.1_4025141_4026044_+	COG1741, COG1741, Pirin-related protein [General function prediction only]	NA|193aa|up_5|NZ_CP041091.1_4026097_4026676_+	pfam03352, Adenine_glyco, Methyladenine glycosylase	NA|130aa|up_4|NZ_CP041091.1_4026672_4027062_+	pfam03788, LrgA, LrgA family	NA|232aa|up_3|NZ_CP041091.1_4027088_4027784_+	pfam04172, LrgB, LrgB-like family	NA|471aa|up_2|NZ_CP041091.1_4027842_4029255_+	PRK07869, PRK07869, amidase; Provisional	NA|57aa|up_1|NZ_CP041091.1_4029600_4029771_+	PRK00504, rpmG, 50S ribosomal protein L33; Validated	NA|242aa|up_0|NZ_CP041091.1_4029850_4030576_-	NA	NA|135aa|down_0|NZ_CP041091.1_4031026_4031431_+	pfam13452, MaoC_dehydrat_N, N-terminal half of MaoC dehydratase	NA|133aa|down_1|NZ_CP041091.1_4031427_4031826_+	cd03453, SAV4209_like, SAV4209_like	NA|350aa|down_2|NZ_CP041091.1_4031845_4032895_+	PRK13903, murB, UDP-N-acetylmuramate dehydrogenase	NA|188aa|down_3|NZ_CP041091.1_4032956_4033520_-	pfam13828, DUF4190, Domain of unknown function (DUF4190)	NA|88aa|down_4|NZ_CP041091.1_4034822_4035086_+	PRK07597, secE, preprotein translocase subunit SecE; Reviewed	NA|265aa|down_5|NZ_CP041091.1_4035132_4035927_+	PRK05609, nusG, transcription antitermination protein NusG; Validated	NA|143aa|down_6|NZ_CP041091.1_4036138_4036567_+	PRK00140, rplK, 50S ribosomal protein L11; Validated	NA|240aa|down_7|NZ_CP041091.1_4036632_4037352_+	PRK05424, rplA, 50S ribosomal protein L1; Validated	NA|538aa|down_8|NZ_CP041091.1_4037445_4039059_-	pfam12897, Aminotran_MocR, Alanine-glyoxylate amino-transferase	NA|202aa|down_9|NZ_CP041091.1_4039057_4039663_+	PRK00099, rplJ, 50S ribosomal protein L10; Reviewed
GCF_006494815.1_ASM649481v1	NZ_CP041091	Nocardioides sp. KUDC 5002 chromosome, complete genome	6	4264240-4264316	5	CRISPRCasFinder	no		csa3,cas4,WYL,DEDDh,DinG,RT	Orphan	CGCACCGGCATCCGACGCCCCGG	23	0	0	NA	NA	NA	1	1	Orphan	csa3,cas4,WYL,DEDDh,DinG,RT	NA,NA	NA|375aa|up_9|NZ_CP041091.1_4252410_4253535_+	cd13565, PBP2_PstS, Substrate binding domain of ABC-type phosphate transporter, a member of the type 2 periplasmic-binding fold superfamily	NA|311aa|up_8|NZ_CP041091.1_4253674_4254607_+	TIGR02138, phosphate_transport_system_permease_protein_PstC, phosphate ABC transporter, permease protein PstC	NA|372aa|up_7|NZ_CP041091.1_4254610_4255726_+	COG0581, PstA, ABC-type phosphate transport system, permease component [Inorganic ion transport and metabolism]	NA|260aa|up_6|NZ_CP041091.1_4255770_4256550_+	PRK14241, PRK14241, phosphate transporter ATP-binding protein; Provisional	NA|219aa|up_5|NZ_CP041091.1_4256670_4257327_+	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|335aa|up_4|NZ_CP041091.1_4257425_4258430_-	pfam01384, PHO4, Phosphate transporter family	NA|206aa|up_3|NZ_CP041091.1_4258434_4259052_-	COG1392, COG1392, Phosphate transport regulator (distant homolog of PhoU) [Inorganic ion transport and metabolism]	NA|715aa|up_2|NZ_CP041091.1_4259232_4261377_-	PRK05443, PRK05443, polyphosphate kinase; Provisional	NA|345aa|up_1|NZ_CP041091.1_4261450_4262485_+	pfam07859, Abhydrolase_3, alpha/beta hydrolase fold	NA|173aa|up_0|NZ_CP041091.1_4262713_4263232_-	pfam11180, DUF2968, Protein of unknown function (DUF2968)	NA|278aa|down_0|NZ_CP041091.1_4264549_4265383_-	TIGR03448, mycothiol_MshD, mycothiol synthase	NA|245aa|down_1|NZ_CP041091.1_4265431_4266166_-	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|91aa|down_2|NZ_CP041091.1_4266287_4266560_+	cd17040, Ubl_MoaD_like, ubiquitin-like (Ubl) domain found in a group of small sulfide carrier proteins	NA|326aa|down_3|NZ_CP041091.1_4266643_4267621_+	cd05260, GDP_MD_SDR_e, GDP-mannose 4,6 dehydratase, extended (e) SDRs	NA|283aa|down_4|NZ_CP041091.1_4268029_4268878_+	COG2897, SseA, Rhodanese-related sulfurtransferase [Inorganic ion transport and metabolism]	NA|99aa|down_5|NZ_CP041091.1_4268880_4269177_+	pfam07210, DUF1416, Protein of unknown function (DUF1416)	NA|151aa|down_6|NZ_CP041091.1_4269240_4269693_+	cd02947, TRX_family, TRX family; composed of two groups: Group I, which includes proteins that exclusively encode a TRX domain; and Group II, which are composed of fusion proteins of TRX and additional domains	NA|141aa|down_7|NZ_CP041091.1_4269730_4270153_+	pfam14340, DUF4395, Domain of unknown function (DUF4395)	NA|237aa|down_8|NZ_CP041091.1_4270210_4270921_+	cd05830, Sortase_E, Sortase domain found in the class E family of sortases	NA|143aa|down_9|NZ_CP041091.1_4271012_4271441_-	PRK05273, PRK05273, D-tyrosyl-tRNA(Tyr) deacylase; Provisional
