assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_005954645.2_ASM595464v2	NZ_CP040695	Nocardioides sp. S-1144 chromosome, complete genome	1	23814-23926	1	CRISPRCasFinder	no		WYL,DEDDh,cas3,csa3,cas4,RT,PD-DExK,DinG	Orphan	CGTGCAATTGCCGCGACCTGGCCAGCTCTCG	31	0	0	NA	NA	NA	1	1	Orphan	WYL,DEDDh,cas3,csa3,cas4,RT,PD-DExK,DinG	NA,NA	NA|248aa|up_9|NZ_CP040695.2_15798_16542_-	cd07742, metallo-hydrolase-like_MBL-fold, uncharacterized subgroup of the MBL-fold_metallo-hydrolase superfamily; MBL-fold metallo hydrolase domain	NA|247aa|up_8|NZ_CP040695.2_16730_17471_+	cd06662, SURF1, SURF1 superfamily	NA|117aa|up_7|NZ_CP040695.2_17491_17842_+	pfam12823, DUF3817, Domain of unknown function (DUF3817)	NA|278aa|up_6|NZ_CP040695.2_17848_18682_+	COG1752, RssA, Predicted esterase of the alpha-beta hydrolase superfamily [General function prediction only]	NA|340aa|up_5|NZ_CP040695.2_18678_19698_+	cd07990, LPLAT_LCLAT1-like, Lysophospholipid Acyltransferases (LPLATs) of Glycerophospholipid Biosynthesis: LCLAT1-like	NA|241aa|up_4|NZ_CP040695.2_19716_20439_-	PRK05035, PRK05035, electron transport complex protein RnfC; Provisional	NA|174aa|up_3|NZ_CP040695.2_20646_21168_+	COG0652, PpiB, Peptidyl-prolyl cis-trans isomerase (rotamase) - cyclophilin family [Posttranslational modification, protein turnover, chaperones]	NA|320aa|up_2|NZ_CP040695.2_21178_22138_+	pfam01694, Rhomboid, Rhomboid family	NA|187aa|up_1|NZ_CP040695.2_22406_22967_-	pfam06781, CrgA, Cell division protein CrgA	NA|269aa|up_0|NZ_CP040695.2_22965_23772_+	pfam05949, DUF881, Bacterial protein of unknown function (DUF881)	NA|607aa|down_0|NZ_CP040695.2_24004_25825_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|529aa|down_1|NZ_CP040695.2_25821_27408_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|487aa|down_2|NZ_CP040695.2_27430_28891_-	COG0768, FtsI, Cell division protein FtsI/penicillin-binding protein 2 [Cell envelope biogenesis, outer membrane]	NA|470aa|down_3|NZ_CP040695.2_28910_30320_-	COG0772, FtsW, Bacterial cell division membrane protein [Cell division and chromosome partitioning]	NA|458aa|down_4|NZ_CP040695.2_30322_31696_-	COG0631, PTC1, Serine/threonine protein phosphatase [Signal transduction mechanisms]	NA|157aa|down_5|NZ_CP040695.2_31749_32220_-	COG1716, COG1716, FOG: FHA domain [Signal transduction mechanisms]	NA|249aa|down_6|NZ_CP040695.2_32212_32959_-	pfam12401, DUF3662, Protein of unknown function (DUF2662)	NA|356aa|down_7|NZ_CP040695.2_33559_34627_+	cd05327, retinol-DH_like_SDR_c_like, retinol dehydrogenase (retinol-DH), Light dependent Protochlorophyllide (Pchlide) OxidoReductase (LPOR) and related proteins, classical (c) SDRs	NA|186aa|down_8|NZ_CP040695.2_34667_35225_+	cd02138, TdsD-like, nitroreductase similar to Burkholderia pseudomallei TdsD	NA|164aa|down_9|NZ_CP040695.2_35237_35729_+	cd00340, GSH_Peroxidase, Glutathione (GSH) peroxidase family; tetrameric selenoenzymes that catalyze the reduction of a variety of hydroperoxides including lipid peroxidases, using GSH as a specific electron donor substrate
GCF_005954645.2_ASM595464v2	NZ_CP040695	Nocardioides sp. S-1144 chromosome, complete genome	2	89859-89983	2	CRISPRCasFinder	no		WYL,DEDDh,cas3,csa3,cas4,RT,PD-DExK,DinG	Orphan	GAGCCAACCTCGGCAATTGCGCGC	24	0	0	NA	NA	NA	2	2	Orphan	WYL,DEDDh,cas3,csa3,cas4,RT,PD-DExK,DinG	NA|85aa|up_8|NZ_CP040695.2_82241_82496_+,NA|454aa|up_6|NZ_CP040695.2_83790_85152_-,NA|292aa|up_5|NZ_CP040695.2_85291_86167_-,NA|79aa|up_4|NZ_CP040695.2_86274_86511_-,NA|137aa|down_7|NZ_CP040695.2_99690_100101_-	NA|292aa|up_9|NZ_CP040695.2_81307_82183_-	COG0266, Nei, Formamidopyrimidine-DNA glycosylase [DNA replication, recombination, and repair]	NA|85aa|up_8|NZ_CP040695.2_82241_82496_+	NA	NA|398aa|up_7|NZ_CP040695.2_82570_83764_+	pfam13191, AAA_16, AAA ATPase domain	NA|454aa|up_6|NZ_CP040695.2_83790_85152_-	NA	NA|292aa|up_5|NZ_CP040695.2_85291_86167_-	NA	NA|79aa|up_4|NZ_CP040695.2_86274_86511_-	NA	NA|299aa|up_3|NZ_CP040695.2_86507_87404_-	PRK09636, PRK09636, RNA polymerase sigma factor SigJ; Provisional	NA|260aa|up_2|NZ_CP040695.2_87536_88316_+	cd08582, GDPD_like_2, Glycerophosphodiester phosphodiesterase domain of uncharacterized bacterial glycerophosphodiester phosphodiesterases	NA|312aa|up_1|NZ_CP040695.2_88381_89317_+	PRK05590, PRK05590, hypothetical protein; Provisional	NA|138aa|up_0|NZ_CP040695.2_89313_89727_-	cd00038, CAP_ED, effector domain of the CAP family of transcription factors; members include CAP (or cAMP receptor protein (CRP)), which binds cAMP, FNR (fumarate and nitrate reduction), which uses an iron-sulfur cluster to sense oxygen) and CooA, a heme containing CO sensor	NA|403aa|down_0|NZ_CP040695.2_90033_91242_-	PRK01388, PRK01388, arginine deiminase; Provisional	NA|131aa|down_1|NZ_CP040695.2_91343_91736_+	pfam14584, DUF4446, Protein of unknown function (DUF4446)	NA|225aa|down_2|NZ_CP040695.2_91781_92456_-	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|847aa|down_3|NZ_CP040695.2_92452_94993_-	COG2205, KdpD, Osmosensitive K+ channel histidine kinase [Signal transduction mechanisms]	NA|207aa|down_4|NZ_CP040695.2_95001_95622_-	pfam02669, KdpC, K+-transporting ATPase, c chain	NA|705aa|down_5|NZ_CP040695.2_95630_97745_-	PRK01122, PRK01122, potassium-transporting ATPase subunit KdpB	NA|555aa|down_6|NZ_CP040695.2_97741_99406_-	pfam03814, KdpA, Potassium-transporting ATPase A subunit	NA|137aa|down_7|NZ_CP040695.2_99690_100101_-	NA	NA|184aa|down_8|NZ_CP040695.2_100137_100689_+	TIGR02983, putative_RNA_polymerase_ECF-subfamily_sigma_factor, RNA polymerase sigma-70 factor, sigma-E family	NA|291aa|down_9|NZ_CP040695.2_100685_101558_+	PRK14963, PRK14963, DNA polymerase III subunits gamma and tau; Provisional
GCF_005954645.2_ASM595464v2	NZ_CP040695	Nocardioides sp. S-1144 chromosome, complete genome	3	318064-318150	3	CRISPRCasFinder	no		WYL,DEDDh,cas3,csa3,cas4,RT,PD-DExK,DinG	Orphan	ACCGGAGGTTGAGCAAGCGAGCGC	24	0	0	NA	NA	NA	1	1	Orphan	WYL,DEDDh,cas3,csa3,cas4,RT,PD-DExK,DinG	NA|113aa|up_9|NZ_CP040695.2_311586_311925_+,NA|97aa|up_8|NZ_CP040695.2_311921_312212_+,NA|59aa|up_7|NZ_CP040695.2_312225_312402_+,NA|179aa|up_5|NZ_CP040695.2_312795_313332_+,NA|456aa|down_0|NZ_CP040695.2_318187_319555_-,NA|311aa|down_1|NZ_CP040695.2_319551_320484_-,NA|153aa|down_2|NZ_CP040695.2_320516_320975_-	NA|113aa|up_9|NZ_CP040695.2_311586_311925_+	NA	NA|97aa|up_8|NZ_CP040695.2_311921_312212_+	NA	NA|59aa|up_7|NZ_CP040695.2_312225_312402_+	NA	NA|158aa|up_6|NZ_CP040695.2_312325_312799_+	NF033218, anchor_AmaP, alkaline shock response membrane anchor protein AmaP	NA|179aa|up_5|NZ_CP040695.2_312795_313332_+	NA	NA|194aa|up_4|NZ_CP040695.2_313328_313910_+	NF033218, anchor_AmaP, alkaline shock response membrane anchor protein AmaP	NA|58aa|up_3|NZ_CP040695.2_313966_314140_+	pfam05532, CsbD, CsbD-like	NA|458aa|up_2|NZ_CP040695.2_314206_315580_-	TIGR00387, Glycolate_oxidase_subunit_glcD	NA|287aa|up_1|NZ_CP040695.2_315670_316531_+	COG0714, COG0714, MoxR-like ATPases [General function prediction only]	NA|475aa|up_0|NZ_CP040695.2_316527_317952_+	pfam05762, VWA_CoxE, VWA domain containing CoxE-like protein	NA|456aa|down_0|NZ_CP040695.2_318187_319555_-	NA	NA|311aa|down_1|NZ_CP040695.2_319551_320484_-	NA	NA|153aa|down_2|NZ_CP040695.2_320516_320975_-	NA	NA|393aa|down_3|NZ_CP040695.2_321080_322259_-	PRK05764, PRK05764, aspartate aminotransferase; Provisional	NA|152aa|down_4|NZ_CP040695.2_322588_323044_+	cd04684, Nudix_Hydrolase_25, Contains a crystal structure of the Nudix hydrolase from Enterococcus faecalis, which has an unknown function	NA|498aa|down_5|NZ_CP040695.2_323141_324635_+	PRK05326, PRK05326, potassium/proton antiporter	NA|215aa|down_6|NZ_CP040695.2_324650_325295_-	pfam13399, LytR_C, LytR cell envelope-related transcriptional attenuator	NA|101aa|down_7|NZ_CP040695.2_325183_325486_-	COG5450, COG5450, Transcription regulator of the Arc/MetJ class [Transcription]	NA|559aa|down_8|NZ_CP040695.2_325531_327208_+	pfam05872, DUF853, Bacterial protein of unknown function (DUF853)	NA|413aa|down_9|NZ_CP040695.2_327204_328443_-	pfam02720, DUF222, Domain of unknown function (DUF222)
GCF_005954645.2_ASM595464v2	NZ_CP040695	Nocardioides sp. S-1144 chromosome, complete genome	4	418384-418539	4	CRISPRCasFinder	no		WYL,DEDDh,cas3,csa3,cas4,RT,PD-DExK,DinG	Orphan	GGCCAGCAGGGCTACGGCCAGCAG	24	1	1	418453-418473	NZ_CP040695.2_2974878-2974898	NA	3	3	Orphan	WYL,DEDDh,cas3,csa3,cas4,RT,PD-DExK,DinG	NA,NA|204aa|down_4|NZ_CP040695.2_425841_426453_+,NA|61aa|down_9|NZ_CP040695.2_430511_430694_+	NA|75aa|up_9|NZ_CP040695.2_408514_408739_+	pfam14029, DUF4244, Protein of unknown function (DUF4244)	NA|136aa|up_8|NZ_CP040695.2_408735_409143_+	pfam07811, TadE, TadE-like protein	NA|128aa|up_7|NZ_CP040695.2_409139_409523_+	TIGR03816, tadE_like_DECH, helicase/secretion neighborhood TadE-like protein	NA|104aa|up_6|NZ_CP040695.2_409510_409822_-	cd06582, TM_PBP1_LivH_like, Transmembrane subunit (TM) of Escherichia coli LivH and related proteins	NA|780aa|up_5|NZ_CP040695.2_409831_412171_-	TIGR03817, DECH_helic, helicase/secretion neighborhood putative DEAH-box helicase	NA|113aa|up_4|NZ_CP040695.2_412420_412759_+	cd07043, STAS_anti-anti-sigma_factors, Sulphate Transporter and Anti-Sigma factor antagonist) domain of anti-anti-sigma factors, key regulators of anti-sigma factors by phosphorylation	NA|770aa|up_3|NZ_CP040695.2_412971_415281_+	PRK00733, hppA, membrane-bound proton-translocating pyrophosphatase; Validated	NA|489aa|up_2|NZ_CP040695.2_415393_416860_+	pfam05175, MTS, Methyltransferase small domain	NA|119aa|up_1|NZ_CP040695.2_416896_417253_-	pfam07045, DUF1330, Domain of unknown function (DUF1330)	NA|252aa|up_0|NZ_CP040695.2_417363_418119_+	pfam17765, MLTR_LBD, MmyB-like transcription regulator ligand binding domain	NA|919aa|down_0|NZ_CP040695.2_419542_422299_+	PRK07561, PRK07561, DNA topoisomerase I subunit omega; Validated	NA|217aa|down_1|NZ_CP040695.2_422309_422960_+	PRK00698, tmk, thymidylate kinase; Validated	NA|378aa|down_2|NZ_CP040695.2_423066_424200_+	PRK07940, PRK07940, DNA polymerase III subunit delta'; Validated	NA|525aa|down_3|NZ_CP040695.2_424213_425788_+	pfam08386, Abhydrolase_4, TAP-like protein	NA|204aa|down_4|NZ_CP040695.2_425841_426453_+	NA	NA|329aa|down_5|NZ_CP040695.2_426460_427447_-	PRK13685, PRK13685, hypothetical protein; Provisional	NA|310aa|down_6|NZ_CP040695.2_427443_428373_-	COG1721, COG1721, Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) [General function prediction only]	NA|349aa|down_7|NZ_CP040695.2_428390_429437_-	COG0714, COG0714, MoxR-like ATPases [General function prediction only]	NA|308aa|down_8|NZ_CP040695.2_429501_430425_-	COG0583, LysR, Transcriptional regulator [Transcription]	NA|61aa|down_9|NZ_CP040695.2_430511_430694_+	NA
GCF_005954645.2_ASM595464v2	NZ_CP040695	Nocardioides sp. S-1144 chromosome, complete genome	5	739274-739387	5	CRISPRCasFinder	no		WYL,DEDDh,cas3,csa3,cas4,RT,PD-DExK,DinG	Orphan	GACCCGTCGCTGATCACTGACGGGTCAG	28	0	0	NA	NA	NA	1	1	Orphan	WYL,DEDDh,cas3,csa3,cas4,RT,PD-DExK,DinG	NA,NA|66aa|down_0|NZ_CP040695.2_739390_739588_-	NA|224aa|up_9|NZ_CP040695.2_725854_726526_+	PRK05472, PRK05472, redox-sensing transcriptional repressor Rex; Provisional	NA|460aa|up_8|NZ_CP040695.2_726536_727916_+	PRK00045, hemA, glutamyl-tRNA reductase; Reviewed	NA|335aa|up_7|NZ_CP040695.2_727930_728935_+	PRK00072, hemC, porphobilinogen deaminase; Reviewed	NA|563aa|up_6|NZ_CP040695.2_728931_730620_+	COG0007, CysG, Uroporphyrinogen-III methylase [Coenzyme metabolism]	NA|336aa|up_5|NZ_CP040695.2_730766_731774_+	PRK09283, PRK09283, porphobilinogen synthase	NA|427aa|up_4|NZ_CP040695.2_733367_734648_+	PRK00062, PRK00062, glutamate-1-semialdehyde 2,1-aminomutase	NA|226aa|up_3|NZ_CP040695.2_734670_735348_+	pfam00300, His_Phos_1, Histidine phosphatase superfamily (branch 1)	NA|189aa|up_2|NZ_CP040695.2_735347_735914_+	cd02966, TlpA_like_family, TlpA-like family; composed of  TlpA, ResA, DsbE and similar proteins	NA|256aa|up_1|NZ_CP040695.2_735906_736674_+	pfam02683, DsbD, Cytochrome C biogenesis protein transmembrane region	NA|541aa|up_0|NZ_CP040695.2_736673_738296_+	pfam05140, ResB, ResB-like family	NA|66aa|down_0|NZ_CP040695.2_739390_739588_-	NA	NA|88aa|down_1|NZ_CP040695.2_739655_739919_+	pfam14012, DUF4229, Protein of unknown function (DUF4229)	NA|291aa|down_2|NZ_CP040695.2_740106_740979_-	PRK06080, PRK06080, 1,4-dihydroxy-2-naphthoate octaprenyltransferase; Validated	NA|330aa|down_3|NZ_CP040695.2_741073_742063_+	PRK07824, PRK07824, o-succinylbenzoate--CoA ligase	NA|317aa|down_4|NZ_CP040695.2_742080_743031_+	PRK02901, PRK02901, O-succinylbenzoate synthase; Provisional	NA|533aa|down_5|NZ_CP040695.2_743027_744626_+	PRK07449, PRK07449, 2-succinyl-5-enolpyruvyl-6-hydroxy-3-cyclohexene-1-carboxylate synthase; Validated	NA|388aa|down_6|NZ_CP040695.2_744660_745824_+	PRK07764, PRK07764, DNA polymerase III subunits gamma and tau; Validated	NA|631aa|down_7|NZ_CP040695.2_745885_747778_+	COG0025, NhaP, NhaP-type Na+/H+ and K+/H+ antiporters [Inorganic ion transport and metabolism]	NA|278aa|down_8|NZ_CP040695.2_748106_748940_-	cd05269, TMR_SDR_a, triphenylmethane reductase (TMR)-like proteins, NMRa-like, atypical (a) SDRs	NA|202aa|down_9|NZ_CP040695.2_749058_749664_+	COG4430, COG4430, Uncharacterized protein conserved in bacteria [Function unknown]
GCF_005954645.2_ASM595464v2	NZ_CP040695	Nocardioides sp. S-1144 chromosome, complete genome	6	906186-906308	6	CRISPRCasFinder	no		WYL,DEDDh,cas3,csa3,cas4,RT,PD-DExK,DinG	Orphan	CCCTGCCCGTGCAATTGCCGCGAATGGCTCGG	32	0	0	NA	NA	NA	1	1	Orphan	WYL,DEDDh,cas3,csa3,cas4,RT,PD-DExK,DinG	NA|437aa|up_9|NZ_CP040695.2_893206_894517_-,NA|124aa|up_8|NZ_CP040695.2_894546_894918_-,NA|117aa|up_6|NZ_CP040695.2_896497_896848_+,NA|76aa|up_4|NZ_CP040695.2_898682_898910_+,NA|311aa|up_2|NZ_CP040695.2_899410_900343_-,NA	NA|437aa|up_9|NZ_CP040695.2_893206_894517_-	NA	NA|124aa|up_8|NZ_CP040695.2_894546_894918_-	NA	NA|484aa|up_7|NZ_CP040695.2_895049_896501_+	TIGR02946, Putative_diacyglycerol_O-acyltransferase_Mb3115, acyltransferase, WS/DGAT/MGAT	NA|117aa|up_6|NZ_CP040695.2_896497_896848_+	NA	NA|546aa|up_5|NZ_CP040695.2_896998_898636_+	cd07123, ALDH_F4-17_P5CDH, Delta(1)-pyrroline-5-carboxylate dehydrogenase, ALDH families 4 and 17	NA|76aa|up_4|NZ_CP040695.2_898682_898910_+	NA	NA|157aa|up_3|NZ_CP040695.2_898894_899365_-	pfam10698, DUF2505, Protein of unknown function (DUF2505)	NA|311aa|up_2|NZ_CP040695.2_899410_900343_-	NA	NA|276aa|up_1|NZ_CP040695.2_900417_901245_+	TIGR03036, trp_2_3_diox, tryptophan 2,3-dioxygenase	NA|1617aa|up_0|NZ_CP040695.2_901311_906162_+	pfam05088, Bac_GDH, Bacterial NAD-glutamate dehydrogenase	NA|598aa|down_0|NZ_CP040695.2_906447_908241_+	COG1132, MdlB, ABC-type multidrug transport system, ATPase and permease components [Defense mechanisms]	NA|622aa|down_1|NZ_CP040695.2_908237_910103_+	COG1132, MdlB, ABC-type multidrug transport system, ATPase and permease components [Defense mechanisms]	NA|903aa|down_2|NZ_CP040695.2_910206_912915_+	cd09603, M1_APN_like, Peptidase M1 family similar to aminopeptidase N catalytic domain	NA|300aa|down_3|NZ_CP040695.2_913044_913944_+	COG1131, CcmA, ABC-type multidrug transport system, ATPase component [Defense mechanisms]	NA|358aa|down_4|NZ_CP040695.2_913943_915017_+	COG0842, COG0842, ABC-type multidrug transport system, permease component [Defense mechanisms]	NA|510aa|down_5|NZ_CP040695.2_915052_916582_+	PRK07764, PRK07764, DNA polymerase III subunits gamma and tau; Validated	NA|409aa|down_6|NZ_CP040695.2_916742_917969_+	cd00085, HNHc, HNH nucleases; HNH endonuclease signature which is found in viral, prokaryotic, and eukaryotic proteins	NA|321aa|down_7|NZ_CP040695.2_918028_918991_-	cd00789, KU_like, Ku-core domain, Ku-like subfamily; composed of prokaryotic homologs of the eukaryotic DNA binding protein Ku	NA|312aa|down_8|NZ_CP040695.2_919084_920020_+	PRK09632, PRK09632, ATP-dependent DNA ligase; Reviewed	NA|311aa|down_9|NZ_CP040695.2_920016_920949_+	PRK09632, PRK09632, ATP-dependent DNA ligase; Reviewed
GCF_005954645.2_ASM595464v2	NZ_CP040695	Nocardioides sp. S-1144 chromosome, complete genome	7	1041891-1041983	7	CRISPRCasFinder	no		WYL,DEDDh,cas3,csa3,cas4,RT,PD-DExK,DinG	Orphan	GTCGCGCTGACGCGGATCCCGGTCGTCA	28	0	0	NA	NA	NA	1	1	Orphan	WYL,DEDDh,cas3,csa3,cas4,RT,PD-DExK,DinG	NA|129aa|up_7|NZ_CP040695.2_1030836_1031223_+,NA|190aa|down_0|NZ_CP040695.2_1042018_1042588_+	NA|139aa|up_9|NZ_CP040695.2_1029339_1029756_-	cd14771, TrHb2_Mt-trHbO-like_O, Truncated hemoglobins, group 2 (O); Mycobacterium tuberculosis hemoglobin O like	NA|339aa|up_8|NZ_CP040695.2_1029755_1030772_-	pfam00924, MS_channel, Mechanosensitive ion channel	NA|129aa|up_7|NZ_CP040695.2_1030836_1031223_+	NA	NA|127aa|up_6|NZ_CP040695.2_1031209_1031590_+	pfam17174, DUF5130, Domain of unknown function (DUF5130)	NA|856aa|up_5|NZ_CP040695.2_1031823_1034391_-	TIGR02412, Aminopeptidase_N, aminopeptidase N, Streptomyces lividans type	NA|575aa|up_4|NZ_CP040695.2_1034523_1036248_+	pfam05257, CHAP, CHAP domain	NA|209aa|up_3|NZ_CP040695.2_1036276_1036903_+	cd03022, DsbA_HCCA_Iso, DsbA family, 2-hydroxychromene-2-carboxylate (HCCA) isomerase subfamily; HCCA isomerase is a glutathione (GSH) dependent enzyme involved in the naphthalene catabolic pathway	NA|390aa|up_2|NZ_CP040695.2_1036987_1038157_+	smart00701, PGRP, Animal peptidoglycan recognition proteins homologous to Bacteriophage T3 lysozyme	NA|313aa|up_1|NZ_CP040695.2_1038176_1039115_+	COG3217, COG3217, Uncharacterized Fe-S protein [General function prediction only]	NA|607aa|up_0|NZ_CP040695.2_1039124_1040945_+	PRK12268, PRK12268, methionyl-tRNA synthetase; Reviewed	NA|190aa|down_0|NZ_CP040695.2_1042018_1042588_+	NA	NA|248aa|down_1|NZ_CP040695.2_1042729_1043473_+	pfam07859, Abhydrolase_3, alpha/beta hydrolase fold	NA|168aa|down_2|NZ_CP040695.2_1043511_1044015_+	PRK05571, PRK05571, ribose-5-phosphate isomerase B; Provisional	NA|295aa|down_3|NZ_CP040695.2_1044007_1044892_+	COG0266, Nei, Formamidopyrimidine-DNA glycosylase [DNA replication, recombination, and repair]	NA|319aa|down_4|NZ_CP040695.2_1045092_1046049_+	pfam07228, SpoIIE, Stage II sporulation protein E (SpoIIE)	NA|479aa|down_5|NZ_CP040695.2_1046358_1047795_+	PRK01490, tig, trigger factor; Provisional	NA|575aa|down_6|NZ_CP040695.2_1047884_1049609_+	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|702aa|down_7|NZ_CP040695.2_1049676_1051782_-	pfam01841, Transglut_core, Transglutaminase-like superfamily	NA|398aa|down_8|NZ_CP040695.2_1051778_1052972_-	COG1721, COG1721, Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) [General function prediction only]	NA|330aa|down_9|NZ_CP040695.2_1052995_1053985_-	COG0714, COG0714, MoxR-like ATPases [General function prediction only]
GCF_005954645.2_ASM595464v2	NZ_CP040695	Nocardioides sp. S-1144 chromosome, complete genome	8	1171494-1171626	8	CRISPRCasFinder	no		WYL,DEDDh,cas3,csa3,cas4,RT,PD-DExK,DinG	Orphan	CGGTGGTTGAGTCGGCGTCCTCGGCCCGGGGGTTTCGTCGGCGCTCGC	48	0	0	NA	NA	NA	1	1	Orphan	WYL,DEDDh,cas3,csa3,cas4,RT,PD-DExK,DinG	NA,NA|144aa|down_3|NZ_CP040695.2_1175847_1176279_-,NA|214aa|down_8|NZ_CP040695.2_1179516_1180158_+	NA|180aa|up_9|NZ_CP040695.2_1161139_1161679_+	pfam01327, Pep_deformylase, Polypeptide deformylase	NA|211aa|up_8|NZ_CP040695.2_1161826_1162459_+	pfam13649, Methyltransf_25, Methyltransferase domain	NA|286aa|up_7|NZ_CP040695.2_1162613_1163471_+	pfam12811, BaxI_1, Bax inhibitor 1 like	NA|343aa|up_6|NZ_CP040695.2_1163572_1164601_-	cd01836, FeeA_FeeB_like, SGNH_hydrolase subfamily, FeeA, FeeB and similar esterases/lipases	NA|469aa|up_5|NZ_CP040695.2_1164645_1166052_+	TIGR01137, Cystathionine_beta-synthase, cystathionine beta-synthase	NA|214aa|up_4|NZ_CP040695.2_1166165_1166807_+	COG1174, OpuBB, ABC-type proline/glycine betaine transport systems, permease component [Amino acid transport and metabolism]	NA|396aa|up_3|NZ_CP040695.2_1166803_1167991_+	COG1125, OpuBA, ABC-type proline/glycine betaine transport systems, ATPase components [Amino acid transport and metabolism]	NA|236aa|up_2|NZ_CP040695.2_1168094_1168802_+	COG1174, OpuBB, ABC-type proline/glycine betaine transport systems, permease component [Amino acid transport and metabolism]	NA|331aa|up_1|NZ_CP040695.2_1168801_1169794_+	cd13611, PBP2_YehZ, Substrate-binding domain YehZ of an osmoregulated ABC-type transporter; the type 2 periplasmic-binding protein fold	NA|264aa|up_0|NZ_CP040695.2_1169845_1170637_+	TIGR02569, conserved_hypothetical_protein, TIGR02569 family protein	NA|129aa|down_0|NZ_CP040695.2_1171821_1172208_-	COG3012, COG3012, Uncharacterized protein conserved in bacteria [Function unknown]	NA|463aa|down_1|NZ_CP040695.2_1172308_1173697_+	COG1239, ChlI, Mg-chelatase subunit ChlI [Coenzyme metabolism]	NA|685aa|down_2|NZ_CP040695.2_1173689_1175744_+	COG4867, COG4867, Uncharacterized protein with a von Willebrand factor type A (vWA) domain [General function prediction only]	NA|144aa|down_3|NZ_CP040695.2_1175847_1176279_-	NA	NA|294aa|down_4|NZ_CP040695.2_1176339_1177221_-	cd01144, BtuF, Cobalamin binding protein BtuF	NA|260aa|down_5|NZ_CP040695.2_1177256_1178036_+	PRK10621, PRK10621, hypothetical protein; Provisional	NA|213aa|down_6|NZ_CP040695.2_1178032_1178671_+	COG1739, COG1739, Uncharacterized conserved protein [Function unknown]	NA|267aa|down_7|NZ_CP040695.2_1178675_1179476_+	pfam18029, Glyoxalase_6, Glyoxalase-like domain	NA|214aa|down_8|NZ_CP040695.2_1179516_1180158_+	NA	NA|160aa|down_9|NZ_CP040695.2_1180297_1180777_+	pfam00583, Acetyltransf_1, Acetyltransferase (GNAT) family
GCF_005954645.2_ASM595464v2	NZ_CP040695	Nocardioides sp. S-1144 chromosome, complete genome	9	1587927-1588040	9	CRISPRCasFinder	no		WYL,DEDDh,cas3,csa3,cas4,RT,PD-DExK,DinG	Orphan	TGACCCGTCGCTGATCACTGACGGGTCG	28	0	0	NA	NA	NA	1	1	Orphan	WYL,DEDDh,cas3,csa3,cas4,RT,PD-DExK,DinG	NA,NA|83aa|down_2|NZ_CP040695.2_1590626_1590875_+,NA|76aa|down_4|NZ_CP040695.2_1591411_1591639_+,NA|254aa|down_5|NZ_CP040695.2_1591949_1592711_-	NA|1099aa|up_9|NZ_CP040695.2_1570013_1573310_+	cd06244, M14-like, Peptidase M14-like domain; uncharacterized subgroup	NA|223aa|up_8|NZ_CP040695.2_1573364_1574033_-	COG2197, CitB, Response regulator containing a CheY-like receiver domain and an HTH DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|314aa|up_7|NZ_CP040695.2_1574029_1574971_-	COG4585, COG4585, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|187aa|up_6|NZ_CP040695.2_1575161_1575722_+	pfam01145, Band_7, SPFH domain / Band 7 family	NA|503aa|up_5|NZ_CP040695.2_1575708_1577217_+	PRK05326, PRK05326, potassium/proton antiporter	NA|1087aa|up_4|NZ_CP040695.2_1577281_1580542_+	COG0210, UvrD, Superfamily I DNA and RNA helicases [DNA replication, recombination, and repair]	NA|1068aa|up_3|NZ_CP040695.2_1580610_1583814_+	COG0210, UvrD, Superfamily I DNA and RNA helicases [DNA replication, recombination, and repair]	NA|399aa|up_2|NZ_CP040695.2_1583863_1585060_+	pfam13191, AAA_16, AAA ATPase domain	NA|357aa|up_1|NZ_CP040695.2_1585072_1586143_-	COG2072, TrkA, Predicted flavoprotein involved in K+ transport [Inorganic ion transport and metabolism]	NA|246aa|up_0|NZ_CP040695.2_1586171_1586909_-	PRK05819, deoD, DeoD-type purine-nucleoside phosphorylase	NA|90aa|down_0|NZ_CP040695.2_1588084_1588354_-	TIGR02200, conserved_hypothetical_protein, Glutaredoxin-like protein	NA|685aa|down_1|NZ_CP040695.2_1588434_1590489_+	COG0210, UvrD, Superfamily I DNA and RNA helicases [DNA replication, recombination, and repair]	NA|83aa|down_2|NZ_CP040695.2_1590626_1590875_+	NA	NA|112aa|down_3|NZ_CP040695.2_1591079_1591415_+	pfam02467, Whib, Transcription factor WhiB	NA|76aa|down_4|NZ_CP040695.2_1591411_1591639_+	NA	NA|254aa|down_5|NZ_CP040695.2_1591949_1592711_-	NA	NA|265aa|down_6|NZ_CP040695.2_1593156_1593951_+	PRK09245, PRK09245, crotonase/enoyl-CoA hydratase family protein	NA|359aa|down_7|NZ_CP040695.2_1594286_1595363_+	PRK07945, PRK07945, PHP domain-containing protein	NA|178aa|down_8|NZ_CP040695.2_1595373_1595907_+	cd07344, M48_yhfN_like, Peptidase M48 YhfN-like, a novel minigluzincin	NA|200aa|down_9|NZ_CP040695.2_1595837_1596437_-	cd03674, Nudix_Hydrolase_1, Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X
GCF_005954645.2_ASM595464v2	NZ_CP040695	Nocardioides sp. S-1144 chromosome, complete genome	10	1814034-1814122	10	CRISPRCasFinder	no		WYL,DEDDh,cas3,csa3,cas4,RT,PD-DExK,DinG	Orphan	GCTCGCTTGCTCAACCTCCGGTGGTGGTCGGC	32	0	0	NA	NA	NA	1	1	Orphan	WYL,DEDDh,cas3,csa3,cas4,RT,PD-DExK,DinG	NA,NA|338aa|down_0|NZ_CP040695.2_1815392_1816406_-,NA|220aa|down_1|NZ_CP040695.2_1816402_1817062_-,NA|419aa|down_4|NZ_CP040695.2_1819354_1820611_-,NA|346aa|down_5|NZ_CP040695.2_1820618_1821656_-	NA|386aa|up_9|NZ_CP040695.2_1801469_1802627_-	cd03801, GT4_PimA-like, phosphatidyl-myo-inositol mannosyltransferase	NA|250aa|up_8|NZ_CP040695.2_1802674_1803424_+	pfam08241, Methyltransf_11, Methyltransferase domain	NA|314aa|up_7|NZ_CP040695.2_1803513_1804455_+	pfam11271, DUF3068, Protein of unknown function (DUF3068)	NA|322aa|up_6|NZ_CP040695.2_1804486_1805452_+	pfam07726, AAA_3, ATPase family associated with various cellular activities (AAA)	NA|406aa|up_5|NZ_CP040695.2_1805444_1806662_+	COG1721, COG1721, Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) [General function prediction only]	NA|721aa|up_4|NZ_CP040695.2_1806676_1808839_+	pfam01841, Transglut_core, Transglutaminase-like superfamily	NA|287aa|up_3|NZ_CP040695.2_1808906_1809767_-	pfam08241, Methyltransf_11, Methyltransferase domain	NA|299aa|up_2|NZ_CP040695.2_1809827_1810724_+	cd05154, ACAD10_11_N-like, N-terminal domain of Acyl-CoA dehydrogenase (ACAD) 10 and 11, and similar proteins	NA|490aa|up_1|NZ_CP040695.2_1810837_1812307_+	PRK07899, rpsA, 30S ribosomal protein S1; Reviewed	NA|315aa|up_0|NZ_CP040695.2_1812692_1813637_+	PRK09393, ftrA, transcriptional activator FtrA; Provisional	NA|338aa|down_0|NZ_CP040695.2_1815392_1816406_-	NA	NA|220aa|down_1|NZ_CP040695.2_1816402_1817062_-	NA	NA|422aa|down_2|NZ_CP040695.2_1817058_1818324_-	PRK02186, PRK02186, argininosuccinate lyase; Provisional	NA|348aa|down_3|NZ_CP040695.2_1818320_1819364_-	cd04187, DPM1_like_bac, Bacterial DPM1_like enzymes are related to eukaryotic DPM1	NA|419aa|down_4|NZ_CP040695.2_1819354_1820611_-	NA	NA|346aa|down_5|NZ_CP040695.2_1820618_1821656_-	NA	NA|233aa|down_6|NZ_CP040695.2_1821835_1822534_-	pfam13305, WHG, WHG domain	NA|321aa|down_7|NZ_CP040695.2_1822679_1823642_+	cd05229, SDR_a3, atypical (a) SDRs, subgroup 3	NA|195aa|down_8|NZ_CP040695.2_1823716_1824301_+	COG1595, RpoE, DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog [Transcription]	NA|314aa|down_9|NZ_CP040695.2_1824300_1825242_+	PRK07764, PRK07764, DNA polymerase III subunits gamma and tau; Validated
GCF_005954645.2_ASM595464v2	NZ_CP040695	Nocardioides sp. S-1144 chromosome, complete genome	11	2059212-2059301	11	CRISPRCasFinder	no		WYL,DEDDh,cas3,csa3,cas4,RT,PD-DExK,DinG	Orphan	GAGTTTCACCGCTTGAGCGGTGAAACTC	28	0	0	NA	NA	NA	1	1	Orphan	WYL,DEDDh,cas3,csa3,cas4,RT,PD-DExK,DinG	NA|306aa|up_2|NZ_CP040695.2_2055624_2056542_+,NA|79aa|down_8|NZ_CP040695.2_2068059_2068296_-	NA|253aa|up_9|NZ_CP040695.2_2049918_2050677_+	COG0396, sufC, Cysteine desulfurase activator ATPase [Posttranslational modification, protein turnover, chaperones]	NA|422aa|up_8|NZ_CP040695.2_2050718_2051984_+	TIGR01979, Probable_cysteine_desulfurase, cysteine desulfurases, SufSfamily	NA|153aa|up_7|NZ_CP040695.2_2051987_2052446_+	TIGR01994, Iron-sulfur_cluster_assembly_scaffold_protein_IscU, SUF system FeS assembly protein, NifU family	NA|129aa|up_6|NZ_CP040695.2_2052438_2052825_+	COG2151, PaaD, Predicted metal-sulfur cluster biosynthetic enzyme [General function prediction only]	NA|365aa|up_5|NZ_CP040695.2_2052843_2053938_-	PRK15313, PRK15313, intestinal colonization autotransporter adhesin MisL	NA|168aa|up_4|NZ_CP040695.2_2053934_2054438_-	TIGR02983, putative_RNA_polymerase_ECF-subfamily_sigma_factor, RNA polymerase sigma-70 factor, sigma-E family	NA|364aa|up_3|NZ_CP040695.2_2054522_2055614_+	COG0622, COG0622, Predicted phosphoesterase [General function prediction only]	NA|306aa|up_2|NZ_CP040695.2_2055624_2056542_+	NA	NA|310aa|up_1|NZ_CP040695.2_2056587_2057517_-	COG2321, COG2321, Predicted metalloprotease [General function prediction only]	NA|533aa|up_0|NZ_CP040695.2_2057601_2059200_+	COG0488, Uup, ATPase components of ABC transporters with duplicated ATPase domains [General function prediction only]	NA|351aa|down_0|NZ_CP040695.2_2059309_2060362_-	COG0809, QueA, S-adenosylmethionine:tRNA-ribosyltransferase-isomerase (queuine synthetase) [Translation, ribosomal structure and biogenesis]	NA|228aa|down_1|NZ_CP040695.2_2060364_2061048_-	cd05233, SDR_c, classical (c) SDRs	NA|273aa|down_2|NZ_CP040695.2_2061165_2061984_+	cd06558, crotonase-like, Crotonase/Enoyl-Coenzyme A (CoA) hydratase superfamily	NA|639aa|down_3|NZ_CP040695.2_2062030_2063947_+	COG1132, MdlB, ABC-type multidrug transport system, ATPase and permease components [Defense mechanisms]	NA|499aa|down_4|NZ_CP040695.2_2063939_2065436_+	TIGR00711, Uncharacterized_MFS-type_transporter_YhcA, drug resistance transporter, EmrB/QacA subfamily	NA|253aa|down_5|NZ_CP040695.2_2065453_2066212_+	cd05344, BKR_like_SDR_like, putative beta-ketoacyl acyl carrier protein [ACP] reductase (BKR)-like, SDR	NA|309aa|down_6|NZ_CP040695.2_2066097_2067024_-	cd06662, SURF1, SURF1 superfamily	NA|339aa|down_7|NZ_CP040695.2_2067060_2068077_+	PRK00164, moaA, GTP 3',8-cyclase MoaA	NA|79aa|down_8|NZ_CP040695.2_2068059_2068296_-	NA	NA|109aa|down_9|NZ_CP040695.2_2068296_2068623_-	pfam11298, DUF3099, Protein of unknown function (DUF3099)
GCF_005954645.2_ASM595464v2	NZ_CP040695	Nocardioides sp. S-1144 chromosome, complete genome	12	2725862-2725966	12	CRISPRCasFinder	no		WYL,DEDDh,cas3,csa3,cas4,RT,PD-DExK,DinG	Orphan	CCCAGCCACCGGACGTCGAGCAGCG	25	0	0	NA	NA	NA	1	1	Orphan	WYL,DEDDh,cas3,csa3,cas4,RT,PD-DExK,DinG	NA|82aa|up_9|NZ_CP040695.2_2716502_2716748_-,NA|86aa|up_5|NZ_CP040695.2_2719033_2719291_-,NA|111aa|up_2|NZ_CP040695.2_2721965_2722298_-,NA|285aa|down_1|NZ_CP040695.2_2726678_2727533_-,NA|110aa|down_3|NZ_CP040695.2_2728508_2728838_-,NA|102aa|down_6|NZ_CP040695.2_2730354_2730660_-,NA|331aa|down_9|NZ_CP040695.2_2732952_2733945_+	NA|82aa|up_9|NZ_CP040695.2_2716502_2716748_-	NA	NA|160aa|up_8|NZ_CP040695.2_2716790_2717270_-	pfam00186, DHFR_1, Dihydrofolate reductase	NA|269aa|up_7|NZ_CP040695.2_2717269_2718076_-	PRK01827, thyA, thymidylate synthase; Reviewed	NA|277aa|up_6|NZ_CP040695.2_2718104_2718935_-	COG0384, COG0384, Predicted epimerase, PhzC/PhzF homolog [General function prediction only]	NA|86aa|up_5|NZ_CP040695.2_2719033_2719291_-	NA	NA|367aa|up_4|NZ_CP040695.2_2719287_2720388_-	PRK00389, gcvT, glycine cleavage system aminomethyltransferase GcvT	NA|506aa|up_3|NZ_CP040695.2_2720451_2721969_+	PRK00913, PRK00913, multifunctional aminopeptidase A; Provisional	NA|111aa|up_2|NZ_CP040695.2_2721965_2722298_-	NA	NA|469aa|up_1|NZ_CP040695.2_2722454_2723861_+	PRK06416, PRK06416, dihydrolipoamide dehydrogenase; Reviewed	NA|632aa|up_0|NZ_CP040695.2_2723887_2725783_+	TIGR02927, putative_dihydrolipoamide_acyltransferase, 2-oxoglutarate dehydrogenase, E2 component, dihydrolipoamide succinyltransferase	NA|153aa|down_0|NZ_CP040695.2_2726149_2726608_-	pfam05099, TerB, Tellurite resistance protein TerB	NA|285aa|down_1|NZ_CP040695.2_2726678_2727533_-	NA	NA|303aa|down_2|NZ_CP040695.2_2727619_2728528_+	cd05242, SDR_a8, atypical (a) SDRs, subgroup 8	NA|110aa|down_3|NZ_CP040695.2_2728508_2728838_-	NA	NA|240aa|down_4|NZ_CP040695.2_2729339_2730059_+	PRK14345, PRK14345, lipoyl(octanoyl) transferase LipB	NA|94aa|down_5|NZ_CP040695.2_2730076_2730358_-	pfam11829, DUF3349, Protein of unknown function (DUF3349)	NA|102aa|down_6|NZ_CP040695.2_2730354_2730660_-	NA	NA|401aa|down_7|NZ_CP040695.2_2730674_2731877_-	pfam01384, PHO4, Phosphate transporter family	NA|333aa|down_8|NZ_CP040695.2_2731957_2732956_+	COG1131, CcmA, ABC-type multidrug transport system, ATPase component [Defense mechanisms]	NA|331aa|down_9|NZ_CP040695.2_2732952_2733945_+	NA
GCF_005954645.2_ASM595464v2	NZ_CP040695	Nocardioides sp. S-1144 chromosome, complete genome	13	3310558-3310633	13	CRISPRCasFinder	no	DinG	WYL,DEDDh,cas3,csa3,cas4,RT,PD-DExK,DinG	Type IV-A	CGGCGAGCGCCGACGAAACCCCGG	24	0	0	NA	NA	NA	1	1	Orphan	WYL,DEDDh,cas3,csa3,cas4,RT,PD-DExK,DinG	NA|255aa|up_9|NZ_CP040695.2_3301786_3302551_-,NA|60aa|up_6|NZ_CP040695.2_3303917_3304097_-,NA|223aa|up_3|NZ_CP040695.2_3306250_3306919_+,NA|434aa|up_2|NZ_CP040695.2_3306937_3308239_-,NA|124aa|down_9|NZ_CP040695.2_3321043_3321415_+	NA|255aa|up_9|NZ_CP040695.2_3301786_3302551_-	NA	NA|222aa|up_8|NZ_CP040695.2_3302582_3303248_-	pfam10263, SprT-like, SprT-like family	NA|215aa|up_7|NZ_CP040695.2_3303258_3303903_-	pfam12146, Hydrolase_4, Serine aminopeptidase, S33	NA|60aa|up_6|NZ_CP040695.2_3303917_3304097_-	NA	NA|208aa|up_5|NZ_CP040695.2_3304115_3304739_+	PRK10030, PRK10030, YiiX family permuted papain-like enzyme	NA|108aa|up_4|NZ_CP040695.2_3304777_3305101_-	TIGR02890, conserved_hypothetical_protein, regulatory protein, yteA family	NA|223aa|up_3|NZ_CP040695.2_3306250_3306919_+	NA	NA|434aa|up_2|NZ_CP040695.2_3306937_3308239_-	NA	NA|187aa|up_1|NZ_CP040695.2_3308436_3308997_+	COG1695, COG1695, Predicted transcriptional regulators [Transcription]	NA|515aa|up_0|NZ_CP040695.2_3308975_3310520_+	TIGR00711, Uncharacterized_MFS-type_transporter_YhcA, drug resistance transporter, EmrB/QacA subfamily	NA|199aa|down_0|NZ_CP040695.2_3310660_3311257_-	cd14498, DSP, dual-specificity phosphatase domain	NA|128aa|down_1|NZ_CP040695.2_3311435_3311819_+	cd00221, Vsr, Very Short Patch Repair (Vsr) Endonuclease	NA|340aa|down_2|NZ_CP040695.2_3311815_3312835_-	pfam03747, ADP_ribosyl_GH, ADP-ribosylglycohydrolase	NA|249aa|down_3|NZ_CP040695.2_3312913_3313660_+	COG4111, COG4111, Uncharacterized conserved protein [General function prediction only]	NA|502aa|down_4|NZ_CP040695.2_3313656_3315162_+	cd01406, SIR2-like, Sir2-like: Prokaryotic group of uncharacterized Sir2-like proteins which lack certain key catalytic residues and conserved zinc binding cysteines; and are members of the SIR2 superfamily of proteins, silent information regulator 2 (Sir2) enzymes which catalyze NAD+-dependent protein/histone deacetylation	NA|979aa|down_5|NZ_CP040695.2_3315259_3318196_-	PRK06556, PRK06556, vitamin B12-dependent ribonucleotide reductase; Validated	NA|161aa|down_6|NZ_CP040695.2_3318366_3318849_-	PRK00464, nrdR, transcriptional repressor NrdR	NA|165aa|down_7|NZ_CP040695.2_3319090_3319585_-	cd10028, UDG-F2_TDG_MUG, Uracil DNA glycosylase family 2, includes thymine DNA glycosylase, mismatch-specific uracil DNA glycosylase and similar proteins	NA|431aa|down_8|NZ_CP040695.2_3319671_3320964_+	COG1972, NupC, Nucleoside permease [Nucleotide transport and metabolism]	NA|124aa|down_9|NZ_CP040695.2_3321043_3321415_+	NA
GCF_005954645.2_ASM595464v2	NZ_CP040695	Nocardioides sp. S-1144 chromosome, complete genome	14	4102880-4103286	14	CRISPRCasFinder	no		WYL,DEDDh,cas3,csa3,cas4,RT,PD-DExK,DinG	Orphan	GACCGGCTCCGGCGACCTCCACATCG	26	0	0	NA	NA	NA	7	7	Orphan	WYL,DEDDh,cas3,csa3,cas4,RT,PD-DExK,DinG	NA|94aa|up_2|NZ_CP040695.2_4099729_4100011_-,NA|60aa|down_0|NZ_CP040695.2_4103517_4103697_+,NA|262aa|down_5|NZ_CP040695.2_4107269_4108055_+	NA|266aa|up_9|NZ_CP040695.2_4088507_4089305_-	TIGR01581, Molybdenum_transport_system_permease_protein_modB	NA|251aa|up_8|NZ_CP040695.2_4089331_4090084_-	cd13538, PBP2_ModA_like_1, Substrate binding domain of putative molybdate-binding protein;the type 2 periplasmic binding protein fold	NA|132aa|up_7|NZ_CP040695.2_4090121_4090517_-	TIGR00638, Probable_molybdenum-pterin-binding_protein, molybdenum-pterin binding domain	NA|250aa|up_6|NZ_CP040695.2_4090615_4091365_+	pfam03649, UPF0014, Uncharacterized protein family (UPF0014)	NA|564aa|up_5|NZ_CP040695.2_4091572_4093264_-	PRK05414, PRK05414, urocanate hydratase; Provisional	NA|523aa|up_4|NZ_CP040695.2_4093260_4094829_-	PRK09367, PRK09367, histidine ammonia-lyase; Provisional	NA|214aa|up_3|NZ_CP040695.2_4099069_4099711_+	pfam13302, Acetyltransf_3, Acetyltransferase (GNAT) domain	NA|94aa|up_2|NZ_CP040695.2_4099729_4100011_-	NA	NA|577aa|up_1|NZ_CP040695.2_4100024_4101755_-	cd11478, SLC5sbd_u2, Uncharacterized bacterial solute carrier 5 subfamily; putative solute-binding domain	NA|192aa|up_0|NZ_CP040695.2_4101934_4102510_+	COG4226, HicB, Predicted nuclease of the RNAse H fold, HicB family [General    function prediction only]	NA|60aa|down_0|NZ_CP040695.2_4103517_4103697_+	NA	NA|314aa|down_1|NZ_CP040695.2_4103709_4104651_-	cd06193, siderophore_interacting, Siderophore interacting proteins share the domain structure of the ferredoxin reductase like family	NA|192aa|down_2|NZ_CP040695.2_4104817_4105393_-	PRK00416, dcd, deoxycytidine triphosphate deaminase; Reviewed	NA|260aa|down_3|NZ_CP040695.2_4105443_4106223_+	cd01832, SGNH_hydrolase_like_1, Members of the SGNH-hydrolase superfamily, a diverse family of lipases and esterases	NA|219aa|down_4|NZ_CP040695.2_4106519_4107176_+	pfam09365, DUF2461, Conserved hypothetical protein (DUF2461)	NA|262aa|down_5|NZ_CP040695.2_4107269_4108055_+	NA	NA|153aa|down_6|NZ_CP040695.2_4108130_4108589_-	COG1959, COG1959, Predicted transcriptional regulator [Transcription]	NA|625aa|down_7|NZ_CP040695.2_4108832_4110707_+	cd01153, ACAD_fadE5, Putative acyl-CoA dehydrogenases similar to fadE5	NA|293aa|down_8|NZ_CP040695.2_4110814_4111693_-	PRK03635, PRK03635, ArgP/LysG family DNA-binding transcriptional regulator	NA|200aa|down_9|NZ_CP040695.2_4111763_4112363_+	COG1279, COG1279, Lysine efflux permease [General function prediction only]
GCF_005954645.2_ASM595464v2	NZ_CP040695	Nocardioides sp. S-1144 chromosome, complete genome	15	4173177-4173284	15	CRISPRCasFinder	no		WYL,DEDDh,cas3,csa3,cas4,RT,PD-DExK,DinG	Orphan	CGGCCCGCCGGGATATTCACCGCTG	25	0	0	NA	NA	NA	1	1	Orphan	WYL,DEDDh,cas3,csa3,cas4,RT,PD-DExK,DinG	NA|174aa|up_5|NZ_CP040695.2_4163817_4164339_-,NA	NA|319aa|up_9|NZ_CP040695.2_4159653_4160610_-	COG0435, ECM4, Predicted glutathione S-transferase [Posttranslational modification, protein turnover, chaperones]	NA|344aa|up_8|NZ_CP040695.2_4160663_4161695_-	pfam12870, DUF4878, Domain of unknown function (DUF4878)	NA|335aa|up_7|NZ_CP040695.2_4161789_4162794_-	pfam15887, Peptidase_Mx, Putative zinc-binding metallo-peptidase	NA|207aa|up_6|NZ_CP040695.2_4162790_4163411_-	pfam12870, DUF4878, Domain of unknown function (DUF4878)	NA|174aa|up_5|NZ_CP040695.2_4163817_4164339_-	NA	NA|294aa|up_4|NZ_CP040695.2_4164374_4165256_-	COG1305, COG1305, Transglutaminase-like enzymes, putative cysteine proteases [Amino acid transport and metabolism]	NA|840aa|up_3|NZ_CP040695.2_4165252_4167772_-	pfam14403, CP_ATPgrasp_2, Circularly permuted ATP-grasp type 2	NA|1099aa|up_2|NZ_CP040695.2_4167799_4171096_-	pfam09899, DUF2126, Putative amidoligase enzyme (DUF2126)	NA|194aa|up_1|NZ_CP040695.2_4171208_4171790_+	pfam12840, HTH_20, Helix-turn-helix domain	NA|423aa|up_0|NZ_CP040695.2_4171786_4173055_+	cd06173, MFS_MefA_like, Macrolide efflux protein A and similar proteins of the Major Facilitator Superfamily of transporters	NA|356aa|down_0|NZ_CP040695.2_4173341_4174409_-	PRK03321, PRK03321, putative aminotransferase; Provisional	NA|190aa|down_1|NZ_CP040695.2_4174419_4174989_-	COG0262, FolA, Dihydrofolate reductase [Coenzyme metabolism]	NA|188aa|down_2|NZ_CP040695.2_4174981_4175545_-	pfam04892, VanZ, VanZ like family	NA|234aa|down_3|NZ_CP040695.2_4175555_4176257_-	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|467aa|down_4|NZ_CP040695.2_4176310_4177711_+	COG3670, COG3670, Lignostilbene-alpha,beta-dioxygenase and related enzymes [Secondary metabolites biosynthesis, transport, and catabolism]	NA|301aa|down_5|NZ_CP040695.2_4177714_4178617_-	COG1834, COG1834, N-Dimethylarginine dimethylaminohydrolase [Amino acid transport and metabolism]	NA|233aa|down_6|NZ_CP040695.2_4178728_4179427_+	COG1802, GntR, Transcriptional regulators [Transcription]	NA|575aa|down_7|NZ_CP040695.2_4179521_4181246_+	PRK09124, PRK09124, ubiquinone-dependent pyruvate dehydrogenase	NA|134aa|down_8|NZ_CP040695.2_4181279_4181681_+	pfam04020, Phage_holin_4_2, Mycobacterial 4 TMS phage holin, superfamily IV	NA|172aa|down_9|NZ_CP040695.2_4181677_4182193_+	cd16343, LMWPTP, Low molecular weight protein tyrosine phosphatase
GCF_005954645.2_ASM595464v2	NZ_CP040695	Nocardioides sp. S-1144 chromosome, complete genome	16	4226129-4226214	16	CRISPRCasFinder	no		WYL,DEDDh,cas3,csa3,cas4,RT,PD-DExK,DinG	Orphan	GGATTTCCCCGGTCGACCGGGGAA	24	0	0	NA	NA	NA	1	1	Orphan	WYL,DEDDh,cas3,csa3,cas4,RT,PD-DExK,DinG	NA,NA|186aa|down_3|NZ_CP040695.2_4229357_4229915_+	NA|649aa|up_9|NZ_CP040695.2_4215502_4217449_-	COG4770, COG4770, Acetyl/propionyl-CoA carboxylase, alpha subunit [Lipid metabolism]	NA|531aa|up_8|NZ_CP040695.2_4217454_4219047_-	COG4799, COG4799, Acetyl-CoA carboxylase, carboxyltransferase component (subunits alpha and beta) [Lipid metabolism]	NA|154aa|up_7|NZ_CP040695.2_4219043_4219505_-	cd07812, SRPBCC, START/RHO_alpha_C/PITP/Bet_v1/CoxG/CalC (SRPBCC) ligand-binding domain superfamily	NA|564aa|up_6|NZ_CP040695.2_4219581_4221273_-	pfam07287, AtuA, Acyclic terpene utilisation family protein AtuA	NA|267aa|up_5|NZ_CP040695.2_4221340_4222141_-	TIGR03084, conserved_hypothetical_protein, TIGR03084 family protein	NA|188aa|up_4|NZ_CP040695.2_4222137_4222701_-	pfam00440, TetR_N, Bacterial regulatory proteins, tetR family	NA|287aa|up_3|NZ_CP040695.2_4222797_4223658_-	TIGR01444, 2-O-methyltransferase_NoeI, methyltransferase, FkbM family	NA|143aa|up_2|NZ_CP040695.2_4223667_4224096_-	COG4270, COG4270, Predicted membrane protein [Function unknown]	NA|360aa|up_1|NZ_CP040695.2_4224092_4225172_-	pfam04892, VanZ, VanZ like family	NA|304aa|up_0|NZ_CP040695.2_4225182_4226094_-	cd09083, EEP-1, Exonuclease-Endonuclease-Phosphatase domain; uncharacterized family 1	NA|232aa|down_0|NZ_CP040695.2_4226248_4226944_-	pfam00300, His_Phos_1, Histidine phosphatase superfamily (branch 1)	NA|364aa|down_1|NZ_CP040695.2_4226943_4228035_-	cd05154, ACAD10_11_N-like, N-terminal domain of Acyl-CoA dehydrogenase (ACAD) 10 and 11, and similar proteins	NA|422aa|down_2|NZ_CP040695.2_4228031_4229297_-	cd01155, ACAD_FadE2, Acyl-CoA dehydrogenases similar to fadE2	NA|186aa|down_3|NZ_CP040695.2_4229357_4229915_+	NA	NA|503aa|down_4|NZ_CP040695.2_4230666_4232175_-	cd07085, ALDH_F6_MMSDH, Methylmalonate semialdehyde dehydrogenase and ALDH family members 6A1 and 6B2	NA|454aa|down_5|NZ_CP040695.2_4232188_4233550_-	PRK06062, PRK06062, hypothetical protein; Provisional	NA|385aa|down_6|NZ_CP040695.2_4233680_4234835_-	pfam09084, NMT1, NMT1/THI5 like	NA|263aa|down_7|NZ_CP040695.2_4234900_4235689_-	COG0600, TauC, ABC-type nitrate/sulfonate/bicarbonate transport system, permease component [Inorganic ion transport and metabolism]	NA|308aa|down_8|NZ_CP040695.2_4235685_4236609_-	cd03293, ABC_NrtD_SsuB_transporters, ATP-binding cassette domain of the nitrate and sulfonate transporters	NA|313aa|down_9|NZ_CP040695.2_4236605_4237544_-	COG0600, TauC, ABC-type nitrate/sulfonate/bicarbonate transport system, permease component [Inorganic ion transport and metabolism]
GCF_005954645.2_ASM595464v2	NZ_CP040695	Nocardioides sp. S-1144 chromosome, complete genome	17	4350462-4350608	17	CRISPRCasFinder	no		WYL,DEDDh,cas3,csa3,cas4,RT,PD-DExK,DinG	Orphan	GGGGTTTCGTCGGCGCTCGCTGGCGCTCGCTTGCTCAACCTCCGGGG	47	0	0	NA	NA	NA	1	1	Orphan	WYL,DEDDh,cas3,csa3,cas4,RT,PD-DExK,DinG	NA|709aa|up_7|NZ_CP040695.2_4341476_4343603_-,NA|349aa|up_6|NZ_CP040695.2_4343671_4344718_-,NA|107aa|up_4|NZ_CP040695.2_4345209_4345530_-,NA|211aa|down_9|NZ_CP040695.2_4375946_4376579_-	NA|260aa|up_9|NZ_CP040695.2_4340260_4341040_-	COG1414, IclR, Transcriptional regulator [Transcription]	NA|107aa|up_8|NZ_CP040695.2_4341125_4341446_+	COG1359, COG1359, Uncharacterized conserved protein [Function unknown]	NA|709aa|up_7|NZ_CP040695.2_4341476_4343603_-	NA	NA|349aa|up_6|NZ_CP040695.2_4343671_4344718_-	NA	NA|130aa|up_5|NZ_CP040695.2_4344714_4345104_-	COG1695, COG1695, Predicted transcriptional regulators [Transcription]	NA|107aa|up_4|NZ_CP040695.2_4345209_4345530_-	NA	NA|223aa|up_3|NZ_CP040695.2_4345526_4346195_-	COG1296, AzlC, Predicted branched-chain amino acid permease (azaleucine resistance) [Amino acid transport and metabolism]	NA|387aa|up_2|NZ_CP040695.2_4346276_4347437_-	COG1940, NagC, Transcriptional regulator/sugar kinase [Transcription / Carbohydrate transport and metabolism]	NA|466aa|up_1|NZ_CP040695.2_4347521_4348919_-	TIGR01312, Xylulose_kinase, D-xylulose kinase	NA|412aa|up_0|NZ_CP040695.2_4349085_4350321_+	COG1940, NagC, Transcriptional regulator/sugar kinase [Transcription / Carbohydrate transport and metabolism]	NA|2207aa|down_0|NZ_CP040695.2_4350736_4357357_-	pfam06283, ThuA, Trehalose utilisation	NA|352aa|down_1|NZ_CP040695.2_4357463_4358519_-	smart00089, PKD, Repeats in polycystic kidney disease 1 (PKD1) and other proteins	NA|381aa|down_2|NZ_CP040695.2_4366022_4367165_-	COG0673, MviM, Predicted dehydrogenases and related proteins [General function prediction only]	NA|362aa|down_3|NZ_CP040695.2_4367309_4368395_-	cd06311, PBP1_ABC_sugar_binding-like, periplasmic sugar-binding domain of uncharacterized ABC-type transport systems	NA|345aa|down_4|NZ_CP040695.2_4368506_4369541_-	COG1172, AraH, Ribose/xylose/arabinose/galactoside ABC-type transport systems, permease components [Carbohydrate transport and metabolism]	NA|336aa|down_5|NZ_CP040695.2_4371397_4372405_-	COG1082, IolE, Sugar phosphate isomerases/epimerases [Carbohydrate transport and metabolism]	NA|454aa|down_6|NZ_CP040695.2_4372484_4373846_-	pfam01663, Phosphodiest, Type I phosphodiesterase / nucleotide pyrophosphatase	NA|405aa|down_7|NZ_CP040695.2_4373875_4375090_-	COG1082, IolE, Sugar phosphate isomerases/epimerases [Carbohydrate transport and metabolism]	NA|287aa|down_8|NZ_CP040695.2_4375086_4375947_-	COG1099, COG1099, Predicted metal-dependent hydrolases with the TIM-barrel fold [General function prediction only]	NA|211aa|down_9|NZ_CP040695.2_4375946_4376579_-	NA
GCF_005954645.2_ASM595464v2	NZ_CP040695	Nocardioides sp. S-1144 chromosome, complete genome	18	4391468-4391559	18	CRISPRCasFinder	no	csa3	WYL,DEDDh,cas3,csa3,cas4,RT,PD-DExK,DinG	Type I-A	GGGAGTTTCACCGCCTCAGCGGTGAAACTC	30	0	0	NA	NA	NA	1	1	Orphan	WYL,DEDDh,cas3,csa3,cas4,RT,PD-DExK,DinG	NA|71aa|up_9|NZ_CP040695.2_4382725_4382938_-,NA	NA|71aa|up_9|NZ_CP040695.2_4382725_4382938_-	NA	NA|534aa|up_8|NZ_CP040695.2_4383012_4384614_+	cd06173, MFS_MefA_like, Macrolide efflux protein A and similar proteins of the Major Facilitator Superfamily of transporters	NA|119aa|up_7|NZ_CP040695.2_4384610_4384967_+	pfam06689, zf-C4_ClpX, ClpX C4-type zinc finger	NA|159aa|up_6|NZ_CP040695.2_4385010_4385487_+	cd07825, SRPBCC_7, Ligand-binding SRPBCC domain of an uncharacterized subfamily of proteins	NA|159aa|up_5|NZ_CP040695.2_4385535_4386012_-	cd07254, VOC_like, uncharacterized subfamily of vicinal oxygen chelate (VOC) family	csa3|126aa|up_4|NZ_CP040695.2_4386092_4386470_+	cd00090, HTH_ARSR, Arsenical Resistance Operon Repressor and similar prokaryotic, metal regulated homodimeric repressors	NA|384aa|up_3|NZ_CP040695.2_4386466_4387618_+	TIGR00832, Uncharacterized_transporter_slr0944, arsenical-resistance protein	NA|138aa|up_2|NZ_CP040695.2_4387614_4388028_+	cd16345, LMWP_ArsC, Arsenate reductase of the LMWP family	NA|473aa|up_1|NZ_CP040695.2_4388080_4389499_-	TIGR00665, DnaB, replicative DNA helicase	NA|476aa|up_0|NZ_CP040695.2_4389937_4391365_+	cd13136, MATE_DinF_like, DinF and similar proteins, a subfamily of the multidrug and toxic compound extrusion (MATE)-like proteins	NA|149aa|down_0|NZ_CP040695.2_4391656_4392103_-	PRK00137, rplI, 50S ribosomal protein L9; Reviewed	NA|79aa|down_1|NZ_CP040695.2_4392130_4392367_-	PRK00391, rpsR, 30S ribosomal protein S18; Reviewed	NA|201aa|down_2|NZ_CP040695.2_4392432_4393035_-	PRK07772, PRK07772, single-stranded DNA-binding protein; Provisional	NA|102aa|down_3|NZ_CP040695.2_4393174_4393480_-	PRK00453, rpsF, 30S ribosomal protein S6; Reviewed	NA|747aa|down_4|NZ_CP040695.2_4393653_4395894_-	TIGR02037, Probable_periplasmic_serine_protease_do/HhoA-like, periplasmic serine protease, Do/DeqQ family	NA|265aa|down_5|NZ_CP040695.2_4395892_4396687_+	PRK01060, PRK01060, endonuclease IV; Provisional	NA|396aa|down_6|NZ_CP040695.2_4396787_4397975_+	COG2348, COG2348, Peptidoglycan interpeptide bridge formation enzyme [Cell wall/membrane/envelope biogenesis]	NA|365aa|down_7|NZ_CP040695.2_4397979_4399074_+	pfam01168, Ala_racemase_N, Alanine racemase, N-terminal domain	NA|479aa|down_8|NZ_CP040695.2_4399034_4400471_-	COG5650, COG5650, Predicted integral membrane protein [Function unknown]	NA|771aa|down_9|NZ_CP040695.2_4400467_4402780_-	COG0744, MrcB, Membrane carboxypeptidase (penicillin-binding protein) [Cell envelope biogenesis, outer membrane]
