assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_002263495.1_ASM226349v1	NZ_CP022753	Nocardiopsis gilva YIM 90087 chromosome, complete genome	1	1085495-1085601	1	CRISPRCasFinder	no		csa3,cas3,WYL,DEDDh,Cas14u_CAS-V,DinG,casR,c2c9_V-U4,cas4,cas8e,cse2gr11,cas7,cas5,cas6e	Orphan	AACACGCTGTGTTGGCCTAGGGGG	24	1	1	1085519-1085577	NZ_CP022753.1_1085613-1085671	NA	1	1	Orphan	csa3,cas3,WYL,DEDDh,Cas14u_CAS-V,DinG,casR,c2c9_V-U4,cas4,cas8e,cse2gr11,cas7,cas5,cas6e	NA,NA|62aa|down_0|NZ_CP022753.1_1085634_1085820_-	NA|412aa|up_9|NZ_CP022753.1_1073458_1074694_+	cd06173, MFS_MefA_like, Macrolide efflux protein A and similar proteins of the Major Facilitator Superfamily of transporters	NA|183aa|up_8|NZ_CP022753.1_1074737_1075286_+	TIGR02258, UPF0097_protein_AF_2157, 2'-5' RNA ligase	NA|242aa|up_7|NZ_CP022753.1_1075539_1076265_-	COG3246, COG3246, Uncharacterized conserved protein [Function unknown]	NA|342aa|up_6|NZ_CP022753.1_1076398_1077424_+	cd09971, SdiA-regulated, SdiA-regulated	NA|332aa|up_5|NZ_CP022753.1_1077498_1078494_-	cd08602, GDPD_ScGlpQ1_like, Glycerophosphodiester phosphodiesterase domain of Streptomycin coelicolor (GlpQ1) and similar proteins	NA|427aa|up_4|NZ_CP022753.1_1078601_1079882_-	TIGR01366, Putative_phosphoserine_aminotransferase_PSAT	NA|369aa|up_3|NZ_CP022753.1_1080011_1081118_-	PRK12350, PRK12350, citrate synthase 2; Provisional	NA|222aa|up_2|NZ_CP022753.1_1081262_1081928_+	TIGR00558, Pyridoxine/pyridoxamine_5'-phosphate_oxidase, pyridoxamine-phosphate oxidase	NA|237aa|up_1|NZ_CP022753.1_1082216_1082927_+	COG1321, TroR, Mn-dependent transcriptional regulator [Transcription]	NA|755aa|up_0|NZ_CP022753.1_1083123_1085388_+	pfam07228, SpoIIE, Stage II sporulation protein E (SpoIIE)	NA|62aa|down_0|NZ_CP022753.1_1085634_1085820_-	NA	NA|381aa|down_1|NZ_CP022753.1_1085858_1087001_-	cd01160, LCAD, Long chain acyl-CoA dehydrogenase	NA|680aa|down_2|NZ_CP022753.1_1087274_1089314_+	cd17502, MFS_Azr1_MDR_like, Saccharomyces cerevisiae Azole resistance protein 1 (Azr1p), and similar multidrug resistance (MDR) transporters of the Major Facilitator Superfamily	NA|381aa|down_3|NZ_CP022753.1_1089429_1090572_+	PRK07801, PRK07801, acetyl-CoA C-acetyltransferase	NA|407aa|down_4|NZ_CP022753.1_1090694_1091915_-	COG3214, COG3214, Uncharacterized protein conserved in bacteria [Function unknown]	NA|646aa|down_5|NZ_CP022753.1_1092089_1094027_-	cd01161, VLCAD, Very long chain acyl-CoA dehydrogenase	NA|475aa|down_6|NZ_CP022753.1_1094304_1095729_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|288aa|down_7|NZ_CP022753.1_1095812_1096676_-	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|268aa|down_8|NZ_CP022753.1_1097018_1097822_+	cd09086, ExoIII-like_AP-endo, Escherichia coli exonuclease III (ExoIII) and Neisseria meningitides NExo-like subfamily of the ExoIII family purinic/apyrimidinic (AP) endonucleases	NA|287aa|down_9|NZ_CP022753.1_1098069_1098930_+	cd01174, ribokinase, Ribokinase catalyses the phosphorylation of ribose to ribose-5-phosphate using ATP
GCF_002263495.1_ASM226349v1	NZ_CP022753	Nocardiopsis gilva YIM 90087 chromosome, complete genome	2	1734334-1734423	2	CRISPRCasFinder	no		csa3,cas3,WYL,DEDDh,Cas14u_CAS-V,DinG,casR,c2c9_V-U4,cas4,cas8e,cse2gr11,cas7,cas5,cas6e	Orphan	GTAAGCCCCGCATGCGCGGGGGTGGACC	28	0	0	NA	NA	NA	1	1	Orphan	csa3,cas3,WYL,DEDDh,Cas14u_CAS-V,DinG,casR,c2c9_V-U4,cas4,cas8e,cse2gr11,cas7,cas5,cas6e	NA|85aa|up_9|NZ_CP022753.1_1724228_1724483_-,NA|137aa|down_6|NZ_CP022753.1_1741402_1741813_+	NA|85aa|up_9|NZ_CP022753.1_1724228_1724483_-	NA	NA|373aa|up_8|NZ_CP022753.1_1724582_1725701_+	smart00530, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|193aa|up_7|NZ_CP022753.1_1725715_1726294_+	pfam13023, HD_3, HD domain	NA|289aa|up_6|NZ_CP022753.1_1727698_1728565_-	PRK08317, PRK08317, hypothetical protein; Provisional	NA|215aa|up_5|NZ_CP022753.1_1728959_1729604_+	TIGR02947, putative_RNA_polymerase_sigma_factor, RNA polymerase sigma-70 factor, TIGR02947 family	NA|97aa|up_4|NZ_CP022753.1_1729600_1729891_+	TIGR03988, antisig_RsrA, mycothiol system anti-sigma-R factor	NA|442aa|up_3|NZ_CP022753.1_1730875_1732201_+	COG2206, COG2206, c-di-GMP phosphodiesterase class II (HD-GYP domain) [Signal transduction mechanisms]	NA|437aa|up_2|NZ_CP022753.1_1732197_1733508_+	COG3437, COG3437, Response regulator containing a CheY-like receiver domain and an HD-GYP domain [Transcription / Signal transduction mechanisms]	NA|71aa|up_1|NZ_CP022753.1_1733592_1733805_+	cd06850, biotinyl_domain, The biotinyl-domain or biotin carboxyl carrier protein (BCCP) domain is present in all biotin-dependent enzymes, such as acetyl-CoA carboxylase, pyruvate carboxylase, propionyl-CoA carboxylase, methylcrotonyl-CoA carboxylase, geranyl-CoA carboxylase, oxaloacetate decarboxylase, methylmalonyl-CoA decarboxylase, transcarboxylase and urea amidolyase	NA|126aa|up_0|NZ_CP022753.1_1733900_1734278_+	COG5496, COG5496, Predicted thioesterase [General function prediction only]	NA|202aa|down_0|NZ_CP022753.1_1734647_1735253_-	pfam11611, DUF4352, Domain of unknown function (DUF4352)	NA|554aa|down_1|NZ_CP022753.1_1735462_1737124_-	PRK05858, PRK05858, acetolactate synthase	NA|248aa|down_2|NZ_CP022753.1_1737264_1738008_-	COG2188, PhnF, Transcriptional regulators [Transcription]	NA|521aa|down_3|NZ_CP022753.1_1738104_1739667_+	pfam12282, H_kinase_N, Signal transduction histidine kinase	NA|86aa|down_4|NZ_CP022753.1_1739831_1740089_-	pfam02467, Whib, Transcription factor WhiB	NA|312aa|down_5|NZ_CP022753.1_1740427_1741363_-	COG1597, LCB5, Sphingosine kinase and enzymes related to eukaryotic diacylglycerol kinase [Lipid metabolism / General function prediction only]	NA|137aa|down_6|NZ_CP022753.1_1741402_1741813_+	NA	NA|244aa|down_7|NZ_CP022753.1_1742008_1742740_-	TIGR02980, SigBFG, RNA polymerase sigma-70 factor, sigma-B/F/G subfamily	NA|142aa|down_8|NZ_CP022753.1_1742808_1743234_-	PRK04069, PRK04069, serine-protein kinase RsbW; Provisional	NA|162aa|down_9|NZ_CP022753.1_1743383_1743869_-	pfam00583, Acetyltransf_1, Acetyltransferase (GNAT) family
GCF_002263495.1_ASM226349v1	NZ_CP022753	Nocardiopsis gilva YIM 90087 chromosome, complete genome	3	1908999-1909154	3	CRISPRCasFinder	no		csa3,cas3,WYL,DEDDh,Cas14u_CAS-V,DinG,casR,c2c9_V-U4,cas4,cas8e,cse2gr11,cas7,cas5,cas6e	Orphan	CGGGGCCTACCGCGTCCCCCAGGCGTAGGTTTGCTTGCGCAGCTTCAGATAGAG	54	0	0	NA	NA	NA	1	1	Orphan	csa3,cas3,WYL,DEDDh,Cas14u_CAS-V,DinG,casR,c2c9_V-U4,cas4,cas8e,cse2gr11,cas7,cas5,cas6e	NA|84aa|up_5|NZ_CP022753.1_1902245_1902497_-,NA|83aa|down_3|NZ_CP022753.1_1914842_1915091_-,NA|155aa|down_7|NZ_CP022753.1_1919827_1920292_+	NA|303aa|up_9|NZ_CP022753.1_1898833_1899742_+	PRK05299, rpsB, 30S ribosomal protein S2; Provisional	NA|279aa|up_8|NZ_CP022753.1_1899867_1900704_+	PRK09377, tsf, elongation factor Ts; Provisional	NA|256aa|up_7|NZ_CP022753.1_1900840_1901608_+	PRK00358, pyrH, uridylate kinase; Provisional	NA|186aa|up_6|NZ_CP022753.1_1901677_1902235_+	PRK00083, frr, ribosome recycling factor; Reviewed	NA|84aa|up_5|NZ_CP022753.1_1902245_1902497_-	NA	NA|288aa|up_4|NZ_CP022753.1_1902495_1903359_+	pfam01148, CTP_transf_1, Cytidylyltransferase family	NA|410aa|up_3|NZ_CP022753.1_1903432_1904662_-	cd06173, MFS_MefA_like, Macrolide efflux protein A and similar proteins of the Major Facilitator Superfamily of transporters	NA|362aa|up_2|NZ_CP022753.1_1906085_1907171_-	pfam05076, SUFU, Suppressor of fused protein (SUFU)	NA|367aa|up_1|NZ_CP022753.1_1907357_1908458_+	PRK14459, PRK14459, ribosomal RNA large subunit methyltransferase N; Provisional	NA|158aa|up_0|NZ_CP022753.1_1908508_1908982_+	COG1238, COG1238, Predicted membrane protein [Function unknown]	NA|539aa|down_0|NZ_CP022753.1_1909733_1911350_-	pfam07905, PucR, Purine catabolism regulatory protein-like family	NA|433aa|down_1|NZ_CP022753.1_1911591_1912890_+	cd00610, OAT_like, Acetyl ornithine aminotransferase family	NA|500aa|down_2|NZ_CP022753.1_1912951_1914451_+	cd07085, ALDH_F6_MMSDH, Methylmalonate semialdehyde dehydrogenase and ALDH family members 6A1 and 6B2	NA|83aa|down_3|NZ_CP022753.1_1914842_1915091_-	NA	NA|267aa|down_4|NZ_CP022753.1_1915308_1916109_-	cd07581, nitrilase_3, Uncharacterized subgroup of the nitrilase superfamily (putative class 13 nitrilases)	NA|447aa|down_5|NZ_CP022753.1_1916269_1917610_-	PRK06058, PRK06058, 4-aminobutyrate--2-oxoglutarate transaminase	NA|489aa|down_6|NZ_CP022753.1_1917731_1919198_+	pfam07905, PucR, Purine catabolism regulatory protein-like family	NA|155aa|down_7|NZ_CP022753.1_1919827_1920292_+	NA	NA|1042aa|down_8|NZ_CP022753.1_1920602_1923728_+	COG1112, COG1112, Superfamily I DNA and RNA helicases and helicase subunits [DNA replication, recombination, and repair]	NA|294aa|down_9|NZ_CP022753.1_1923928_1924810_+	COG0266, Nei, Formamidopyrimidine-DNA glycosylase [DNA replication, recombination, and repair]
GCF_002263495.1_ASM226349v1	NZ_CP022753	Nocardiopsis gilva YIM 90087 chromosome, complete genome	4	2493142-2493246	4	CRISPRCasFinder	no	csa3	csa3,cas3,WYL,DEDDh,Cas14u_CAS-V,DinG,casR,c2c9_V-U4,cas4,cas8e,cse2gr11,cas7,cas5,cas6e	Type I-A	GGACCATCCCCGCGGTAGCGGGGA	24	0	0	NA	NA	NA	1	1	Orphan	csa3,cas3,WYL,DEDDh,Cas14u_CAS-V,DinG,casR,c2c9_V-U4,cas4,cas8e,cse2gr11,cas7,cas5,cas6e	NA|157aa|up_9|NZ_CP022753.1_2483138_2483609_-,NA|69aa|down_3|NZ_CP022753.1_2497105_2497312_+,NA|95aa|down_7|NZ_CP022753.1_2500056_2500341_+,NA|193aa|down_9|NZ_CP022753.1_2501588_2502167_+	NA|157aa|up_9|NZ_CP022753.1_2483138_2483609_-	NA	NA|475aa|up_8|NZ_CP022753.1_2483746_2485171_-	TIGR00653, Glutamine_synthetase, glutamine synthetase, type I	NA|129aa|up_7|NZ_CP022753.1_2485536_2485923_+	pfam06271, RDD, RDD family	NA|244aa|up_6|NZ_CP022753.1_2486132_2486864_-	pfam13829, DUF4191, Domain of unknown function (DUF4191)	NA|321aa|up_5|NZ_CP022753.1_2487029_2487992_-	PRK05481, PRK05481, lipoyl synthase; Provisional	NA|245aa|up_4|NZ_CP022753.1_2488050_2488785_-	PRK14345, PRK14345, lipoyl(octanoyl) transferase LipB	NA|298aa|up_3|NZ_CP022753.1_2488972_2489866_-	cd05242, SDR_a8, atypical (a) SDRs, subgroup 8	NA|331aa|up_2|NZ_CP022753.1_2490229_2491222_-	TIGR02927, putative_dihydrolipoamide_acyltransferase, 2-oxoglutarate dehydrogenase, E2 component, dihydrolipoamide succinyltransferase	NA|166aa|up_1|NZ_CP022753.1_2491152_2491650_-	PRK11856, PRK11856, branched-chain alpha-keto acid dehydrogenase subunit E2; Reviewed	NA|459aa|up_0|NZ_CP022753.1_2491726_2493103_-	PRK06416, PRK06416, dihydrolipoamide dehydrogenase; Reviewed	NA|501aa|down_0|NZ_CP022753.1_2493255_2494758_-	PRK00913, PRK00913, multifunctional aminopeptidase A; Provisional	NA|250aa|down_1|NZ_CP022753.1_2495160_2495910_-	PRK00235, cobS, cobalamin synthase; Reviewed	NA|320aa|down_2|NZ_CP022753.1_2495954_2496914_-	pfam02283, CobU, Cobinamide kinase / cobinamide phosphate guanyltransferase	NA|69aa|down_3|NZ_CP022753.1_2497105_2497312_+	NA	NA|337aa|down_4|NZ_CP022753.1_2497431_2498442_-	cd19074, Aldo_ket_red_shaker-like, Shaker potassium channel beta subunit family and similar proteins	NA|200aa|down_5|NZ_CP022753.1_2498521_2499121_-	pfam11241, DUF3043, Protein of unknown function (DUF3043)	NA|258aa|down_6|NZ_CP022753.1_2499286_2500060_+	COG1842, PspA, Phage shock protein A (IM30), suppresses sigma54-dependent transcription [Transcription / Signal transduction mechanisms]	NA|95aa|down_7|NZ_CP022753.1_2500056_2500341_+	NA	NA|306aa|down_8|NZ_CP022753.1_2500592_2501510_+	PRK02391, PRK02391, heat shock protein HtpX; Provisional	NA|193aa|down_9|NZ_CP022753.1_2501588_2502167_+	NA
GCF_002263495.1_ASM226349v1	NZ_CP022753	Nocardiopsis gilva YIM 90087 chromosome, complete genome	5	4471679-4471833	5	CRISPRCasFinder	no		csa3,cas3,WYL,DEDDh,Cas14u_CAS-V,DinG,casR,c2c9_V-U4,cas4,cas8e,cse2gr11,cas7,cas5,cas6e	Orphan	AGGCCCATGCGCACCGCCGGTGCAGCAGGCGAGGAACCCGAAAGCGGAAAA	51	0	0	NA	NA	NA	1	1	Orphan	csa3,cas3,WYL,DEDDh,Cas14u_CAS-V,DinG,casR,c2c9_V-U4,cas4,cas8e,cse2gr11,cas7,cas5,cas6e	NA,NA	NA|982aa|up_9|NZ_CP022753.1_4454520_4457466_-	COG0277, GlcD, FAD/FMN-containing dehydrogenases [Energy production and conversion]	NA|156aa|up_8|NZ_CP022753.1_4457649_4458117_+	TIGR03618, Rv1155_F420, PPOX class probable F420-dependent enzyme	NA|425aa|up_7|NZ_CP022753.1_4458159_4459434_-	cd06548, GH18_chitinase, The GH18 (glycosyl hydrolases, family 18) type II chitinases hydrolyze chitin, an abundant polymer of N-acetylglucosamine and have been identified in bacteria, fungi, insects, plants, viruses, and protozoan parasites	NA|385aa|up_6|NZ_CP022753.1_4459696_4460851_-	PRK05901, PRK05901, RNA polymerase sigma factor; Provisional	NA|457aa|up_5|NZ_CP022753.1_4462954_4464325_+	cd14750, PBP2_TMBP, The periplasmic-binding component of ABC transport systems specific for trehalose/maltose; possesses type 2 periplasmic binding fold	NA|321aa|up_4|NZ_CP022753.1_4464341_4465304_+	COG1175, UgpA, ABC-type sugar transport systems, permease components [Carbohydrate transport and metabolism]	NA|281aa|up_3|NZ_CP022753.1_4465300_4466143_+	COG0395, UgpE, ABC-type sugar transport system, permease component [Carbohydrate transport and metabolism]	NA|251aa|up_2|NZ_CP022753.1_4466431_4467184_-	COG1272, COG1272, Predicted membrane protein, hemolysin III homolog [General function prediction only]	NA|893aa|up_1|NZ_CP022753.1_4467607_4470286_+	PRK09279, PRK09279, pyruvate phosphate dikinase; Provisional	NA|364aa|up_0|NZ_CP022753.1_4470546_4471638_+	cd06259, YdcF-like, YdcF-like	NA|416aa|down_0|NZ_CP022753.1_4471968_4473216_+	PRK03007, PRK03007, deoxyguanosinetriphosphate triphosphohydrolase-like protein; Provisional	NA|456aa|down_1|NZ_CP022753.1_4473347_4474715_-	PRK04173, PRK04173, glycyl-tRNA synthetase; Provisional	NA|100aa|down_2|NZ_CP022753.1_4474923_4475223_-	COG2329, COG2329, Uncharacterized enzyme involved in biosynthesis of extracellular polysaccharides [General function prediction only]	NA|1244aa|down_3|NZ_CP022753.1_4475358_4479090_-	pfam04464, Glyphos_transf, CDP-Glycerol:Poly(glycerophosphate) glycerophosphotransferase	NA|268aa|down_4|NZ_CP022753.1_4479865_4480669_+	COG1121, ZnuC, ABC-type Mn/Zn transport systems, ATPase component [Inorganic ion transport and metabolism]	NA|300aa|down_5|NZ_CP022753.1_4480676_4481576_+	COG1108, ZnuB, ABC-type Mn2+/Zn2+ transport systems, permease components [Inorganic ion transport and metabolism]	NA|131aa|down_6|NZ_CP022753.1_4481572_4481965_+	cd07153, Fur_like, Ferric uptake regulator(Fur) and related metalloregulatory proteins; typically iron-dependent, DNA-binding repressors and activators	NA|267aa|down_7|NZ_CP022753.1_4482117_4482918_-	cd01832, SGNH_hydrolase_like_1, Members of the SGNH-hydrolase superfamily, a diverse family of lipases and esterases	NA|287aa|down_8|NZ_CP022753.1_4483137_4483998_-	PRK14829, PRK14829, undecaprenyl pyrophosphate synthase; Provisional	NA|244aa|down_9|NZ_CP022753.1_4483994_4484726_-	PRK00085, recO, DNA repair protein RecO; Reviewed
