assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000269985.1_ASM26998v1	NC_016109	Kitasatospora setae KM-6054, complete genome	1	591400-591479	1	CRISPRCasFinder	no		cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,WYL,cas3,csa3,DEDDh,PD-DExK,RT,DinG,cas4	Orphan	GAGTTGGAGGAGACCAACCGGGGCGT	26	0	0	NA	NA	NA	1	1	Orphan	cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,WYL,cas3,csa3,DEDDh,PD-DExK,RT,DinG,cas4	NA,NA|90aa|down_6|NC_016109.1_599628_599898_+,NA|121aa|down_9|NC_016109.1_602558_602921_-	NA|401aa|up_9|NC_016109.1_582424_583627_-	COG5012, COG5012, Predicted cobalamin binding protein [General function prediction only]	NA|538aa|up_8|NC_016109.1_583623_585237_-	pfam07228, SpoIIE, Stage II sporulation protein E (SpoIIE)	NA|124aa|up_7|NC_016109.1_585503_585875_+	cd07043, STAS_anti-anti-sigma_factors, Sulphate Transporter and Anti-Sigma factor antagonist) domain of anti-anti-sigma factors, key regulators of anti-sigma factors by phosphorylation	NA|138aa|up_6|NC_016109.1_585949_586363_+	cd16936, HATPase_RsbW-like, Histidine kinase-like ATPase domain of RsbW, an anti sigma-B factor and serine-protein kinase involved in regulating sigma-B during stress in Bacilli, and related domains	NA|142aa|up_5|NC_016109.1_586610_587036_-	pfam04686, SsgA, Streptomyces sporulation and cell division protein, SsgA	NA|158aa|up_4|NC_016109.1_587298_587772_+	cd01043, DPS, DPS protein, ferritin-like diiron-binding domain	NA|280aa|up_3|NC_016109.1_587974_588814_+	cd07041, STAS_RsbR_RsbS_like, Sulphate Transporter and Anti-Sigma factor antagonist domain of the "stressosome" complex proteins RsbS and RsbR, regulators of the bacterial stress activated alternative sigma factor sigma-B by phosphorylation	NA|126aa|up_2|NC_016109.1_588810_589188_+	cd07041, STAS_RsbR_RsbS_like, Sulphate Transporter and Anti-Sigma factor antagonist domain of the "stressosome" complex proteins RsbS and RsbR, regulators of the bacterial stress activated alternative sigma factor sigma-B by phosphorylation	NA|139aa|up_1|NC_016109.1_589184_589601_+	cd16934, HATPase_RsbT-like, Histidine kinase-like ATPase domain of the anti sigma-B factor Bacillus subtilis serine/threonine-protein kinase RsbT, and related domains	NA|415aa|up_0|NC_016109.1_589597_590842_+	cd16934, HATPase_RsbT-like, Histidine kinase-like ATPase domain of the anti sigma-B factor Bacillus subtilis serine/threonine-protein kinase RsbT, and related domains	NA|546aa|down_0|NC_016109.1_592646_594284_+	pfam07228, SpoIIE, Stage II sporulation protein E (SpoIIE)	NA|119aa|down_1|NC_016109.1_594396_594753_+	cd07043, STAS_anti-anti-sigma_factors, Sulphate Transporter and Anti-Sigma factor antagonist) domain of anti-anti-sigma factors, key regulators of anti-sigma factors by phosphorylation	NA|260aa|down_2|NC_016109.1_594759_595539_-	cd03392, PAP2_like_2, PAP2_like_2 proteins	NA|337aa|down_3|NC_016109.1_595535_596546_-	COG1295, Rbn, Ribonuclease BN family enzyme [Replication, recombination, and repair]	NA|559aa|down_4|NC_016109.1_596545_598222_-	COG1597, LCB5, Sphingosine kinase and enzymes related to eukaryotic diacylglycerol kinase [Lipid metabolism / General function prediction only]	NA|406aa|down_5|NC_016109.1_598338_599556_+	PRK01642, cls, cardiolipin synthetase; Reviewed	NA|90aa|down_6|NC_016109.1_599628_599898_+	NA	NA|274aa|down_7|NC_016109.1_599937_600759_+	pfam06724, DUF1206, Domain of Unknown Function (DUF1206)	NA|492aa|down_8|NC_016109.1_600784_602260_-	COG0659, SUL1, Sulfate permease and related transporters (MFS superfamily) [Inorganic ion transport and metabolism]	NA|121aa|down_9|NC_016109.1_602558_602921_-	NA
GCF_000269985.1_ASM26998v1	NC_016109	Kitasatospora setae KM-6054, complete genome	2	613078-613838	1,2,1	CRT,CRISPRCasFinder,PILER-CR	no	cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e	cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,WYL,cas3,csa3,DEDDh,PD-DExK,RT,DinG,cas4	Type I-E	GGAGCAACCCCCGCGAGCGCGGGGCCGAG,GAGCAACCCCCGCGAGCGCGGGGCCGAG,GAGCAACCCCCGCGAGCGCGGGGCCGAG	29,28,28	2	2	613290-613321|613290-613322	NC_016109.1_3733827-3733796|NC_016109.1_3733827-3733795	NA:NA:NA	12,12,11	12	TypeI-E	cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,WYL,cas3,csa3,DEDDh,PD-DExK,RT,DinG,cas4	NA|121aa|up_9|NC_016109.1_602558_602921_-,NA|142aa|up_8|NC_016109.1_603292_603718_+,NA|85aa|up_5|NC_016109.1_607643_607898_+,NA|157aa|up_4|NC_016109.1_608016_608487_+,NA|119aa|up_3|NC_016109.1_608490_608847_+,NA|90aa|up_2|NC_016109.1_611030_611300_+,NA|133aa|down_8|NC_016109.1_627874_628273_-	NA|121aa|up_9|NC_016109.1_602558_602921_-	NA	NA|142aa|up_8|NC_016109.1_603292_603718_+	NA	NA|398aa|up_7|NC_016109.1_603828_605022_+	COG0031, CysK, Cysteine synthase [Amino acid transport and metabolism]	NA|425aa|up_6|NC_016109.1_605018_606293_+	cd17329, MFS_MdtH_MDR_like, Multidrug resistance protein MdtH and similar multidrug resistance (MDR) transporters of the Major Facilitator Superfamily	NA|85aa|up_5|NC_016109.1_607643_607898_+	NA	NA|157aa|up_4|NC_016109.1_608016_608487_+	NA	NA|119aa|up_3|NC_016109.1_608490_608847_+	NA	NA|90aa|up_2|NC_016109.1_611030_611300_+	NA	NA|100aa|up_1|NC_016109.1_611837_612137_+	COG2963, COG2963, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|291aa|up_0|NC_016109.1_612133_613006_+	PHA02517, PHA02517, putative transposase OrfB; Reviewed	cas2|93aa|down_0|NC_016109.1_613847_614126_-	pfam09707, Cas_Cas2CT1978, CRISPR-associated protein (Cas_Cas2CT1978)	cas1|345aa|down_1|NC_016109.1_614122_615157_-	TIGR03638, cas1_ECOLI, CRISPR-associated endonuclease Cas1, subtype I-E/ECOLI	cas6e|256aa|down_2|NC_016109.1_615160_615928_-	pfam08798, CRISPR_assoc, CRISPR associated protein	cas5|278aa|down_3|NC_016109.1_615936_616770_-	pfam09704, Cas_Cas5d, CRISPR-associated protein (Cas_Cas5)	cas7|383aa|down_4|NC_016109.1_616769_617918_-	pfam09344, Cas_CT1975, CT1975-like protein	cse2gr11|201aa|down_5|NC_016109.1_617983_618586_-	pfam09485, CRISPR_Cse2, CRISPR-associated protein Cse2 (CRISPR_cse2)	cas8e|571aa|down_6|NC_016109.1_618575_620288_-	pfam09481, CRISPR_Cse1, CRISPR-associated protein Cse1 (CRISPR_cse1)	NA|217aa|down_7|NC_016109.1_626393_627044_+	COG1637, COG1637, Predicted nuclease of the RecB family [DNA replication, recombination, and repair]	NA|133aa|down_8|NC_016109.1_627874_628273_-	NA	NA|160aa|down_9|NC_016109.1_629248_629728_+	cd01276, PKCI_related, Protein Kinase C Interacting protein related (PKCI): PKCI and related proteins belong to the ubiquitous HIT family of hydrolases that act on alpha-phosphates of ribonucleotides
GCF_000269985.1_ASM26998v1	NC_016109	Kitasatospora setae KM-6054, complete genome	3	623530-625875	2,3,2	PILER-CR,CRISPRCasFinder,CRT	no	cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e	cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,WYL,cas3,csa3,DEDDh,PD-DExK,RT,DinG,cas4	Type I-E	GAGCAACCCCCGCGAGCGCGGGGCCGAG,CTCGGCCCCGCGCTCGCGGGGGTTGCTC,CTCGGCCCCGCGCTCGCGGGGGTTGCTC	28,28,28	1	1	624046-624078	NC_016109.1_3709700-3709668	NA:NA:NA	37,38,38	38	TypeI-E	cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,WYL,cas3,csa3,DEDDh,PD-DExK,RT,DinG,cas4	NA|90aa|up_9|NC_016109.1_611030_611300_+,NA|133aa|down_1|NC_016109.1_627874_628273_-	NA|90aa|up_9|NC_016109.1_611030_611300_+	NA	NA|100aa|up_8|NC_016109.1_611837_612137_+	COG2963, COG2963, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|291aa|up_7|NC_016109.1_612133_613006_+	PHA02517, PHA02517, putative transposase OrfB; Reviewed	cas2|93aa|up_6|NC_016109.1_613847_614126_-	pfam09707, Cas_Cas2CT1978, CRISPR-associated protein (Cas_Cas2CT1978)	cas1|345aa|up_5|NC_016109.1_614122_615157_-	TIGR03638, cas1_ECOLI, CRISPR-associated endonuclease Cas1, subtype I-E/ECOLI	cas6e|256aa|up_4|NC_016109.1_615160_615928_-	pfam08798, CRISPR_assoc, CRISPR associated protein	cas5|278aa|up_3|NC_016109.1_615936_616770_-	pfam09704, Cas_Cas5d, CRISPR-associated protein (Cas_Cas5)	cas7|383aa|up_2|NC_016109.1_616769_617918_-	pfam09344, Cas_CT1975, CT1975-like protein	cse2gr11|201aa|up_1|NC_016109.1_617983_618586_-	pfam09485, CRISPR_Cse2, CRISPR-associated protein Cse2 (CRISPR_cse2)	cas8e|571aa|up_0|NC_016109.1_618575_620288_-	pfam09481, CRISPR_Cse1, CRISPR-associated protein Cse1 (CRISPR_cse1)	NA|217aa|down_0|NC_016109.1_626393_627044_+	COG1637, COG1637, Predicted nuclease of the RecB family [DNA replication, recombination, and repair]	NA|133aa|down_1|NC_016109.1_627874_628273_-	NA	NA|160aa|down_2|NC_016109.1_629248_629728_+	cd01276, PKCI_related, Protein Kinase C Interacting protein related (PKCI): PKCI and related proteins belong to the ubiquitous HIT family of hydrolases that act on alpha-phosphates of ribonucleotides	NA|365aa|down_3|NC_016109.1_629740_630835_-	PRK00726, murG, undecaprenyldiphospho-muramoylpentapeptide beta-N- acetylglucosaminyltransferase; Provisional	NA|550aa|down_4|NC_016109.1_631238_632888_-	PRK09194, PRK09194, prolyl-tRNA synthetase; Provisional	NA|376aa|down_5|NC_016109.1_632905_634033_-	cd03801, GT4_PimA-like, phosphatidyl-myo-inositol mannosyltransferase	NA|249aa|down_6|NC_016109.1_634029_634776_-	pfam08889, WbqC, WbqC-like protein family	NA|213aa|down_7|NC_016109.1_634772_635411_-	COG2120, COG2120, Uncharacterized proteins, LmbE homologs [Function unknown]	NA|187aa|down_8|NC_016109.1_635454_636015_-	PRK00416, dcd, deoxycytidine triphosphate deaminase; Reviewed	NA|185aa|down_9|NC_016109.1_636011_636566_-	PRK00416, dcd, deoxycytidine triphosphate deaminase; Reviewed
GCF_000269985.1_ASM26998v1	NC_016109	Kitasatospora setae KM-6054, complete genome	4	1931784-1931936	4	CRISPRCasFinder	no		cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,WYL,cas3,csa3,DEDDh,PD-DExK,RT,DinG,cas4	Orphan	CGAAGAGGCCGACCCGACCAAGGGCCGCCTGGTCAAGGTCCCGCCGCC	48	1	1	1931832-1931888	NC_016109.1_1932246-1932302	NA	1	1	Orphan	cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,WYL,cas3,csa3,DEDDh,PD-DExK,RT,DinG,cas4	NA|109aa|up_4|NC_016109.1_1926427_1926754_+,NA|133aa|up_2|NC_016109.1_1928058_1928457_-,NA|109aa|down_2|NC_016109.1_1933562_1933889_-	NA|336aa|up_9|NC_016109.1_1919936_1920944_-	cd09019, galactose_mutarotase_like, galactose mutarotase_like	NA|986aa|up_8|NC_016109.1_1921146_1924104_-	pfam13191, AAA_16, AAA ATPase domain	NA|327aa|up_7|NC_016109.1_1924441_1925422_+	cd00687, Terpene_cyclase_nonplant_C1, Non-plant Terpene Cyclases, Class 1	NA|122aa|up_6|NC_016109.1_1925452_1925818_-	pfam09851, SHOCT, Short C-terminal domain	NA|143aa|up_5|NC_016109.1_1925837_1926266_-	pfam06897, DUF1269, Protein of unknown function (DUF1269)	NA|109aa|up_4|NC_016109.1_1926427_1926754_+	NA	NA|401aa|up_3|NC_016109.1_1926799_1928002_-	pfam00144, Beta-lactamase, Beta-lactamase	NA|133aa|up_2|NC_016109.1_1928058_1928457_-	NA	NA|184aa|up_1|NC_016109.1_1928545_1929097_+	cd04697, Nudix_Hydrolase_38, Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X	NA|255aa|up_0|NC_016109.1_1929216_1929981_+	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|143aa|down_0|NC_016109.1_1932613_1933042_+	pfam14078, DUF4259, Domain of unknown function (DUF4259)	NA|136aa|down_1|NC_016109.1_1933082_1933490_+	pfam14078, DUF4259, Domain of unknown function (DUF4259)	NA|109aa|down_2|NC_016109.1_1933562_1933889_-	NA	NA|605aa|down_3|NC_016109.1_1934284_1936099_+	cd06548, GH18_chitinase, The GH18 (glycosyl hydrolases, family 18) type II chitinases hydrolyze chitin, an abundant polymer of N-acetylglucosamine and have been identified in bacteria, fungi, insects, plants, viruses, and protozoan parasites	NA|221aa|down_4|NC_016109.1_1936108_1936771_-	COG0421, SpeE, Spermidine synthase [Amino acid transport and metabolism]	NA|286aa|down_5|NC_016109.1_1936927_1937785_-	COG0765, HisM, ABC-type amino acid transport system, permease component [Amino acid transport and metabolism]	NA|215aa|down_6|NC_016109.1_1937781_1938426_-	COG0765, HisM, ABC-type amino acid transport system, permease component [Amino acid transport and metabolism]	NA|304aa|down_7|NC_016109.1_1938440_1939352_-	cd13690, PBP2_GluB, Substrate binding domain of ABC glutamate transporter; the type 2 periplasmic binding protein fold	NA|243aa|down_8|NC_016109.1_1939363_1940092_-	COG1126, GlnQ, ABC-type polar amino acid transport system, ATPase component [Amino acid transport and metabolism]	NA|104aa|down_9|NC_016109.1_1940172_1940484_-	pfam06305, LapA_dom, Lipopolysaccharide assembly protein A domain
GCF_000269985.1_ASM26998v1	NC_016109	Kitasatospora setae KM-6054, complete genome	5	2749054-2749177	5	CRISPRCasFinder	no	PD-DExK	cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,WYL,cas3,csa3,DEDDh,PD-DExK,RT,DinG,cas4	Unclear	TACGGCTACCCGCCCCCGCAGCAGGGCTACGGCTACCCGCCGCAAC	46	0	0	NA	NA	NA	1	1	Orphan	cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,WYL,cas3,csa3,DEDDh,PD-DExK,RT,DinG,cas4	PD-DExK|315aa|up_9|NC_016109.1_2733320_2734265_+,NA|291aa|up_8|NC_016109.1_2734531_2735404_+,NA	PD-DExK|315aa|up_9|NC_016109.1_2733320_2734265_+	NA	NA|291aa|up_8|NC_016109.1_2734531_2735404_+	NA	NA|1410aa|up_7|NC_016109.1_2735451_2739681_+	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|205aa|up_6|NC_016109.1_2739727_2740342_-	pfam13358, DDE_3, DDE superfamily endonuclease	NA|180aa|up_5|NC_016109.1_2740251_2740791_-	pfam13592, HTH_33, Winged helix-turn helix	NA|60aa|up_4|NC_016109.1_2740926_2741106_+	PRK15347, PRK15347, two component system sensor kinase	NA|422aa|up_3|NC_016109.1_2741429_2742695_+	TIGR02168, Chromosome_partition_protein_Smc, chromosome segregation protein SMC, common bacterial type	NA|72aa|up_2|NC_016109.1_2742721_2742937_+	TIGR02605, CxxC_CxxC_SSSS, putative regulatory protein, FmdB family	NA|273aa|up_1|NC_016109.1_2743052_2743871_-	cd07520, HAD_like, uncharacterized family of the haloacid dehalogenase-like (HAD) hydrolase superfamily	NA|391aa|up_0|NC_016109.1_2746712_2747885_-	pfam15617, C-C_Bond_Lyase, C-C_Bond_Lyase of the TIM-Barrel fold	NA|248aa|down_0|NC_016109.1_2749363_2750107_-	COG4110, COG4110, Uncharacterized protein involved in stress response [General function prediction only]	NA|385aa|down_1|NC_016109.1_2750323_2751478_+	pfam10935, DUF2637, Protein of unknown function (DUF2637)	NA|361aa|down_2|NC_016109.1_2751558_2752641_-	PRK14013, PRK14013, hypothetical protein; Provisional	NA|192aa|down_3|NC_016109.1_2752701_2753277_-	pfam02342, TerD, TerD domain	NA|802aa|down_4|NC_016109.1_2753668_2756074_+	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|192aa|down_5|NC_016109.1_2756138_2756714_-	pfam02342, TerD, TerD domain	NA|153aa|down_6|NC_016109.1_2756865_2757324_-	cd03018, PRX_AhpE_like, Peroxiredoxin (PRX) family, AhpE-like subfamily; composed of proteins similar to Mycobacterium tuberculosis AhpE	NA|146aa|down_7|NC_016109.1_2757550_2757988_-	pfam11253, DUF3052, Protein of unknown function (DUF3052)	NA|917aa|down_8|NC_016109.1_2758455_2761206_+	PRK09405, aceE, pyruvate dehydrogenase subunit E1; Reviewed	NA|533aa|down_9|NC_016109.1_2761361_2762960_-	TIGR00711, Uncharacterized_MFS-type_transporter_YhcA, drug resistance transporter, EmrB/QacA subfamily
GCF_000269985.1_ASM26998v1	NC_016109	Kitasatospora setae KM-6054, complete genome	6	3238386-3238517	6	CRISPRCasFinder	no		cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,WYL,cas3,csa3,DEDDh,PD-DExK,RT,DinG,cas4	Orphan	GGTCAGCTCTCGCGCTGCTTGCGCCACCGGATGCCGGCCTCGATG	45	0	0	NA	NA	NA	1	1	Orphan	cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,WYL,cas3,csa3,DEDDh,PD-DExK,RT,DinG,cas4	NA|63aa|up_0|NC_016109.1_3238119_3238308_+,NA	NA|94aa|up_9|NC_016109.1_3226211_3226493_-	cd14435, SPO1_TF1_like, Bacteriophage SPO1-encoded TF1 binds and bends DNA	NA|492aa|up_8|NC_016109.1_3226922_3228398_-	COG0281, SfcA, Malic enzyme [Energy production and conversion]	NA|767aa|up_7|NC_016109.1_3228724_3231025_-	COG3973, COG3973, Superfamily I DNA and RNA helicases [General function prediction only]	NA|387aa|up_6|NC_016109.1_3231128_3232289_-	PRK07239, PRK07239, bifunctional uroporphyrinogen-III synthetase/response regulator domain protein; Validated	NA|457aa|up_5|NC_016109.1_3232463_3233834_-	COG2223, NarK, Nitrate/nitrite transporter [Inorganic ion transport and metabolism]	NA|161aa|up_4|NC_016109.1_3234528_3235011_-	pfam01668, SmpB, SmpB protein	NA|382aa|up_3|NC_016109.1_3235055_3236201_-	COG0793, Prc, Periplasmic protease [Cell envelope biogenesis, outer membrane]	NA|302aa|up_2|NC_016109.1_3236252_3237158_-	COG2177, FtsX, Cell division protein [Cell division and chromosome partitioning]	NA|230aa|up_1|NC_016109.1_3237169_3237859_-	COG2884, FtsE, Predicted ATPase involved in cell division [Cell division and chromosome partitioning]	NA|63aa|up_0|NC_016109.1_3238119_3238308_+	NA	NA|417aa|down_0|NC_016109.1_3239612_3240863_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|627aa|down_1|NC_016109.1_3241015_3242896_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|910aa|down_2|NC_016109.1_3243249_3245979_+	TIGR03924, T7SS_EccC_a, type VII secretion protein EccCa	NA|1646aa|down_3|NC_016109.1_3246070_3251008_-	pfam05088, Bac_GDH, Bacterial NAD-glutamate dehydrogenase	NA|539aa|down_4|NC_016109.1_3251300_3252917_-	cd07123, ALDH_F4-17_P5CDH, Delta(1)-pyrroline-5-carboxylate dehydrogenase, ALDH families 4 and 17	NA|203aa|down_5|NC_016109.1_3253000_3253609_-	COG0546, Gph, Predicted phosphatases [General function prediction only]	NA|276aa|down_6|NC_016109.1_3253715_3254543_-	cd19131, AKR_AKR5C2, Escherichia coli 2,5-diketo-D-gluconic acid reductase A (DkgA/YqhE) and similar proteins	NA|393aa|down_7|NC_016109.1_3254576_3255755_-	COG2814, AraJ, Arabinose efflux permease [Carbohydrate transport and metabolism]	NA|420aa|down_8|NC_016109.1_3255897_3257157_-	cd06173, MFS_MefA_like, Macrolide efflux protein A and similar proteins of the Major Facilitator Superfamily of transporters	NA|397aa|down_9|NC_016109.1_3257187_3258378_-	PRK02186, PRK02186, argininosuccinate lyase; Provisional
GCF_000269985.1_ASM26998v1	NC_016109	Kitasatospora setae KM-6054, complete genome	7	4788771-4788866	7	CRISPRCasFinder	no	RT	cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,WYL,cas3,csa3,DEDDh,PD-DExK,RT,DinG,cas4	Unclear	TTCGACGGCCCGCAGGTCACGACC	24	0	0	NA	NA	NA	1	1	Orphan	cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,WYL,cas3,csa3,DEDDh,PD-DExK,RT,DinG,cas4	NA|149aa|up_8|NC_016109.1_4775815_4776262_+,NA|472aa|up_5|NC_016109.1_4778532_4779948_-,NA|701aa|up_4|NC_016109.1_4780611_4782714_+,NA|192aa|down_1|NC_016109.1_4790943_4791519_+,NA|48aa|down_2|NC_016109.1_4791683_4791827_+,NA|123aa|down_3|NC_016109.1_4791829_4792198_+,NA|191aa|down_4|NC_016109.1_4792322_4792895_+,NA|360aa|down_9|NC_016109.1_4805673_4806753_+	NA|420aa|up_9|NC_016109.1_4774272_4775532_-	pfam13367, PrsW-protease, Protease prsW family	NA|149aa|up_8|NC_016109.1_4775815_4776262_+	NA	NA|251aa|up_7|NC_016109.1_4776281_4777034_-	PRK00121, trmB, tRNA (guanine-N(7)-)-methyltransferase; Reviewed	NA|411aa|up_6|NC_016109.1_4777217_4778450_-	PRK11728, PRK11728, L-2-hydroxyglutarate oxidase	NA|472aa|up_5|NC_016109.1_4778532_4779948_-	NA	NA|701aa|up_4|NC_016109.1_4780611_4782714_+	NA	NA|171aa|up_3|NC_016109.1_4782850_4783363_+	pfam00582, Usp, Universal stress protein family	NA|232aa|up_2|NC_016109.1_4783454_4784150_+	cd03024, DsbA_FrnE, DsbA family, FrnE subfamily; FrnE is a DsbA-like protein containing a CXXC motif	NA|717aa|up_1|NC_016109.1_4784225_4786376_-	COG1595, RpoE, DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog [Transcription]	NA|265aa|up_0|NC_016109.1_4786695_4787490_-	pfam00440, TetR_N, Bacterial regulatory proteins, tetR family	NA|606aa|down_0|NC_016109.1_4789129_4790947_+	pfam06259, Abhydrolase_8, Alpha/beta hydrolase	NA|192aa|down_1|NC_016109.1_4790943_4791519_+	NA	NA|48aa|down_2|NC_016109.1_4791683_4791827_+	NA	NA|123aa|down_3|NC_016109.1_4791829_4792198_+	NA	NA|191aa|down_4|NC_016109.1_4792322_4792895_+	NA	NA|446aa|down_5|NC_016109.1_4793111_4794449_+	COG1252, Ndh, NADH dehydrogenase, FAD-containing subunit [Energy production and conversion]	NA|417aa|down_6|NC_016109.1_4794734_4795985_+	pfam07228, SpoIIE, Stage II sporulation protein E (SpoIIE)	NA|307aa|down_7|NC_016109.1_4801907_4802828_+	TIGR03448, mycothiol_MshD, mycothiol synthase	NA|715aa|down_8|NC_016109.1_4803437_4805582_+	PRK05443, PRK05443, polyphosphate kinase; Provisional	NA|360aa|down_9|NC_016109.1_4805673_4806753_+	NA
GCF_000269985.1_ASM26998v1	NC_016109	Kitasatospora setae KM-6054, complete genome	8	7220753-7220841	8	CRISPRCasFinder	no		cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,WYL,cas3,csa3,DEDDh,PD-DExK,RT,DinG,cas4	Orphan	GTGGGTGGGGTGGCCGTGCAGGCGGGGGA	29	0	0	NA	NA	NA	1	1	Orphan	cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,WYL,cas3,csa3,DEDDh,PD-DExK,RT,DinG,cas4	NA|296aa|up_6|NC_016109.1_7212112_7213000_+,NA|85aa|up_5|NC_016109.1_7213171_7213426_+,NA|132aa|up_2|NC_016109.1_7214518_7214914_+,NA|104aa|up_0|NC_016109.1_7219379_7219691_-,NA|136aa|down_0|NC_016109.1_7221043_7221451_-,NA|115aa|down_1|NC_016109.1_7221578_7221923_-,NA|155aa|down_2|NC_016109.1_7222084_7222549_-	NA|471aa|up_9|NC_016109.1_7208129_7209542_+	pfam09770, PAT1, Topoisomerase II-associated protein PAT1	NA|538aa|up_8|NC_016109.1_7209531_7211145_+	COG4962, CpaF, Flp pilus assembly protein, ATPase CpaF [Intracellular trafficking and secretion]	NA|324aa|up_7|NC_016109.1_7211144_7212116_+	COG4965, TadB, Flp pilus assembly protein TadB [Intracellular trafficking and secretion]	NA|296aa|up_6|NC_016109.1_7212112_7213000_+	NA	NA|85aa|up_5|NC_016109.1_7213171_7213426_+	NA	NA|121aa|up_4|NC_016109.1_7213566_7213929_+	pfam07811, TadE, TadE-like protein	NA|149aa|up_3|NC_016109.1_7213970_7214417_+	COG4961, TadG, Flp pilus assembly protein TadG [Intracellular trafficking and secretion]	NA|132aa|up_2|NC_016109.1_7214518_7214914_+	NA	NA|1450aa|up_1|NC_016109.1_7214913_7219263_+	smart01043, BTAD, Bacterial transcriptional activator domain	NA|104aa|up_0|NC_016109.1_7219379_7219691_-	NA	NA|136aa|down_0|NC_016109.1_7221043_7221451_-	NA	NA|115aa|down_1|NC_016109.1_7221578_7221923_-	NA	NA|155aa|down_2|NC_016109.1_7222084_7222549_-	NA	NA|165aa|down_3|NC_016109.1_7223230_7223725_-	pfam06983, 3-dmu-9_3-mt, 3-demethylubiquinone-9 3-methyltransferase	NA|267aa|down_4|NC_016109.1_7223892_7224693_+	pfam01909, NTP_transf_2, Nucleotidyltransferase domain	NA|188aa|down_5|NC_016109.1_7224763_7225327_-	pfam14024, DUF4240, Protein of unknown function (DUF4240)	NA|316aa|down_6|NC_016109.1_7225954_7226902_+	cd08244, MDR_enoyl_red, Possible enoyl reductase	NA|368aa|down_7|NC_016109.1_7226993_7228097_+	cd07062, Peptidase_S66_mccF_like, Microcin C7 self-immunity protein determines resistance to exogenous microcin C7	NA|426aa|down_8|NC_016109.1_7228090_7229368_-	cd17329, MFS_MdtH_MDR_like, Multidrug resistance protein MdtH and similar multidrug resistance (MDR) transporters of the Major Facilitator Superfamily	NA|337aa|down_9|NC_016109.1_7229427_7230438_+	cd00090, HTH_ARSR, Arsenical Resistance Operon Repressor and similar prokaryotic, metal regulated homodimeric repressors
GCF_000269985.1_ASM26998v1	NC_016109	Kitasatospora setae KM-6054, complete genome	9	7272977-7274053	9,3,3	CRISPRCasFinder,CRT,PILER-CR	no	cse2gr11,cas7,cas5,cas6e,cas1,cas2	cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,WYL,cas3,csa3,DEDDh,PD-DExK,RT,DinG,cas4	Type I-E	GTCCTCCCCGCGCGAGCGGGGGTCTACCGG,GTCCTCCCCGCGCGAGCGGGGGTCTACCG,GTCCTCCCCGCGCGAGCGGGGGTCTACC	30,29,28	0	0	NA	NA	NA:NA:NA	17,17,15	17	TypeI-E	cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,WYL,cas3,csa3,DEDDh,PD-DExK,RT,DinG,cas4	NA,NA	NA|356aa|up_9|NC_016109.1_7261311_7262379_-	pfam01636, APH, Phosphotransferase enzyme family	NA|940aa|up_8|NC_016109.1_7262375_7265195_-	COG0561, Cof, Predicted hydrolases of the HAD superfamily [General function prediction only]	NA|215aa|up_7|NC_016109.1_7265605_7266250_-	PRK00393, ribA, GTP cyclohydrolase II RibA	NA|525aa|up_6|NC_016109.1_7266773_7268348_-	pfam12770, CHAT, CHAT domain	cse2gr11|94aa|up_5|NC_016109.1_7268614_7268896_+	cd09731, Cse2_I-E, CRISPR/Cas system-associated protein Cse2	cas7|389aa|up_4|NC_016109.1_7268970_7270137_+	pfam09344, Cas_CT1975, CT1975-like protein	cas5|271aa|up_3|NC_016109.1_7270133_7270946_+	TIGR01868, hypothetical_protein, CRISPR-associated protein Cas5/CasD, subtype I-E/ECOLI	cas6e|219aa|up_2|NC_016109.1_7270942_7271599_+	pfam08798, CRISPR_assoc, CRISPR associated protein	cas1|308aa|up_1|NC_016109.1_7271673_7272597_+	TIGR03638, cas1_ECOLI, CRISPR-associated endonuclease Cas1, subtype I-E/ECOLI	cas2|107aa|up_0|NC_016109.1_7272634_7272955_+	pfam09707, Cas_Cas2CT1978, CRISPR-associated protein (Cas_Cas2CT1978)	NA|308aa|down_0|NC_016109.1_7274747_7275671_+	cd05289, MDR_like_2, alcohol dehydrogenase and quinone reductase-like medium chain degydrogenases/reductases	NA|259aa|down_1|NC_016109.1_7275667_7276444_+	cd07247, SgaA_N_like, N-terminal domain of Streptomyces griseus SgaA and similar domains	NA|522aa|down_2|NC_016109.1_7276475_7278041_-	COG0815, Lnt, Apolipoprotein N-acyltransferase [Cell envelope biogenesis, outer membrane]	NA|1203aa|down_3|NC_016109.1_7278215_7281824_-	PRK05989, cobN, cobaltochelatase subunit CobN; Reviewed	NA|442aa|down_4|NC_016109.1_7282476_7283802_+	TIGR02435, Precorrin-3B_synthase, precorrin-3B synthase	NA|209aa|down_5|NC_016109.1_7283798_7284425_+	PRK08285, cobH, precorrin-8X methylmutase; Reviewed	NA|524aa|down_6|NC_016109.1_7284421_7285993_+	cd11646, Precorrin_3B_C17_MT, Precorrin-3B C(17)-methyltransferase (also named CobJ or CbiH)	NA|247aa|down_7|NC_016109.1_7286012_7286753_-	PRK08057, PRK08057, cobalt-precorrin-6x reductase; Reviewed	NA|408aa|down_8|NC_016109.1_7286886_7288110_+	PRK00075, cbiD, cobalt-precorrin-6A synthase; Reviewed	NA|137aa|down_9|NC_016109.1_7288183_7288594_-	cd00448, YjgF_YER057c_UK114_family, YjgF, YER057c, and UK114 belong to a large family of proteins present in bacteria, archaea, and eukaryotes with no definitive function
