assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000400635.2_ASM40063v2	NC_021252	Amycolatopsis keratiniphila, complete sequence	1	48862-49263	1	CRT	no		csa3,RT,WYL,cas4,DEDDh,DinG,casR,cas3	Orphan	NNNGCCGNNGNNCTGGTCGTA	21	1	3	49228-49245|49228-49245|49228-49245	NC_021252.1_1147175-1147192|NC_021252.1_2115333-2115350|NC_021252.1_8715769-8715786	NA	9	9	Orphan	csa3,RT,WYL,cas4,DEDDh,DinG,casR,cas3	NA|215aa|up_7|NC_021252.1_38902_39547_+,NA|111aa|down_2|NC_021252.1_54969_55302_+	NA|215aa|up_9|NC_021252.1_37467_38112_+	PRK07765, PRK07765, aminodeoxychorismate/anthranilate synthase component II	NA|266aa|up_8|NC_021252.1_38117_38915_+	COG1131, CcmA, ABC-type multidrug transport system, ATPase component [Defense mechanisms]	NA|215aa|up_7|NC_021252.1_38902_39547_+	NA	NA|181aa|up_6|NC_021252.1_39546_40089_+	pfam13671, AAA_33, AAA domain	NA|653aa|up_5|NC_021252.1_40317_42276_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|436aa|up_4|NC_021252.1_42272_43580_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|488aa|up_3|NC_021252.1_43583_45047_-	COG0768, FtsI, Cell division protein FtsI/penicillin-binding protein 2 [Cell envelope biogenesis, outer membrane]	NA|493aa|up_2|NC_021252.1_45043_46522_-	COG0772, FtsW, Bacterial cell division membrane protein [Cell division and chromosome partitioning]	NA|466aa|up_1|NC_021252.1_46521_47919_-	COG0631, PTC1, Serine/threonine protein phosphatase [Signal transduction mechanisms]	NA|156aa|up_0|NC_021252.1_47915_48383_-	COG1716, COG1716, FOG: FHA domain [Signal transduction mechanisms]	NA|125aa|down_0|NC_021252.1_50140_50515_-	COG0545, FkpA, FKBP-type peptidyl-prolyl cis-trans isomerases 1 [Posttranslational modification, protein turnover, chaperones]	NA|347aa|down_1|NC_021252.1_53919_54960_+	cd00614, CGS_like, CGS_like: Cystathionine gamma-synthase is a PLP dependent enzyme and catalyzes the committed step of methionine biosynthesis	NA|111aa|down_2|NC_021252.1_54969_55302_+	NA	NA|222aa|down_3|NC_021252.1_55751_56417_+	PRK11907, PRK11907, bifunctional 2',3'-cyclic-nucleotide 2'-phosphodiesterase/3'-nucleotidase	NA|201aa|down_4|NC_021252.1_56420_57023_+	cd05829, Sortase_F, Sortase domain found in the class F family of sortases	NA|335aa|down_5|NC_021252.1_57095_58100_-	COG1752, RssA, Predicted esterase of the alpha-beta hydrolase superfamily [General function prediction only]	NA|248aa|down_6|NC_021252.1_58172_58916_-	COG2186, FadR, Transcriptional regulators [Transcription]	NA|497aa|down_7|NC_021252.1_58976_60467_+	cd08494, PBP2_NikA_DppA_OppA_like_6, The substrate-binding component of an uncharacterized ABC-type nickel/dipeptide/oligopeptide-like import system contains the type 2 periplasmic binding fold	NA|315aa|down_8|NC_021252.1_60463_61408_+	COG0601, DppB, ABC-type dipeptide/oligopeptide/nickel transport systems, permease components [Amino acid transport and metabolism / Inorganic ion transport and metabolism]	NA|271aa|down_9|NC_021252.1_61407_62220_+	COG1173, DppC, ABC-type dipeptide/oligopeptide/nickel transport systems, permease components [Amino acid transport and metabolism / Inorganic ion transport and metabolism]
GCF_000400635.2_ASM40063v2	NC_021252	Amycolatopsis keratiniphila, complete sequence	2	483438-483542	1	CRISPRCasFinder	no		csa3,RT,WYL,cas4,DEDDh,DinG,casR,cas3	Orphan	GGTCAGTTCGTGCGGCCCACGCCGCCGCAGGAGCAG	36	0	0	NA	NA	NA	1	1	Orphan	csa3,RT,WYL,cas4,DEDDh,DinG,casR,cas3	NA|306aa|up_4|NC_021252.1_477988_478906_-,NA|216aa|up_3|NC_021252.1_478902_479550_-,NA|34aa|down_4|NC_021252.1_487783_487885_+	NA|271aa|up_9|NC_021252.1_472859_473672_+	COG0596, MhpC, Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) [General function prediction only]	NA|435aa|up_8|NC_021252.1_473693_474998_+	TIGR03449, mycothiol_MshA, D-inositol-3-phosphate glycosyltransferase	NA|172aa|up_7|NC_021252.1_474994_475510_+	pfam10722, YbjN, Putative bacterial sensory transduction regulator	NA|305aa|up_6|NC_021252.1_475578_476493_+	pfam14257, DUF4349, Domain of unknown function (DUF4349)	NA|250aa|up_5|NC_021252.1_477151_477901_+	PRK14120, gpmA, phosphoglyceromutase; Provisional	NA|306aa|up_4|NC_021252.1_477988_478906_-	NA	NA|216aa|up_3|NC_021252.1_478902_479550_-	NA	NA|411aa|up_2|NC_021252.1_479822_481055_+	TIGR02966, Phosphate_regulon_sensor_protein_PhoR, phosphate regulon sensor kinase PhoR	NA|230aa|up_1|NC_021252.1_481051_481741_+	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|330aa|up_0|NC_021252.1_481946_482936_+	COG0248, GppA, Exopolyphosphatase [Nucleotide transport and metabolism / Inorganic ion transport and metabolism]	NA|266aa|down_0|NC_021252.1_484605_485403_+	COG1082, IolE, Sugar phosphate isomerases/epimerases [Carbohydrate transport and metabolism]	NA|277aa|down_1|NC_021252.1_485544_486375_+	pfam13622, 4HBT_3, Thioesterase-like superfamily	NA|270aa|down_2|NC_021252.1_486399_487209_+	PRK11880, PRK11880, pyrroline-5-carboxylate reductase; Reviewed	NA|72aa|down_3|NC_021252.1_487422_487638_+	TIGR01764, Probable_excisionase, DNA binding domain, excisionase family	NA|34aa|down_4|NC_021252.1_487783_487885_+	NA	NA|350aa|down_5|NC_021252.1_488171_489221_+	cd05240, UDP_G4E_3_SDR_e, UDP-glucose 4 epimerase (G4E), subgroup 3, extended (e) SDRs	NA|364aa|down_6|NC_021252.1_489217_490309_+	cd07987, LPLAT_MGAT-like, Lysophospholipid Acyltransferases (LPLATs) of Glycerophospholipid Biosynthesis: MGAT-like	NA|301aa|down_7|NC_021252.1_490438_491341_-	TIGR01490, Uncharacterized_protein_Rv3661/MT3761, HAD-superfamily subfamily IB hydrolase, TIGR01490	NA|364aa|down_8|NC_021252.1_491437_492529_-	PRK14971, PRK14971, DNA polymerase III subunit gamma/tau	NA|233aa|down_9|NC_021252.1_492638_493337_-	TIGR02952, RNA_polymerase_sigma-70_factor_ECF_subfamily, RNA polymerase sigma-70 factor, TIGR02952 family
GCF_000400635.2_ASM40063v2	NC_021252	Amycolatopsis keratiniphila, complete sequence	3	1238420-1238500	2	CRISPRCasFinder	no		csa3,RT,WYL,cas4,DEDDh,DinG,casR,cas3	Orphan	GGGAGCAAGGGACCTTTGCTACC	23	0	0	NA	NA	NA	1	1	Orphan	csa3,RT,WYL,cas4,DEDDh,DinG,casR,cas3	NA,NA	NA|325aa|up_9|NC_021252.1_1223126_1224101_-	TIGR04247, nitrous_oxide_maturation_protein_NosD, nitrous oxide reductase family maturation protein NosD	NA|292aa|up_8|NC_021252.1_1224507_1225383_-	PRK00311, panB, 3-methyl-2-oxobutanoate hydroxymethyltransferase; Reviewed	NA|1106aa|up_7|NC_021252.1_1225627_1228945_+	PRK02983, lysS, bifunctional lysylphosphatidylglycerol synthetase/lysine--tRNA ligase LysX	NA|196aa|up_6|NC_021252.1_1229039_1229627_+	pfam04978, DUF664, Protein of unknown function (DUF664)	NA|179aa|up_5|NC_021252.1_1229639_1230176_-	COG0350, Ada, Methylated DNA-protein cysteine methyltransferase [DNA replication, recombination, and repair]	NA|209aa|up_4|NC_021252.1_1230266_1230893_+	pfam13468, Glyoxalase_3, Glyoxalase-like domain	NA|586aa|up_3|NC_021252.1_1230863_1232621_-	PRK13981, PRK13981, NAD synthetase; Provisional	NA|776aa|up_2|NC_021252.1_1232770_1235098_-	PRK12326, PRK12326, preprotein translocase subunit SecA; Reviewed	NA|458aa|up_1|NC_021252.1_1235445_1236819_+	pfam08386, Abhydrolase_4, TAP-like protein	NA|502aa|up_0|NC_021252.1_1236910_1238416_+	pfam08386, Abhydrolase_4, TAP-like protein	NA|422aa|down_0|NC_021252.1_1239252_1240518_+	cd17355, MFS_YcxA_like, MFS-type transporter YcxA and similar proteins of the Major Facilitator Superfamily of transporters	NA|448aa|down_1|NC_021252.1_1240586_1241930_+	COG0174, GlnA, Glutamine synthetase [Amino acid transport and metabolism]	NA|521aa|down_2|NC_021252.1_1241944_1243507_+	PRK08137, PRK08137, amidase; Provisional	NA|149aa|down_3|NC_021252.1_1249205_1249652_-	COG1522, Lrp, Transcriptional regulators [Transcription]	NA|151aa|down_4|NC_021252.1_1249759_1250212_+	cd01521, RHOD_PspE2, Member of the Rhodanese Homology Domain superfamily	NA|252aa|down_5|NC_021252.1_1250291_1251047_+	cd01741, GATase1_1, Subgroup of proteins having the Type 1 glutamine amidotransferase (GATase1) domain	NA|377aa|down_6|NC_021252.1_1251140_1252271_+	pfam01384, PHO4, Phosphate transporter family	NA|991aa|down_7|NC_021252.1_1252344_1255317_+	PRK14109, PRK14109, bifunctional [glutamine synthetase] adenylyltransferase/[glutamine synthetase]-adenylyl-L-tyrosine phosphorylase	NA|145aa|down_8|NC_021252.1_1255445_1255880_+	cd00586, 4HBT, 4-hydroxybenzoyl-CoA thioesterase (4HBT)	NA|475aa|down_9|NC_021252.1_1255971_1257396_-	TIGR00653, Glutamine_synthetase, glutamine synthetase, type I
GCF_000400635.2_ASM40063v2	NC_021252	Amycolatopsis keratiniphila, complete sequence	4	3015840-3015938	3	CRISPRCasFinder	no		csa3,RT,WYL,cas4,DEDDh,DinG,casR,cas3	Orphan	CGCATTTAGAGGACTAAACGCGT	23	0	0	NA	NA	NA	1	1	Orphan	csa3,RT,WYL,cas4,DEDDh,DinG,casR,cas3	NA,NA|390aa|down_1|NC_021252.1_3017142_3018312_+	NA|110aa|up_9|NC_021252.1_3005733_3006063_+	PRK02237, PRK02237, YnfA family protein	NA|267aa|up_8|NC_021252.1_3006081_3006882_+	TIGR02227, Inactive_signal_peptidase_IA	NA|655aa|up_7|NC_021252.1_3006805_3008770_-	COG0661, AarF, Predicted unusual protein kinase [General function prediction only]	NA|406aa|up_6|NC_021252.1_3008783_3010001_-	cd17324, MFS_NepI_like, Purine ribonucleoside efflux pump NepI and similar transporters of the Major Facilitator Superfamily	NA|190aa|up_5|NC_021252.1_3010090_3010660_+	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|410aa|up_4|NC_021252.1_3010669_3011899_+	COG4585, COG4585, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|218aa|up_3|NC_021252.1_3011895_3012549_+	COG2197, CitB, Response regulator containing a CheY-like receiver domain and an HTH DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|609aa|up_2|NC_021252.1_3012557_3014384_-	PRK05506, PRK05506, bifunctional sulfate adenylyltransferase subunit 1/adenylylsulfate kinase protein; Provisional	NA|303aa|up_1|NC_021252.1_3014383_3015292_-	PRK05253, PRK05253, sulfate adenylyltransferase subunit CysD	NA|156aa|up_0|NC_021252.1_3015369_3015837_+	pfam13577, SnoaL_4, SnoaL-like domain	NA|296aa|down_0|NC_021252.1_3016167_3017055_-	COG1414, IclR, Transcriptional regulator [Transcription]	NA|390aa|down_1|NC_021252.1_3017142_3018312_+	NA	NA|261aa|down_2|NC_021252.1_3018308_3019091_+	PRK07890, PRK07890, short chain dehydrogenase; Provisional	NA|387aa|down_3|NC_021252.1_3019275_3020436_+	pfam13469, Sulfotransfer_3, Sulfotransferase family	NA|561aa|down_4|NC_021252.1_3020511_3022194_-	cd01948, EAL, EAL domain	NA|376aa|down_5|NC_021252.1_3022266_3023394_-	TIGR01790, Uncharacterized_carotenoid_cyclase_DR_0801, lycopene cyclase family protein	NA|521aa|down_6|NC_021252.1_3023437_3025000_+	TIGR01353, dGTP_triPase, deoxyguanosinetriphosphate triphosphohydrolase, putative	NA|957aa|down_7|NC_021252.1_3025213_3028084_+	cd05562, Peptidases_S53_like, Peptidase domain in the S53 family	NA|223aa|down_8|NC_021252.1_3028309_3028978_+	pfam18859, acVLRF1, Actinobacteria/chloroflexi VLRF1 release factor	NA|543aa|down_9|NC_021252.1_3029120_3030749_+	COG0488, Uup, ATPase components of ABC transporters with duplicated ATPase domains [General function prediction only]
GCF_000400635.2_ASM40063v2	NC_021252	Amycolatopsis keratiniphila, complete sequence	5	4509171-4509263	4	CRISPRCasFinder	no	csa3	csa3,RT,WYL,cas4,DEDDh,DinG,casR,cas3	Type I-A	TTCGGACCCACGATGCCGTCGGCC	24	0	0	NA	NA	NA	1	1	Orphan	csa3,RT,WYL,cas4,DEDDh,DinG,casR,cas3	NA|338aa|up_6|NC_021252.1_4502321_4503335_+,NA|74aa|up_5|NC_021252.1_4503331_4503553_+,NA|154aa|down_8|NC_021252.1_4520339_4520801_+	NA|280aa|up_9|NC_021252.1_4499220_4500060_-	PRK12554, PRK12554, undecaprenyl pyrophosphate phosphatase; Reviewed	NA|349aa|up_8|NC_021252.1_4500059_4501106_-	TIGR03559, F420_Rv3520c, probable F420-dependent oxidoreductase, Rv3520c family	NA|322aa|up_7|NC_021252.1_4501242_4502208_+	cd19087, AKR_AKR12A1_B1_C1, AKR12A, AKR12B,  AKR12C families of aldo-keto reductase (AKR)	NA|338aa|up_6|NC_021252.1_4502321_4503335_+	NA	NA|74aa|up_5|NC_021252.1_4503331_4503553_+	NA	NA|249aa|up_4|NC_021252.1_4503826_4504573_-	cd07814, SRPBCC_CalC_Aha1-like, Putative hydrophobic ligand-binding SRPBCC domain of Micromonospora echinospora CalC, human Aha1, and related proteins	csa3|197aa|up_3|NC_021252.1_4504573_4505164_-	pfam12840, HTH_20, Helix-turn-helix domain	NA|293aa|up_2|NC_021252.1_4505204_4506083_-	COG0596, MhpC, Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) [General function prediction only]	NA|441aa|up_1|NC_021252.1_4506082_4507405_-	PRK07906, PRK07906, hypothetical protein; Provisional	NA|324aa|up_0|NC_021252.1_4507517_4508489_-	COG3409, COG3409, Putative peptidoglycan-binding domain-containing protein [Cell envelope biogenesis, outer membrane]	NA|565aa|down_0|NC_021252.1_4509578_4511273_+	pfam08924, DUF1906, Domain of unknown function (DUF1906)	NA|534aa|down_1|NC_021252.1_4511347_4512949_+	smart00191, Int_alpha, Integrin alpha (beta-propellor repeats)	NA|509aa|down_2|NC_021252.1_4512959_4514486_+	COG3540, PhoD, Phosphodiesterase/alkaline phosphatase D [Inorganic ion transport and metabolism]	NA|321aa|down_3|NC_021252.1_4514536_4515499_-	cd00090, HTH_ARSR, Arsenical Resistance Operon Repressor and similar prokaryotic, metal regulated homodimeric repressors	NA|333aa|down_4|NC_021252.1_4515749_4516748_+	COG3409, COG3409, Putative peptidoglycan-binding domain-containing protein [Cell envelope biogenesis, outer membrane]	NA|390aa|down_5|NC_021252.1_4516744_4517914_+	pfam07335, Glyco_hydro_75, Fungal chitosanase of glycosyl hydrolase group 75	NA|219aa|down_6|NC_021252.1_4517950_4518607_-	COG2197, CitB, Response regulator containing a CheY-like receiver domain and an HTH DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|368aa|down_7|NC_021252.1_4518612_4519716_-	COG4585, COG4585, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|154aa|down_8|NC_021252.1_4520339_4520801_+	NA	NA|411aa|down_9|NC_021252.1_4522052_4523285_-	pfam13336, AcetylCoA_hyd_C, Acetyl-CoA hydrolase/transferase C-terminal domain
GCF_000400635.2_ASM40063v2	NC_021252	Amycolatopsis keratiniphila, complete sequence	6	5398768-5398854	5	CRISPRCasFinder	no	WYL	csa3,RT,WYL,cas4,DEDDh,DinG,casR,cas3	Unclear	CGCCGCCTGATCACGCATGATCGC	24	0	0	NA	NA	NA	1	1	Orphan	csa3,RT,WYL,cas4,DEDDh,DinG,casR,cas3	NA|72aa|up_9|NC_021252.1_5388279_5388495_-,NA|90aa|up_0|NC_021252.1_5398433_5398703_+,NA	NA|72aa|up_9|NC_021252.1_5388279_5388495_-	NA	NA|269aa|up_8|NC_021252.1_5388635_5389442_+	cd05269, TMR_SDR_a, triphenylmethane reductase (TMR)-like proteins, NMRa-like, atypical (a) SDRs	NA|159aa|up_7|NC_021252.1_5389944_5390421_-	COG1959, COG1959, Predicted transcriptional regulator [Transcription]	NA|244aa|up_6|NC_021252.1_5390534_5391266_+	cd05243, SDR_a5, atypical (a) SDRs, subgroup 5	NA|571aa|up_5|NC_021252.1_5391319_5393032_-	COG0578, GlpA, Glycerol-3-phosphate dehydrogenase [Energy production and conversion]	NA|506aa|up_4|NC_021252.1_5393037_5394555_-	PRK00047, glpK, glycerol kinase GlpK	NA|274aa|up_3|NC_021252.1_5394566_5395388_-	COG0580, GlpF, Glycerol uptake facilitator and related permeases (Major Intrinsic Protein Family) [Carbohydrate transport and metabolism]	NA|256aa|up_2|NC_021252.1_5395539_5396307_+	COG1414, IclR, Transcriptional regulator [Transcription]	NA|651aa|up_1|NC_021252.1_5396320_5398273_+	cd07789, FGGY_CsGK_like, Cellulomonas sp	NA|90aa|up_0|NC_021252.1_5398433_5398703_+	NA	NA|533aa|down_0|NC_021252.1_5398969_5400568_+	cd00839, MPP_PAPs, purple acid phosphatases of the metallophosphatase superfamily, metallophosphatase domain	NA|269aa|down_1|NC_021252.1_5400647_5401454_+	pfam13354, Beta-lactamase2, Beta-lactamase enzyme family	NA|212aa|down_2|NC_021252.1_5401410_5402046_-	cd04778, HTH_MerR-like_sg2, Helix-Turn-Helix DNA binding domain of putative transcription regulators from the MerR superfamily	NA|288aa|down_3|NC_021252.1_5402390_5403254_+	pfam04185, Phosphoesterase, Phosphoesterase family	NA|420aa|down_4|NC_021252.1_5403320_5404580_+	cd17370, MFS_MJ1317_like, MJ1317 and similar transporters of the Major Facilitator Superfamily	NA|330aa|down_5|NC_021252.1_5404576_5405566_+	COG0823, TolB, Periplasmic component of the Tol biopolymer transport system [Intracellular trafficking and secretion]	NA|179aa|down_6|NC_021252.1_5405587_5406124_+	cd00431, cysteine_hydrolases, Cysteine hydrolases; This family contains amidohydrolases, like CSHase (N-carbamoylsarcosine amidohydrolase), involved in creatine metabolism and nicotinamidase, converting nicotinamide to nicotinic acid and ammonia in the pyridine nucleotide cycle	NA|167aa|down_7|NC_021252.1_5406136_5406637_+	pfam04892, VanZ, VanZ like family	NA|564aa|down_8|NC_021252.1_5406600_5408292_-	pfam01593, Amino_oxidase, Flavin containing amine oxidoreductase	NA|252aa|down_9|NC_021252.1_5408288_5409044_-	cd07576, R-amidase_like, Pseudomonas sp
