assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000284035.1_ASM28403v1	NC_016887	Nocardia cyriacigeorgica GUH-2, complete genome	1	2996733-2996838	1	CRISPRCasFinder	no		cas3,csa3,Cas14u_CAS-V,WYL,c2c9_V-U4,DinG,DEDDh,cas4	Orphan	TTCGGGGTCGAGTTCGGGGTCGGCG	25	1	1	2996758-2996813	NC_016887.1_1794265-1794210	NA	1	1	Orphan	cas3,csa3,Cas14u_CAS-V,WYL,c2c9_V-U4,DinG,DEDDh,cas4	NA|329aa|up_4|NC_016887.1_2991431_2992418_+,NA|102aa|down_1|NC_016887.1_2999718_3000024_+,NA|153aa|down_4|NC_016887.1_3002443_3002902_-	NA|415aa|up_9|NC_016887.1_2984041_2985286_+	pfam03583, LIP, Secretory lipase	NA|476aa|up_8|NC_016887.1_2985500_2986928_+	pfam03583, LIP, Secretory lipase	NA|355aa|up_7|NC_016887.1_2987071_2988136_+	PRK13693, PRK13693, (3R)-hydroxyacyl-ACP dehydratase subunit HadB; Provisional	NA|317aa|up_6|NC_016887.1_2988222_2989173_+	pfam13560, HTH_31, Helix-turn-helix domain	NA|517aa|up_5|NC_016887.1_2989379_2990930_-	COG4799, COG4799, Acetyl-CoA carboxylase, carboxyltransferase component (subunits alpha and beta) [Lipid metabolism]	NA|329aa|up_4|NC_016887.1_2991431_2992418_+	NA	NA|317aa|up_3|NC_016887.1_2992464_2993415_+	cd01050, Acyl_ACP_Desat, Acyl ACP desaturase, ferritin-like diiron-binding domain	NA|350aa|up_2|NC_016887.1_2993411_2994461_+	COG0627, COG0627, Predicted esterase [General function prediction only]	NA|346aa|up_1|NC_016887.1_2994464_2995502_-	PRK13693, PRK13693, (3R)-hydroxyacyl-ACP dehydratase subunit HadB; Provisional	NA|211aa|up_0|NC_016887.1_2995545_2996178_-	cd06259, YdcF-like, YdcF-like	NA|414aa|down_0|NC_016887.1_2998441_2999683_+	pfam09995, DUF2236, Uncharacterized protein conserved in bacteria (DUF2236)	NA|102aa|down_1|NC_016887.1_2999718_3000024_+	NA	NA|418aa|down_2|NC_016887.1_3000037_3001291_-	cd01076, NAD_bind_1_Glu_DH, NAD(P) binding domain of glutamate dehydrogenase, subgroup 1	NA|318aa|down_3|NC_016887.1_3001486_3002440_+	cd08423, PBP2_LTTR_like_6, The C-terminal substrate binding domain of an uncharacterized LysR-type transcriptional regulator, contains the type 2 periplasmic binding fold	NA|153aa|down_4|NC_016887.1_3002443_3002902_-	NA	NA|340aa|down_5|NC_016887.1_3003156_3004176_+	COG0627, COG0627, Predicted esterase [General function prediction only]	NA|159aa|down_6|NC_016887.1_3004349_3004826_+	cd07501, HAD_MDP-1_like, eukaryotic hypothetical phosphotyrosine phosphatase MDP-1 and related phosphatases, similar to Bacillus cereus phosphonoacetaldehyde hydrolase and Streptomyces FkbH	NA|532aa|down_7|NC_016887.1_3004900_3006496_-	PRK02106, PRK02106, choline dehydrogenase; Validated	NA|577aa|down_8|NC_016887.1_3006641_3008372_-	COG2303, BetA, Choline dehydrogenase and related flavoproteins [Amino acid transport and metabolism]	NA|230aa|down_9|NC_016887.1_3008472_3009162_+	cd00592, HTH_MerR-like, Helix-Turn-Helix DNA binding domain of MerR-like transcription regulators
GCF_000284035.1_ASM28403v1	NC_016887	Nocardia cyriacigeorgica GUH-2, complete genome	2	3956326-3956660	2,1,1	CRISPRCasFinder,CRT,PILER-CR	no		cas3,csa3,Cas14u_CAS-V,WYL,c2c9_V-U4,DinG,DEDDh,cas4	Orphan	GGGCTCATCCCCGCGCGTGCGGGGAGCGCA,GGGCTCATCCCCGCGCGTGCGGGGAGCGC,GGGCTCATCCCCGCGCGTGCGGGGAGCGCA	30,29,30	0	0	NA	NA	I-E:I-E:I-E	5,5,4	5	Orphan	cas3,csa3,Cas14u_CAS-V,WYL,c2c9_V-U4,DinG,DEDDh,cas4	NA|99aa|up_9|NC_016887.1_3945151_3945448_-,NA|135aa|up_8|NC_016887.1_3945509_3945914_-,NA|214aa|up_7|NC_016887.1_3945910_3946552_-,NA|304aa|up_5|NC_016887.1_3947439_3948351_-,NA|170aa|up_2|NC_016887.1_3953242_3953752_-,NA|617aa|up_1|NC_016887.1_3953748_3955599_-,NA|96aa|up_0|NC_016887.1_3955598_3955886_-,NA	NA|99aa|up_9|NC_016887.1_3945151_3945448_-	NA	NA|135aa|up_8|NC_016887.1_3945509_3945914_-	NA	NA|214aa|up_7|NC_016887.1_3945910_3946552_-	NA	NA|292aa|up_6|NC_016887.1_3946548_3947424_-	cd02036, MinD, septum site-determining protein MinD	NA|304aa|up_5|NC_016887.1_3947439_3948351_-	NA	NA|563aa|up_4|NC_016887.1_3948408_3950097_-	pfam01083, Cutinase, Cutinase	NA|804aa|up_3|NC_016887.1_3950343_3952755_-	TIGR00691, PppGpp_synthetase	NA|170aa|up_2|NC_016887.1_3953242_3953752_-	NA	NA|617aa|up_1|NC_016887.1_3953748_3955599_-	NA	NA|96aa|up_0|NC_016887.1_3955598_3955886_-	NA	NA|488aa|down_0|NC_016887.1_3957121_3958585_+	COG1672, COG1672, Predicted ATPase (AAA+ superfamily) [General function prediction only]	NA|191aa|down_1|NC_016887.1_3959552_3960125_-	PRK02304, PRK02304, adenine phosphoribosyltransferase; Provisional	NA|577aa|down_2|NC_016887.1_3960163_3961894_-	cd08501, PBP2_Lpqw, The substrate-binding domain of mycobacterial lipoprotein Lpqw contains type 2 periplasmic binding fold	NA|397aa|down_3|NC_016887.1_3961911_3963102_-	PRK13022, secF, protein translocase subunit SecF	NA|587aa|down_4|NC_016887.1_3963098_3964859_-	PRK05812, secD, preprotein translocase subunit SecD; Reviewed	NA|119aa|down_5|NC_016887.1_3964968_3965325_-	pfam02699, YajC, Preprotein translocase subunit	NA|497aa|down_6|NC_016887.1_3965509_3967000_-	COG2072, TrkA, Predicted flavoprotein involved in K+ transport [Inorganic ion transport and metabolism]	NA|222aa|down_7|NC_016887.1_3967036_3967702_+	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|193aa|down_8|NC_016887.1_3967763_3968342_+	TIGR03543, divI1A_rptt_fam, DivIVA domain repeat protein	NA|359aa|down_9|NC_016887.1_3968376_3969453_-	PRK00080, ruvB, Holliday junction branch migration DNA helicase RuvB
GCF_000284035.1_ASM28403v1	NC_016887	Nocardia cyriacigeorgica GUH-2, complete genome	3	3958679-3959470	3,2,2	CRISPRCasFinder,CRT,PILER-CR	no	csa3	cas3,csa3,Cas14u_CAS-V,WYL,c2c9_V-U4,DinG,DEDDh,cas4	Type I-A	GGGCTCATCCCCGCGCGTGCGGGGAGCAC,GGGCTCANTCCCCGCGCGTGCGGGGAGCAC,GGGCTCATCCCCGCGCGTGCGGGGAGCAC	29,30,29	1	1	3959412-3959441	NC_016887.1_5892033-5892004	I-E:NA:I-E	12,11,9	12	Orphan	cas3,csa3,Cas14u_CAS-V,WYL,c2c9_V-U4,DinG,DEDDh,cas4	NA|135aa|up_9|NC_016887.1_3945509_3945914_-,NA|214aa|up_8|NC_016887.1_3945910_3946552_-,NA|304aa|up_6|NC_016887.1_3947439_3948351_-,NA|170aa|up_3|NC_016887.1_3953242_3953752_-,NA|617aa|up_2|NC_016887.1_3953748_3955599_-,NA|96aa|up_1|NC_016887.1_3955598_3955886_-,NA	NA|135aa|up_9|NC_016887.1_3945509_3945914_-	NA	NA|214aa|up_8|NC_016887.1_3945910_3946552_-	NA	NA|292aa|up_7|NC_016887.1_3946548_3947424_-	cd02036, MinD, septum site-determining protein MinD	NA|304aa|up_6|NC_016887.1_3947439_3948351_-	NA	NA|563aa|up_5|NC_016887.1_3948408_3950097_-	pfam01083, Cutinase, Cutinase	NA|804aa|up_4|NC_016887.1_3950343_3952755_-	TIGR00691, PppGpp_synthetase	NA|170aa|up_3|NC_016887.1_3953242_3953752_-	NA	NA|617aa|up_2|NC_016887.1_3953748_3955599_-	NA	NA|96aa|up_1|NC_016887.1_3955598_3955886_-	NA	NA|488aa|up_0|NC_016887.1_3957121_3958585_+	COG1672, COG1672, Predicted ATPase (AAA+ superfamily) [General function prediction only]	NA|191aa|down_0|NC_016887.1_3959552_3960125_-	PRK02304, PRK02304, adenine phosphoribosyltransferase; Provisional	NA|577aa|down_1|NC_016887.1_3960163_3961894_-	cd08501, PBP2_Lpqw, The substrate-binding domain of mycobacterial lipoprotein Lpqw contains type 2 periplasmic binding fold	NA|397aa|down_2|NC_016887.1_3961911_3963102_-	PRK13022, secF, protein translocase subunit SecF	NA|587aa|down_3|NC_016887.1_3963098_3964859_-	PRK05812, secD, preprotein translocase subunit SecD; Reviewed	NA|119aa|down_4|NC_016887.1_3964968_3965325_-	pfam02699, YajC, Preprotein translocase subunit	NA|497aa|down_5|NC_016887.1_3965509_3967000_-	COG2072, TrkA, Predicted flavoprotein involved in K+ transport [Inorganic ion transport and metabolism]	NA|222aa|down_6|NC_016887.1_3967036_3967702_+	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|193aa|down_7|NC_016887.1_3967763_3968342_+	TIGR03543, divI1A_rptt_fam, DivIVA domain repeat protein	NA|359aa|down_8|NC_016887.1_3968376_3969453_-	PRK00080, ruvB, Holliday junction branch migration DNA helicase RuvB	NA|202aa|down_9|NC_016887.1_3969459_3970065_-	PRK00116, ruvA, Holliday junction branch migration protein RuvA
GCF_000284035.1_ASM28403v1	NC_016887	Nocardia cyriacigeorgica GUH-2, complete genome	4	4036123-4036238	4	CRISPRCasFinder	no		cas3,csa3,Cas14u_CAS-V,WYL,c2c9_V-U4,DinG,DEDDh,cas4	Orphan	GCCAAGAAAGCCGCCGCCAAGAAGGC	26	1	1	4036194-4036212	NC_016887.1_4326928-4326910	NA	2	2	Orphan	cas3,csa3,Cas14u_CAS-V,WYL,c2c9_V-U4,DinG,DEDDh,cas4	NA|93aa|up_3|NC_016887.1_4033070_4033349_-,NA	NA|135aa|up_9|NC_016887.1_4029222_4029627_-	cd04488, RecG_wedge_OBF, RecG_wedge_OBF: A subfamily of OB folds corresponding to the OB fold found in the N-terminal (wedge) domain of Escherichia coli RecG	NA|232aa|up_8|NC_016887.1_4029751_4030447_+	COG0596, MhpC, Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) [General function prediction only]	NA|267aa|up_7|NC_016887.1_4030529_4031330_-	pfam12502, DUF3710, Protein of unknown function (DUF3710)	NA|167aa|up_6|NC_016887.1_4031478_4031979_-	PRK00601, dut, dUTP diphosphatase	NA|172aa|up_5|NC_016887.1_4032053_4032569_+	pfam11292, DUF3093, Protein of unknown function (DUF3093)	NA|101aa|up_4|NC_016887.1_4032656_4032959_-	pfam13834, DUF4193, Domain of unknown function (DUF4193)	NA|93aa|up_3|NC_016887.1_4033070_4033349_-	NA	NA|226aa|up_2|NC_016887.1_4033332_4034010_+	pfam13399, LytR_C, LytR cell envelope-related transcriptional attenuator	NA|310aa|up_1|NC_016887.1_4034019_4034949_-	cd01639, IMPase, IMPase, inositol monophosphatase and related domains	NA|253aa|up_0|NC_016887.1_4035118_4035877_+	COG1940, NagC, Transcriptional regulator/sugar kinase [Transcription / Carbohydrate transport and metabolism]	NA|259aa|down_0|NC_016887.1_4037477_4038254_-	COG3300, COG3300, MHYT domain (predicted integral membrane sensor domain) [Signal transduction mechanisms]	NA|184aa|down_1|NC_016887.1_4038791_4039343_-	PRK05465, PRK05465, ethanolamine ammonia-lyase subunit EutC	NA|474aa|down_2|NC_016887.1_4039723_4041145_-	PRK15067, PRK15067, ethanolamine ammonia-lyase subunit EutB	NA|384aa|down_3|NC_016887.1_4041224_4042376_-	COG0475, KefB, Kef-type K+ transport systems, membrane components [Inorganic ion transport and metabolism]	NA|161aa|down_4|NC_016887.1_4042380_4042863_-	COG0490, COG0490, Putative regulatory, ligand-binding protein related to C-terminal domains of K+ channels [Inorganic ion transport and metabolism]	NA|405aa|down_5|NC_016887.1_4043087_4044302_+	COG4585, COG4585, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|216aa|down_6|NC_016887.1_4044298_4044946_+	COG2197, CitB, Response regulator containing a CheY-like receiver domain and an HTH DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|120aa|down_7|NC_016887.1_4044960_4045320_-	pfam06108, DUF952, Protein of unknown function (DUF952)	NA|252aa|down_8|NC_016887.1_4045890_4046646_-	pfam02909, TetR_C, Tetracyclin repressor, C-terminal all-alpha domain	NA|586aa|down_9|NC_016887.1_4046776_4048534_+	COG0553, HepA, Superfamily II DNA/RNA helicases, SNF2 family [Transcription / DNA replication, recombination, and repair]
GCF_000284035.1_ASM28403v1	NC_016887	Nocardia cyriacigeorgica GUH-2, complete genome	5	6051233-6051339	5	CRISPRCasFinder	no	WYL	cas3,csa3,Cas14u_CAS-V,WYL,c2c9_V-U4,DinG,DEDDh,cas4	Unclear	GGCCGTGGTCCGCAGGGTGGGCCGCCGCC	29	0	0	NA	NA	NA	1	1	Orphan	cas3,csa3,Cas14u_CAS-V,WYL,c2c9_V-U4,DinG,DEDDh,cas4	NA,NA	NA|352aa|up_9|NC_016887.1_6040717_6041773_+	COG1175, UgpA, ABC-type sugar transport systems, permease components [Carbohydrate transport and metabolism]	NA|293aa|up_8|NC_016887.1_6041787_6042666_+	COG0395, UgpE, ABC-type sugar transport system, permease component [Carbohydrate transport and metabolism]	NA|773aa|up_7|NC_016887.1_6042839_6045158_+	COG0744, MrcB, Membrane carboxypeptidase (penicillin-binding protein) [Cell envelope biogenesis, outer membrane]	NA|153aa|up_6|NC_016887.1_6045426_6045885_+	pfam14526, Cass2, Integron-associated effector binding protein	NA|142aa|up_5|NC_016887.1_6045895_6046321_+	smart00871, AraC_E_bind, Bacterial transcription activator, effector binding domain	NA|366aa|up_4|NC_016887.1_6046390_6047488_-	TIGR03450, mycothiol_INO1, inositol 1-phosphate synthase, Actinobacterial type	NA|186aa|up_3|NC_016887.1_6047480_6048038_-	COG1695, COG1695, Predicted transcriptional regulators [Transcription]	NA|270aa|up_2|NC_016887.1_6048263_6049073_-	pfam08044, DUF1707, Domain of unknown function (DUF1707)	NA|373aa|up_1|NC_016887.1_6049080_6050199_-	pfam08044, DUF1707, Domain of unknown function (DUF1707)	NA|140aa|up_0|NC_016887.1_6050439_6050859_+	pfam17249, DUF5318, Family of unknown function (DUF5318)	NA|826aa|down_0|NC_016887.1_6051493_6053971_+	COG0744, MrcB, Membrane carboxypeptidase (penicillin-binding protein) [Cell envelope biogenesis, outer membrane]	NA|574aa|down_1|NC_016887.1_6054000_6055722_+	COG5650, COG5650, Predicted integral membrane protein [Function unknown]	NA|517aa|down_2|NC_016887.1_6055787_6057338_-	cd06974, TerD_like, Uncharacterized proteins involved in stress response, similar to tellurium resistance terD	NA|450aa|down_3|NC_016887.1_6057359_6058709_-	pfam02342, TerD, TerD domain	NA|208aa|down_4|NC_016887.1_6058909_6059533_+	pfam01987, AIM24, Mitochondrial biogenesis AIM24	NA|233aa|down_5|NC_016887.1_6059538_6060237_+	pfam01987, AIM24, Mitochondrial biogenesis AIM24	NA|254aa|down_6|NC_016887.1_6060239_6061001_+	pfam01987, AIM24, Mitochondrial biogenesis AIM24	NA|364aa|down_7|NC_016887.1_6061053_6062145_-	cd02933, OYE_like_FMN, Old yellow enzyme (OYE)-like FMN binding domain	NA|121aa|down_8|NC_016887.1_6062207_6062570_+	cd01282, HTH_MerR-like_sg3, Helix-Turn-Helix DNA binding domain of putative transcription regulators from the MerR superfamily	WYL|363aa|down_9|NC_016887.1_6062651_6063740_-	COG2378, COG2378, Predicted transcriptional regulator [Transcription]
GCF_000284035.1_ASM28403v1	NC_016887	Nocardia cyriacigeorgica GUH-2, complete genome	6	6124145-6124239	6	CRISPRCasFinder	no		cas3,csa3,Cas14u_CAS-V,WYL,c2c9_V-U4,DinG,DEDDh,cas4	Orphan	GCGGCTACCTCGCCTCCGCTCGG	23	0	0	NA	NA	NA	1	1	Orphan	cas3,csa3,Cas14u_CAS-V,WYL,c2c9_V-U4,DinG,DEDDh,cas4	NA|96aa|up_9|NC_016887.1_6113878_6114166_+,NA|277aa|up_8|NC_016887.1_6114427_6115258_+,NA|66aa|up_5|NC_016887.1_6118011_6118209_-,NA|68aa|up_2|NC_016887.1_6119803_6120007_+,NA|63aa|down_3|NC_016887.1_6128819_6129008_+	NA|96aa|up_9|NC_016887.1_6113878_6114166_+	NA	NA|277aa|up_8|NC_016887.1_6114427_6115258_+	NA	NA|646aa|up_7|NC_016887.1_6115273_6117211_+	PRK07803, sdhA, succinate dehydrogenase flavoprotein subunit; Reviewed	NA|249aa|up_6|NC_016887.1_6117213_6117960_+	PRK12386, PRK12386, fumarate reductase iron-sulfur subunit; Provisional	NA|66aa|up_5|NC_016887.1_6118011_6118209_-	NA	NA|77aa|up_4|NC_016887.1_6118276_6118507_-	pfam04149, DUF397, Domain of unknown function (DUF397)	NA|331aa|up_3|NC_016887.1_6118503_6119496_-	pfam13560, HTH_31, Helix-turn-helix domain	NA|68aa|up_2|NC_016887.1_6119803_6120007_+	NA	NA|316aa|up_1|NC_016887.1_6120039_6120987_-	COG0122, AlkA, 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase [DNA replication, recombination, and repair]	NA|932aa|up_0|NC_016887.1_6121334_6124130_-	COG0277, GlcD, FAD/FMN-containing dehydrogenases [Energy production and conversion]	NA|627aa|down_0|NC_016887.1_6124523_6126404_+	COG1132, MdlB, ABC-type multidrug transport system, ATPase and permease components [Defense mechanisms]	NA|331aa|down_1|NC_016887.1_6126631_6127624_-	TIGR02823, oxido_YhdH, putative quinone oxidoreductase, YhdH/YhfP family	NA|354aa|down_2|NC_016887.1_6127659_6128721_-	PRK03352, PRK03352, DNA polymerase IV; Validated	NA|63aa|down_3|NC_016887.1_6128819_6129008_+	NA	NA|107aa|down_4|NC_016887.1_6129004_6129325_+	pfam02452, PemK_toxin, PemK-like, MazF-like toxin of type II toxin-antitoxin system	NA|147aa|down_5|NC_016887.1_6129353_6129794_+	cd03443, PaaI_thioesterase, PaaI_thioesterase is a tetrameric acyl-CoA thioesterase with a hot dog fold and one of several proteins responsible for phenylacetic acid (PA) degradation in bacteria	NA|824aa|down_6|NC_016887.1_6129935_6132407_-	COG0466, Lon, ATP-dependent Lon protease, bacterial type [Posttranslational modification, protein turnover, chaperones]	NA|333aa|down_7|NC_016887.1_6132680_6133679_+	pfam13354, Beta-lactamase2, Beta-lactamase enzyme family	NA|154aa|down_8|NC_016887.1_6133666_6134128_+	cd00090, HTH_ARSR, Arsenical Resistance Operon Repressor and similar prokaryotic, metal regulated homodimeric repressors	NA|221aa|down_9|NC_016887.1_6134177_6134840_+	cd06259, YdcF-like, YdcF-like
