assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_002356575.1_ASM235657v1	NZ_AP014835	Bacillus anthracis strain Shikan-NIID plasmid pXO2	1	34173-34493	1	PILER-CR	no			Orphan	AATTTACACATGTGTAAATGTGTAAATGTGTAAATGTGTAAA	42	0	0	NA	NA	NA	3	3	Orphan	cas3,cas14j,DEDDh,csa3,WYL,DinG,cas14k,c2c9_V-U4,RT,c2c4_V-U1	NA|923aa|up_9|NZ_AP014835.1_22362_25131_-,NA|110aa|up_8|NZ_AP014835.1_25256_25586_-,NA|79aa|up_7|NZ_AP014835.1_25636_25873_-,NA|63aa|up_5|NZ_AP014835.1_26842_27031_-,NA|177aa|up_4|NZ_AP014835.1_27020_27551_-,NA|61aa|up_2|NZ_AP014835.1_29235_29418_-,NA|255aa|up_1|NZ_AP014835.1_31115_31880_-,NA|129aa|down_1|NZ_AP014835.1_35386_35773_+,NA|78aa|down_2|NZ_AP014835.1_35971_36205_+,NA|66aa|down_4|NZ_AP014835.1_40270_40468_-,NA|110aa|down_5|NZ_AP014835.1_40486_40816_-	NA|923aa|up_9|NZ_AP014835.1_22362_25131_-	NA	NA|110aa|up_8|NZ_AP014835.1_25256_25586_-	NA	NA|79aa|up_7|NZ_AP014835.1_25636_25873_-	NA	NA|163aa|up_6|NZ_AP014835.1_26257_26746_-	pfam17631, DUF5512, Family of unknown function (DUF5512)	NA|63aa|up_5|NZ_AP014835.1_26842_27031_-	NA	NA|177aa|up_4|NZ_AP014835.1_27020_27551_-	NA	NA|101aa|up_3|NZ_AP014835.1_28057_28360_-	pfam17362, pXO2-34, Family of unknown function	NA|61aa|up_2|NZ_AP014835.1_29235_29418_-	NA	NA|255aa|up_1|NZ_AP014835.1_31115_31880_-	NA	NA|513aa|up_0|NZ_AP014835.1_32082_33621_-	pfam08708, PriCT_1, Primase C terminal 1 (PriCT-1)	NA|289aa|down_0|NZ_AP014835.1_34527_35394_+	COG1192, Soj, ATPases involved in chromosome partitioning [Cell division and chromosome partitioning]	NA|129aa|down_1|NZ_AP014835.1_35386_35773_+	NA	NA|78aa|down_2|NZ_AP014835.1_35971_36205_+	NA	NA|532aa|down_3|NZ_AP014835.1_37413_39009_+	cd02696, MurNAc-LAA, N-acetylmuramoyl-L-alanine amidase or MurNAc-LAA (also known as peptidoglycan aminohydrolase, NAMLA amidase, NAMLAA, Amidase 3, and peptidoglycan amidase; EC 3	NA|66aa|down_4|NZ_AP014835.1_40270_40468_-	NA	NA|110aa|down_5|NZ_AP014835.1_40486_40816_-	NA	NA|104aa|down_6|NZ_AP014835.1_41398_41710_-	PRK12856, PRK12856, hypothetical protein; Provisional	NA|222aa|down_7|NZ_AP014835.1_41758_42424_-	pfam02517, Abi, CAAX protease self-immunity	NA|155aa|down_8|NZ_AP014835.1_43134_43599_+	pfam04463, DUF523, Protein of unknown function (DUF523)	NA|197aa|down_9|NZ_AP014835.1_43975_44566_+	COG1309, AcrR, Transcriptional regulator [Transcription]
GCF_002356575.1_ASM235657v1	NZ_AP014833	Bacillus anthracis strain Shikan-NIID	1	2202612-2202713	1	CRISPRCasFinder	no	csa3	cas3,cas14j,DEDDh,csa3,WYL,DinG,cas14k,c2c9_V-U4	Type I-A	ACATTCTCAAAAGAAACCTAAATT	24	0	0	NA	NA	NA	1	1	Orphan	cas3,cas14j,DEDDh,csa3,WYL,DinG,cas14k,c2c9_V-U4,RT,c2c4_V-U1	NA|218aa|up_0|NZ_AP014833.1_2201319_2201973_+,NA|77aa|down_5|NZ_AP014833.1_2207824_2208055_-,NA|210aa|down_9|NZ_AP014833.1_2214026_2214656_-	NA|116aa|up_9|NZ_AP014833.1_2192356_2192704_+	pfam14079, DUF4260, Domain of unknown function (DUF4260)	NA|176aa|up_8|NZ_AP014833.1_2192727_2193255_+	pfam04229, GrpB, GrpB protein	NA|200aa|up_7|NZ_AP014833.1_2193389_2193989_+	pfam04299, FMN_bind_2, Putative FMN-binding domain	NA|64aa|up_6|NZ_AP014833.1_2194062_2194254_-	TIGR04429, hypothetical_protein_bmyco0001_31490, Phr family secreted Rap phosphatase inhibitor	NA|489aa|up_5|NZ_AP014833.1_2195600_2197067_-	PRK00029, PRK00029, YdiU family protein	NA|333aa|up_4|NZ_AP014833.1_2197245_2198244_-	cd05289, MDR_like_2, alcohol dehydrogenase and quinone reductase-like medium chain degydrogenases/reductases	csa3|115aa|up_3|NZ_AP014833.1_2198262_2198607_-	cd00090, HTH_ARSR, Arsenical Resistance Operon Repressor and similar prokaryotic, metal regulated homodimeric repressors	NA|540aa|up_2|NZ_AP014833.1_2199084_2200704_+	pfam01494, FAD_binding_3, FAD binding domain	NA|60aa|up_1|NZ_AP014833.1_2201035_2201215_-	PRK12270, kgd, multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit	NA|218aa|up_0|NZ_AP014833.1_2201319_2201973_+	NA	NA|315aa|down_0|NZ_AP014833.1_2203276_2204221_+	cd08601, GDPD_SaGlpQ_like, Glycerophosphodiester phosphodiesterase domain of Staphylococcus aureus and similar proteins	NA|226aa|down_1|NZ_AP014833.1_2204270_2204948_-	cd06259, YdcF-like, YdcF-like	NA|170aa|down_2|NZ_AP014833.1_2205041_2205551_-	pfam13079, DUF3916, Protein of unknown function (DUF3916)	NA|284aa|down_3|NZ_AP014833.1_2205543_2206395_-	cd08442, PBP2_YofA_SoxR_like, The C-terminal substrate binding domain of LysR-type transcriptional regulators, YofA and SoxR, contains the type 2 periplasmic binding fold	NA|415aa|down_4|NZ_AP014833.1_2206480_2207725_+	cd17475, MFS_MT3072_like, Mycobacterium tuberculosis uncharacterized MFS-type transporter MT3072 and similar transporters of the Major Facilitator Superfamily	NA|77aa|down_5|NZ_AP014833.1_2207824_2208055_-	NA	NA|378aa|down_6|NZ_AP014833.1_2208311_2209445_+	pfam00144, Beta-lactamase, Beta-lactamase	NA|596aa|down_7|NZ_AP014833.1_2209480_2211268_-	cd09608, M3B_PepF, Peptidase family M3B, oligopeptidase F (PepF)	NA|514aa|down_8|NZ_AP014833.1_2212390_2213932_+	PRK09441, PRK09441, cytoplasmic alpha-amylase; Reviewed	NA|210aa|down_9|NZ_AP014833.1_2214026_2214656_-	NA
GCF_002356575.1_ASM235657v1	NZ_AP014833	Bacillus anthracis strain Shikan-NIID	2	2749219-2749416	2	CRISPRCasFinder	no		cas3,cas14j,DEDDh,csa3,WYL,DinG,cas14k,c2c9_V-U4	Orphan	AGGAATGAATATTCATTCCGAAA	23	0	0	NA	NA	NA	3	3	Orphan	cas3,cas14j,DEDDh,csa3,WYL,DinG,cas14k,c2c9_V-U4,RT,c2c4_V-U1	NA,NA	NA|178aa|up_9|NZ_AP014833.1_2734809_2735343_+	pfam13523, Acetyltransf_8, Acetyltransferase (GNAT) domain	NA|177aa|up_8|NZ_AP014833.1_2735359_2735890_+	cd01014, nicotinamidase_related, Nicotinamidase_ related amidohydrolases	NA|146aa|up_7|NZ_AP014833.1_2736038_2736476_-	pfam10710, DUF2512, Protein of unknown function (DUF2512)	NA|359aa|up_6|NZ_AP014833.1_2740967_2742044_+	PRK12595, PRK12595, bifunctional 3-deoxy-7-phosphoheptulonate synthase/chorismate mutase; Reviewed	NA|391aa|up_5|NZ_AP014833.1_2742328_2743501_+	PRK12463, PRK12463, chorismate synthase; Reviewed	NA|367aa|up_4|NZ_AP014833.1_2743519_2744620_+	PRK01533, PRK01533, histidinol-phosphate aminotransferase; Validated	NA|367aa|up_3|NZ_AP014833.1_2744612_2745713_+	PRK06545, PRK06545, prephenate dehydrogenase; Validated	NA|430aa|up_2|NZ_AP014833.1_2745729_2747019_+	PRK02427, PRK02427, 3-phosphoshikimate 1-carboxyvinyltransferase; Provisional	NA|386aa|up_1|NZ_AP014833.1_2747204_2748362_+	COG4552, Eis, Predicted acetyltransferase involved in intracellular survival and related acetyltransferases [General function prediction only]	NA|189aa|up_0|NZ_AP014833.1_2748631_2749198_+	pfam16295, TetR_C_10, Tetracycline repressor, C-terminal all-alpha domain	NA|808aa|down_0|NZ_AP014833.1_2749451_2751875_+	COG2409, COG2409, Predicted drug exporters of the RND superfamily [General function prediction only]	NA|51aa|down_1|NZ_AP014833.1_2751875_2752028_+	COG2409, COG2409, Predicted drug exporters of the RND superfamily [General function prediction only]	NA|191aa|down_2|NZ_AP014833.1_2751994_2752567_+	COG2409, COG2409, Predicted drug exporters of the RND superfamily [General function prediction only]	NA|283aa|down_3|NZ_AP014833.1_2752852_2753701_+	PRK06761, PRK06761, hypothetical protein; Provisional	NA|542aa|down_4|NZ_AP014833.1_2754078_2755704_+	PRK15064, PRK15064, ABC transporter ATP-binding protein; Provisional	NA|658aa|down_5|NZ_AP014833.1_2756220_2758194_+	COG1368, MdoB, Phosphoglycerol transferase and related proteins, alkaline phosphatase superfamily [Cell envelope biogenesis, outer membrane]	NA|224aa|down_6|NZ_AP014833.1_2758253_2758925_-	TIGR03717, R_switched_YjbE, integral membrane protein, YjbE family	NA|221aa|down_7|NZ_AP014833.1_2759035_2759698_-	pfam12952, DUF3841, Domain of unknown function (DUF3841)	NA|276aa|down_8|NZ_AP014833.1_2759857_2760685_-	cd10944, CE4_SmPgdA_like, Catalytic NodB homology domain of Streptococcus mutans polysaccharide deacetylase PgdA, Bacillus subtilis YheN, and similar proteins	NA|87aa|down_9|NZ_AP014833.1_2760902_2761163_+	PRK10811, rne, ribonuclease E; Reviewed
GCF_002356575.1_ASM235657v1	NZ_AP014833	Bacillus anthracis strain Shikan-NIID	3	4292924-4293179	3	CRISPRCasFinder	no		cas3,cas14j,DEDDh,csa3,WYL,DinG,cas14k,c2c9_V-U4	Orphan	TTCTCATGCGGAAGGTAAACATACA	25	0	0	NA	NA	NA	3	3	Orphan	cas3,cas14j,DEDDh,csa3,WYL,DinG,cas14k,c2c9_V-U4,RT,c2c4_V-U1	NA,NA|120aa|down_3|NZ_AP014833.1_4296269_4296629_-,NA|58aa|down_7|NZ_AP014833.1_4299055_4299229_+	NA|246aa|up_9|NZ_AP014833.1_4282022_4282760_-	cd02538, G1P_TT_short, G1P_TT_short is the short form of glucose-1-phosphate thymidylyltransferase	NA|227aa|up_8|NZ_AP014833.1_4282774_4283455_-	pfam13712, Glyco_tranf_2_5, Glycosyltransferase like family	NA|229aa|up_7|NZ_AP014833.1_4283467_4284154_-	pfam13712, Glyco_tranf_2_5, Glycosyltransferase like family	NA|230aa|up_6|NZ_AP014833.1_4284150_4284840_-	pfam08242, Methyltransf_12, Methyltransferase domain	NA|371aa|up_5|NZ_AP014833.1_4284977_4286090_-	cd02511, Beta4Glucosyltransferase, UDP-glucose LOS-beta-1,4 glucosyltransferase is required for biosynthesis of lipooligosaccharide	NA|374aa|up_4|NZ_AP014833.1_4286255_4287377_+	pfam18573, BclA_C, BclA C-terminal domain	NA|282aa|up_3|NZ_AP014833.1_4287499_4288345_+	pfam05711, TylF, Macrocin-O-methyltransferase (TylF)	NA|256aa|up_2|NZ_AP014833.1_4288472_4289240_+	pfam13649, Methyltransf_25, Methyltransferase domain	NA|312aa|up_1|NZ_AP014833.1_4289246_4290182_+	pfam00535, Glycos_transf_2, Glycosyl transferase family 2	NA|247aa|up_0|NZ_AP014833.1_4291558_4292299_+	PRK13625, PRK13625, bis(5'-nucleosyl)-tetraphosphatase PrpE; Provisional	NA|298aa|down_0|NZ_AP014833.1_4293875_4294769_-	COG0564, RluA, Pseudouridylate synthases, 23S RNA-specific [Translation, ribosomal structure and biogenesis]	NA|266aa|down_1|NZ_AP014833.1_4294784_4295582_-	PRK04885, ppnK, inorganic polyphosphate/ATP-NAD kinase; Provisional	NA|213aa|down_2|NZ_AP014833.1_4295600_4296239_-	COG2357, COG2357, PpGpp synthetase catalytic domain [General function prediction only]	NA|120aa|down_3|NZ_AP014833.1_4296269_4296629_-	NA	NA|193aa|down_4|NZ_AP014833.1_4296775_4297354_+	cd07762, CYTH-like_Pase_1, Uncharacterized subgroup 1 of the CYTH-like superfamily	NA|133aa|down_5|NZ_AP014833.1_4297534_4297933_+	cd14772, TrHb2_Bs-trHb-like_O, Truncated hemoglobins, group 2 (O); Bacillus subtilis TrHb like	NA|298aa|down_6|NZ_AP014833.1_4297932_4298826_+	pfam13743, Thioredoxin_5, Thioredoxin	NA|58aa|down_7|NZ_AP014833.1_4299055_4299229_+	NA	NA|609aa|down_8|NZ_AP014833.1_4299348_4301175_-	cd09608, M3B_PepF, Peptidase family M3B, oligopeptidase F (PepF)	NA|415aa|down_9|NZ_AP014833.1_4301225_4302470_-	COG4469, CoiA, Competence protein CoiA-like family, contains a predicted nuclease    domain [General function prediction only]
GCF_002356575.1_ASM235657v1	NZ_AP014833	Bacillus anthracis strain Shikan-NIID	4	4405591-4405684	4	CRISPRCasFinder	no		cas3,cas14j,DEDDh,csa3,WYL,DinG,cas14k,c2c9_V-U4	Orphan	TTTTACTATTTAACGTATTTAAACC	25	0	0	NA	NA	NA	1	1	Orphan	cas3,cas14j,DEDDh,csa3,WYL,DinG,cas14k,c2c9_V-U4,RT,c2c4_V-U1	NA|143aa|up_8|NZ_AP014833.1_4398070_4398499_-,NA|72aa|up_0|NZ_AP014833.1_4403444_4403660_-,NA|280aa|down_3|NZ_AP014833.1_4409945_4410785_+,NA|118aa|down_4|NZ_AP014833.1_4410781_4411135_-	NA|511aa|up_9|NZ_AP014833.1_4396376_4397909_-	PRK07656, PRK07656, long-chain-fatty-acid--CoA ligase; Validated	NA|143aa|up_8|NZ_AP014833.1_4398070_4398499_-	NA	NA|330aa|up_7|NZ_AP014833.1_4398753_4399743_-	TIGR00545, Probable_lipoate-protein_ligase_A, lipoyltransferase and lipoate-protein ligase	NA|245aa|up_6|NZ_AP014833.1_4399752_4400487_-	cd07716, RNaseZ_short-form-like_MBL-fold, uncharacterized bacterial subgroup of Ribonuclease Z, short form; MBL-fold metallo-hydrolase domain	NA|43aa|up_5|NZ_AP014833.1_4400666_4400795_+	pfam14149, YhfH, YhfH-like protein	NA|326aa|up_4|NZ_AP014833.1_4400870_4401848_-	COG1609, PurR, Transcriptional regulators [Transcription]	NA|168aa|up_3|NZ_AP014833.1_4402085_4402589_-	pfam00583, Acetyltransf_1, Acetyltransferase (GNAT) family	NA|118aa|up_2|NZ_AP014833.1_4402722_4403076_+	pfam14470, bPH_3, Bacterial PH domain	NA|102aa|up_1|NZ_AP014833.1_4403102_4403408_-	pfam09860, DUF2087, Uncharacterized protein conserved in bacteria (DUF2087)	NA|72aa|up_0|NZ_AP014833.1_4403444_4403660_-	NA	NA|191aa|down_0|NZ_AP014833.1_4406627_4407200_-	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|633aa|down_1|NZ_AP014833.1_4407413_4409312_-	pfam05569, Peptidase_M56, BlaR1 peptidase M56	NA|133aa|down_2|NZ_AP014833.1_4409317_4409716_-	pfam03965, Penicillinase_R, Penicillinase repressor	NA|280aa|down_3|NZ_AP014833.1_4409945_4410785_+	NA	NA|118aa|down_4|NZ_AP014833.1_4410781_4411135_-	NA	NA|789aa|down_5|NZ_AP014833.1_4411286_4413653_-	COG2374, COG2374, Predicted extracellular nuclease [General function prediction only]	NA|451aa|down_6|NZ_AP014833.1_4413893_4415246_+	pfam13218, DUF4026, Protein of unknown function (DUF4026)	NA|474aa|down_7|NZ_AP014833.1_4415285_4416707_-	PRK11883, PRK11883, protoporphyrinogen oxidase; Reviewed	NA|312aa|down_8|NZ_AP014833.1_4416726_4417662_-	PRK12435, PRK12435, ferrochelatase; Provisional	NA|349aa|down_9|NZ_AP014833.1_4417676_4418723_-	PRK00115, hemE, uroporphyrinogen decarboxylase; Validated
