assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000875715.1_ASM87571v1	NZ_CP010854	Bacillus anthracis strain A1144 plasmid pXO2, complete sequence	1	34344-34664	1	PILER-CR	no			Orphan	AATTTACACATGTGTAAATGTGTAAATGTGTAAATGTGTAAA	42	0	0	NA	NA	NA	3	3	Orphan	cas3,csa3,WYL,c2c9_V-U4,cas14k,DinG,DEDDh,cas14j,RT,c2c4_V-U1	NA|923aa|up_9|NZ_CP010854.1_22533_25302_-,NA|110aa|up_8|NZ_CP010854.1_25427_25757_-,NA|79aa|up_7|NZ_CP010854.1_25807_26044_-,NA|63aa|up_5|NZ_CP010854.1_27013_27202_-,NA|177aa|up_4|NZ_CP010854.1_27191_27722_-,NA|61aa|up_2|NZ_CP010854.1_29406_29589_-,NA|255aa|up_1|NZ_CP010854.1_31286_32051_-,NA|129aa|down_1|NZ_CP010854.1_35557_35944_+,NA|78aa|down_2|NZ_CP010854.1_36142_36376_+,NA|110aa|down_4|NZ_CP010854.1_40657_40987_-	NA|923aa|up_9|NZ_CP010854.1_22533_25302_-	NA	NA|110aa|up_8|NZ_CP010854.1_25427_25757_-	NA	NA|79aa|up_7|NZ_CP010854.1_25807_26044_-	NA	NA|163aa|up_6|NZ_CP010854.1_26428_26917_-	pfam17631, DUF5512, Family of unknown function (DUF5512)	NA|63aa|up_5|NZ_CP010854.1_27013_27202_-	NA	NA|177aa|up_4|NZ_CP010854.1_27191_27722_-	NA	NA|101aa|up_3|NZ_CP010854.1_28228_28531_-	pfam17362, pXO2-34, Family of unknown function	NA|61aa|up_2|NZ_CP010854.1_29406_29589_-	NA	NA|255aa|up_1|NZ_CP010854.1_31286_32051_-	NA	NA|513aa|up_0|NZ_CP010854.1_32253_33792_-	pfam08708, PriCT_1, Primase C terminal 1 (PriCT-1)	NA|289aa|down_0|NZ_CP010854.1_34698_35565_+	COG1192, Soj, ATPases involved in chromosome partitioning [Cell division and chromosome partitioning]	NA|129aa|down_1|NZ_CP010854.1_35557_35944_+	NA	NA|78aa|down_2|NZ_CP010854.1_36142_36376_+	NA	NA|532aa|down_3|NZ_CP010854.1_37584_39180_+	cd02696, MurNAc-LAA, N-acetylmuramoyl-L-alanine amidase or MurNAc-LAA (also known as peptidoglycan aminohydrolase, NAMLA amidase, NAMLAA, Amidase 3, and peptidoglycan amidase; EC 3	NA|110aa|down_4|NZ_CP010854.1_40657_40987_-	NA	NA|104aa|down_5|NZ_CP010854.1_41569_41881_-	PRK12856, PRK12856, hypothetical protein; Provisional	NA|222aa|down_6|NZ_CP010854.1_41929_42595_-	pfam02517, Abi, CAAX protease self-immunity	NA|155aa|down_7|NZ_CP010854.1_43305_43770_+	pfam04463, DUF523, Protein of unknown function (DUF523)	NA|197aa|down_8|NZ_CP010854.1_44146_44737_+	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|145aa|down_9|NZ_CP010854.1_46712_47147_-	cd03385, PAP2_BcrC_like, PAP2_like proteins, BcrC_like subfamily
GCF_000875715.1_ASM87571v1	NZ_CP010852	Bacillus anthracis strain A1144, complete genome	1	1347670-1347763	1	CRISPRCasFinder	no		cas3,csa3,WYL,c2c9_V-U4,cas14k,DinG,DEDDh,cas14j	Orphan	GGTTTAAATACGTTAAATAGCAAAA	25	0	0	NA	NA	NA	1	1	Orphan	cas3,csa3,WYL,c2c9_V-U4,cas14k,DinG,DEDDh,cas14j,RT,c2c4_V-U1	NA|118aa|up_4|NZ_CP010852.1_1342218_1342572_+,NA|280aa|up_3|NZ_CP010852.1_1342568_1343408_-,NA|72aa|down_0|NZ_CP010852.1_1349693_1349909_+,NA|143aa|down_8|NZ_CP010852.1_1354854_1355283_+	NA|349aa|up_9|NZ_CP010852.1_1334630_1335677_+	PRK00115, hemE, uroporphyrinogen decarboxylase; Validated	NA|312aa|up_8|NZ_CP010852.1_1335691_1336627_+	PRK12435, PRK12435, ferrochelatase; Provisional	NA|474aa|up_7|NZ_CP010852.1_1336646_1338068_+	PRK11883, PRK11883, protoporphyrinogen oxidase; Reviewed	NA|451aa|up_6|NZ_CP010852.1_1338107_1339460_-	pfam13218, DUF4026, Protein of unknown function (DUF4026)	NA|789aa|up_5|NZ_CP010852.1_1339700_1342067_+	COG2374, COG2374, Predicted extracellular nuclease [General function prediction only]	NA|118aa|up_4|NZ_CP010852.1_1342218_1342572_+	NA	NA|280aa|up_3|NZ_CP010852.1_1342568_1343408_-	NA	NA|133aa|up_2|NZ_CP010852.1_1343637_1344036_+	pfam03965, Penicillinase_R, Penicillinase repressor	NA|633aa|up_1|NZ_CP010852.1_1344041_1345940_+	pfam05569, Peptidase_M56, BlaR1 peptidase M56	NA|191aa|up_0|NZ_CP010852.1_1346153_1346726_+	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|72aa|down_0|NZ_CP010852.1_1349693_1349909_+	NA	NA|102aa|down_1|NZ_CP010852.1_1349945_1350251_+	pfam09860, DUF2087, Uncharacterized protein conserved in bacteria (DUF2087)	NA|118aa|down_2|NZ_CP010852.1_1350277_1350631_-	pfam14470, bPH_3, Bacterial PH domain	NA|168aa|down_3|NZ_CP010852.1_1350764_1351268_+	pfam00583, Acetyltransf_1, Acetyltransferase (GNAT) family	NA|326aa|down_4|NZ_CP010852.1_1351505_1352483_+	COG1609, PurR, Transcriptional regulators [Transcription]	NA|43aa|down_5|NZ_CP010852.1_1352558_1352687_-	pfam14149, YhfH, YhfH-like protein	NA|245aa|down_6|NZ_CP010852.1_1352866_1353601_+	cd07716, RNaseZ_short-form-like_MBL-fold, uncharacterized bacterial subgroup of Ribonuclease Z, short form; MBL-fold metallo-hydrolase domain	NA|330aa|down_7|NZ_CP010852.1_1353610_1354600_+	TIGR00545, Probable_lipoate-protein_ligase_A, lipoyltransferase and lipoate-protein ligase	NA|143aa|down_8|NZ_CP010852.1_1354854_1355283_+	NA	NA|511aa|down_9|NZ_CP010852.1_1355444_1356977_+	PRK07656, PRK07656, long-chain-fatty-acid--CoA ligase; Validated
GCF_000875715.1_ASM87571v1	NZ_CP010852	Bacillus anthracis strain A1144, complete genome	2	1460177-1460285	2	CRISPRCasFinder	no		cas3,csa3,WYL,c2c9_V-U4,cas14k,DinG,DEDDh,cas14j	Orphan	TGTATGATTACCTTCCGCATGAGAA	25	0	0	NA	NA	NA	1	1	Orphan	cas3,csa3,WYL,c2c9_V-U4,cas14k,DinG,DEDDh,cas14j,RT,c2c4_V-U1	NA|120aa|up_3|NZ_CP010852.1_1456726_1457086_+,NA	NA|515aa|up_9|NZ_CP010852.1_1449259_1450804_+	PRK01642, cls, cardiolipin synthetase; Reviewed	NA|415aa|up_8|NZ_CP010852.1_1450885_1452130_+	COG4469, CoiA, Competence protein CoiA-like family, contains a predicted nuclease    domain [General function prediction only]	NA|609aa|up_7|NZ_CP010852.1_1452180_1454007_+	cd09608, M3B_PepF, Peptidase family M3B, oligopeptidase F (PepF)	NA|298aa|up_6|NZ_CP010852.1_1454529_1455423_-	pfam13743, Thioredoxin_5, Thioredoxin	NA|133aa|up_5|NZ_CP010852.1_1455422_1455821_-	cd14772, TrHb2_Bs-trHb-like_O, Truncated hemoglobins, group 2 (O); Bacillus subtilis TrHb like	NA|193aa|up_4|NZ_CP010852.1_1456001_1456580_-	cd07762, CYTH-like_Pase_1, Uncharacterized subgroup 1 of the CYTH-like superfamily	NA|120aa|up_3|NZ_CP010852.1_1456726_1457086_+	NA	NA|213aa|up_2|NZ_CP010852.1_1457116_1457755_+	COG2357, COG2357, PpGpp synthetase catalytic domain [General function prediction only]	NA|266aa|up_1|NZ_CP010852.1_1457773_1458571_+	PRK04885, ppnK, inorganic polyphosphate/ATP-NAD kinase; Provisional	NA|298aa|up_0|NZ_CP010852.1_1458586_1459480_+	COG0564, RluA, Pseudouridylate synthases, 23S RNA-specific [Translation, ribosomal structure and biogenesis]	NA|247aa|down_0|NZ_CP010852.1_1461056_1461797_-	PRK13625, PRK13625, bis(5'-nucleosyl)-tetraphosphatase PrpE; Provisional	NA|312aa|down_1|NZ_CP010852.1_1463173_1464109_-	pfam00535, Glycos_transf_2, Glycosyl transferase family 2	NA|256aa|down_2|NZ_CP010852.1_1464115_1464883_-	pfam13649, Methyltransf_25, Methyltransferase domain	NA|282aa|down_3|NZ_CP010852.1_1465010_1465856_-	pfam05711, TylF, Macrocin-O-methyltransferase (TylF)	NA|263aa|down_4|NZ_CP010852.1_1465978_1466767_-	pfam18573, BclA_C, BclA C-terminal domain	NA|386aa|down_5|NZ_CP010852.1_1466932_1468090_+	cd02511, Beta4Glucosyltransferase, UDP-glucose LOS-beta-1,4 glucosyltransferase is required for biosynthesis of lipooligosaccharide	NA|230aa|down_6|NZ_CP010852.1_1468183_1468873_+	pfam08242, Methyltransf_12, Methyltransferase domain	NA|229aa|down_7|NZ_CP010852.1_1468869_1469556_+	pfam13712, Glyco_tranf_2_5, Glycosyltransferase like family	NA|227aa|down_8|NZ_CP010852.1_1469568_1470249_+	pfam13712, Glyco_tranf_2_5, Glycosyltransferase like family	NA|246aa|down_9|NZ_CP010852.1_1470263_1471001_+	cd02538, G1P_TT_short, G1P_TT_short is the short form of glucose-1-phosphate thymidylyltransferase
GCF_000875715.1_ASM87571v1	NZ_CP010852	Bacillus anthracis strain A1144, complete genome	3	2974791-2974988	3	CRISPRCasFinder	no		cas3,csa3,WYL,c2c9_V-U4,cas14k,DinG,DEDDh,cas14j	Orphan	TTTCGGAATGAACATTCATTCCT	23	0	0	NA	NA	NA	3	3	Orphan	cas3,csa3,WYL,c2c9_V-U4,cas14k,DinG,DEDDh,cas14j,RT,c2c4_V-U1	NA,NA	NA|87aa|up_9|NZ_CP010852.1_2963043_2963304_-	PRK10811, rne, ribonuclease E; Reviewed	NA|276aa|up_8|NZ_CP010852.1_2963521_2964349_+	cd10944, CE4_SmPgdA_like, Catalytic NodB homology domain of Streptococcus mutans polysaccharide deacetylase PgdA, Bacillus subtilis YheN, and similar proteins	NA|221aa|up_7|NZ_CP010852.1_2964508_2965171_+	pfam12952, DUF3841, Domain of unknown function (DUF3841)	NA|224aa|up_6|NZ_CP010852.1_2965281_2965953_+	TIGR03717, R_switched_YjbE, integral membrane protein, YjbE family	NA|658aa|up_5|NZ_CP010852.1_2966012_2967986_-	COG1368, MdoB, Phosphoglycerol transferase and related proteins, alkaline phosphatase superfamily [Cell envelope biogenesis, outer membrane]	NA|542aa|up_4|NZ_CP010852.1_2968502_2970128_-	PRK15064, PRK15064, ABC transporter ATP-binding protein; Provisional	NA|283aa|up_3|NZ_CP010852.1_2970505_2971354_-	PRK06761, PRK06761, hypothetical protein; Provisional	NA|191aa|up_2|NZ_CP010852.1_2971639_2972212_-	COG2409, COG2409, Predicted drug exporters of the RND superfamily [General function prediction only]	NA|51aa|up_1|NZ_CP010852.1_2972178_2972331_-	COG2409, COG2409, Predicted drug exporters of the RND superfamily [General function prediction only]	NA|808aa|up_0|NZ_CP010852.1_2972331_2974755_-	COG2409, COG2409, Predicted drug exporters of the RND superfamily [General function prediction only]	NA|189aa|down_0|NZ_CP010852.1_2975008_2975575_-	pfam16295, TetR_C_10, Tetracycline repressor, C-terminal all-alpha domain	NA|386aa|down_1|NZ_CP010852.1_2975844_2977002_-	COG4552, Eis, Predicted acetyltransferase involved in intracellular survival and related acetyltransferases [General function prediction only]	NA|430aa|down_2|NZ_CP010852.1_2977187_2978477_-	PRK02427, PRK02427, 3-phosphoshikimate 1-carboxyvinyltransferase; Provisional	NA|367aa|down_3|NZ_CP010852.1_2978493_2979594_-	PRK06545, PRK06545, prephenate dehydrogenase; Validated	NA|367aa|down_4|NZ_CP010852.1_2979586_2980687_-	PRK01533, PRK01533, histidinol-phosphate aminotransferase; Validated	NA|391aa|down_5|NZ_CP010852.1_2980705_2981878_-	PRK12463, PRK12463, chorismate synthase; Reviewed	NA|359aa|down_6|NZ_CP010852.1_2982162_2983239_-	PRK12595, PRK12595, bifunctional 3-deoxy-7-phosphoheptulonate synthase/chorismate mutase; Reviewed	NA|146aa|down_7|NZ_CP010852.1_2987731_2988169_+	pfam10710, DUF2512, Protein of unknown function (DUF2512)	NA|177aa|down_8|NZ_CP010852.1_2988317_2988848_-	cd01014, nicotinamidase_related, Nicotinamidase_ related amidohydrolases	NA|178aa|down_9|NZ_CP010852.1_2988864_2989398_-	pfam13523, Acetyltransf_8, Acetyltransferase (GNAT) domain
GCF_000875715.1_ASM87571v1	NZ_CP010852	Bacillus anthracis strain A1144, complete genome	4	3353084-3353218	4	CRISPRCasFinder	no		cas3,csa3,WYL,c2c9_V-U4,cas14k,DinG,DEDDh,cas14j	Orphan	TTTTGGATCATCCGTTTTTGGTTCTTC	27	0	0	NA	NA	NA	2	2	Orphan	cas3,csa3,WYL,c2c9_V-U4,cas14k,DinG,DEDDh,cas14j,RT,c2c4_V-U1	NA|118aa|up_9|NZ_CP010852.1_3344513_3344867_-,NA|86aa|up_6|NZ_CP010852.1_3346103_3346361_-,NA|64aa|up_5|NZ_CP010852.1_3346684_3346876_+,NA|119aa|down_3|NZ_CP010852.1_3357555_3357912_-,NA|104aa|down_9|NZ_CP010852.1_3364063_3364375_+	NA|118aa|up_9|NZ_CP010852.1_3344513_3344867_-	NA	NA|192aa|up_8|NZ_CP010852.1_3344863_3345439_-	pfam02525, Flavodoxin_2, Flavodoxin-like fold	NA|146aa|up_7|NZ_CP010852.1_3345462_3345900_-	COG4715, COG4715, Uncharacterized conserved protein [Function unknown]	NA|86aa|up_6|NZ_CP010852.1_3346103_3346361_-	NA	NA|64aa|up_5|NZ_CP010852.1_3346684_3346876_+	NA	NA|241aa|up_4|NZ_CP010852.1_3346908_3347631_-	pfam14256, YwiC, YwiC-like protein	NA|234aa|up_3|NZ_CP010852.1_3347814_3348516_-	pfam07155, ECF-ribofla_trS, ECF-type riboflavin transporter, S component	NA|554aa|up_2|NZ_CP010852.1_3348488_3350150_-	cd03226, ABC_cobalt_CbiO_domain2, Second domain of the ATP-binding cassette component of cobalt transport system	NA|293aa|up_1|NZ_CP010852.1_3350122_3351001_-	COG0619, CbiQ, ABC-type cobalt transport system, permease component CbiQ and related transporters [Inorganic ion transport and metabolism]	NA|279aa|up_0|NZ_CP010852.1_3351023_3351860_-	pfam14478, DUF4430, Domain of unknown function (DUF4430)	NA|160aa|down_0|NZ_CP010852.1_3354042_3354522_-	PRK09372, PRK09372, ribonuclease E inhibitor RraA	NA|477aa|down_1|NZ_CP010852.1_3354722_3356153_+	COG1288, COG1288, Predicted membrane protein [Function unknown]	NA|276aa|down_2|NZ_CP010852.1_3356300_3357128_-	COG2356, EndA, Endonuclease I [DNA replication, recombination, and repair]	NA|119aa|down_3|NZ_CP010852.1_3357555_3357912_-	NA	NA|343aa|down_4|NZ_CP010852.1_3358112_3359141_-	COG0596, MhpC, Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) [General function prediction only]	NA|375aa|down_5|NZ_CP010852.1_3359151_3360276_-	pfam01381, HTH_3, Helix-turn-helix	NA|191aa|down_6|NZ_CP010852.1_3360616_3361189_-	PRK00220, PRK00220, glycerol-3-phosphate 1-O-acyltransferase PlsY	NA|159aa|down_7|NZ_CP010852.1_3361808_3362285_-	cd00002, YbaK_deacylase, This CD includes cysteinyl-tRNA(Pro) deacylases from Haemophilus influenzae and Escherichia coli and other related bacterial proteins	NA|426aa|down_8|NZ_CP010852.1_3362624_3363902_+	PRK12420, PRK12420, histidyl-tRNA synthetase; Provisional	NA|104aa|down_9|NZ_CP010852.1_3364063_3364375_+	NA
GCF_000875715.1_ASM87571v1	NZ_CP010852	Bacillus anthracis strain A1144, complete genome	5	3521444-3521546	5	CRISPRCasFinder	no	csa3	cas3,csa3,WYL,c2c9_V-U4,cas14k,DinG,DEDDh,cas14j	Type I-A	AAGTTTAGGTTTCTTTTGAGAATGT	25	0	0	NA	NA	NA	1	1	Orphan	cas3,csa3,WYL,c2c9_V-U4,cas14k,DinG,DEDDh,cas14j,RT,c2c4_V-U1	NA|210aa|up_8|NZ_CP010852.1_3509500_3510130_+,NA|77aa|up_5|NZ_CP010852.1_3516102_3516333_+,NA|218aa|down_0|NZ_CP010852.1_3522184_3522838_-	NA|634aa|up_9|NZ_CP010852.1_3506986_3508888_-	PRK00409, PRK00409, recombination and DNA strand exchange inhibitor protein; Reviewed	NA|210aa|up_8|NZ_CP010852.1_3509500_3510130_+	NA	NA|514aa|up_7|NZ_CP010852.1_3510224_3511766_-	PRK09441, PRK09441, cytoplasmic alpha-amylase; Reviewed	NA|378aa|up_6|NZ_CP010852.1_3514712_3515846_-	pfam00144, Beta-lactamase, Beta-lactamase	NA|77aa|up_5|NZ_CP010852.1_3516102_3516333_+	NA	NA|415aa|up_4|NZ_CP010852.1_3516432_3517677_-	cd17475, MFS_MT3072_like, Mycobacterium tuberculosis uncharacterized MFS-type transporter MT3072 and similar transporters of the Major Facilitator Superfamily	NA|284aa|up_3|NZ_CP010852.1_3517762_3518614_+	cd08442, PBP2_YofA_SoxR_like, The C-terminal substrate binding domain of LysR-type transcriptional regulators, YofA and SoxR, contains the type 2 periplasmic binding fold	NA|170aa|up_2|NZ_CP010852.1_3518606_3519116_+	pfam13079, DUF3916, Protein of unknown function (DUF3916)	NA|226aa|up_1|NZ_CP010852.1_3519209_3519887_+	cd06259, YdcF-like, YdcF-like	NA|315aa|up_0|NZ_CP010852.1_3519936_3520881_-	cd08601, GDPD_SaGlpQ_like, Glycerophosphodiester phosphodiesterase domain of Staphylococcus aureus and similar proteins	NA|218aa|down_0|NZ_CP010852.1_3522184_3522838_-	NA	NA|60aa|down_1|NZ_CP010852.1_3522942_3523122_+	PRK12270, kgd, multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit	NA|540aa|down_2|NZ_CP010852.1_3523453_3525073_-	pfam01494, FAD_binding_3, FAD binding domain	csa3|115aa|down_3|NZ_CP010852.1_3525550_3525895_+	cd00090, HTH_ARSR, Arsenical Resistance Operon Repressor and similar prokaryotic, metal regulated homodimeric repressors	NA|333aa|down_4|NZ_CP010852.1_3525913_3526912_+	cd05289, MDR_like_2, alcohol dehydrogenase and quinone reductase-like medium chain degydrogenases/reductases	NA|489aa|down_5|NZ_CP010852.1_3527090_3528557_+	PRK00029, PRK00029, YdiU family protein	NA|64aa|down_6|NZ_CP010852.1_3529903_3530095_+	TIGR04429, hypothetical_protein_bmyco0001_31490, Phr family secreted Rap phosphatase inhibitor	NA|200aa|down_7|NZ_CP010852.1_3530168_3530768_-	pfam04299, FMN_bind_2, Putative FMN-binding domain	NA|176aa|down_8|NZ_CP010852.1_3530902_3531430_-	pfam04229, GrpB, GrpB protein	NA|116aa|down_9|NZ_CP010852.1_3531453_3531801_-	pfam14079, DUF4260, Domain of unknown function (DUF4260)
