assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000177235.2_ASM17723v2	NC_014829	Bacillus cellulosilyticus DSM 2522, complete genome	1	472384-472571	1	CRISPRCasFinder	no		cas3,csa3,DinG,WYL,RT,DEDDh	Orphan	AATAAACGGAGAAATTCCGCTTATT	25	0	0	NA	NA	NA	3	3	Orphan	cas3,csa3,DinG,WYL,RT,DEDDh	NA|108aa|up_8|NC_014829.1_460978_461302_+,NA|161aa|down_6|NC_014829.1_481414_481897_-,NA|301aa|down_8|NC_014829.1_483279_484182_-	NA|425aa|up_9|NC_014829.1_458957_460232_+	PRK00885, PRK00885, phosphoribosylamine--glycine ligase; Provisional	NA|108aa|up_8|NC_014829.1_460978_461302_+	NA	NA|94aa|up_7|NC_014829.1_461391_461673_-	pfam11127, DUF2892, Protein of unknown function (DUF2892)	NA|583aa|up_6|NC_014829.1_461782_463531_+	COG1001, AdeC, Adenine deaminase [Nucleotide transport and metabolism]	NA|361aa|up_5|NC_014829.1_463568_464651_+	pfam11258, DUF3048, Protein of unknown function (DUF3048) N-terminal domain	NA|102aa|up_4|NC_014829.1_464710_465016_+	COG4496, COG4496, Uncharacterized protein conserved in bacteria [Function unknown]	NA|229aa|up_3|NC_014829.1_465780_466467_+	PRK04169, PRK04169, heptaprenylglyceryl phosphate synthase	NA|761aa|up_2|NC_014829.1_466536_468819_+	TIGR01073, ATP-dependent_DNA_helicase_PcrA, ATP-dependent DNA helicase PcrA	NA|671aa|up_1|NC_014829.1_468831_470844_+	PRK07956, ligA, NAD-dependent DNA ligase LigA; Validated	NA|393aa|up_0|NC_014829.1_470884_472063_+	pfam07537, CamS, CamS sex pheromone cAM373 precursor	NA|97aa|down_0|NC_014829.1_473079_473370_+	PRK00034, gatC, Asp-tRNA(Asn)/Glu-tRNA(Gln) amidotransferase subunit GatC	NA|486aa|down_1|NC_014829.1_473384_474842_+	PRK00012, gatA, Asp-tRNA(Asn)/Glu-tRNA(Gln) amidotransferase subunit GatA	NA|478aa|down_2|NC_014829.1_474855_476289_+	PRK05477, gatB, Asp-tRNA(Asn)/Glu-tRNA(Gln) amidotransferase subunit GatB	NA|410aa|down_3|NC_014829.1_477021_478251_+	pfam00150, Cellulase, Cellulase (glycosyl hydrolase family 5)	NA|489aa|down_4|NC_014829.1_478805_480272_+	pfam00150, Cellulase, Cellulase (glycosyl hydrolase family 5)	NA|287aa|down_5|NC_014829.1_480331_481192_-	COG1834, COG1834, N-Dimethylarginine dimethylaminohydrolase [Amino acid transport and metabolism]	NA|161aa|down_6|NC_014829.1_481414_481897_-	NA	NA|305aa|down_7|NC_014829.1_482148_483063_+	PRK13337, PRK13337, putative lipid kinase; Reviewed	NA|301aa|down_8|NC_014829.1_483279_484182_-	NA	NA|312aa|down_9|NC_014829.1_484430_485366_+	COG1131, CcmA, ABC-type multidrug transport system, ATPase component [Defense mechanisms]
GCF_000177235.2_ASM17723v2	NC_014829	Bacillus cellulosilyticus DSM 2522, complete genome	2	568335-568522	2	CRISPRCasFinder	no		cas3,csa3,DinG,WYL,RT,DEDDh	Orphan	AATAAACGGAGAAATTCCGCTTATT	25	0	0	NA	NA	NA	3	3	Orphan	cas3,csa3,DinG,WYL,RT,DEDDh	NA|137aa|up_8|NC_014829.1_557955_558366_+,NA|161aa|up_3|NC_014829.1_563887_564370_+,NA	NA|120aa|up_9|NC_014829.1_557375_557735_+	pfam09685, DUF4870, Domain of unknown function (DUF4870)	NA|137aa|up_8|NC_014829.1_557955_558366_+	NA	NA|869aa|up_7|NC_014829.1_558639_561246_-	COG4591, LolE, ABC-type transport system, involved in lipoprotein release, permease component [Cell envelope biogenesis, outer membrane]	NA|228aa|up_6|NC_014829.1_561242_561926_-	cd03255, ABC_MJ0796_LolCDE_FtsE, ATP-binding cassette domain of the transporters involved in export of lipoprotein and macrolide, and cell division protein	NA|326aa|up_5|NC_014829.1_562025_563003_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|225aa|up_4|NC_014829.1_563028_563703_-	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|161aa|up_3|NC_014829.1_563887_564370_+	NA	NA|527aa|up_2|NC_014829.1_564538_566119_+	COG1164, COG1164, Oligoendopeptidase F [Amino acid transport and metabolism]	NA|247aa|up_1|NC_014829.1_566448_567189_-	cd01106, HTH_TipAL-Mta, Helix-Turn-Helix DNA binding domain of the transcription regulators TipAL, Mta, and SkgA	NA|167aa|up_0|NC_014829.1_567483_567984_-	cd16913, YkuD_like, L,D-transpeptidases/carboxypeptidases similar to Bacillus YkuD	NA|725aa|down_0|NC_014829.1_568778_570953_-	PRK05443, PRK05443, polyphosphate kinase; Provisional	NA|1486aa|down_1|NC_014829.1_571313_575771_-	PRK09419, PRK09419, multifunctional 2',3'-cyclic-nucleotide 2'-phosphodiesterase/3'-nucleotidase/5'-nucleotidase	NA|328aa|down_2|NC_014829.1_576111_577095_+	COG0673, MviM, Predicted dehydrogenases and related proteins [General function prediction only]	NA|244aa|down_3|NC_014829.1_577237_577969_-	PRK07475, PRK07475, hypothetical protein; Provisional	NA|389aa|down_4|NC_014829.1_578158_579325_+	PRK04346, PRK04346, tryptophan synthase subunit beta; Validated	NA|136aa|down_5|NC_014829.1_579506_579914_+	pfam06094, GGACT, Gamma-glutamyl cyclotransferase, AIG2-like	NA|235aa|down_6|NC_014829.1_580188_580893_-	cd03024, DsbA_FrnE, DsbA family, FrnE subfamily; FrnE is a DsbA-like protein containing a CXXC motif	NA|233aa|down_7|NC_014829.1_581255_581954_+	cd03255, ABC_MJ0796_LolCDE_FtsE, ATP-binding cassette domain of the transporters involved in export of lipoprotein and macrolide, and cell division protein	NA|349aa|down_8|NC_014829.1_581919_582966_+	TIGR01730, COG0845:_Membrane-fusion_protein, RND family efflux transporter, MFP subunit	NA|954aa|down_9|NC_014829.1_582980_585842_+	COG4591, LolE, ABC-type transport system, involved in lipoprotein release, permease component [Cell envelope biogenesis, outer membrane]
GCF_000177235.2_ASM17723v2	NC_014829	Bacillus cellulosilyticus DSM 2522, complete genome	3	1890531-1890717	3	CRISPRCasFinder	no		cas3,csa3,DinG,WYL,RT,DEDDh	Orphan	AATAAACGGAGAAATTCCGCTTAT	24	0	0	NA	NA	NA	3	3	Orphan	cas3,csa3,DinG,WYL,RT,DEDDh	NA|256aa|up_6|NC_014829.1_1885212_1885980_-,NA|136aa|up_1|NC_014829.1_1889273_1889681_+,NA|58aa|up_0|NC_014829.1_1890016_1890190_+,NA|106aa|down_8|NC_014829.1_1897149_1897467_+	NA|856aa|up_9|NC_014829.1_1880692_1883260_+	PRK08115, PRK08115, vitamin B12-dependent ribonucleotide reductase	NA|144aa|up_8|NC_014829.1_1883595_1884027_+	PRK03902, PRK03902, transcriptional regulator MntR	NA|298aa|up_7|NC_014829.1_1884289_1885183_+	cd07207, Pat_ExoU_VipD_like, ExoU and VipD-like proteins; homologus to patatin, cPLA2, and iPLA2	NA|256aa|up_6|NC_014829.1_1885212_1885980_-	NA	NA|125aa|up_5|NC_014829.1_1886019_1886394_-	cd18565, ABC_6TM_exporter_like, Six-transmembrane helical domain (TMD) of an uncharacterized ABC exporter, and similar proteins	NA|169aa|up_4|NC_014829.1_1886682_1887189_+	pfam11085, YqhR, Conserved membrane protein YqhR	NA|356aa|up_3|NC_014829.1_1887280_1888348_+	COG0006, PepP, Xaa-Pro aminopeptidase [Amino acid transport and metabolism]	NA|186aa|up_2|NC_014829.1_1888379_1888937_+	PRK00529, PRK00529, elongation factor P; Validated	NA|136aa|up_1|NC_014829.1_1889273_1889681_+	NA	NA|58aa|up_0|NC_014829.1_1890016_1890190_+	NA	NA|311aa|down_0|NC_014829.1_1891098_1892031_+	TIGR02858, Stage_III_sporulation_protein_AA, stage III sporulation protein AA	NA|171aa|down_1|NC_014829.1_1892027_1892540_+	PRK08307, PRK08307, stage III sporulation protein SpoAB; Provisional	NA|69aa|down_2|NC_014829.1_1892560_1892767_+	TIGR02848, Stage_III_sporulation_protein_AC, stage III sporulation protein AC	NA|130aa|down_3|NC_014829.1_1892782_1893172_+	TIGR02849, Stage_III_sporulation_protein_AD, stage III sporulation protein AD	NA|394aa|down_4|NC_014829.1_1893184_1894366_+	TIGR02829, Stage_III_sporulation_protein_AE, stage III sporulation protein AE	NA|218aa|down_5|NC_014829.1_1894393_1895047_+	pfam09581, Spore_III_AF, Stage III sporulation protein AF (Spore_III_AF)	NA|219aa|down_6|NC_014829.1_1895221_1895878_+	TIGR02830, Stage_III_sporulation_protein_AG, stage III sporulation protein AG	NA|198aa|down_7|NC_014829.1_1895881_1896475_+	pfam12685, SpoIIIAH, SpoIIIAH-like protein	NA|106aa|down_8|NC_014829.1_1897149_1897467_+	NA	NA|164aa|down_9|NC_014829.1_1897919_1898411_+	PRK06302, PRK06302, acetyl-CoA carboxylase biotin carboxyl carrier protein
GCF_000177235.2_ASM17723v2	NC_014829	Bacillus cellulosilyticus DSM 2522, complete genome	4	3586778-3586852	4	CRISPRCasFinder	no		cas3,csa3,DinG,WYL,RT,DEDDh	Orphan	GCTGCTCCATCTTACCTTCTGCT	23	0	0	NA	NA	NA	1	1	Orphan	cas3,csa3,DinG,WYL,RT,DEDDh	NA,NA	NA|203aa|up_9|NC_014829.1_3574297_3574906_-	PRK03846, PRK03846, adenylylsulfate kinase; Provisional	NA|380aa|up_8|NC_014829.1_3574902_3576042_-	PRK04149, sat, sulfate adenylyltransferase; Reviewed	NA|239aa|up_7|NC_014829.1_3576061_3576778_-	TIGR00434, Phosphoadenosine_phosphosulfate_reductase, phosophoadenylyl-sulfate reductase (thioredoxin)	NA|361aa|up_6|NC_014829.1_3576780_3577863_-	COG0306, PitA, Phosphate/sulphate permeases [Inorganic ion transport and metabolism]	NA|482aa|up_5|NC_014829.1_3578295_3579741_-	cd07149, ALDH_y4uC, Uncharacterized ALDH (y4uC) with similarity to Tortula ruralis aldehyde dehydrogenase ALDH21A1	NA|346aa|up_4|NC_014829.1_3579773_3580811_-	PRK06270, PRK06270, homoserine dehydrogenase; Provisional	NA|404aa|up_3|NC_014829.1_3580800_3582012_-	cd08021, M20_Acy1_YhaA-like, M20 Peptidase aminoacylase 1 subfamily, includes Bacillus subtilis YhaA and Staphylococcus aureus amidohydrolase, SACOL0085	NA|396aa|up_2|NC_014829.1_3582034_3583222_-	PRK08249, PRK08249, cystathionine gamma-synthase family protein	NA|420aa|up_1|NC_014829.1_3583307_3584567_-	TIGR02993, ectoine_eutD, ectoine utilization protein EutD	NA|560aa|up_0|NC_014829.1_3584731_3586411_-	pfam07905, PucR, Purine catabolism regulatory protein-like family	NA|205aa|down_0|NC_014829.1_3587281_3587896_-	cd01834, SGNH_hydrolase_like_2, SGNH_hydrolase subfamily	NA|299aa|down_1|NC_014829.1_3587918_3588815_-	COG0679, COG0679, Predicted permeases [General function prediction only]	NA|417aa|down_2|NC_014829.1_3588973_3590224_-	TIGR03087, stp1, sugar transferase, PEP-CTERM/EpsH1 system associated	NA|211aa|down_3|NC_014829.1_3590588_3591221_-	COG1251, NirB, NAD(P)H-nitrite reductase [Energy production and conversion]	NA|342aa|down_4|NC_014829.1_3591345_3592371_-	cd19974, PBP1_LacI-like, ligand-binding domain of uncharacterized DNA-binding regulatory proteins that are members of the LacI-GalR family of bacterial transcription repressors	NA|532aa|down_5|NC_014829.1_3592854_3594450_-	COG2985, COG2985, Predicted permease [General function prediction only]	NA|241aa|down_6|NC_014829.1_3594634_3595357_-	cd07716, RNaseZ_short-form-like_MBL-fold, uncharacterized bacterial subgroup of Ribonuclease Z, short form; MBL-fold metallo-hydrolase domain	NA|930aa|down_7|NC_014829.1_3595655_3598445_-	pfam03424, CBM_17_28, Carbohydrate binding domain (family 17/28)	NA|969aa|down_8|NC_014829.1_3598983_3601890_-	pfam03424, CBM_17_28, Carbohydrate binding domain (family 17/28)	NA|338aa|down_9|NC_014829.1_3603068_3604082_+	pfam07885, Ion_trans_2, Ion channel
GCF_000177235.2_ASM17723v2	NC_014829	Bacillus cellulosilyticus DSM 2522, complete genome	5	3835035-3835157	5	CRISPRCasFinder	no		cas3,csa3,DinG,WYL,RT,DEDDh	Orphan	TTCTCTTGTTTTCTCTCGGTTTGAAGGTGAGA	32	0	0	NA	NA	NA	1	1	Orphan	cas3,csa3,DinG,WYL,RT,DEDDh	NA,NA	NA|212aa|up_9|NC_014829.1_3820653_3821289_-	PRK13288, PRK13288, pyrophosphatase PpaX; Provisional	NA|320aa|up_8|NC_014829.1_3822124_3823084_-	pfam07670, Gate, Nucleoside recognition	NA|293aa|up_7|NC_014829.1_3823177_3824056_-	PRK12437, PRK12437, prolipoprotein diacylglyceryl transferase; Reviewed	NA|313aa|up_6|NC_014829.1_3824170_3825109_-	PRK05428, PRK05428, HPr kinase/phosphorylase; Provisional	NA|119aa|up_5|NC_014829.1_3825439_3825796_-	COG1950, COG1950, Predicted membrane protein [Function unknown]	NA|66aa|up_4|NC_014829.1_3825795_3825993_-	COG1983, PspC, Putative stress-responsive transcriptional regulator [Transcription / Signal transduction mechanisms]	NA|368aa|up_3|NC_014829.1_3826072_3827176_-	COG3595, COG3595, Uncharacterized conserved protein [Function unknown]	NA|960aa|up_2|NC_014829.1_3827559_3830439_-	PRK00349, uvrA, excinuclease ABC subunit UvrA	NA|663aa|up_1|NC_014829.1_3830444_3832433_-	PRK05298, PRK05298, excinuclease ABC subunit UvrB	NA|609aa|up_0|NC_014829.1_3832859_3834686_-	cd00987, PDZ_serine_protease, PDZ domain of trypsin-like serine proteases, such as DegP/HtrA, which are oligomeric proteins involved in heat-shock response, chaperone function, and apoptosis	NA|490aa|down_0|NC_014829.1_3835267_3836737_-	COG0793, Prc, Periplasmic protease [Cell envelope biogenesis, outer membrane]	NA|460aa|down_1|NC_014829.1_3837023_3838403_-	pfam01551, Peptidase_M23, Peptidase family M23	NA|298aa|down_2|NC_014829.1_3838642_3839536_-	COG2177, FtsX, Cell division protein [Cell division and chromosome partitioning]	NA|229aa|down_3|NC_014829.1_3839525_3840212_-	COG2884, FtsE, Predicted ATPase involved in cell division [Cell division and chromosome partitioning]	NA|122aa|down_4|NC_014829.1_3840672_3841038_-	pfam13442, Cytochrome_CBB3, Cytochrome C oxidase, cbb3-type, subunit III	NA|295aa|down_5|NC_014829.1_3841101_3841986_-	COG1284, COG1284, Uncharacterized conserved protein [Function unknown]	NA|367aa|down_6|NC_014829.1_3842251_3843353_-	PRK00578, prfB, peptide chain release factor 2; Validated	NA|840aa|down_7|NC_014829.1_3843524_3846044_-	PRK12904, PRK12904, preprotein translocase subunit SecA; Reviewed	NA|278aa|down_8|NC_014829.1_3846343_3847177_+	COG3711, BglG, Transcriptional antiterminator [Transcription]	NA|702aa|down_9|NC_014829.1_3847356_3849462_+	TIGR02002, PTS_system_glucose-specific_IIABC_component, PTS system, glucose-specific IIBC component
GCF_000177235.2_ASM17723v2	NC_014829	Bacillus cellulosilyticus DSM 2522, complete genome	6	4017754-4017941	6	CRISPRCasFinder	no		cas3,csa3,DinG,WYL,RT,DEDDh	Orphan	AATAAGGGGAATTTCTCCGTTTATT	25	0	0	NA	NA	NA	3	3	Orphan	cas3,csa3,DinG,WYL,RT,DEDDh	NA|55aa|up_6|NC_014829.1_4010529_4010694_-,NA|96aa|up_2|NC_014829.1_4014100_4014388_+,NA|180aa|down_2|NC_014829.1_4024500_4025040_-,NA|195aa|down_4|NC_014829.1_4030417_4031002_-	NA|389aa|up_9|NC_014829.1_4007452_4008619_-	PRK15057, PRK15057, UDP-glucose 6-dehydrogenase; Provisional	NA|145aa|up_8|NC_014829.1_4008868_4009303_-	pfam14568, SUKH_6, SMI1-KNR4 cell-wall	NA|331aa|up_7|NC_014829.1_4009292_4010285_-	pfam12639, Colicin-DNase, DNase/tRNase domain of colicin-like bacteriocin	NA|55aa|up_6|NC_014829.1_4010529_4010694_-	NA	NA|449aa|up_5|NC_014829.1_4010711_4012058_-	COG2855, COG2855, Predicted membrane protein [Function unknown]	NA|53aa|up_4|NC_014829.1_4012137_4012296_-	pfam10055, DUF2292, Uncharacterized small protein (DUF2292)	NA|371aa|up_3|NC_014829.1_4012621_4013734_-	cd02947, TRX_family, TRX family; composed of two groups: Group I, which includes proteins that exclusively encode a TRX domain; and Group II, which are composed of fusion proteins of TRX and additional domains	NA|96aa|up_2|NC_014829.1_4014100_4014388_+	NA	NA|483aa|up_1|NC_014829.1_4014548_4015997_+	pfam13240, zinc_ribbon_2, zinc-ribbon domain	NA|381aa|up_0|NC_014829.1_4016011_4017154_+	TIGR02037, Probable_periplasmic_serine_protease_do/HhoA-like, periplasmic serine protease, Do/DeqQ family	NA|432aa|down_0|NC_014829.1_4018251_4019547_-	pfam14903, WG_beta_rep, WG containing repeat	NA|1460aa|down_1|NC_014829.1_4019767_4024147_-	COG5498, ACF2, Predicted glycosyl hydrolase [Cell envelope biogenesis, outer membrane]	NA|180aa|down_2|NC_014829.1_4024500_4025040_-	NA	NA|1515aa|down_3|NC_014829.1_4025152_4029697_-	pfam06283, ThuA, Trehalose utilisation	NA|195aa|down_4|NC_014829.1_4030417_4031002_-	NA	NA|129aa|down_5|NC_014829.1_4031230_4031617_-	cd08352, VOC_Bs_YwkD_like, vicinal oxygen chelate (VOC) family protein  Bacillus subtilis YwkD and similar proteins	NA|261aa|down_6|NC_014829.1_4031761_4032544_+	cd12083, DD_cGKI, Dimerization/Docking domain of Cyclic GMP-dependent Protein Kinase I	NA|320aa|down_7|NC_014829.1_4032795_4033755_+	PRK14018, PRK14018, bifunctional peptide-methionine (S)-S-oxide reductase MsrA/peptide-methionine (R)-S-oxide reductase MsrB	NA|307aa|down_8|NC_014829.1_4034899_4035820_-	pfam08378, NERD, Nuclease-related domain	NA|324aa|down_9|NC_014829.1_4035996_4036968_+	pfam00762, Ferrochelatase, Ferrochelatase
