assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_002109385.1_ASM210938v1	NZ_CP020814	Alkalihalobacillus krulwichiae strain AM31D chromosome, complete genome	1	2116239-2116339	1	CRISPRCasFinder	no	cas3	cas3,RT,PrimPol,csa3,DEDDh,DinG	Unclear	GGCCCGCAAGTAGCAGGAGCAATGTCGCCAGGACC	35	0	0	NA	NA	NA	1	1	Unclear	cas3,RT,PrimPol,csa3,DEDDh,DinG	NA|188aa|up_9|NZ_CP020814.1_2105403_2105967_-,NA|62aa|up_8|NZ_CP020814.1_2106085_2106271_+,NA|145aa|up_3|NZ_CP020814.1_2111091_2111526_+,NA|62aa|up_2|NZ_CP020814.1_2111691_2111877_-,NA|100aa|down_6|NZ_CP020814.1_2124189_2124489_+,NA|111aa|down_7|NZ_CP020814.1_2124625_2124958_+,NA|241aa|down_9|NZ_CP020814.1_2125631_2126354_+	NA|188aa|up_9|NZ_CP020814.1_2105403_2105967_-	NA	NA|62aa|up_8|NZ_CP020814.1_2106085_2106271_+	NA	NA|324aa|up_7|NZ_CP020814.1_2106712_2107684_+	COG3272, COG3272, Uncharacterized conserved protein [Function unknown]	NA|68aa|up_6|NZ_CP020814.1_2108495_2108699_+	COG1278, CspC, Cold shock proteins [Transcription]	NA|469aa|up_5|NZ_CP020814.1_2108849_2110256_+	TIGR03423, pbp2_mrdA, penicillin-binding protein 2	NA|226aa|up_4|NZ_CP020814.1_2110201_2110879_+	COG0768, FtsI, Cell division protein FtsI/penicillin-binding protein 2 [Cell envelope biogenesis, outer membrane]	NA|145aa|up_3|NZ_CP020814.1_2111091_2111526_+	NA	NA|62aa|up_2|NZ_CP020814.1_2111691_2111877_-	NA	cas3|753aa|up_1|NZ_CP020814.1_2112087_2114346_+	COG1205, COG1205, Distinct helicase family with a unique C-terminal domain including a metal-binding cysteine cluster [General function prediction only]	NA|419aa|up_0|NZ_CP020814.1_2114364_2115621_+	COG3359, COG3359, Predicted exonuclease [DNA replication, recombination, and repair]	NA|187aa|down_0|NZ_CP020814.1_2116494_2117055_+	PRK13660, PRK13660, hypothetical protein; Provisional	NA|104aa|down_1|NZ_CP020814.1_2117135_2117447_+	PRK14127, PRK14127, cell division regulator GpsB	NA|131aa|down_2|NZ_CP020814.1_2117458_2117851_+	PRK13907, rnhA, ribonuclease H; Provisional	NA|383aa|down_3|NZ_CP020814.1_2118610_2119759_+	COG0116, COG0116, Predicted N6-adenine-specific DNA methylase [DNA replication, recombination, and repair]	NA|85aa|down_4|NZ_CP020814.1_2121851_2122106_+	pfam10819, DUF2564, Protein of unknown function (DUF2564)	NA|103aa|down_5|NZ_CP020814.1_2122163_2122472_-	COG4841, COG4841, Uncharacterized protein conserved in bacteria [Function unknown]	NA|100aa|down_6|NZ_CP020814.1_2124189_2124489_+	NA	NA|111aa|down_7|NZ_CP020814.1_2124625_2124958_+	NA	NA|165aa|down_8|NZ_CP020814.1_2125124_2125619_+	pfam06695, Sm_multidrug_ex, Putative small multi-drug export protein	NA|241aa|down_9|NZ_CP020814.1_2125631_2126354_+	NA
GCF_002109385.1_ASM210938v1	NZ_CP020814	Alkalihalobacillus krulwichiae strain AM31D chromosome, complete genome	2	2880317-2880389	2	CRISPRCasFinder	no		cas3,RT,PrimPol,csa3,DEDDh,DinG	Orphan	TTAGATCGATCTAATTAATTGGTA	24	0	0	NA	NA	NA	1	1	Orphan	cas3,RT,PrimPol,csa3,DEDDh,DinG	NA,NA|68aa|down_5|NZ_CP020814.1_2886351_2886555_-	NA|359aa|up_9|NZ_CP020814.1_2869381_2870458_-	pfam04346, EutH, Ethanolamine utilisation protein, EutH	NA|36aa|up_8|NZ_CP020814.1_2870613_2870721_+	pfam09680, Tiny_TM_bacill, Protein of unknown function (Tiny_TM_bacill)	NA|234aa|up_7|NZ_CP020814.1_2870995_2871697_+	PRK05819, deoD, DeoD-type purine-nucleoside phosphorylase	NA|319aa|up_6|NZ_CP020814.1_2871790_2872747_-	cd07938, DRE_TIM_HMGL, 3-hydroxy-3-methylglutaryl-CoA lyase, catalytic TIM barrel domain	NA|402aa|up_5|NZ_CP020814.1_2872746_2873952_-	pfam02515, CoA_transf_3, CoA-transferase family III	NA|236aa|up_4|NZ_CP020814.1_2873973_2874681_-	COG2186, FadR, Transcriptional regulators [Transcription]	NA|262aa|up_3|NZ_CP020814.1_2874920_2875706_-	PRK02412, aroD, type I 3-dehydroquinate dehydratase	NA|586aa|up_2|NZ_CP020814.1_2876041_2877799_-	COG0028, IlvB, Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] [Amino acid transport and metabolism / Coenzyme metabolism]	NA|332aa|up_1|NZ_CP020814.1_2877800_2878796_-	cd08234, threonine_DH_like, L-threonine dehydrogenase	NA|478aa|up_0|NZ_CP020814.1_2878857_2880291_-	cd07149, ALDH_y4uC, Uncharacterized ALDH (y4uC) with similarity to Tortula ruralis aldehyde dehydrogenase ALDH21A1	NA|183aa|down_0|NZ_CP020814.1_2880652_2881201_+	PRK00131, aroK, shikimate kinase; Reviewed	NA|186aa|down_1|NZ_CP020814.1_2883698_2884256_-	COG3090, DctM, TRAP-type C4-dicarboxylate transport system, small permease component [Carbohydrate transport and metabolism]	NA|333aa|down_2|NZ_CP020814.1_2884399_2885398_-	cd13679, PBP2_TRAP_YiaO_like, Substrate-binding domain of 2,3-diketo-L-gulonate-binding Tripartite ATP-independent  Periplasmic transport system and related proteins; the type 2 periplasmic-binding protein fold	NA|86aa|down_3|NZ_CP020814.1_2885619_2885877_-	pfam00381, PTS-HPr, PTS HPr component phosphorylation site	NA|96aa|down_4|NZ_CP020814.1_2886048_2886336_-	pfam07875, Coat_F, Coat F domain	NA|68aa|down_5|NZ_CP020814.1_2886351_2886555_-	NA	NA|303aa|down_6|NZ_CP020814.1_2886720_2887629_+	COG2508, COG2508, Regulator of polyketide synthase expression [Signal transduction mechanisms / Secondary metabolites biosynthesis, transport, and catabolism]	NA|366aa|down_7|NZ_CP020814.1_2887740_2888838_+	PRK11650, ugpC, sn-glycerol-3-phosphate ABC transporter ATP-binding protein UgpC	NA|725aa|down_8|NZ_CP020814.1_2888892_2891067_-	PRK02999, PRK02999, malate synthase G; Provisional	NA|323aa|down_9|NZ_CP020814.1_2892637_2893606_-	COG1082, IolE, Sugar phosphate isomerases/epimerases [Carbohydrate transport and metabolism]
GCF_002109385.1_ASM210938v1	NZ_CP020814	Alkalihalobacillus krulwichiae strain AM31D chromosome, complete genome	3	3444754-3444825	3	CRISPRCasFinder	no		cas3,RT,PrimPol,csa3,DEDDh,DinG	Orphan	CCTGAAGACGATTCTAAAGAAGAA	24	0	0	NA	NA	NA	1	1	Orphan	cas3,RT,PrimPol,csa3,DEDDh,DinG	NA,NA	NA|273aa|up_9|NZ_CP020814.1_3429830_3430649_+	PRK03501, ppnK, NAD kinase	NA|70aa|up_8|NZ_CP020814.1_3432467_3432677_-	pfam00269, SASP, Small, acid-soluble spore proteins, alpha/beta type	NA|401aa|up_7|NZ_CP020814.1_3432791_3433994_-	PRK01565, PRK01565, thiamine biosynthesis protein ThiI; Provisional	NA|381aa|up_6|NZ_CP020814.1_3433990_3435133_-	COG1104, NifS, Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes [Amino acid transport and metabolism]	NA|562aa|up_5|NZ_CP020814.1_3435291_3436977_-	PRK04778, PRK04778, septation ring formation regulator EzrA; Provisional	NA|276aa|up_4|NZ_CP020814.1_3437265_3438093_+	PRK08123, PRK08123, histidinol-phosphatase HisJ	NA|216aa|up_3|NZ_CP020814.1_3438081_3438729_-	pfam00440, TetR_N, Bacterial regulatory proteins, tetR family	NA|160aa|up_2|NZ_CP020814.1_3438969_3439449_+	COG1956, COG1956, GAF domain-containing protein [Signal transduction mechanisms]	NA|201aa|up_1|NZ_CP020814.1_3439727_3440330_+	PRK05327, rpsD, 30S ribosomal protein S4; Validated	NA|414aa|up_0|NZ_CP020814.1_3440417_3441659_-	PRK05912, PRK05912, tyrosyl-tRNA synthetase; Validated	NA|572aa|down_0|NZ_CP020814.1_3445120_3446836_-	PRK04319, PRK04319, acetyl-CoA synthetase; Provisional	NA|211aa|down_1|NZ_CP020814.1_3447042_3447675_+	pfam00583, Acetyltransf_1, Acetyltransferase (GNAT) family	NA|213aa|down_2|NZ_CP020814.1_3447696_3448335_+	cd04584, CBS_pair_AcuB_like, Two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with the ACT domain	NA|389aa|down_3|NZ_CP020814.1_3448331_3449498_+	cd09994, HDAC_AcuC_like, Class I histone deacetylase AcuC (Acetoin utilization protein)-like enzymes	NA|232aa|down_4|NZ_CP020814.1_3449544_3450240_-	cd09008, MTAN, 5'-methylthioadenosine/S-adenosylhomocysteine nucleosidases	NA|253aa|down_5|NZ_CP020814.1_3450305_3451064_-	PRK06925, PRK06925, flagellar motor protein MotB	NA|268aa|down_6|NZ_CP020814.1_3451053_3451857_-	PRK06926, PRK06926, flagellar motor protein MotP; Reviewed	NA|334aa|down_7|NZ_CP020814.1_3451913_3452915_-	TIGR01481, catabolite_control_protein_A, catabolite control protein A	NA|359aa|down_8|NZ_CP020814.1_3453150_3454227_-	PRK12595, PRK12595, bifunctional 3-deoxy-7-phosphoheptulonate synthase/chorismate mutase; Reviewed	NA|149aa|down_9|NZ_CP020814.1_3454574_3455021_-	COG4980, GvpP, Gas vesicle protein [General function prediction only]
GCF_002109385.1_ASM210938v1	NZ_CP020814	Alkalihalobacillus krulwichiae strain AM31D chromosome, complete genome	4	3909620-3909958	1	CRT	no		cas3,RT,PrimPol,csa3,DEDDh,DinG	Orphan	TGTCCTTTAGNGCCCTCCTATTGGACAN	28	0	0	NA	NA	NA	6	6	Orphan	cas3,RT,PrimPol,csa3,DEDDh,DinG	NA|53aa|up_5|NZ_CP020814.1_3903230_3903389_+,NA|153aa|up_2|NZ_CP020814.1_3907191_3907650_+,NA|114aa|down_9|NZ_CP020814.1_3919507_3919849_-	NA|387aa|up_9|NZ_CP020814.1_3898040_3899201_-	cd03794, GT4_WbuB-like, Escherichia coli WbuB and similar proteins	NA|374aa|up_8|NZ_CP020814.1_3899234_3900356_-	cd03811, GT4_GT28_WabH-like, family 4 and family 28 glycosyltransferases similar to Klebsiella WabH	NA|537aa|up_7|NZ_CP020814.1_3900641_3902252_+	sd00006, TPR, Tetratricopeptide repeat	NA|259aa|up_6|NZ_CP020814.1_3902279_3903056_+	pfam13472, Lipase_GDSL_2, GDSL-like Lipase/Acylhydrolase family	NA|53aa|up_5|NZ_CP020814.1_3903230_3903389_+	NA	NA|476aa|up_4|NZ_CP020814.1_3903729_3905157_+	pfam01235, Na_Ala_symp, Sodium:alanine symporter family	NA|529aa|up_3|NZ_CP020814.1_3905445_3907032_+	PRK14016, PRK14016, cyanophycin synthetase; Provisional	NA|153aa|up_2|NZ_CP020814.1_3907191_3907650_+	NA	NA|303aa|up_1|NZ_CP020814.1_3907699_3908608_-	PRK09379, PRK09379, LytR family transcriptional regulator	NA|209aa|up_0|NZ_CP020814.1_3908845_3909472_+	TIGR02869, Spore_cortex-lytic_enzyme, spore cortex-lytic enzyme	NA|252aa|down_0|NZ_CP020814.1_3909960_3910716_-	PRK12429, PRK12429, 3-hydroxybutyrate dehydrogenase; Provisional	NA|60aa|down_1|NZ_CP020814.1_3910974_3911154_+	pfam08141, SspH, Small acid-soluble spore protein H family	NA|217aa|down_2|NZ_CP020814.1_3911648_3912299_+	cd01949, GGDEF, Diguanylate-cyclase (DGC) or GGDEF domain	NA|570aa|down_3|NZ_CP020814.1_3912801_3914511_-	COG1001, AdeC, Adenine deaminase [Nucleotide transport and metabolism]	NA|354aa|down_4|NZ_CP020814.1_3914526_3915588_-	COG3842, PotA, ABC-type spermidine/putrescine transport systems, ATPase components [Amino acid transport and metabolism]	NA|263aa|down_5|NZ_CP020814.1_3915602_3916391_-	COG1177, PotC, ABC-type spermidine/putrescine transport system, permease component II [Amino acid transport and metabolism]	NA|274aa|down_6|NZ_CP020814.1_3916398_3917220_-	COG1176, PotB, ABC-type spermidine/putrescine transport system, permease component I [Amino acid transport and metabolism]	NA|312aa|down_7|NZ_CP020814.1_3917289_3918225_-	COG1957, URH1, Inosine-uridine nucleoside N-ribohydrolase [Nucleotide transport and metabolism]	NA|350aa|down_8|NZ_CP020814.1_3918240_3919290_-	cd13589, PBP2_polyamine_RpCGA009, The periplasmic-binding component of an uncharacterized ABC transport system from Rhodopseudomonas palustris CGA009 and related proteins; contains the type 2 periplasmic-binding fold	NA|114aa|down_9|NZ_CP020814.1_3919507_3919849_-	NA
GCF_002109385.1_ASM210938v1	NZ_CP020814	Alkalihalobacillus krulwichiae strain AM31D chromosome, complete genome	5	3963307-3963385	4	CRISPRCasFinder	no		cas3,RT,PrimPol,csa3,DEDDh,DinG	Orphan	TTTGTCCTTTAGACGCTTCCTCTCGGA	27	0	0	NA	NA	NA	1	1	Orphan	cas3,RT,PrimPol,csa3,DEDDh,DinG	NA|144aa|up_7|NZ_CP020814.1_3953247_3953679_-,NA|240aa|down_3|NZ_CP020814.1_3968940_3969660_-,NA|47aa|down_5|NZ_CP020814.1_3970900_3971041_+	NA|1078aa|up_9|NZ_CP020814.1_3948204_3951438_-	cd06244, M14-like, Peptidase M14-like domain; uncharacterized subgroup	NA|107aa|up_8|NZ_CP020814.1_3952830_3953151_-	pfam10925, DUF2680, Protein of unknown function (DUF2680)	NA|144aa|up_7|NZ_CP020814.1_3953247_3953679_-	NA	NA|309aa|up_6|NZ_CP020814.1_3953740_3954667_-	COG0679, COG0679, Predicted permeases [General function prediction only]	NA|291aa|up_5|NZ_CP020814.1_3954834_3955707_-	COG0583, LysR, Transcriptional regulator [Transcription]	NA|187aa|up_4|NZ_CP020814.1_3955737_3956298_-	cd03357, LbH_MAT_GAT, Maltose O-acetyltransferase (MAT) and Galactoside O-acetyltransferase (GAT): MAT and GAT catalyze the CoA-dependent acetylation of the 6-hydroxyl group of their respective sugar substrates	NA|409aa|up_3|NZ_CP020814.1_3956517_3957744_-	cd03682, ClC_sycA_like, ClC sycA-like chloride channel proteins	NA|841aa|up_2|NZ_CP020814.1_3958089_3960612_-	PRK10060, PRK10060, cyclic di-GMP phosphodiesterase	NA|248aa|up_1|NZ_CP020814.1_3960676_3961420_-	COG0596, MhpC, Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) [General function prediction only]	NA|394aa|up_0|NZ_CP020814.1_3961813_3962995_-	cd01949, GGDEF, Diguanylate-cyclase (DGC) or GGDEF domain	NA|821aa|down_0|NZ_CP020814.1_3963460_3965923_-	cd01948, EAL, EAL domain	NA|475aa|down_1|NZ_CP020814.1_3966366_3967791_-	TIGR00931, Uncharacterized_Na+/H+_antiporter_HI_1107, Na+/H+ antiporter NhaC	NA|373aa|down_2|NZ_CP020814.1_3967825_3968944_-	cd08018, M20_Acy1_amhX-like, M20 Peptidase aminoacylase 1 amhX-like subfamily	NA|240aa|down_3|NZ_CP020814.1_3968940_3969660_-	NA	NA|373aa|down_4|NZ_CP020814.1_3969661_3970780_-	cd03319, L-Ala-DL-Glu_epimerase, L-Ala-D/L-Glu epimerase catalyzes the epimerization of L-Ala-D/L-Glu and other dipeptides	NA|47aa|down_5|NZ_CP020814.1_3970900_3971041_+	NA	NA|352aa|down_6|NZ_CP020814.1_3971157_3972213_+	pfam13545, HTH_Crp_2, Crp-like helix-turn-helix domain	NA|192aa|down_7|NZ_CP020814.1_3972251_3972827_-	pfam10648, Gmad2, Immunoglobulin-like domain of bacterial spore germination	NA|95aa|down_8|NZ_CP020814.1_3972780_3973065_-	pfam01476, LysM, LysM domain	NA|327aa|down_9|NZ_CP020814.1_3974755_3975736_-	COG0506, PutA, Proline dehydrogenase [Amino acid transport and metabolism]
