assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000015465.1_ASM1546v1	NC_008784	Burkholderia mallei SAVP1 chromosome II, complete sequence	1	20888-21050	1	PILER-CR	no		csa3,cas3	Orphan	GGCGGGCCGGCCGCCG---------------------------CGCGGGCG	51	0	0	NA	NA	NA	2	2	Orphan	WYL,cas3,DEDDh,DinG,csa3	NA|62aa|up_6|NC_008784.1_16248_16434_-,NA|95aa|up_5|NC_008784.1_16450_16735_-,NA|47aa|up_4|NC_008784.1_17132_17273_-,NA|77aa|up_3|NC_008784.1_17725_17956_+,NA|105aa|down_7|NC_008784.1_33857_34172_+	NA|1078aa|up_9|NC_008784.1_10070_13304_-	TIGR00914, Nickel-cobalt-cadmium_resistance_protein_NccA, heavy metal efflux pump, CzcA family	NA|490aa|up_8|NC_008784.1_13349_14819_-	TIGR00999, Nickel-cobalt-cadmium_resistance_protein_NccB, Membrane Fusion Protein cluster 2 (function with RND porters)	NA|453aa|up_7|NC_008784.1_14829_16188_-	COG1538, TolC, Outer membrane protein [Cell envelope biogenesis, outer membrane / Intracellular trafficking and secretion]	NA|62aa|up_6|NC_008784.1_16248_16434_-	NA	NA|95aa|up_5|NC_008784.1_16450_16735_-	NA	NA|47aa|up_4|NC_008784.1_17132_17273_-	NA	NA|77aa|up_3|NC_008784.1_17725_17956_+	NA	NA|163aa|up_2|NC_008784.1_17943_18432_-	pfam04008, Adenosine_kin, Adenosine specific kinase	NA|162aa|up_1|NC_008784.1_18758_19244_+	cd04332, YbaK_like, YbaK-like	NA|452aa|up_0|NC_008784.1_19511_20867_+	TIGR03810, arginine-ornithine_antiporter, arginine-ornithine antiporter	NA|203aa|down_0|NC_008784.1_21166_21775_+	COG0705, COG0705, Membrane associated serine protease [Amino acid transport and metabolism]	NA|667aa|down_1|NC_008784.1_21913_23914_-	TIGR01241, ATP-dependent_zinc_metalloprotease_FtsH, ATP-dependent metalloprotease FtsH	NA|211aa|down_2|NC_008784.1_24529_25162_+	PRK12323, PRK12323, DNA polymerase III subunit gamma/tau	NA|224aa|down_3|NC_008784.1_27170_27842_-	COG2345, COG2345, Predicted transcriptional regulator [Transcription]	NA|366aa|down_4|NC_008784.1_27992_29090_+	TIGR01730, COG0845:_Membrane-fusion_protein, RND family efflux transporter, MFP subunit	NA|1034aa|down_5|NC_008784.1_29086_32188_+	PRK09579, PRK09579, multidrug efflux RND transporter permease subunit	NA|513aa|down_6|NC_008784.1_32322_33861_+	PRK09837, PRK09837, Cu(I)/Ag(I) efflux RND transporter outer membrane protein	NA|105aa|down_7|NC_008784.1_33857_34172_+	NA	NA|215aa|down_8|NC_008784.1_34190_34835_+	PRK00393, ribA, GTP cyclohydrolase II RibA	NA|566aa|down_9|NC_008784.1_34831_36529_+	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment
GCF_000015465.1_ASM1546v1	NC_008784	Burkholderia mallei SAVP1 chromosome II, complete sequence	2	1056180-1056325	2,1	PILER-CR,CRISPRCasFinder	no		csa3,cas3	Orphan	GCAGAGGCGCCGCCGCCGGAATACCCGATCATCGGGATGCTCGATCGCCGAAG,CGGAATGCCCGATCATCGGGATGC	53,24	0	0	NA	NA	NA:NA	2,1	2	Orphan	WYL,cas3,DEDDh,DinG,csa3	NA|109aa|up_9|NC_008784.1_1045640_1045967_+,NA|37aa|up_5|NC_008784.1_1048640_1048751_-,NA|144aa|up_3|NC_008784.1_1049994_1050426_+,NA|125aa|down_5|NC_008784.1_1065536_1065911_+	NA|109aa|up_9|NC_008784.1_1045640_1045967_+	NA	NA|102aa|up_8|NC_008784.1_1046298_1046604_-	cd05299, CtBP_dh, C-terminal binding protein (CtBP), D-isomer-specific 2-hydroxyacid dehydrogenases related repressor	NA|147aa|up_7|NC_008784.1_1046689_1047130_+	cd07821, PYR_PYL_RCAR_like, Pyrabactin resistance 1 (PYR1), PYR1-like (PYL), regulatory component of abscisic acid receptors (RCARs), and related proteins	NA|372aa|up_6|NC_008784.1_1047464_1048580_-	pfam10604, Polyketide_cyc2, Polyketide cyclase / dehydrase and lipid transport	NA|37aa|up_5|NC_008784.1_1048640_1048751_-	NA	NA|312aa|up_4|NC_008784.1_1048877_1049813_+	cd08417, PBP2_Nitroaromatics_like, The C-terminal substrate binding domain of LysR-type transcriptional regulators that involved in the catabolism of nitroaromatic/naphthalene compounds and that of related regulators; contains the type 2 periplasmic binding fold	NA|144aa|up_3|NC_008784.1_1049994_1050426_+	NA	NA|553aa|up_2|NC_008784.1_1050725_1052384_+	COG2303, BetA, Choline dehydrogenase and related flavoproteins [Amino acid transport and metabolism]	NA|758aa|up_1|NC_008784.1_1052694_1054968_-	pfam07992, Pyr_redox_2, Pyridine nucleotide-disulphide oxidoreductase	NA|132aa|up_0|NC_008784.1_1055100_1055496_-	COG1734, DksA, DnaK suppressor protein [Signal transduction mechanisms]	NA|422aa|down_0|NC_008784.1_1056882_1058148_+	TIGR01849, unnamed_protein_product, polyhydroxyalkanoate depolymerase, intracellular	NA|595aa|down_1|NC_008784.1_1058453_1060238_+	PRK06948, PRK06948, ribonucleotide reductase-like protein; Provisional	NA|601aa|down_2|NC_008784.1_1060312_1062115_+	COG2187, COG2187, Uncharacterized protein conserved in bacteria [Function unknown]	NA|344aa|down_3|NC_008784.1_1062643_1063675_+	PRK07877, PRK07877, Rv1355c family protein	NA|611aa|down_4|NC_008784.1_1063671_1065504_+	COG0659, SUL1, Sulfate permease and related transporters (MFS superfamily) [Inorganic ion transport and metabolism]	NA|125aa|down_5|NC_008784.1_1065536_1065911_+	NA	NA|150aa|down_6|NC_008784.1_1065883_1066333_-	cd17775, CBS_pair_bact_arch, Two tandem repeats of the cystathionine beta-synthase (CBS pair) domains  present in bacteria and archaea	NA|218aa|down_7|NC_008784.1_1066786_1067440_+	cd02142, McbC_SagB-like_oxidoreductase, oxidase similar to the microcin B17 processing protein McbC	NA|145aa|down_8|NC_008784.1_1067507_1067942_-	COG0071, IbpA, Molecular chaperone (small heat shock protein) [Posttranslational modification, protein turnover, chaperones]	NA|376aa|down_9|NC_008784.1_1068347_1069475_-	cd00342, gram_neg_porins, Porins form aqueous channels for the diffusion of small hydrophillic molecules across the outer membrane
GCF_000015465.1_ASM1546v1	NC_008784	Burkholderia mallei SAVP1 chromosome II, complete sequence	3	1288637-1288729	2	CRISPRCasFinder	no		csa3,cas3	Orphan	TCGCCGCCGCGCGCGACGACGCGC	24	0	0	NA	NA	NA	1	1	Orphan	WYL,cas3,DEDDh,DinG,csa3	NA|86aa|up_6|NC_008784.1_1284404_1284662_-,NA|168aa|down_3|NC_008784.1_1293338_1293842_+,NA|170aa|down_4|NC_008784.1_1293899_1294409_-,NA|53aa|down_6|NC_008784.1_1296606_1296765_-	NA|393aa|up_9|NC_008784.1_1279890_1281069_-	PRK07058, PRK07058, acetate/propionate family kinase	NA|477aa|up_8|NC_008784.1_1281065_1282496_-	PRK08190, PRK08190, bifunctional enoyl-CoA hydratase/phosphate acetyltransferase; Validated	NA|598aa|up_7|NC_008784.1_1282479_1284273_-	TIGR01838, Poly-beta-hydroxybutyrate_polymerase, poly(R)-hydroxyalkanoic acid synthase, class I	NA|86aa|up_6|NC_008784.1_1284404_1284662_-	NA	NA|535aa|up_5|NC_008784.1_1284633_1286238_+	PRK12597, PRK12597, F0F1 ATP synthase subunit beta; Provisional	NA|152aa|up_4|NC_008784.1_1286227_1286683_+	PRK13447, PRK13447, F0F1 ATP synthase subunit epsilon; Provisional	NA|164aa|up_3|NC_008784.1_1286679_1287171_+	pfam09527, ATPase_gene1, Putative F0F1-ATPase subunit Ca2+/Mg2+ transporter	NA|103aa|up_2|NC_008784.1_1287163_1287472_+	TIGR03165, F1F0_chp_2, F1/F0 ATPase, Methanosarcina type, subunit 2	NA|234aa|up_1|NC_008784.1_1287468_1288170_+	PRK13421, PRK13421, F0F1 ATP synthase subunit A; Provisional	NA|84aa|up_0|NC_008784.1_1288166_1288418_+	PRK13468, PRK13468, F0F1 ATP synthase subunit C; Provisional	NA|671aa|down_0|NC_008784.1_1289164_1291177_+	PRK13343, PRK13343, F0F1 ATP synthase subunit alpha; Provisional	NA|283aa|down_1|NC_008784.1_1291173_1292022_+	pfam00231, ATP-synt, ATP synthase	NA|342aa|down_2|NC_008784.1_1292236_1293262_+	cd08297, CAD3, Cinnamyl alcohol dehydrogenases (CAD)	NA|168aa|down_3|NC_008784.1_1293338_1293842_+	NA	NA|170aa|down_4|NC_008784.1_1293899_1294409_-	NA	NA|535aa|down_5|NC_008784.1_1294910_1296515_+	PRK15048, PRK15048, methyl-accepting chemotaxis protein II; Provisional	NA|53aa|down_6|NC_008784.1_1296606_1296765_-	NA	NA|612aa|down_7|NC_008784.1_1296864_1298700_-	PRK13557, PRK13557, histidine kinase; Provisional	NA|375aa|down_8|NC_008784.1_1298849_1299974_-	COG0842, COG0842, ABC-type multidrug transport system, permease component [Defense mechanisms]	NA|1072aa|down_9|NC_008784.1_1299976_1303192_-	COG1131, CcmA, ABC-type multidrug transport system, ATPase component [Defense mechanisms]
