assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_001718695.1_ASM171869v1	NZ_CP013421	Burkholderia ubonensis strain MSMB0783 chromosome 3, complete sequence	1	135533-135620	1	CRISPRCasFinder	no		PD-DExK,RT	Orphan	GCCCGACCTGCGACGGCGATGCG	23	0	0	NA	NA	NA	1	1	Orphan	WYL,DEDDh,csa3,cas3,DinG,PD-DExK,RT	NA,NA	NA|161aa|up_9|NZ_CP013421.1_124884_125367_-	pfam13276, HTH_21, HTH-like domain	NA|537aa|up_8|NZ_CP013421.1_125730_127341_-	COG2192, COG2192, Predicted carbamoyl transferase, NodU family [Posttranslational modification, protein turnover, chaperones]	NA|356aa|up_7|NZ_CP013421.1_127372_128440_-	COG3376, HoxN, High-affinity nickel permease [Inorganic ion transport and metabolism]	NA|358aa|up_6|NZ_CP013421.1_129098_130172_+	COG1740, HyaA, Ni,Fe-hydrogenase I small subunit [Energy production and conversion]	NA|619aa|up_5|NZ_CP013421.1_130229_132086_+	pfam00374, NiFeSe_Hases, Nickel-dependent hydrogenase	NA|237aa|up_4|NZ_CP013421.1_132204_132915_+	TIGR02125, CytB-hydogenase, Ni/Fe-hydrogenase, b-type cytochrome subunit	NA|226aa|up_3|NZ_CP013421.1_132922_133600_+	TIGR00140, Hydrogenase_expression/formation_protein_HupD, hydrogenase expression/formation protein	NA|117aa|up_2|NZ_CP013421.1_133590_133941_+	pfam01455, HupF_HypC, HupF/HypC family	NA|178aa|up_1|NZ_CP013421.1_133937_134471_+	cd02965, HyaE, HyaE family; HyaE is also called HupG and HoxO	NA|288aa|up_0|NZ_CP013421.1_134488_135352_+	pfam04809, HupH_C, HupH hydrogenase expression protein, C-terminal conserved region	NA|418aa|down_0|NZ_CP013421.1_136129_137383_+	COG3259, FrhA, Coenzyme F420-reducing hydrogenase, alpha subunit [Energy production and conversion]	NA|116aa|down_1|NZ_CP013421.1_137370_137718_+	pfam01155, HypA, Hydrogenase/urease nickel incorporation, metallochaperone, hypA	NA|346aa|down_2|NZ_CP013421.1_137747_138785_+	PRK10463, PRK10463, hydrogenase nickel incorporation protein HypB; Provisional	NA|392aa|down_3|NZ_CP013421.1_138781_139957_+	COG0068, HypF, Hydrogenase maturation factor [Posttranslational modification, protein turnover, chaperones]	NA|86aa|down_4|NZ_CP013421.1_139947_140205_+	pfam01455, HupF_HypC, HupF/HypC family	NA|379aa|down_5|NZ_CP013421.1_140201_141338_+	PRK15062, PRK15062, hydrogenase isoenzymes formation protein HypD; Provisional	NA|359aa|down_6|NZ_CP013421.1_141334_142411_+	cd02197, HypE, HypE (Hydrogenase expression/formation protein)	NA|501aa|down_7|NZ_CP013421.1_142531_144034_+	COG2204, AtoC, Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains [Signal transduction mechanisms]	NA|346aa|down_8|NZ_CP013421.1_144030_145068_+	COG1740, HyaA, Ni,Fe-hydrogenase I small subunit [Energy production and conversion]	NA|486aa|down_9|NZ_CP013421.1_145076_146534_+	NF033181, NiFeSe_hydrog, nickel-dependent hydrogenase large subunit
GCF_001718695.1_ASM171869v1	NZ_CP013421	Burkholderia ubonensis strain MSMB0783 chromosome 3, complete sequence	2	322412-322662	1	PILER-CR	no		PD-DExK,RT	Orphan	GCCCGTGGAGAGCGAGCTCACGCCGGTCGACAGCGAGCTGGCCGTCGTCGACGTCGAGG	59	2	7	322471-322510|322471-322510|322471-322510|322471-322510|322471-322510|322471-322510|322570-322606	NZ_CP013421.1_319624-319663|NZ_CP013421.1_319912-319951|NZ_CP013421.1_320296-320335|NZ_CP013421.1_320647-320686|NZ_CP013421.1_323713-323752|NZ_CP013421.1_320392-320431|NZ_CP013421.1_320491-320527	NA	2	2	Orphan	WYL,DEDDh,csa3,cas3,DinG,PD-DExK,RT	NA|79aa|up_9|NZ_CP013421.1_307317_307554_-,NA|104aa|up_7|NZ_CP013421.1_309587_309899_-,NA|116aa|up_6|NZ_CP013421.1_309936_310284_-,NA|209aa|up_3|NZ_CP013421.1_312292_312919_-,NA|158aa|down_0|NZ_CP013421.1_325747_326221_-	NA|79aa|up_9|NZ_CP013421.1_307317_307554_-	NA	NA|623aa|up_8|NZ_CP013421.1_307674_309543_-	sd00006, TPR, Tetratricopeptide repeat	NA|104aa|up_7|NZ_CP013421.1_309587_309899_-	NA	NA|116aa|up_6|NZ_CP013421.1_309936_310284_-	NA	NA|284aa|up_5|NZ_CP013421.1_310297_311149_-	TIGR02925, hypothetical_protein_Nmul_A0241, peptidyl-prolyl cis-trans isomerase, EpsD family	NA|381aa|up_4|NZ_CP013421.1_311153_312296_-	pfam10933, DUF2827, Protein of unknown function (DUF2827)	NA|209aa|up_3|NZ_CP013421.1_312292_312919_-	NA	NA|374aa|up_2|NZ_CP013421.1_312915_314037_-	pfam10933, DUF2827, Protein of unknown function (DUF2827)	NA|378aa|up_1|NZ_CP013421.1_314033_315167_-	pfam10933, DUF2827, Protein of unknown function (DUF2827)	NA|232aa|up_0|NZ_CP013421.1_315355_316051_-	cd07185, OmpA_C-like, Peptidoglycan binding domains similar to the C-terminal domain of outer-membrane protein OmpA	NA|158aa|down_0|NZ_CP013421.1_325747_326221_-	NA	NA|236aa|down_1|NZ_CP013421.1_326265_326973_-	pfam00563, EAL, EAL domain	NA|208aa|down_2|NZ_CP013421.1_327475_328099_+	cd03768, SR_ResInv, Serine Recombinase (SR) family, Resolvase and Invertase subfamily, catalytic domain; members contain a C-terminal DNA binding domain	NA|458aa|down_3|NZ_CP013421.1_328142_329516_+	pfam05637, Glyco_transf_34, galactosyl transferase GMA12/MNN10 family	NA|471aa|down_4|NZ_CP013421.1_329488_330901_+	COG0793, Prc, Periplasmic protease [Cell envelope biogenesis, outer membrane]	NA|79aa|down_5|NZ_CP013421.1_331637_331874_-	pfam03544, TonB_C, Gram-negative bacterial TonB protein C-terminal	NA|281aa|down_6|NZ_CP013421.1_332158_333001_-	cd03768, SR_ResInv, Serine Recombinase (SR) family, Resolvase and Invertase subfamily, catalytic domain; members contain a C-terminal DNA binding domain	NA|392aa|down_7|NZ_CP013421.1_334861_336036_-	PHA02517, PHA02517, putative transposase OrfB; Reviewed	NA|319aa|down_8|NZ_CP013421.1_337229_338186_-	pfam08843, AbiEii, Nucleotidyl transferase AbiEii toxin, Type IV TA system	NA|274aa|down_9|NZ_CP013421.1_338169_338991_-	pfam11459, AbiEi_3, Transcriptional regulator, AbiEi antitoxin, Type IV TA system
GCF_001718695.1_ASM171869v1	NZ_CP013420	Burkholderia ubonensis strain MSMB0783 chromosome 1, complete sequence	1	1709939-1710046	1	CRISPRCasFinder	no		WYL,DEDDh,csa3,cas3	Orphan	TGCCGAGTGCGCCGGTCAATGCGCCGGCCGGGTTACCGC	39	0	0	NA	NA	NA	1	1	Orphan	WYL,DEDDh,csa3,cas3,DinG,PD-DExK,RT	NA,NA	NA|57aa|up_9|NZ_CP013420.1_1701262_1701433_-	COG1773, COG1773, Rubredoxin [Energy production and conversion]	NA|289aa|up_8|NZ_CP013420.1_1701652_1702519_+	cd01169, HMPP_kinase, 4-amino-5-hydroxymethyl-2-methyl-pyrimidine phosphate kinase (HMPP-kinase) catalyzes two consecutive phosphorylation steps in the thiamine phosphate biosynthesis pathway, leading to the synthesis of vitamin B1	NA|547aa|up_7|NZ_CP013420.1_1702665_1704306_-	PRK00013, groEL, chaperonin GroEL; Reviewed	NA|98aa|up_6|NZ_CP013420.1_1704352_1704646_-	PRK00364, groES, co-chaperonin GroES; Reviewed	NA|170aa|up_5|NZ_CP013420.1_1705019_1705529_-	cd07824, SRPBCC_6, Ligand-binding SRPBCC domain of an uncharacterized subfamily of proteins	NA|211aa|up_4|NZ_CP013420.1_1705552_1706185_-	pfam07608, DUF1571, Protein of unknown function (DUF1571)	NA|140aa|up_3|NZ_CP013420.1_1706181_1706601_-	cd04223, N2OR_C, The C-terminal cupredoxin domain of Nitrous-oxide reductase	NA|264aa|up_2|NZ_CP013420.1_1706632_1707424_-	COG5662, COG5662, Predicted transmembrane transcriptional regulator (anti-sigma factor) [Transcription]	NA|172aa|up_1|NZ_CP013420.1_1707423_1707939_-	PRK12511, PRK12511, RNA polymerase sigma factor; Provisional	NA|120aa|up_0|NZ_CP013420.1_1707988_1708348_-	COG4315, COG4315, Uncharacterized protein conserved in bacteria [Function unknown]	NA|216aa|down_0|NZ_CP013420.1_1711191_1711839_+	COG2964, COG2964, Uncharacterized protein conserved in bacteria [Function unknown]	NA|244aa|down_1|NZ_CP013420.1_1711896_1712628_+	cd03024, DsbA_FrnE, DsbA family, FrnE subfamily; FrnE is a DsbA-like protein containing a CXXC motif	NA|481aa|down_2|NZ_CP013420.1_1712731_1714174_+	TIGR00711, Uncharacterized_MFS-type_transporter_YhcA, drug resistance transporter, EmrB/QacA subfamily	NA|263aa|down_3|NZ_CP013420.1_1714332_1715121_+	pfam01925, TauE, Sulfite exporter TauE/SafE	NA|611aa|down_4|NZ_CP013420.1_1715241_1717074_-	COG2939, COG2939, Carboxypeptidase C (cathepsin A) [Amino acid transport and metabolism]	NA|700aa|down_5|NZ_CP013420.1_1717677_1719777_-	pfam10605, 3HBOH, 3HB-oligomer hydrolase (3HBOH)	NA|125aa|down_6|NZ_CP013420.1_1719886_1720261_-	pfam04972, BON, BON domain	NA|475aa|down_7|NZ_CP013420.1_1720699_1722124_+	cd13520, PBP2_TAXI_TRAP, Substrate binding domain of TAXI proteins of the tripartite ATP-independent periplasmic transporters; the type 2 periplasmic binding protein fold	NA|213aa|down_8|NZ_CP013420.1_1722236_1722875_+	cd03022, DsbA_HCCA_Iso, DsbA family, 2-hydroxychromene-2-carboxylate (HCCA) isomerase subfamily; HCCA isomerase is a glutathione (GSH) dependent enzyme involved in the naphthalene catabolic pathway	NA|134aa|down_9|NZ_CP013420.1_1723966_1724368_-	COG4460, COG4460, Uncharacterized protein conserved in bacteria [Function unknown]
GCF_001718695.1_ASM171869v1	NZ_CP013422	Burkholderia ubonensis strain MSMB0783 chromosome 2, complete sequence	1	2029084-2029305	1	CRISPRCasFinder	no		DEDDh,cas3,csa3,DinG	Orphan	GCCTGCGCCGCCGCCGAGCGCGCCGCCGAT	30	0	0	NA	NA	NA	3	3	Orphan	WYL,DEDDh,csa3,cas3,DinG,PD-DExK,RT	NA|126aa|up_5|NZ_CP013422.1_2024224_2024602_+,NA|76aa|up_4|NZ_CP013422.1_2024673_2024901_+,NA|124aa|up_2|NZ_CP013422.1_2026070_2026442_-,NA|63aa|down_0|NZ_CP013422.1_2030045_2030234_-,NA|193aa|down_5|NZ_CP013422.1_2034843_2035422_-	NA|372aa|up_9|NZ_CP013422.1_2020176_2021292_+	cd06250, M14_PaAOTO_like, Peptidase M14 Succinylglutamate desuccinylase (ASTE)/aspartoacylase (ASPA)-like subfamily; subgroup includes Pseudomonas aeruginosa AotO	NA|270aa|up_8|NZ_CP013422.1_2021386_2022196_-	COG1464, NlpA, ABC-type metal ion transport system, periplasmic component/surface antigen [Inorganic ion transport and metabolism]	NA|383aa|up_7|NZ_CP013422.1_2022512_2023661_+	cd00342, gram_neg_porins, Porins form aqueous channels for the diffusion of small hydrophillic molecules across the outer membrane	NA|87aa|up_6|NZ_CP013422.1_2023874_2024135_+	pfam11065, DUF2866, Protein of unknown function (DUF2866)	NA|126aa|up_5|NZ_CP013422.1_2024224_2024602_+	NA	NA|76aa|up_4|NZ_CP013422.1_2024673_2024901_+	NA	NA|204aa|up_3|NZ_CP013422.1_2025448_2026060_-	COG2860, COG2860, Predicted membrane protein [Function unknown]	NA|124aa|up_2|NZ_CP013422.1_2026070_2026442_-	NA	NA|268aa|up_1|NZ_CP013422.1_2026660_2027464_+	COG3570, StrB, Streptomycin 6-kinase [Defense mechanisms]	NA|279aa|up_0|NZ_CP013422.1_2027542_2028379_-	PRK11171, PRK11171, (S)-ureidoglycine aminohydrolase	NA|63aa|down_0|NZ_CP013422.1_2030045_2030234_-	NA	NA|140aa|down_1|NZ_CP013422.1_2030488_2030908_-	COG0537, Hit, Diadenosine tetraphosphate (Ap4A) hydrolase and other HIT family hydrolases [Nucleotide transport and metabolism / Carbohydrate transport and metabolism / General function prediction only]	NA|469aa|down_2|NZ_CP013422.1_2031160_2032567_+	COG1680, AmpC, Beta-lactamase class C and other penicillin binding proteins [Defense mechanisms]	NA|208aa|down_3|NZ_CP013422.1_2032619_2033243_-	cd03180, GST_C_2, C-terminal, alpha helical domain of an unknown subfamily 2 of Glutathione S-transferases	NA|401aa|down_4|NZ_CP013422.1_2033555_2034758_+	PRK13656, PRK13656, enoyl-[acyl-carrier-protein] reductase FabV	NA|193aa|down_5|NZ_CP013422.1_2034843_2035422_-	NA	NA|262aa|down_6|NZ_CP013422.1_2035534_2036320_-	pfam10086, DUF2324, Putative membrane peptidase family (DUF2324)	NA|178aa|down_7|NZ_CP013422.1_2036377_2036911_-	pfam13508, Acetyltransf_7, Acetyltransferase (GNAT) domain	NA|159aa|down_8|NZ_CP013422.1_2037392_2037869_+	cd08895, SRPBCC_CalC_Aha1-like_2, Putative hydrophobic ligand-binding SRPBCC domain of an uncharacterized subgroup of CalC- and Aha1-like proteins	NA|213aa|down_9|NZ_CP013422.1_2038005_2038644_+	cd03022, DsbA_HCCA_Iso, DsbA family, 2-hydroxychromene-2-carboxylate (HCCA) isomerase subfamily; HCCA isomerase is a glutathione (GSH) dependent enzyme involved in the naphthalene catabolic pathway
