assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000215065.1_ASM21506v1	NC_015587	Hydrogenobaculum sp. SHO, complete sequence	1	199538-199636	1	CRISPRCasFinder	no		Cas9_archaeal,Cas14b_CAS-V-F,cas3,cas6,cas4,cas1,csm3gr7,DEDDh,DinG,csa3,cas7,cas5,cas14j,cas8b2,cas2,TnpB_regular.1	Orphan	AAATGCGGACGTTACCGCTCTATT	24	0	0	NA	NA	NA	1	1	Orphan	Cas9_archaeal,Cas14b_CAS-V-F,cas3,cas6,cas4,cas1,csm3gr7,DEDDh,DinG,csa3,cas7,cas5,cas14j,cas8b2,cas2,TnpB_regular.1	NA|152aa|up_5|NC_015587.1_195220_195676_+,NA|76aa|up_0|NC_015587.1_198969_199197_-,NA|207aa|down_0|NC_015587.1_199879_200500_-,NA|65aa|down_1|NC_015587.1_200511_200706_-,NA|193aa|down_2|NC_015587.1_200751_201330_-,NA|138aa|down_9|NC_015587.1_206142_206556_+	NA|155aa|up_9|NC_015587.1_190666_191131_+	COG1905, NuoE, NADH:ubiquinone oxidoreductase 24 kD subunit [Energy production and conversion]	NA|426aa|up_8|NC_015587.1_191105_192383_+	COG1894, NuoF, NADH:ubiquinone oxidoreductase, NADH-binding (51 kD) subunit [Energy production and conversion]	NA|555aa|up_7|NC_015587.1_192399_194064_+	PRK00911, PRK00911, dihydroxy-acid dehydratase; Provisional	NA|377aa|up_6|NC_015587.1_194073_195204_+	PRK00112, tgt, queuine tRNA-ribosyltransferase; Provisional	NA|152aa|up_5|NC_015587.1_195220_195676_+	NA	NA|218aa|up_4|NC_015587.1_195751_196405_-	TIGR01093, 3-dehydroquinate_dehydratase, 3-dehydroquinate dehydratase, type I	NA|386aa|up_3|NC_015587.1_196404_197562_-	PRK05382, PRK05382, chorismate synthase; Validated	NA|171aa|up_2|NC_015587.1_197898_198411_-	cd04645, LbH_gamma_CA_like, Gamma carbonic anhydrase-like: This family is composed of gamma carbonic anhydrase (CA), Ferripyochelin Binding Protein (FBP), E	NA|149aa|up_1|NC_015587.1_198436_198883_-	cd00851, MTH1175, This uncharacterized conserved protein belongs to a family of iron-molybdenum cluster-binding proteins that includes NifX, NifB, and NifY, all of which are involved in the synthesis of an iron-molybdenum cofactor (FeMo-co) that binds the active site of the dinitrogenase enzyme	NA|76aa|up_0|NC_015587.1_198969_199197_-	NA	NA|207aa|down_0|NC_015587.1_199879_200500_-	NA	NA|65aa|down_1|NC_015587.1_200511_200706_-	NA	NA|193aa|down_2|NC_015587.1_200751_201330_-	NA	NA|311aa|down_3|NC_015587.1_201624_202557_+	TIGR02197, heptose_epim, ADP-L-glycero-D-manno-heptose-6-epimerase	NA|322aa|down_4|NC_015587.1_202553_203519_+	TIGR01138, Cysteine_synthase_B, cysteine synthase B	NA|172aa|down_5|NC_015587.1_203520_204036_+	pfam02620, DUF177, Uncharacterized ACR, COG1399	NA|61aa|down_6|NC_015587.1_204016_204199_+	PRK12286, rpmF, 50S ribosomal protein L32; Reviewed	NA|340aa|down_7|NC_015587.1_204210_205230_+	PRK05331, PRK05331, phosphate acyltransferase PlsX	NA|304aa|down_8|NC_015587.1_205229_206141_+	PRK09352, PRK09352, beta-ketoacyl-ACP synthase 3	NA|138aa|down_9|NC_015587.1_206142_206556_+	NA
GCF_000215065.1_ASM21506v1	NC_015587	Hydrogenobaculum sp. SHO, complete sequence	2	340588-340887	1,2,1	CRT,CRISPRCasFinder,PILER-CR	no	cas6,cas4,cas1,csm3gr7,DEDDh	Cas9_archaeal,Cas14b_CAS-V-F,cas3,cas6,cas4,cas1,csm3gr7,DEDDh,DinG,csa3,cas7,cas5,cas14j,cas8b2,cas2,TnpB_regular.1	Unclear	ANGTTTTNTNTGTNCCTATAGGGGATTGAAAC,GTTTTTTGTGTACCTATAGGGGATTGAAAC,GGTTTTTTGTGTACCTATAGGGGATTGAAACG	32,30,32	0	0	NA	NA	NA:NA:NA	4,4,2	4	Unclear	Cas9_archaeal,Cas14b_CAS-V-F,cas3,cas6,cas4,cas1,csm3gr7,DEDDh,DinG,csa3,cas7,cas5,cas14j,cas8b2,cas2,TnpB_regular.1	NA|139aa|up_7|NC_015587.1_332924_333341_+,NA|400aa|down_7|NC_015587.1_350236_351436_-	NA|117aa|up_9|NC_015587.1_331673_332024_+	cd06664, IscU_like, Iron-sulfur cluster scaffold-like proteins	NA|309aa|up_8|NC_015587.1_332001_332928_+	pfam04463, DUF523, Protein of unknown function (DUF523)	NA|139aa|up_7|NC_015587.1_332924_333341_+	NA	NA|536aa|up_6|NC_015587.1_333350_334958_+	PRK01611, argS, arginyl-tRNA synthetase; Reviewed	NA|244aa|up_5|NC_015587.1_335030_335762_+	cd13519, PBP2_PEB3_AcfC, Ligand-binding domain of a glycoprotein adhesion and an accessory colonization factor, a member of the type 2 periplasmic binding fold superfamily	NA|230aa|up_4|NC_015587.1_335924_336614_-	cd18669, M20_18_42, M20, M18 and M42 Zn-peptidases include aminopeptidases and carboxypeptidases	NA|512aa|up_3|NC_015587.1_336854_338390_-	pfam09820, AAA-ATPase_like, Predicted AAA-ATPase	cas6|175aa|up_2|NC_015587.1_338463_338988_+	TIGR01877, CRISPR-associated_endoribonuclease_Cas6_1, CRISPR-associated endoribonuclease Cas6	cas6|72aa|up_1|NC_015587.1_338984_339200_+	COG1583, COG1583, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	cas4|166aa|up_0|NC_015587.1_339204_339702_+	pfam01930, Cas_Cas4, Domain of unknown function DUF83	csm3gr7|440aa|down_0|NC_015587.1_342310_343630_+	TIGR03986, CRISPR-associated_protein, CRISPR-associated protein	NA|203aa|down_1|NC_015587.1_344158_344767_+	COG1196, Smc, Chromosome segregation ATPases [Cell division and chromosome partitioning]	DEDDh|197aa|down_2|NC_015587.1_344786_345377_-	cd06127, DEDDh, DEDDh 3'-5' exonuclease domain family	NA|317aa|down_3|NC_015587.1_345390_346341_-	pfam09312, SurA_N, SurA N-terminal domain	NA|422aa|down_4|NC_015587.1_347568_348834_-	cd18773, PDC1_HK_sensor, first PDC (PhoQ/DcuS/CitA) domain of methyl-accepting chemotaxis proteins, diguanylate-cyclase and similar domains	NA|190aa|down_5|NC_015587.1_348820_349390_-	cd02165, NMNAT, Nicotinamide/nicotinate mononucleotide adenylyltransferase	NA|268aa|down_6|NC_015587.1_349380_350184_-	pfam01904, DUF72, Protein of unknown function DUF72	NA|400aa|down_7|NC_015587.1_350236_351436_-	NA	NA|326aa|down_8|NC_015587.1_351416_352394_-	COG2187, COG2187, Uncharacterized protein conserved in bacteria [Function unknown]	NA|620aa|down_9|NC_015587.1_352386_354246_-	COG1034, NuoG, NADH dehydrogenase/NADH:ubiquinone oxidoreductase 75 kD subunit (chain G) [Energy production and conversion]
GCF_000215065.1_ASM21506v1	NC_015587	Hydrogenobaculum sp. SHO, complete sequence	3	346996-347159	3,2	CRISPRCasFinder,PILER-CR	no	cas6,cas4,cas1,csm3gr7,DEDDh	Cas9_archaeal,Cas14b_CAS-V-F,cas3,cas6,cas4,cas1,csm3gr7,DEDDh,DinG,csa3,cas7,cas5,cas14j,cas8b2,cas2,TnpB_regular.1	Unclear	GAGTTTCATCTGAACCGTGTGGGTTAAGAA,GAGTTTCATCTGAACCGTGTGGGTTAAGAAGC	30,32	0	0	NA	NA	NA:NA	2,2	2	Unclear	Cas9_archaeal,Cas14b_CAS-V-F,cas3,cas6,cas4,cas1,csm3gr7,DEDDh,DinG,csa3,cas7,cas5,cas14j,cas8b2,cas2,TnpB_regular.1	NA,NA|400aa|down_3|NC_015587.1_350236_351436_-	NA|230aa|up_9|NC_015587.1_335924_336614_-	cd18669, M20_18_42, M20, M18 and M42 Zn-peptidases include aminopeptidases and carboxypeptidases	NA|512aa|up_8|NC_015587.1_336854_338390_-	pfam09820, AAA-ATPase_like, Predicted AAA-ATPase	cas6|175aa|up_7|NC_015587.1_338463_338988_+	TIGR01877, CRISPR-associated_endoribonuclease_Cas6_1, CRISPR-associated endoribonuclease Cas6	cas6|72aa|up_6|NC_015587.1_338984_339200_+	COG1583, COG1583, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	cas4|166aa|up_5|NC_015587.1_339204_339702_+	pfam01930, Cas_Cas4, Domain of unknown function DUF83	cas1|301aa|up_4|NC_015587.1_339705_340608_+	TIGR03641, cas1_HMARI, CRISPR-associated endonuclease Cas1, subtype I-B/HMARI/TNEAP	csm3gr7|440aa|up_3|NC_015587.1_342310_343630_+	TIGR03986, CRISPR-associated_protein, CRISPR-associated protein	NA|203aa|up_2|NC_015587.1_344158_344767_+	COG1196, Smc, Chromosome segregation ATPases [Cell division and chromosome partitioning]	DEDDh|197aa|up_1|NC_015587.1_344786_345377_-	cd06127, DEDDh, DEDDh 3'-5' exonuclease domain family	NA|317aa|up_0|NC_015587.1_345390_346341_-	pfam09312, SurA_N, SurA N-terminal domain	NA|422aa|down_0|NC_015587.1_347568_348834_-	cd18773, PDC1_HK_sensor, first PDC (PhoQ/DcuS/CitA) domain of methyl-accepting chemotaxis proteins, diguanylate-cyclase and similar domains	NA|190aa|down_1|NC_015587.1_348820_349390_-	cd02165, NMNAT, Nicotinamide/nicotinate mononucleotide adenylyltransferase	NA|268aa|down_2|NC_015587.1_349380_350184_-	pfam01904, DUF72, Protein of unknown function DUF72	NA|400aa|down_3|NC_015587.1_350236_351436_-	NA	NA|326aa|down_4|NC_015587.1_351416_352394_-	COG2187, COG2187, Uncharacterized protein conserved in bacteria [Function unknown]	NA|620aa|down_5|NC_015587.1_352386_354246_-	COG1034, NuoG, NADH dehydrogenase/NADH:ubiquinone oxidoreductase 75 kD subunit (chain G) [Energy production and conversion]	NA|225aa|down_6|NC_015587.1_354235_354910_-	PRK00230, PRK00230, orotidine-5'-phosphate decarboxylase	NA|284aa|down_7|NC_015587.1_354902_355754_-	COG2445, COG2445, Uncharacterized conserved protein [Function unknown]	NA|611aa|down_8|NC_015587.1_355798_357631_-	PRK05192, PRK05192, tRNA uridine-5-carboxymethylaminomethyl(34) synthesis enzyme MnmG	NA|175aa|down_9|NC_015587.1_357636_358161_-	COG2143, COG2143, Thioredoxin-related protein [Posttranslational modification, protein turnover, chaperones]
GCF_000215065.1_ASM21506v1	NC_015587	Hydrogenobaculum sp. SHO, complete sequence	4	591474-592162	3,4,2	PILER-CR,CRISPRCasFinder,CRT	no	Cas14b_CAS-V-F,cas6,cas7,cas5,cas3,cas4,cas14j	Cas9_archaeal,Cas14b_CAS-V-F,cas3,cas6,cas4,cas1,csm3gr7,DEDDh,DinG,csa3,cas7,cas5,cas14j,cas8b2,cas2,TnpB_regular.1	Unclear	GTTTCATCTGAACCGTGTGGGATATAAA,GTTTCATCTGAACCGTGTGGGATATAAAGT,GTTTCATCTGAACCGTGTGGGATATAAA	28,30,28	0	0	NA	NA	NA:NA:NA	10,10,10	10	TypeV	Cas9_archaeal,Cas14b_CAS-V-F,cas3,cas6,cas4,cas1,csm3gr7,DEDDh,DinG,csa3,cas7,cas5,cas14j,cas8b2,cas2,TnpB_regular.1	NA|116aa|up_9|NC_015587.1_582569_582917_-,NA|49aa|up_8|NC_015587.1_582923_583070_-,NA|54aa|up_7|NC_015587.1_583071_583233_-,NA|144aa|up_0|NC_015587.1_589040_589472_-,NA|53aa|down_6|NC_015587.1_599521_599680_+	NA|116aa|up_9|NC_015587.1_582569_582917_-	NA	NA|49aa|up_8|NC_015587.1_582923_583070_-	NA	NA|54aa|up_7|NC_015587.1_583071_583233_-	NA	NA|453aa|up_6|NC_015587.1_583529_584888_-	sd00006, TPR, Tetratricopeptide repeat	NA|277aa|up_5|NC_015587.1_584925_585756_-	TIGR02163, Ferredoxin-type_protein_NapH_homolog, ferredoxin-type protein, NapH/MauN family	NA|133aa|up_4|NC_015587.1_585850_586249_-	pfam09969, DUF2203, Uncharacterized conserved protein (DUF2203)	NA|305aa|up_3|NC_015587.1_586296_587211_-	cd03789, GT9_LPS_heptosyltransferase, lipopolysaccharide heptosyltransferase and similar proteins	NA|74aa|up_2|NC_015587.1_587203_587425_-	cd00754, Ubl_MoaD, ubiquitin-like (Ubl) domain found in molybdenum cofactor biosynthesis protein D (MoaD) and similar proteins	NA|465aa|up_1|NC_015587.1_587421_588816_-	PRK01406, gltX, glutamyl-tRNA synthetase; Reviewed	NA|144aa|up_0|NC_015587.1_589040_589472_-	NA	cas6|266aa|down_0|NC_015587.1_592453_593251_+	cd09652, Cas6-I-III, CRISPR/Cas system-associated RAMP superfamily protein Cas6	NA|556aa|down_1|NC_015587.1_593247_594915_+	pfam09706, Cas_CXXC_CXXC, CRISPR-associated protein (Cas_CXXC_CXXC)	cas7|329aa|down_2|NC_015587.1_594952_595939_+	TIGR02585, conserved_protein, CRISPR-associated protein Cas7/Cst2/DevR, subtype I-B/TNEAP	cas5|224aa|down_3|NC_015587.1_595946_596618_+	TIGR01895, conserved_hypothetical_protein, CRISPR-associated protein Cas5, subtype I-B/TNEAP	cas3|764aa|down_4|NC_015587.1_596605_598897_+	COG1203, COG1203, CRISPR-associated helicase Cas3 [Defense mechanisms]	cas4|72aa|down_5|NC_015587.1_598893_599109_+	pfam01930, Cas_Cas4, Domain of unknown function DUF83	NA|53aa|down_6|NC_015587.1_599521_599680_+	NA	cas14j|407aa|down_7|NC_015587.1_599794_601015_+	COG0675, COG0675, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|260aa|down_8|NC_015587.1_601023_601803_+	smart00756, VKc, Family of likely enzymes that includes the catalytic subunit of vitamin K epoxide reductase	NA|983aa|down_9|NC_015587.1_601883_604832_+	TIGR03346, chaperone_ClpB, ATP-dependent chaperone ClpB
GCF_000215065.1_ASM21506v1	NC_015587	Hydrogenobaculum sp. SHO, complete sequence	5	928075-929220	5,3,4	CRISPRCasFinder,CRT,PILER-CR	no		Cas9_archaeal,Cas14b_CAS-V-F,cas3,cas6,cas4,cas1,csm3gr7,DEDDh,DinG,csa3,cas7,cas5,cas14j,cas8b2,cas2,TnpB_regular.1	Orphan	CTTCTTAACCCACACGGTTCAGATGAAAC,CTTCTTAACCCACACGGTTCAGATGAAAC,GTTTCATCTGAACCGTGTGGGTTAAGAAG	29,29,29	2	2	929025-929060|929026-929061	NC_015587.1_1337724-1337759|NC_015587.1_1337724-1337759	NA:NA:NA	17,17,17	17	Orphan	Cas9_archaeal,Cas14b_CAS-V-F,cas3,cas6,cas4,cas1,csm3gr7,DEDDh,DinG,csa3,cas7,cas5,cas14j,cas8b2,cas2,TnpB_regular.1	NA,NA|102aa|down_2|NC_015587.1_932785_933091_-,NA|59aa|down_6|NC_015587.1_937888_938065_-	NA|332aa|up_9|NC_015587.1_919770_920766_+	TIGR00433, biotin_synthase, biotin synthase	NA|148aa|up_8|NC_015587.1_920756_921200_+	COG0590, CumB, Cytosine/adenosine deaminases [Nucleotide transport and metabolism / Translation, ribosomal structure and biogenesis]	NA|134aa|up_7|NC_015587.1_921204_921606_+	pfam01894, UPF0047, Uncharacterized protein family UPF0047	NA|439aa|up_6|NC_015587.1_921602_922919_+	TIGR00054, Putative_zinc_metalloprotease_slr1821, RIP metalloprotease RseP	NA|313aa|up_5|NC_015587.1_922936_923875_+	PRK00779, PRK00779, ornithine carbamoyltransferase; Provisional	NA|566aa|up_4|NC_015587.1_923846_925544_+	COG1132, MdlB, ABC-type multidrug transport system, ATPase and permease components [Defense mechanisms]	NA|167aa|up_3|NC_015587.1_925536_926037_-	pfam14385, DUF4416, Domain of unknown function (DUF4416)	NA|335aa|up_2|NC_015587.1_926033_927038_-	PRK00436, argC, N-acetyl-gamma-glutamyl-phosphate reductase; Validated	NA|147aa|up_1|NC_015587.1_927022_927463_-	PRK00132, rpsI, 30S ribosomal protein S9; Reviewed	NA|145aa|up_0|NC_015587.1_927475_927910_-	PRK09216, rplM, 50S ribosomal protein L13; Reviewed	NA|320aa|down_0|NC_015587.1_929806_930766_-	pfam04754, Transposase_31, Putative transposase, YhgA-like	NA|344aa|down_1|NC_015587.1_931757_932789_-	COG0787, Alr, Alanine racemase [Cell envelope biogenesis, outer membrane]	NA|102aa|down_2|NC_015587.1_932785_933091_-	NA	NA|262aa|down_3|NC_015587.1_933083_933869_-	COG3494, COG3494, Uncharacterized protein conserved in bacteria [Function unknown]	NA|441aa|down_4|NC_015587.1_933884_935207_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|843aa|down_5|NC_015587.1_935213_937742_-	PRK05306, infB, translation initiation factor IF-2; Validated	NA|59aa|down_6|NC_015587.1_937888_938065_-	NA	NA|503aa|down_7|NC_015587.1_938195_939704_-	PRK09281, PRK09281, F0F1 ATP synthase subunit alpha; Validated	NA|183aa|down_8|NC_015587.1_939810_940359_-	COG0712, AtpH, F0F1-type ATP synthase, delta subunit (mitochondrial oligomycin sensitivity protein) [Energy production and conversion]	NA|161aa|down_9|NC_015587.1_940355_940838_-	COG0711, AtpF, F0F1-type ATP synthase, subunit b [Energy production and conversion]
GCF_000215065.1_ASM21506v1	NC_015587	Hydrogenobaculum sp. SHO, complete sequence	6	930815-931239	5,6,4	PILER-CR,CRISPRCasFinder,CRT	no		Cas9_archaeal,Cas14b_CAS-V-F,cas3,cas6,cas4,cas1,csm3gr7,DEDDh,DinG,csa3,cas7,cas5,cas14j,cas8b2,cas2,TnpB_regular.1	Orphan	GTTTCATCTGAACCGTGTGGGTTAAGAAG,CTTCTTAACCCACACGGTTCAGATGAAAC,CTTCTTAACCCACACGGTTCAGATGAAAC	29,29,29	0	0	NA	NA	NA:NA:NA	6,6,6	6	Orphan	Cas9_archaeal,Cas14b_CAS-V-F,cas3,cas6,cas4,cas1,csm3gr7,DEDDh,DinG,csa3,cas7,cas5,cas14j,cas8b2,cas2,TnpB_regular.1	NA,NA|102aa|down_1|NC_015587.1_932785_933091_-,NA|59aa|down_5|NC_015587.1_937888_938065_-	NA|148aa|up_9|NC_015587.1_920756_921200_+	COG0590, CumB, Cytosine/adenosine deaminases [Nucleotide transport and metabolism / Translation, ribosomal structure and biogenesis]	NA|134aa|up_8|NC_015587.1_921204_921606_+	pfam01894, UPF0047, Uncharacterized protein family UPF0047	NA|439aa|up_7|NC_015587.1_921602_922919_+	TIGR00054, Putative_zinc_metalloprotease_slr1821, RIP metalloprotease RseP	NA|313aa|up_6|NC_015587.1_922936_923875_+	PRK00779, PRK00779, ornithine carbamoyltransferase; Provisional	NA|566aa|up_5|NC_015587.1_923846_925544_+	COG1132, MdlB, ABC-type multidrug transport system, ATPase and permease components [Defense mechanisms]	NA|167aa|up_4|NC_015587.1_925536_926037_-	pfam14385, DUF4416, Domain of unknown function (DUF4416)	NA|335aa|up_3|NC_015587.1_926033_927038_-	PRK00436, argC, N-acetyl-gamma-glutamyl-phosphate reductase; Validated	NA|147aa|up_2|NC_015587.1_927022_927463_-	PRK00132, rpsI, 30S ribosomal protein S9; Reviewed	NA|145aa|up_1|NC_015587.1_927475_927910_-	PRK09216, rplM, 50S ribosomal protein L13; Reviewed	NA|320aa|up_0|NC_015587.1_929806_930766_-	pfam04754, Transposase_31, Putative transposase, YhgA-like	NA|344aa|down_0|NC_015587.1_931757_932789_-	COG0787, Alr, Alanine racemase [Cell envelope biogenesis, outer membrane]	NA|102aa|down_1|NC_015587.1_932785_933091_-	NA	NA|262aa|down_2|NC_015587.1_933083_933869_-	COG3494, COG3494, Uncharacterized protein conserved in bacteria [Function unknown]	NA|441aa|down_3|NC_015587.1_933884_935207_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|843aa|down_4|NC_015587.1_935213_937742_-	PRK05306, infB, translation initiation factor IF-2; Validated	NA|59aa|down_5|NC_015587.1_937888_938065_-	NA	NA|503aa|down_6|NC_015587.1_938195_939704_-	PRK09281, PRK09281, F0F1 ATP synthase subunit alpha; Validated	NA|183aa|down_7|NC_015587.1_939810_940359_-	COG0712, AtpH, F0F1-type ATP synthase, delta subunit (mitochondrial oligomycin sensitivity protein) [Energy production and conversion]	NA|161aa|down_8|NC_015587.1_940355_940838_-	COG0711, AtpF, F0F1-type ATP synthase, subunit b [Energy production and conversion]	NA|143aa|down_9|NC_015587.1_940842_941271_-	cd06503, ATP-synt_Fo_b, F-type ATP synthase, membrane subunit b
GCF_000215065.1_ASM21506v1	NC_015587	Hydrogenobaculum sp. SHO, complete sequence	7	1024000-1024496	7,5,6	CRISPRCasFinder,CRT,PILER-CR	no		Cas9_archaeal,Cas14b_CAS-V-F,cas3,cas6,cas4,cas1,csm3gr7,DEDDh,DinG,csa3,cas7,cas5,cas14j,cas8b2,cas2,TnpB_regular.1	Orphan	GTTTCAATCCCCTATAGGTACAAACAAAAC,GTTTCAATCCCCTATAGGTACAAACAAAAC,GTTTTGTTTGTACCTATAGGGGATTGAAAC	30,30,30	0	0	NA	NA	NA:NA:NA	7,7,6	7	Orphan	Cas9_archaeal,Cas14b_CAS-V-F,cas3,cas6,cas4,cas1,csm3gr7,DEDDh,DinG,csa3,cas7,cas5,cas14j,cas8b2,cas2,TnpB_regular.1	NA|129aa|up_4|NC_015587.1_1018977_1019364_-,NA|200aa|up_2|NC_015587.1_1021023_1021623_-,NA|140aa|down_8|NC_015587.1_1032071_1032491_-	NA|530aa|up_9|NC_015587.1_1012128_1013718_-	COG1009, NuoL, NADH:ubiquinone oxidoreductase subunit 5 (chain L)/Multisubunit Na+/H+ antiporter, MnhA subunit [Energy production and conversion / Inorganic ion transport and metabolism]	NA|234aa|up_8|NC_015587.1_1014060_1014762_-	PRK09362, PRK09362, phosphoribosylaminoimidazole-succinocarboxamide synthase; Reviewed	NA|194aa|up_7|NC_015587.1_1014758_1015340_-	pfam13174, TPR_6, Tetratricopeptide repeat	NA|94aa|up_6|NC_015587.1_1015326_1015608_-	pfam01649, Ribosomal_S20p, Ribosomal protein S20	NA|1088aa|up_5|NC_015587.1_1015717_1018981_-	cd18808, SF1_C_Upf1, C-terminal helicase domain of Upf1-like family helicases	NA|129aa|up_4|NC_015587.1_1018977_1019364_-	NA	NA|328aa|up_3|NC_015587.1_1019778_1020762_+	TIGR01784, Uncharacterized_protein_pSLT051, conserved hypothetical protein (putative transposase or invertase)	NA|200aa|up_2|NC_015587.1_1021023_1021623_-	NA	NA|217aa|up_1|NC_015587.1_1021749_1022400_-	sd00010, SLR, Sel1-like repeat	NA|316aa|up_0|NC_015587.1_1022941_1023889_-	pfam04754, Transposase_31, Putative transposase, YhgA-like	NA|555aa|down_0|NC_015587.1_1024888_1026553_-	PRK05294, carB, carbamoyl-phosphate synthase large subunit	NA|242aa|down_1|NC_015587.1_1026622_1027348_-	pfam00902, TatC, Sec-independent protein translocase protein (TatC)	NA|93aa|down_2|NC_015587.1_1027310_1027589_-	COG1826, TatA, Sec-independent protein secretion pathway components [Intracellular trafficking and secretion]	NA|273aa|down_3|NC_015587.1_1027579_1028398_-	pfam06750, DiS_P_DiS, Bacterial Peptidase A24 N-terminal domain	NA|501aa|down_4|NC_015587.1_1028394_1029897_-	PRK05812, secD, preprotein translocase subunit SecD; Reviewed	NA|201aa|down_5|NC_015587.1_1029899_1030502_-	TIGR00741, Probable_sigma54_modulation_protein_ORF3_ORF95	NA|252aa|down_6|NC_015587.1_1030505_1031261_-	TIGR01352, Protein_TonB, TonB family C-terminal domain	NA|258aa|down_7|NC_015587.1_1031296_1032070_-	cd03424, ADPRase_NUDT5, ADP-ribose pyrophosphatase (ADPRase) catalyzes the hydrolysis of ADP-ribose and a variety of additional ADP-sugar conjugates to AMP and ribose-5-phosphate	NA|140aa|down_8|NC_015587.1_1032071_1032491_-	NA	NA|408aa|down_9|NC_015587.1_1032487_1033711_-	COG1686, DacC, D-alanyl-D-alanine carboxypeptidase [Cell envelope biogenesis, outer membrane]
GCF_000215065.1_ASM21506v1	NC_015587	Hydrogenobaculum sp. SHO, complete sequence	8	1088875-1092207	7,8,6	PILER-CR,CRISPRCasFinder,CRT	no	cas7,cas8b2,cas3,cas4,cas1,cas2,TnpB_regular.1	Cas9_archaeal,Cas14b_CAS-V-F,cas3,cas6,cas4,cas1,csm3gr7,DEDDh,DinG,csa3,cas7,cas5,cas14j,cas8b2,cas2,TnpB_regular.1	Unclear	GTTTCTAATTAACCGTGTGGAGTTGAAAG,GTTTCTAATTAACCGTGTGGAGTTGAAAG,GTTTCTAATTAACCGTGTGGAGTTGAAAG	29,29,29	1	1	1088904-1088942	NC_015587.1_578152-578190	NA:NA:NA	50,50,50	50	Unclear	Cas9_archaeal,Cas14b_CAS-V-F,cas3,cas6,cas4,cas1,csm3gr7,DEDDh,DinG,csa3,cas7,cas5,cas14j,cas8b2,cas2,TnpB_regular.1	NA|95aa|up_8|NC_015587.1_1079629_1079914_+,NA|64aa|up_0|NC_015587.1_1088448_1088640_-,NA|143aa|down_1|NC_015587.1_1092991_1093420_-,NA|322aa|down_2|NC_015587.1_1093401_1094367_-	NA|905aa|up_9|NC_015587.1_1076898_1079613_+	cd01949, GGDEF, Diguanylate-cyclase (DGC) or GGDEF domain	NA|95aa|up_8|NC_015587.1_1079629_1079914_+	NA	NA|188aa|up_7|NC_015587.1_1079913_1080477_+	cd00564, TMP_TenI, Thiamine monophosphate synthase (TMP synthase)/TenI	cas7|329aa|up_6|NC_015587.1_1080522_1081509_+	TIGR01875, CRISPR-associated_protein_Cas7/Cst2/DevR, CRISPR-associated autoregulator DevR family	cas8b2|741aa|up_5|NC_015587.1_1081495_1083718_+	TIGR01895, conserved_hypothetical_protein, CRISPR-associated protein Cas5, subtype I-B/TNEAP	cas3|736aa|up_4|NC_015587.1_1083689_1085897_+	TIGR01587, CRISPR-associated_endonuclease/helicase_Cas3, CRISPR-associated helicase Cas3	cas4|171aa|up_3|NC_015587.1_1085898_1086411_+	pfam01930, Cas_Cas4, Domain of unknown function DUF83	cas1|318aa|up_2|NC_015587.1_1086403_1087357_+	TIGR03641, cas1_HMARI, CRISPR-associated endonuclease Cas1, subtype I-B/HMARI/TNEAP	cas2|94aa|up_1|NC_015587.1_1087353_1087635_+	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	NA|64aa|up_0|NC_015587.1_1088448_1088640_-	NA	NA|262aa|down_0|NC_015587.1_1092209_1092995_-	COG3031, PulC, Type II secretory pathway, component PulC [Intracellular trafficking and secretion]	NA|143aa|down_1|NC_015587.1_1092991_1093420_-	NA	NA|322aa|down_2|NC_015587.1_1093401_1094367_-	NA	NA|129aa|down_3|NC_015587.1_1094367_1094754_-	PRK13258, PRK13258, 7-cyano-7-deazaguanine reductase; Provisional	TnpB_regular.1|485aa|down_4|NC_015587.1_1094944_1096399_+	TIGR01766, Putative_transposase_MJ0751, transposase, IS605 OrfB family, central region	NA|582aa|down_5|NC_015587.1_1096442_1098188_+	cd08500, PBP2_NikA_DppA_OppA_like_4, The substrate-binding component of an uncharacterized ABC-type nickel/dipeptide/oligopeptide-like import system contains the type 2 periplasmic binding fold	NA|516aa|down_6|NC_015587.1_1098287_1099835_+	PRK00915, PRK00915, 2-isopropylmalate synthase; Validated	NA|274aa|down_7|NC_015587.1_1099831_1100653_-	PRK00258, aroE, shikimate 5-dehydrogenase; Reviewed	NA|360aa|down_8|NC_015587.1_1100649_1101729_-	PRK00108, mraY, phospho-N-acetylmuramoyl-pentapeptide-transferase; Provisional	NA|188aa|down_9|NC_015587.1_1101729_1102293_-	pfam09936, Methyltrn_RNA_4, SAM-dependent RNA methyltransferase
GCF_000215065.1_ASM21506v1	NC_015587	Hydrogenobaculum sp. SHO, complete sequence	9	1151818-1151924	9	CRISPRCasFinder	no	cas3	Cas9_archaeal,Cas14b_CAS-V-F,cas3,cas6,cas4,cas1,csm3gr7,DEDDh,DinG,csa3,cas7,cas5,cas14j,cas8b2,cas2,TnpB_regular.1	Unclear	AGAAATGCCGTTGGCACACTAGAGCG	26	0	0	NA	NA	NA	1	1	Unclear	Cas9_archaeal,Cas14b_CAS-V-F,cas3,cas6,cas4,cas1,csm3gr7,DEDDh,DinG,csa3,cas7,cas5,cas14j,cas8b2,cas2,TnpB_regular.1	NA|115aa|up_5|NC_015587.1_1147504_1147849_-,NA|120aa|down_8|NC_015587.1_1159197_1159557_+,NA|94aa|down_9|NC_015587.1_1159540_1159822_+	NA|287aa|up_9|NC_015587.1_1142782_1143643_+	PRK00072, hemC, porphobilinogen deaminase; Reviewed	NA|348aa|up_8|NC_015587.1_1143651_1144695_+	cd07432, PHP_HisPPase, Polymerase and Histidinol Phosphatase domain of Histidinol phosphate phosphatase	NA|455aa|up_7|NC_015587.1_1144686_1146051_-	COG0084, TatD, Mg-dependent DNase [DNA replication, recombination, and repair]	NA|454aa|up_6|NC_015587.1_1146103_1147465_-	PRK06292, PRK06292, dihydrolipoamide dehydrogenase; Validated	NA|115aa|up_5|NC_015587.1_1147504_1147849_-	NA	NA|380aa|up_4|NC_015587.1_1147865_1149005_-	pfam13975, gag-asp_proteas, gag-polyprotein putative aspartyl protease	NA|259aa|up_3|NC_015587.1_1149025_1149802_-	COG0142, IspA, Geranylgeranyl pyrophosphate synthase [Coenzyme metabolism]	NA|311aa|up_2|NC_015587.1_1149798_1150731_-	PRK07259, PRK07259, dihydroorotate dehydrogenase	NA|115aa|up_1|NC_015587.1_1151024_1151369_-	COG2361, COG2361, Uncharacterized conserved protein [Function unknown]	NA|110aa|up_0|NC_015587.1_1151365_1151695_-	COG1669, COG1669, Predicted nucleotidyltransferases [General function prediction only]	NA|566aa|down_0|NC_015587.1_1151966_1153664_-	PRK14667, uvrC, excinuclease ABC subunit C; Provisional	NA|358aa|down_1|NC_015587.1_1153690_1154764_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|223aa|down_2|NC_015587.1_1154744_1155413_-	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|263aa|down_3|NC_015587.1_1155409_1156198_-	PRK13111, trpA, tryptophan synthase subunit alpha; Provisional	NA|183aa|down_4|NC_015587.1_1156194_1156743_-	PRK00083, frr, ribosome recycling factor; Reviewed	NA|90aa|down_5|NC_015587.1_1156825_1157095_+	pfam17209, Hfq, Hfq protein	NA|240aa|down_6|NC_015587.1_1157134_1157854_+	pfam01255, Prenyltransf, Putative undecaprenyl diphosphate synthase	NA|426aa|down_7|NC_015587.1_1157957_1159235_+	PRK00077, eno, enolase; Provisional	NA|120aa|down_8|NC_015587.1_1159197_1159557_+	NA	NA|94aa|down_9|NC_015587.1_1159540_1159822_+	NA
GCF_000215065.1_ASM21506v1	NC_015587	Hydrogenobaculum sp. SHO, complete sequence	10	1214987-1215088	10	CRISPRCasFinder	no		Cas9_archaeal,Cas14b_CAS-V-F,cas3,cas6,cas4,cas1,csm3gr7,DEDDh,DinG,csa3,cas7,cas5,cas14j,cas8b2,cas2,TnpB_regular.1	Orphan	GAAATGCGATTGCATCGCTCTAGT	24	0	0	NA	NA	NA	1	1	Orphan	Cas9_archaeal,Cas14b_CAS-V-F,cas3,cas6,cas4,cas1,csm3gr7,DEDDh,DinG,csa3,cas7,cas5,cas14j,cas8b2,cas2,TnpB_regular.1	NA|120aa|up_7|NC_015587.1_1203486_1203846_+,NA|408aa|up_6|NC_015587.1_1203889_1205113_+,NA|69aa|down_3|NC_015587.1_1218815_1219022_+	NA|87aa|up_9|NC_015587.1_1201851_1202112_-	pfam01381, HTH_3, Helix-turn-helix	NA|396aa|up_8|NC_015587.1_1202122_1203310_-	pfam07804, HipA_C, HipA-like C-terminal domain	NA|120aa|up_7|NC_015587.1_1203486_1203846_+	NA	NA|408aa|up_6|NC_015587.1_1203889_1205113_+	NA	NA|165aa|up_5|NC_015587.1_1205198_1205693_-	COG3260, COG3260, Ni,Fe-hydrogenase III small subunit [Energy production and conversion]	NA|479aa|up_4|NC_015587.1_1205702_1207139_-	COG3261, HycE, Ni,Fe-hydrogenase III large subunit [Energy production and conversion]	NA|481aa|up_3|NC_015587.1_1207135_1208578_-	PRK06458, PRK06458, hydrogenase 4 subunit F; Validated	NA|223aa|up_2|NC_015587.1_1208574_1209243_-	COG4237, HyfE, Hydrogenase 4 membrane component (E) [Energy production and conversion]	NA|306aa|up_1|NC_015587.1_1209252_1210170_-	COG0650, HyfC, Formate hydrogenlyase subunit 4 [Energy production and conversion]	NA|622aa|up_0|NC_015587.1_1210162_1212028_-	PRK06521, PRK06521, hydrogenase 4 subunit B; Validated	NA|273aa|down_0|NC_015587.1_1215289_1216108_-	pfam00535, Glycos_transf_2, Glycosyl transferase family 2	NA|182aa|down_1|NC_015587.1_1216104_1216650_-	PRK05456, PRK05456, ATP-dependent protease subunit HslV	NA|655aa|down_2|NC_015587.1_1216640_1218605_-	COG0021, TktA, Transketolase [Carbohydrate transport and metabolism]	NA|69aa|down_3|NC_015587.1_1218815_1219022_+	NA	NA|100aa|down_4|NC_015587.1_1219245_1219545_+	cd00158, RHOD, Rhodanese Homology Domain (RHOD); an alpha beta fold domain found duplicated in the rhodanese protein	NA|308aa|down_5|NC_015587.1_1219531_1220455_-	cd00657, Ferritin_like, Ferritin-like superfamily of diiron-containing four-helix-bundle proteins	NA|279aa|down_6|NC_015587.1_1220466_1221303_-	pfam04454, Linocin_M18, Encapsulating protein for peroxidase	NA|515aa|down_7|NC_015587.1_1221356_1222901_-	pfam09820, AAA-ATPase_like, Predicted AAA-ATPase	NA|175aa|down_8|NC_015587.1_1225050_1225575_-	cd01949, GGDEF, Diguanylate-cyclase (DGC) or GGDEF domain	NA|354aa|down_9|NC_015587.1_1225609_1226671_-	pfam07680, DoxA, TQO small subunit DoxA
