assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000341855.1_ASM34185v1	NC_020411	Hydrogenobaculum sp. HO, complete sequence	1	199537-199635	1	CRISPRCasFinder	no		Cas9_archaeal,Cas14b_CAS-V-F,cas3,cas6,cas4,cas1,csm3gr7,DEDDh,DinG,csa3,cas7,cas5,cas14j,cas8b2,cas2,TnpB_regular.1	Orphan	AAATGCGGACGTTACCGCTCTATT	24	0	0	NA	NA	NA	1	1	Orphan	Cas9_archaeal,Cas14b_CAS-V-F,cas3,cas6,cas4,cas1,csm3gr7,DEDDh,DinG,csa3,cas7,cas5,cas14j,cas8b2,cas2,TnpB_regular.1	NA|152aa|up_5|NC_020411.1_195219_195675_+,NA|76aa|up_0|NC_020411.1_198968_199196_-,NA|207aa|down_0|NC_020411.1_199878_200499_-,NA|65aa|down_1|NC_020411.1_200510_200705_-,NA|193aa|down_2|NC_020411.1_200750_201329_-,NA|138aa|down_9|NC_020411.1_206141_206555_+	NA|155aa|up_9|NC_020411.1_190665_191130_+	COG1905, NuoE, NADH:ubiquinone oxidoreductase 24 kD subunit [Energy production and conversion]	NA|426aa|up_8|NC_020411.1_191104_192382_+	COG1894, NuoF, NADH:ubiquinone oxidoreductase, NADH-binding (51 kD) subunit [Energy production and conversion]	NA|555aa|up_7|NC_020411.1_192398_194063_+	PRK00911, PRK00911, dihydroxy-acid dehydratase; Provisional	NA|377aa|up_6|NC_020411.1_194072_195203_+	PRK00112, tgt, queuine tRNA-ribosyltransferase; Provisional	NA|152aa|up_5|NC_020411.1_195219_195675_+	NA	NA|218aa|up_4|NC_020411.1_195750_196404_-	TIGR01093, 3-dehydroquinate_dehydratase, 3-dehydroquinate dehydratase, type I	NA|386aa|up_3|NC_020411.1_196403_197561_-	PRK05382, PRK05382, chorismate synthase; Validated	NA|171aa|up_2|NC_020411.1_197897_198410_-	cd04645, LbH_gamma_CA_like, Gamma carbonic anhydrase-like: This family is composed of gamma carbonic anhydrase (CA), Ferripyochelin Binding Protein (FBP), E	NA|149aa|up_1|NC_020411.1_198435_198882_-	cd00851, MTH1175, This uncharacterized conserved protein belongs to a family of iron-molybdenum cluster-binding proteins that includes NifX, NifB, and NifY, all of which are involved in the synthesis of an iron-molybdenum cofactor (FeMo-co) that binds the active site of the dinitrogenase enzyme	NA|76aa|up_0|NC_020411.1_198968_199196_-	NA	NA|207aa|down_0|NC_020411.1_199878_200499_-	NA	NA|65aa|down_1|NC_020411.1_200510_200705_-	NA	NA|193aa|down_2|NC_020411.1_200750_201329_-	NA	NA|311aa|down_3|NC_020411.1_201623_202556_+	TIGR02197, heptose_epim, ADP-L-glycero-D-manno-heptose-6-epimerase	NA|322aa|down_4|NC_020411.1_202552_203518_+	TIGR01138, Cysteine_synthase_B, cysteine synthase B	NA|172aa|down_5|NC_020411.1_203519_204035_+	pfam02620, DUF177, Uncharacterized ACR, COG1399	NA|61aa|down_6|NC_020411.1_204015_204198_+	PRK12286, rpmF, 50S ribosomal protein L32; Reviewed	NA|340aa|down_7|NC_020411.1_204209_205229_+	PRK05331, PRK05331, phosphate acyltransferase PlsX	NA|304aa|down_8|NC_020411.1_205228_206140_+	PRK09352, PRK09352, beta-ketoacyl-ACP synthase 3	NA|138aa|down_9|NC_020411.1_206141_206555_+	NA
GCF_000341855.1_ASM34185v1	NC_020411	Hydrogenobaculum sp. HO, complete sequence	2	340583-340882	1,2,1	CRT,CRISPRCasFinder,PILER-CR	no	cas6,cas4,cas1,csm3gr7,DEDDh	Cas9_archaeal,Cas14b_CAS-V-F,cas3,cas6,cas4,cas1,csm3gr7,DEDDh,DinG,csa3,cas7,cas5,cas14j,cas8b2,cas2,TnpB_regular.1	Unclear	ANGTTTTNTNTGTNCCTATAGGGGATTGAAAC,GTTTTTTGTGTACCTATAGGGGATTGAAAC,GGTTTTTTGTGTACCTATAGGGGATTGAAACG	32,30,32	0	0	NA	NA	NA:NA:NA	4,4,2	4	Unclear	Cas9_archaeal,Cas14b_CAS-V-F,cas3,cas6,cas4,cas1,csm3gr7,DEDDh,DinG,csa3,cas7,cas5,cas14j,cas8b2,cas2,TnpB_regular.1	NA|139aa|up_7|NC_020411.1_332919_333336_+,NA|400aa|down_7|NC_020411.1_350231_351431_-	NA|117aa|up_9|NC_020411.1_331668_332019_+	cd06664, IscU_like, Iron-sulfur cluster scaffold-like proteins	NA|309aa|up_8|NC_020411.1_331996_332923_+	pfam04463, DUF523, Protein of unknown function (DUF523)	NA|139aa|up_7|NC_020411.1_332919_333336_+	NA	NA|536aa|up_6|NC_020411.1_333345_334953_+	PRK01611, argS, arginyl-tRNA synthetase; Reviewed	NA|244aa|up_5|NC_020411.1_335025_335757_+	cd13519, PBP2_PEB3_AcfC, Ligand-binding domain of a glycoprotein adhesion and an accessory colonization factor, a member of the type 2 periplasmic binding fold superfamily	NA|230aa|up_4|NC_020411.1_335919_336609_-	cd18669, M20_18_42, M20, M18 and M42 Zn-peptidases include aminopeptidases and carboxypeptidases	NA|512aa|up_3|NC_020411.1_336849_338385_-	pfam09820, AAA-ATPase_like, Predicted AAA-ATPase	cas6|175aa|up_2|NC_020411.1_338458_338983_+	TIGR01877, CRISPR-associated_endoribonuclease_Cas6_1, CRISPR-associated endoribonuclease Cas6	cas6|72aa|up_1|NC_020411.1_338979_339195_+	COG1583, COG1583, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	cas4|166aa|up_0|NC_020411.1_339199_339697_+	pfam01930, Cas_Cas4, Domain of unknown function DUF83	csm3gr7|440aa|down_0|NC_020411.1_342305_343625_+	TIGR03986, CRISPR-associated_protein, CRISPR-associated protein	NA|203aa|down_1|NC_020411.1_344153_344762_+	COG1196, Smc, Chromosome segregation ATPases [Cell division and chromosome partitioning]	DEDDh|197aa|down_2|NC_020411.1_344781_345372_-	cd06127, DEDDh, DEDDh 3'-5' exonuclease domain family	NA|317aa|down_3|NC_020411.1_345385_346336_-	pfam09312, SurA_N, SurA N-terminal domain	NA|422aa|down_4|NC_020411.1_347563_348829_-	cd18773, PDC1_HK_sensor, first PDC (PhoQ/DcuS/CitA) domain of methyl-accepting chemotaxis proteins, diguanylate-cyclase and similar domains	NA|190aa|down_5|NC_020411.1_348815_349385_-	cd02165, NMNAT, Nicotinamide/nicotinate mononucleotide adenylyltransferase	NA|268aa|down_6|NC_020411.1_349375_350179_-	pfam01904, DUF72, Protein of unknown function DUF72	NA|400aa|down_7|NC_020411.1_350231_351431_-	NA	NA|326aa|down_8|NC_020411.1_351411_352389_-	COG2187, COG2187, Uncharacterized protein conserved in bacteria [Function unknown]	NA|620aa|down_9|NC_020411.1_352381_354241_-	COG1034, NuoG, NADH dehydrogenase/NADH:ubiquinone oxidoreductase 75 kD subunit (chain G) [Energy production and conversion]
GCF_000341855.1_ASM34185v1	NC_020411	Hydrogenobaculum sp. HO, complete sequence	3	346991-347154	3,2	CRISPRCasFinder,PILER-CR	no	cas6,cas4,cas1,csm3gr7,DEDDh	Cas9_archaeal,Cas14b_CAS-V-F,cas3,cas6,cas4,cas1,csm3gr7,DEDDh,DinG,csa3,cas7,cas5,cas14j,cas8b2,cas2,TnpB_regular.1	Unclear	GAGTTTCATCTGAACCGTGTGGGTTAAGAA,GAGTTTCATCTGAACCGTGTGGGTTAAGAAGC	30,32	0	0	NA	NA	NA:NA	2,2	2	Unclear	Cas9_archaeal,Cas14b_CAS-V-F,cas3,cas6,cas4,cas1,csm3gr7,DEDDh,DinG,csa3,cas7,cas5,cas14j,cas8b2,cas2,TnpB_regular.1	NA,NA|400aa|down_3|NC_020411.1_350231_351431_-	NA|230aa|up_9|NC_020411.1_335919_336609_-	cd18669, M20_18_42, M20, M18 and M42 Zn-peptidases include aminopeptidases and carboxypeptidases	NA|512aa|up_8|NC_020411.1_336849_338385_-	pfam09820, AAA-ATPase_like, Predicted AAA-ATPase	cas6|175aa|up_7|NC_020411.1_338458_338983_+	TIGR01877, CRISPR-associated_endoribonuclease_Cas6_1, CRISPR-associated endoribonuclease Cas6	cas6|72aa|up_6|NC_020411.1_338979_339195_+	COG1583, COG1583, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	cas4|166aa|up_5|NC_020411.1_339199_339697_+	pfam01930, Cas_Cas4, Domain of unknown function DUF83	cas1|301aa|up_4|NC_020411.1_339700_340603_+	TIGR03641, cas1_HMARI, CRISPR-associated endonuclease Cas1, subtype I-B/HMARI/TNEAP	csm3gr7|440aa|up_3|NC_020411.1_342305_343625_+	TIGR03986, CRISPR-associated_protein, CRISPR-associated protein	NA|203aa|up_2|NC_020411.1_344153_344762_+	COG1196, Smc, Chromosome segregation ATPases [Cell division and chromosome partitioning]	DEDDh|197aa|up_1|NC_020411.1_344781_345372_-	cd06127, DEDDh, DEDDh 3'-5' exonuclease domain family	NA|317aa|up_0|NC_020411.1_345385_346336_-	pfam09312, SurA_N, SurA N-terminal domain	NA|422aa|down_0|NC_020411.1_347563_348829_-	cd18773, PDC1_HK_sensor, first PDC (PhoQ/DcuS/CitA) domain of methyl-accepting chemotaxis proteins, diguanylate-cyclase and similar domains	NA|190aa|down_1|NC_020411.1_348815_349385_-	cd02165, NMNAT, Nicotinamide/nicotinate mononucleotide adenylyltransferase	NA|268aa|down_2|NC_020411.1_349375_350179_-	pfam01904, DUF72, Protein of unknown function DUF72	NA|400aa|down_3|NC_020411.1_350231_351431_-	NA	NA|326aa|down_4|NC_020411.1_351411_352389_-	COG2187, COG2187, Uncharacterized protein conserved in bacteria [Function unknown]	NA|620aa|down_5|NC_020411.1_352381_354241_-	COG1034, NuoG, NADH dehydrogenase/NADH:ubiquinone oxidoreductase 75 kD subunit (chain G) [Energy production and conversion]	NA|225aa|down_6|NC_020411.1_354230_354905_-	PRK00230, PRK00230, orotidine-5'-phosphate decarboxylase	NA|284aa|down_7|NC_020411.1_354897_355749_-	COG2445, COG2445, Uncharacterized conserved protein [Function unknown]	NA|175aa|down_8|NC_020411.1_357630_358155_-	COG2143, COG2143, Thioredoxin-related protein [Posttranslational modification, protein turnover, chaperones]	NA|391aa|down_9|NC_020411.1_358151_359324_-	PRK05764, PRK05764, aspartate aminotransferase; Provisional
GCF_000341855.1_ASM34185v1	NC_020411	Hydrogenobaculum sp. HO, complete sequence	4	591447-592135	3,4,2	PILER-CR,CRISPRCasFinder,CRT	no	Cas14b_CAS-V-F,cas6,cas7,cas5,cas3,cas4,cas14j	Cas9_archaeal,Cas14b_CAS-V-F,cas3,cas6,cas4,cas1,csm3gr7,DEDDh,DinG,csa3,cas7,cas5,cas14j,cas8b2,cas2,TnpB_regular.1	Unclear	GTTTCATCTGAACCGTGTGGGATATAAA,GTTTCATCTGAACCGTGTGGGATATAAAGT,GTTTCATCTGAACCGTGTGGGATATAAA	28,30,28	0	0	NA	NA	NA:NA:NA	10,10,10	10	TypeV	Cas9_archaeal,Cas14b_CAS-V-F,cas3,cas6,cas4,cas1,csm3gr7,DEDDh,DinG,csa3,cas7,cas5,cas14j,cas8b2,cas2,TnpB_regular.1	NA|116aa|up_9|NC_020411.1_582542_582890_-,NA|49aa|up_8|NC_020411.1_582896_583043_-,NA|54aa|up_7|NC_020411.1_583044_583206_-,NA|144aa|up_0|NC_020411.1_589013_589445_-,NA|53aa|down_6|NC_020411.1_599494_599653_+	NA|116aa|up_9|NC_020411.1_582542_582890_-	NA	NA|49aa|up_8|NC_020411.1_582896_583043_-	NA	NA|54aa|up_7|NC_020411.1_583044_583206_-	NA	NA|453aa|up_6|NC_020411.1_583502_584861_-	sd00006, TPR, Tetratricopeptide repeat	NA|277aa|up_5|NC_020411.1_584898_585729_-	TIGR02163, Ferredoxin-type_protein_NapH_homolog, ferredoxin-type protein, NapH/MauN family	NA|133aa|up_4|NC_020411.1_585823_586222_-	pfam09969, DUF2203, Uncharacterized conserved protein (DUF2203)	NA|305aa|up_3|NC_020411.1_586269_587184_-	cd03789, GT9_LPS_heptosyltransferase, lipopolysaccharide heptosyltransferase and similar proteins	NA|74aa|up_2|NC_020411.1_587176_587398_-	cd00754, Ubl_MoaD, ubiquitin-like (Ubl) domain found in molybdenum cofactor biosynthesis protein D (MoaD) and similar proteins	NA|465aa|up_1|NC_020411.1_587394_588789_-	PRK01406, gltX, glutamyl-tRNA synthetase; Reviewed	NA|144aa|up_0|NC_020411.1_589013_589445_-	NA	cas6|266aa|down_0|NC_020411.1_592426_593224_+	cd09652, Cas6-I-III, CRISPR/Cas system-associated RAMP superfamily protein Cas6	NA|556aa|down_1|NC_020411.1_593220_594888_+	pfam09706, Cas_CXXC_CXXC, CRISPR-associated protein (Cas_CXXC_CXXC)	cas7|329aa|down_2|NC_020411.1_594925_595912_+	TIGR02585, conserved_protein, CRISPR-associated protein Cas7/Cst2/DevR, subtype I-B/TNEAP	cas5|224aa|down_3|NC_020411.1_595919_596591_+	TIGR01895, conserved_hypothetical_protein, CRISPR-associated protein Cas5, subtype I-B/TNEAP	cas3|764aa|down_4|NC_020411.1_596578_598870_+	COG1203, COG1203, CRISPR-associated helicase Cas3 [Defense mechanisms]	cas4|72aa|down_5|NC_020411.1_598866_599082_+	pfam01930, Cas_Cas4, Domain of unknown function DUF83	NA|53aa|down_6|NC_020411.1_599494_599653_+	NA	cas14j|407aa|down_7|NC_020411.1_599767_600988_+	COG0675, COG0675, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|260aa|down_8|NC_020411.1_600996_601776_+	smart00756, VKc, Family of likely enzymes that includes the catalytic subunit of vitamin K epoxide reductase	NA|983aa|down_9|NC_020411.1_601856_604805_+	TIGR03346, chaperone_ClpB, ATP-dependent chaperone ClpB
GCF_000341855.1_ASM34185v1	NC_020411	Hydrogenobaculum sp. HO, complete sequence	5	928155-929300	5,3,4	CRISPRCasFinder,CRT,PILER-CR	no		Cas9_archaeal,Cas14b_CAS-V-F,cas3,cas6,cas4,cas1,csm3gr7,DEDDh,DinG,csa3,cas7,cas5,cas14j,cas8b2,cas2,TnpB_regular.1	Orphan	CTTCTTAACCCACACGGTTCAGATGAAAC,CTTCTTAACCCACACGGTTCAGATGAAAC,GTTTCATCTGAACCGTGTGGGTTAAGAAG	29,29,29	2	2	929105-929140|929106-929141	NC_020411.1_1338049-1338084|NC_020411.1_1338049-1338084	NA:NA:NA	17,17,17	17	Orphan	Cas9_archaeal,Cas14b_CAS-V-F,cas3,cas6,cas4,cas1,csm3gr7,DEDDh,DinG,csa3,cas7,cas5,cas14j,cas8b2,cas2,TnpB_regular.1	NA,NA|102aa|down_2|NC_020411.1_932865_933171_-,NA|59aa|down_6|NC_020411.1_937968_938145_-	NA|332aa|up_9|NC_020411.1_919850_920846_+	TIGR00433, biotin_synthase, biotin synthase	NA|148aa|up_8|NC_020411.1_920836_921280_+	COG0590, CumB, Cytosine/adenosine deaminases [Nucleotide transport and metabolism / Translation, ribosomal structure and biogenesis]	NA|134aa|up_7|NC_020411.1_921284_921686_+	pfam01894, UPF0047, Uncharacterized protein family UPF0047	NA|439aa|up_6|NC_020411.1_921682_922999_+	TIGR00054, Putative_zinc_metalloprotease_slr1821, RIP metalloprotease RseP	NA|313aa|up_5|NC_020411.1_923016_923955_+	PRK00779, PRK00779, ornithine carbamoyltransferase; Provisional	NA|566aa|up_4|NC_020411.1_923926_925624_+	COG1132, MdlB, ABC-type multidrug transport system, ATPase and permease components [Defense mechanisms]	NA|167aa|up_3|NC_020411.1_925616_926117_-	pfam14385, DUF4416, Domain of unknown function (DUF4416)	NA|335aa|up_2|NC_020411.1_926113_927118_-	PRK00436, argC, N-acetyl-gamma-glutamyl-phosphate reductase; Validated	NA|147aa|up_1|NC_020411.1_927102_927543_-	PRK00132, rpsI, 30S ribosomal protein S9; Reviewed	NA|145aa|up_0|NC_020411.1_927555_927990_-	PRK09216, rplM, 50S ribosomal protein L13; Reviewed	NA|320aa|down_0|NC_020411.1_929886_930846_-	pfam04754, Transposase_31, Putative transposase, YhgA-like	NA|344aa|down_1|NC_020411.1_931837_932869_-	COG0787, Alr, Alanine racemase [Cell envelope biogenesis, outer membrane]	NA|102aa|down_2|NC_020411.1_932865_933171_-	NA	NA|262aa|down_3|NC_020411.1_933163_933949_-	COG3494, COG3494, Uncharacterized protein conserved in bacteria [Function unknown]	NA|441aa|down_4|NC_020411.1_933964_935287_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|843aa|down_5|NC_020411.1_935293_937822_-	PRK05306, infB, translation initiation factor IF-2; Validated	NA|59aa|down_6|NC_020411.1_937968_938145_-	NA	NA|503aa|down_7|NC_020411.1_938275_939784_-	PRK09281, PRK09281, F0F1 ATP synthase subunit alpha; Validated	NA|183aa|down_8|NC_020411.1_939890_940439_-	COG0712, AtpH, F0F1-type ATP synthase, delta subunit (mitochondrial oligomycin sensitivity protein) [Energy production and conversion]	NA|161aa|down_9|NC_020411.1_940435_940918_-	COG0711, AtpF, F0F1-type ATP synthase, subunit b [Energy production and conversion]
GCF_000341855.1_ASM34185v1	NC_020411	Hydrogenobaculum sp. HO, complete sequence	6	930895-931319	5,6,4	PILER-CR,CRISPRCasFinder,CRT	no		Cas9_archaeal,Cas14b_CAS-V-F,cas3,cas6,cas4,cas1,csm3gr7,DEDDh,DinG,csa3,cas7,cas5,cas14j,cas8b2,cas2,TnpB_regular.1	Orphan	GTTTCATCTGAACCGTGTGGGTTAAGAAG,CTTCTTAACCCACACGGTTCAGATGAAAC,CTTCTTAACCCACACGGTTCAGATGAAAC	29,29,29	0	0	NA	NA	NA:NA:NA	6,6,6	6	Orphan	Cas9_archaeal,Cas14b_CAS-V-F,cas3,cas6,cas4,cas1,csm3gr7,DEDDh,DinG,csa3,cas7,cas5,cas14j,cas8b2,cas2,TnpB_regular.1	NA,NA|102aa|down_1|NC_020411.1_932865_933171_-,NA|59aa|down_5|NC_020411.1_937968_938145_-	NA|148aa|up_9|NC_020411.1_920836_921280_+	COG0590, CumB, Cytosine/adenosine deaminases [Nucleotide transport and metabolism / Translation, ribosomal structure and biogenesis]	NA|134aa|up_8|NC_020411.1_921284_921686_+	pfam01894, UPF0047, Uncharacterized protein family UPF0047	NA|439aa|up_7|NC_020411.1_921682_922999_+	TIGR00054, Putative_zinc_metalloprotease_slr1821, RIP metalloprotease RseP	NA|313aa|up_6|NC_020411.1_923016_923955_+	PRK00779, PRK00779, ornithine carbamoyltransferase; Provisional	NA|566aa|up_5|NC_020411.1_923926_925624_+	COG1132, MdlB, ABC-type multidrug transport system, ATPase and permease components [Defense mechanisms]	NA|167aa|up_4|NC_020411.1_925616_926117_-	pfam14385, DUF4416, Domain of unknown function (DUF4416)	NA|335aa|up_3|NC_020411.1_926113_927118_-	PRK00436, argC, N-acetyl-gamma-glutamyl-phosphate reductase; Validated	NA|147aa|up_2|NC_020411.1_927102_927543_-	PRK00132, rpsI, 30S ribosomal protein S9; Reviewed	NA|145aa|up_1|NC_020411.1_927555_927990_-	PRK09216, rplM, 50S ribosomal protein L13; Reviewed	NA|320aa|up_0|NC_020411.1_929886_930846_-	pfam04754, Transposase_31, Putative transposase, YhgA-like	NA|344aa|down_0|NC_020411.1_931837_932869_-	COG0787, Alr, Alanine racemase [Cell envelope biogenesis, outer membrane]	NA|102aa|down_1|NC_020411.1_932865_933171_-	NA	NA|262aa|down_2|NC_020411.1_933163_933949_-	COG3494, COG3494, Uncharacterized protein conserved in bacteria [Function unknown]	NA|441aa|down_3|NC_020411.1_933964_935287_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|843aa|down_4|NC_020411.1_935293_937822_-	PRK05306, infB, translation initiation factor IF-2; Validated	NA|59aa|down_5|NC_020411.1_937968_938145_-	NA	NA|503aa|down_6|NC_020411.1_938275_939784_-	PRK09281, PRK09281, F0F1 ATP synthase subunit alpha; Validated	NA|183aa|down_7|NC_020411.1_939890_940439_-	COG0712, AtpH, F0F1-type ATP synthase, delta subunit (mitochondrial oligomycin sensitivity protein) [Energy production and conversion]	NA|161aa|down_8|NC_020411.1_940435_940918_-	COG0711, AtpF, F0F1-type ATP synthase, subunit b [Energy production and conversion]	NA|143aa|down_9|NC_020411.1_940922_941351_-	cd06503, ATP-synt_Fo_b, F-type ATP synthase, membrane subunit b
GCF_000341855.1_ASM34185v1	NC_020411	Hydrogenobaculum sp. HO, complete sequence	7	1024077-1024573	7,5,6	CRISPRCasFinder,CRT,PILER-CR	no		Cas9_archaeal,Cas14b_CAS-V-F,cas3,cas6,cas4,cas1,csm3gr7,DEDDh,DinG,csa3,cas7,cas5,cas14j,cas8b2,cas2,TnpB_regular.1	Orphan	GTTTCAATCCCCTATAGGTACAAACAAAAC,GTTTCAATCCCCTATAGGTACAAACAAAAC,GTTTTGTTTGTACCTATAGGGGATTGAAAC	30,30,30	0	0	NA	NA	NA:NA:NA	7,7,6	7	Orphan	Cas9_archaeal,Cas14b_CAS-V-F,cas3,cas6,cas4,cas1,csm3gr7,DEDDh,DinG,csa3,cas7,cas5,cas14j,cas8b2,cas2,TnpB_regular.1	NA|129aa|up_4|NC_020411.1_1019054_1019441_-,NA|200aa|up_2|NC_020411.1_1021100_1021700_-,NA|140aa|down_8|NC_020411.1_1032148_1032568_-	NA|530aa|up_9|NC_020411.1_1012205_1013795_-	COG1009, NuoL, NADH:ubiquinone oxidoreductase subunit 5 (chain L)/Multisubunit Na+/H+ antiporter, MnhA subunit [Energy production and conversion / Inorganic ion transport and metabolism]	NA|234aa|up_8|NC_020411.1_1014137_1014839_-	PRK09362, PRK09362, phosphoribosylaminoimidazole-succinocarboxamide synthase; Reviewed	NA|194aa|up_7|NC_020411.1_1014835_1015417_-	pfam13174, TPR_6, Tetratricopeptide repeat	NA|94aa|up_6|NC_020411.1_1015403_1015685_-	pfam01649, Ribosomal_S20p, Ribosomal protein S20	NA|1088aa|up_5|NC_020411.1_1015794_1019058_-	cd18808, SF1_C_Upf1, C-terminal helicase domain of Upf1-like family helicases	NA|129aa|up_4|NC_020411.1_1019054_1019441_-	NA	NA|328aa|up_3|NC_020411.1_1019855_1020839_+	TIGR01784, Uncharacterized_protein_pSLT051, conserved hypothetical protein (putative transposase or invertase)	NA|200aa|up_2|NC_020411.1_1021100_1021700_-	NA	NA|217aa|up_1|NC_020411.1_1021826_1022477_-	sd00010, SLR, Sel1-like repeat	NA|316aa|up_0|NC_020411.1_1023018_1023966_-	pfam04754, Transposase_31, Putative transposase, YhgA-like	NA|555aa|down_0|NC_020411.1_1024965_1026630_-	PRK05294, carB, carbamoyl-phosphate synthase large subunit	NA|242aa|down_1|NC_020411.1_1026699_1027425_-	pfam00902, TatC, Sec-independent protein translocase protein (TatC)	NA|93aa|down_2|NC_020411.1_1027387_1027666_-	COG1826, TatA, Sec-independent protein secretion pathway components [Intracellular trafficking and secretion]	NA|273aa|down_3|NC_020411.1_1027656_1028475_-	pfam06750, DiS_P_DiS, Bacterial Peptidase A24 N-terminal domain	NA|501aa|down_4|NC_020411.1_1028471_1029974_-	PRK05812, secD, preprotein translocase subunit SecD; Reviewed	NA|201aa|down_5|NC_020411.1_1029976_1030579_-	TIGR00741, Probable_sigma54_modulation_protein_ORF3_ORF95	NA|252aa|down_6|NC_020411.1_1030582_1031338_-	TIGR01352, Protein_TonB, TonB family C-terminal domain	NA|258aa|down_7|NC_020411.1_1031373_1032147_-	cd03424, ADPRase_NUDT5, ADP-ribose pyrophosphatase (ADPRase) catalyzes the hydrolysis of ADP-ribose and a variety of additional ADP-sugar conjugates to AMP and ribose-5-phosphate	NA|140aa|down_8|NC_020411.1_1032148_1032568_-	NA	NA|408aa|down_9|NC_020411.1_1032564_1033788_-	COG1686, DacC, D-alanyl-D-alanine carboxypeptidase [Cell envelope biogenesis, outer membrane]
GCF_000341855.1_ASM34185v1	NC_020411	Hydrogenobaculum sp. HO, complete sequence	8	1088952-1092284	7,8,6	PILER-CR,CRISPRCasFinder,CRT	no	cas7,cas8b2,cas3,cas4,cas1,cas2,TnpB_regular.1	Cas9_archaeal,Cas14b_CAS-V-F,cas3,cas6,cas4,cas1,csm3gr7,DEDDh,DinG,csa3,cas7,cas5,cas14j,cas8b2,cas2,TnpB_regular.1	Unclear	GTTTCTAATTAACCGTGTGGAGTTGAAAG,GTTTCTAATTAACCGTGTGGAGTTGAAAG,GTTTCTAATTAACCGTGTGGAGTTGAAAG	29,29,29	1	1	1088981-1089019	NC_020411.1_578125-578163	NA:NA:NA	50,50,50	50	Unclear	Cas9_archaeal,Cas14b_CAS-V-F,cas3,cas6,cas4,cas1,csm3gr7,DEDDh,DinG,csa3,cas7,cas5,cas14j,cas8b2,cas2,TnpB_regular.1	NA|95aa|up_8|NC_020411.1_1079706_1079991_+,NA|64aa|up_0|NC_020411.1_1088525_1088717_-,NA|143aa|down_1|NC_020411.1_1093068_1093497_-,NA|322aa|down_2|NC_020411.1_1093478_1094444_-	NA|905aa|up_9|NC_020411.1_1076975_1079690_+	cd01949, GGDEF, Diguanylate-cyclase (DGC) or GGDEF domain	NA|95aa|up_8|NC_020411.1_1079706_1079991_+	NA	NA|188aa|up_7|NC_020411.1_1079990_1080554_+	cd00564, TMP_TenI, Thiamine monophosphate synthase (TMP synthase)/TenI	cas7|329aa|up_6|NC_020411.1_1080599_1081586_+	TIGR01875, CRISPR-associated_protein_Cas7/Cst2/DevR, CRISPR-associated autoregulator DevR family	cas8b2|741aa|up_5|NC_020411.1_1081572_1083795_+	TIGR01895, conserved_hypothetical_protein, CRISPR-associated protein Cas5, subtype I-B/TNEAP	cas3|736aa|up_4|NC_020411.1_1083766_1085974_+	TIGR01587, CRISPR-associated_endonuclease/helicase_Cas3, CRISPR-associated helicase Cas3	cas4|171aa|up_3|NC_020411.1_1085975_1086488_+	pfam01930, Cas_Cas4, Domain of unknown function DUF83	cas1|318aa|up_2|NC_020411.1_1086480_1087434_+	TIGR03641, cas1_HMARI, CRISPR-associated endonuclease Cas1, subtype I-B/HMARI/TNEAP	cas2|94aa|up_1|NC_020411.1_1087430_1087712_+	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	NA|64aa|up_0|NC_020411.1_1088525_1088717_-	NA	NA|262aa|down_0|NC_020411.1_1092286_1093072_-	COG3031, PulC, Type II secretory pathway, component PulC [Intracellular trafficking and secretion]	NA|143aa|down_1|NC_020411.1_1093068_1093497_-	NA	NA|322aa|down_2|NC_020411.1_1093478_1094444_-	NA	NA|129aa|down_3|NC_020411.1_1094444_1094831_-	PRK13258, PRK13258, 7-cyano-7-deazaguanine reductase; Provisional	TnpB_regular.1|485aa|down_4|NC_020411.1_1095021_1096476_+	TIGR01766, Putative_transposase_MJ0751, transposase, IS605 OrfB family, central region	NA|582aa|down_5|NC_020411.1_1096519_1098265_+	cd08500, PBP2_NikA_DppA_OppA_like_4, The substrate-binding component of an uncharacterized ABC-type nickel/dipeptide/oligopeptide-like import system contains the type 2 periplasmic binding fold	NA|516aa|down_6|NC_020411.1_1098364_1099912_+	PRK00915, PRK00915, 2-isopropylmalate synthase; Validated	NA|274aa|down_7|NC_020411.1_1099908_1100730_-	PRK00258, aroE, shikimate 5-dehydrogenase; Reviewed	NA|360aa|down_8|NC_020411.1_1100726_1101806_-	PRK00108, mraY, phospho-N-acetylmuramoyl-pentapeptide-transferase; Provisional	NA|188aa|down_9|NC_020411.1_1101806_1102370_-	pfam09936, Methyltrn_RNA_4, SAM-dependent RNA methyltransferase
GCF_000341855.1_ASM34185v1	NC_020411	Hydrogenobaculum sp. HO, complete sequence	9	1151895-1152001	9	CRISPRCasFinder	no	cas3	Cas9_archaeal,Cas14b_CAS-V-F,cas3,cas6,cas4,cas1,csm3gr7,DEDDh,DinG,csa3,cas7,cas5,cas14j,cas8b2,cas2,TnpB_regular.1	Unclear	AGAAATGCCGTTGGCACACTAGAGCG	26	0	0	NA	NA	NA	1	1	Unclear	Cas9_archaeal,Cas14b_CAS-V-F,cas3,cas6,cas4,cas1,csm3gr7,DEDDh,DinG,csa3,cas7,cas5,cas14j,cas8b2,cas2,TnpB_regular.1	NA|115aa|up_5|NC_020411.1_1147581_1147926_-,NA|120aa|down_8|NC_020411.1_1159274_1159634_+,NA|94aa|down_9|NC_020411.1_1159617_1159899_+	NA|287aa|up_9|NC_020411.1_1142859_1143720_+	PRK00072, hemC, porphobilinogen deaminase; Reviewed	NA|348aa|up_8|NC_020411.1_1143728_1144772_+	cd07432, PHP_HisPPase, Polymerase and Histidinol Phosphatase domain of Histidinol phosphate phosphatase	NA|455aa|up_7|NC_020411.1_1144763_1146128_-	COG0084, TatD, Mg-dependent DNase [DNA replication, recombination, and repair]	NA|454aa|up_6|NC_020411.1_1146180_1147542_-	PRK06292, PRK06292, dihydrolipoamide dehydrogenase; Validated	NA|115aa|up_5|NC_020411.1_1147581_1147926_-	NA	NA|380aa|up_4|NC_020411.1_1147942_1149082_-	pfam13975, gag-asp_proteas, gag-polyprotein putative aspartyl protease	NA|259aa|up_3|NC_020411.1_1149102_1149879_-	COG0142, IspA, Geranylgeranyl pyrophosphate synthase [Coenzyme metabolism]	NA|311aa|up_2|NC_020411.1_1149875_1150808_-	PRK07259, PRK07259, dihydroorotate dehydrogenase	NA|115aa|up_1|NC_020411.1_1151101_1151446_-	COG2361, COG2361, Uncharacterized conserved protein [Function unknown]	NA|110aa|up_0|NC_020411.1_1151442_1151772_-	COG1669, COG1669, Predicted nucleotidyltransferases [General function prediction only]	NA|566aa|down_0|NC_020411.1_1152043_1153741_-	PRK14667, uvrC, excinuclease ABC subunit C; Provisional	NA|358aa|down_1|NC_020411.1_1153767_1154841_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|223aa|down_2|NC_020411.1_1154821_1155490_-	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|263aa|down_3|NC_020411.1_1155486_1156275_-	PRK13111, trpA, tryptophan synthase subunit alpha; Provisional	NA|183aa|down_4|NC_020411.1_1156271_1156820_-	PRK00083, frr, ribosome recycling factor; Reviewed	NA|90aa|down_5|NC_020411.1_1156902_1157172_+	pfam17209, Hfq, Hfq protein	NA|240aa|down_6|NC_020411.1_1157211_1157931_+	pfam01255, Prenyltransf, Putative undecaprenyl diphosphate synthase	NA|426aa|down_7|NC_020411.1_1158034_1159312_+	PRK00077, eno, enolase; Provisional	NA|120aa|down_8|NC_020411.1_1159274_1159634_+	NA	NA|94aa|down_9|NC_020411.1_1159617_1159899_+	NA
GCF_000341855.1_ASM34185v1	NC_020411	Hydrogenobaculum sp. HO, complete sequence	10	1215307-1215408	10	CRISPRCasFinder	no		Cas9_archaeal,Cas14b_CAS-V-F,cas3,cas6,cas4,cas1,csm3gr7,DEDDh,DinG,csa3,cas7,cas5,cas14j,cas8b2,cas2,TnpB_regular.1	Orphan	GAAATGCGATTGCATCGCTCTAGT	24	0	0	NA	NA	NA	1	1	Orphan	Cas9_archaeal,Cas14b_CAS-V-F,cas3,cas6,cas4,cas1,csm3gr7,DEDDh,DinG,csa3,cas7,cas5,cas14j,cas8b2,cas2,TnpB_regular.1	NA|120aa|up_7|NC_020411.1_1203806_1204166_+,NA|408aa|up_6|NC_020411.1_1204209_1205433_+,NA|69aa|down_3|NC_020411.1_1219135_1219342_+	NA|87aa|up_9|NC_020411.1_1202171_1202432_-	pfam01381, HTH_3, Helix-turn-helix	NA|396aa|up_8|NC_020411.1_1202442_1203630_-	pfam07804, HipA_C, HipA-like C-terminal domain	NA|120aa|up_7|NC_020411.1_1203806_1204166_+	NA	NA|408aa|up_6|NC_020411.1_1204209_1205433_+	NA	NA|165aa|up_5|NC_020411.1_1205518_1206013_-	COG3260, COG3260, Ni,Fe-hydrogenase III small subunit [Energy production and conversion]	NA|479aa|up_4|NC_020411.1_1206022_1207459_-	COG3261, HycE, Ni,Fe-hydrogenase III large subunit [Energy production and conversion]	NA|481aa|up_3|NC_020411.1_1207455_1208898_-	PRK06458, PRK06458, hydrogenase 4 subunit F; Validated	NA|223aa|up_2|NC_020411.1_1208894_1209563_-	COG4237, HyfE, Hydrogenase 4 membrane component (E) [Energy production and conversion]	NA|306aa|up_1|NC_020411.1_1209572_1210490_-	COG0650, HyfC, Formate hydrogenlyase subunit 4 [Energy production and conversion]	NA|622aa|up_0|NC_020411.1_1210482_1212348_-	PRK06521, PRK06521, hydrogenase 4 subunit B; Validated	NA|273aa|down_0|NC_020411.1_1215609_1216428_-	pfam00535, Glycos_transf_2, Glycosyl transferase family 2	NA|182aa|down_1|NC_020411.1_1216424_1216970_-	PRK05456, PRK05456, ATP-dependent protease subunit HslV	NA|655aa|down_2|NC_020411.1_1216960_1218925_-	COG0021, TktA, Transketolase [Carbohydrate transport and metabolism]	NA|69aa|down_3|NC_020411.1_1219135_1219342_+	NA	NA|100aa|down_4|NC_020411.1_1219565_1219865_+	cd00158, RHOD, Rhodanese Homology Domain (RHOD); an alpha beta fold domain found duplicated in the rhodanese protein	NA|308aa|down_5|NC_020411.1_1219851_1220775_-	cd00657, Ferritin_like, Ferritin-like superfamily of diiron-containing four-helix-bundle proteins	NA|279aa|down_6|NC_020411.1_1220786_1221623_-	pfam04454, Linocin_M18, Encapsulating protein for peroxidase	NA|515aa|down_7|NC_020411.1_1221676_1223221_-	pfam09820, AAA-ATPase_like, Predicted AAA-ATPase	NA|175aa|down_8|NC_020411.1_1225370_1225895_-	cd01949, GGDEF, Diguanylate-cyclase (DGC) or GGDEF domain	NA|354aa|down_9|NC_020411.1_1225929_1226991_-	pfam07680, DoxA, TQO small subunit DoxA
