assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_002075285.2_ASM207528v2	NZ_CP021983	Halomicronema hongdechloris C2206 genome	1	397394-397479	1	CRISPRCasFinder	no		PD-DExK,cas2,cas1,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,RT,DinG,DEDDh,WYL,csa3,cas6,cas3,cas8b3,cas7,csx1,cas10d,csc2gr7,csc1gr5,cas4	Orphan	GTCATTTTTACCCGGTCCTTGTC	23	0	0	NA	NA	NA	1	1	Orphan	PD-DExK,cas2,cas1,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,RT,DinG,DEDDh,WYL,csa3,cas6,cas3,cas8b3,cas7,csx1,cas10d,csc2gr7,csc1gr5,cas4	NA|80aa|up_8|NZ_CP021983.1_387600_387840_-,NA|78aa|down_0|NZ_CP021983.1_397648_397882_+,NA|121aa|down_7|NZ_CP021983.1_407068_407431_+	NA|187aa|up_9|NZ_CP021983.1_386957_387518_-	cd03017, PRX_BCP, Peroxiredoxin (PRX) family, Bacterioferritin comigratory protein (BCP) subfamily; composed of  thioredoxin-dependent thiol peroxidases, widely expressed in pathogenic bacteria, that protect cells against toxicity from reactive oxygen species by reducing and detoxifying hydroperoxides	NA|80aa|up_8|NZ_CP021983.1_387600_387840_-	NA	NA|515aa|up_7|NZ_CP021983.1_388073_389618_+	COG0644, FixC, Dehydrogenases (flavoproteins) [Energy production and conversion]	NA|277aa|up_6|NZ_CP021983.1_389619_390450_+	COG2912, COG2912, Uncharacterized conserved protein [Function unknown]	NA|261aa|up_5|NZ_CP021983.1_390466_391249_-	COG3694, COG3694, ABC-type uncharacterized transport system, permease component [General function prediction only]	NA|277aa|up_4|NZ_CP021983.1_391248_392079_-	COG4587, COG4587, ABC-type uncharacterized transport system, permease component [General function prediction only]	NA|334aa|up_3|NZ_CP021983.1_392209_393211_+	TIGR04122, hypothetical_protein, putative exonuclease, DNA ligase-associated	NA|127aa|up_2|NZ_CP021983.1_393600_393981_+	pfam07784, DUF1622, Protein of unknown function (DUF1622)	NA|106aa|up_1|NZ_CP021983.1_394129_394447_-	pfam06967, Mo-nitro_C, Mo-dependent nitrogenase C-terminus	NA|663aa|up_0|NZ_CP021983.1_395116_397105_-	TIGR01241, ATP-dependent_zinc_metalloprotease_FtsH, ATP-dependent metalloprotease FtsH	NA|78aa|down_0|NZ_CP021983.1_397648_397882_+	NA	NA|188aa|down_1|NZ_CP021983.1_397894_398458_-	COG4968, PilE, Tfp pilus assembly protein PilE [Cell motility and secretion / Intracellular trafficking and secretion]	NA|328aa|down_2|NZ_CP021983.1_399137_400121_+	COG4586, COG4586, ABC-type uncharacterized transport system, ATPase component [General function prediction only]	NA|584aa|down_3|NZ_CP021983.1_400179_401931_-	cd03528, Rieske_RO_ferredoxin, Rieske non-heme iron oxygenase (RO) family, Rieske ferredoxin component; composed of the Rieske ferredoxin component of some three-component RO systems including biphenyl dioxygenase (BPDO) and carbazole 1,9a-dioxygenase (CARDO)	NA|372aa|down_4|NZ_CP021983.1_402593_403709_+	pfam00924, MS_channel, Mechanosensitive ion channel	NA|224aa|down_5|NZ_CP021983.1_405647_406319_+	PRK00102, rnc, ribonuclease III; Reviewed	NA|96aa|down_6|NZ_CP021983.1_406385_406673_-	COG4636, Uma2, Endonuclease, Uma2 family (restriction endonuclease fold) [General function prediction only]	NA|121aa|down_7|NZ_CP021983.1_407068_407431_+	NA	NA|94aa|down_8|NZ_CP021983.1_407427_407709_+	pfam01844, HNH, HNH endonuclease	NA|217aa|down_9|NZ_CP021983.1_407762_408413_+	pfam05685, Uma2, Putative restriction endonuclease
GCF_002075285.2_ASM207528v2	NZ_CP021983	Halomicronema hongdechloris C2206 genome	2	497487-497735	1,2,1	CRT,CRISPRCasFinder,PILER-CR	no	cas2,cas1,PD-DExK	PD-DExK,cas2,cas1,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,RT,DinG,DEDDh,WYL,csa3,cas6,cas3,cas8b3,cas7,csx1,cas10d,csc2gr7,csc1gr5,cas4	Unclear	GAAGGTTTCCGTCCCCTTGCGGGGTAATGGATTTTGAAC,GTTTCCGTCCCCTTGCGGGGTAATGGATTTTGAAC,GAAGGTTTCCGTCCCCTTGCGGGGTAATGGATTTTGAACA	39,35,40	6	9	497596-497624|497664-497695|497664-497695|497596-497628|497664-497699|497664-497699|497597-497624|497665-497695|497665-497695	NZ_CP021983.1_138399-138427|NZ_CP021983.1_1212532-1212563|NZ_CP021983.1_1705852-1705883|NZ_CP021983.1_138399-138431|NZ_CP021983.1_1212532-1212567|NZ_CP021983.1_1705852-1705887|NZ_CP021983.1_138400-138427|NZ_CP021983.1_1212533-1212563|NZ_CP021983.1_1705853-1705883	NA:NA:NA	3,3,2	3	Unclear	PD-DExK,cas2,cas1,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,RT,DinG,DEDDh,WYL,csa3,cas6,cas3,cas8b3,cas7,csx1,cas10d,csc2gr7,csc1gr5,cas4	NA|199aa|up_6|NZ_CP021983.1_491363_491960_-,NA|83aa|up_4|NZ_CP021983.1_492383_492632_+,NA|156aa|down_8|NZ_CP021983.1_505266_505734_-,NA|342aa|down_9|NZ_CP021983.1_505746_506772_-	NA|212aa|up_9|NZ_CP021983.1_487486_488122_-	TIGR04155, hypothetical_protein, PEP-CTERM protein sorting domain, cyanobacterial subclass	NA|270aa|up_8|NZ_CP021983.1_488433_489243_+	pfam14014, DUF4230, Protein of unknown function (DUF4230)	NA|431aa|up_7|NZ_CP021983.1_490074_491367_+	COG0277, GlcD, FAD/FMN-containing dehydrogenases [Energy production and conversion]	NA|199aa|up_6|NZ_CP021983.1_491363_491960_-	NA	NA|71aa|up_5|NZ_CP021983.1_492004_492217_+	PRK12270, kgd, multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit	NA|83aa|up_4|NZ_CP021983.1_492383_492632_+	NA	NA|268aa|up_3|NZ_CP021983.1_492869_493673_-	pfam13267, DUF4058, Protein of unknown function (DUF4058)	NA|735aa|up_2|NZ_CP021983.1_493701_495906_-	sd00006, TPR, Tetratricopeptide repeat	NA|261aa|up_1|NZ_CP021983.1_495936_496719_-	pfam13267, DUF4058, Protein of unknown function (DUF4058)	NA|119aa|up_0|NZ_CP021983.1_496788_497145_-	TIGR02436, S23_ribosomal_protein, four helix bundle protein	NA|263aa|down_0|NZ_CP021983.1_497963_498752_-	pfam10087, DUF2325, Uncharacterized protein conserved in bacteria (DUF2325)	NA|132aa|down_1|NZ_CP021983.1_498754_499150_-	pfam05635, 23S_rRNA_IVP, 23S rRNA-intervening sequence protein	NA|140aa|down_2|NZ_CP021983.1_499193_499613_-	pfam05635, 23S_rRNA_IVP, 23S rRNA-intervening sequence protein	cas2|93aa|down_3|NZ_CP021983.1_499627_499906_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|655aa|down_4|NZ_CP021983.1_499905_501870_-	pfam01867, Cas_Cas1, CRISPR associated protein Cas1	NA|457aa|down_5|NZ_CP021983.1_502184_503555_+	pfam13191, AAA_16, AAA ATPase domain	NA|161aa|down_6|NZ_CP021983.1_503743_504226_-	COG5551, COG5551, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	NA|174aa|down_7|NZ_CP021983.1_504515_505037_-	COG2091, Sfp, Phosphopantetheinyl transferase [Coenzyme metabolism]	NA|156aa|down_8|NZ_CP021983.1_505266_505734_-	NA	NA|342aa|down_9|NZ_CP021983.1_505746_506772_-	NA
GCF_002075285.2_ASM207528v2	NZ_CP021983	Halomicronema hongdechloris C2206 genome	3	704215-707055	2,3,2	PILER-CR,CRISPRCasFinder,CRT	no	cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10	PD-DExK,cas2,cas1,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,RT,DinG,DEDDh,WYL,csa3,cas6,cas3,cas8b3,cas7,csx1,cas10d,csc2gr7,csc1gr5,cas4	Type III-B,Type III-D,Type III-C,Type III-A, Type III-C?	GTTTCCATTAATTCGACTTCCGAAGAAGTT,GTTTCCATTAATTCGACTTCCGAAGAAGTTGGCGAC,GTTTCCATTAATTCGACTTCCGAAGAAGTTNNNG	30,36,34	0	0	NA	NA	NA:NA:NA	38,38,38	38	TypeIII-B,TypeIII-C?,TypeIII-D,TypeIII-C,TypeIII-A	PD-DExK,cas2,cas1,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,RT,DinG,DEDDh,WYL,csa3,cas6,cas3,cas8b3,cas7,csx1,cas10d,csc2gr7,csc1gr5,cas4	NA|117aa|up_9|NZ_CP021983.1_694859_695210_-,NA|154aa|up_6|NZ_CP021983.1_698420_698882_-,NA|71aa|up_2|NZ_CP021983.1_701305_701518_+,NA|261aa|down_0|NZ_CP021983.1_707317_708100_+,NA|68aa|down_3|NZ_CP021983.1_709575_709779_+,NA|75aa|down_5|NZ_CP021983.1_710274_710499_+,NA|72aa|down_6|NZ_CP021983.1_710912_711128_+,NA|78aa|down_7|NZ_CP021983.1_711407_711641_+	NA|117aa|up_9|NZ_CP021983.1_694859_695210_-	NA	NA|115aa|up_8|NZ_CP021983.1_695249_695594_-	pfam15643, Tox-PL-2, Papain fold toxin 2	NA|858aa|up_7|NZ_CP021983.1_695773_698347_+	pfam12770, CHAT, CHAT domain	NA|154aa|up_6|NZ_CP021983.1_698420_698882_-	NA	NA|72aa|up_5|NZ_CP021983.1_699147_699363_+	pfam03683, UPF0175, Uncharacterized protein family (UPF0175)	NA|195aa|up_4|NZ_CP021983.1_699689_700274_-	pfam05685, Uma2, Putative restriction endonuclease	NA|62aa|up_3|NZ_CP021983.1_701058_701244_+	cd06260, DUF820, Domain of unknown function (DUF820)	NA|71aa|up_2|NZ_CP021983.1_701305_701518_+	NA	NA|48aa|up_1|NZ_CP021983.1_701560_701704_+	COG1569, COG1569, Predicted nucleic acid-binding protein, contains PIN domain [General function prediction only]	NA|498aa|up_0|NZ_CP021983.1_701936_703430_+	pfam13267, DUF4058, Protein of unknown function (DUF4058)	NA|261aa|down_0|NZ_CP021983.1_707317_708100_+	NA	NA|175aa|down_1|NZ_CP021983.1_708322_708847_+	pfam01724, DUF29, Domain of unknown function DUF29	NA|222aa|down_2|NZ_CP021983.1_708854_709520_-	pfam05685, Uma2, Putative restriction endonuclease	NA|68aa|down_3|NZ_CP021983.1_709575_709779_+	NA	NA|156aa|down_4|NZ_CP021983.1_709802_710270_+	cd18687, PIN_VapC-like, uncharacterized subfamily of the VapC (virulence-associated protein C)-like family of the PIN domain superfamily	NA|75aa|down_5|NZ_CP021983.1_710274_710499_+	NA	NA|72aa|down_6|NZ_CP021983.1_710912_711128_+	NA	NA|78aa|down_7|NZ_CP021983.1_711407_711641_+	NA	NA|137aa|down_8|NZ_CP021983.1_711637_712048_+	cd18745, PIN_VapC4-5_FitB-like, uncharacterized subgroup of the PIN_VapC4-5_FitB-like subfamily of the PIN domain superfamily	NA|122aa|down_9|NZ_CP021983.1_712205_712571_-	pfam18480, DUF5615, Domain of unknown function (DUF5615)
GCF_002075285.2_ASM207528v2	NZ_CP021983	Halomicronema hongdechloris C2206 genome	4	713023-713160	4	CRISPRCasFinder	no	cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10	PD-DExK,cas2,cas1,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,RT,DinG,DEDDh,WYL,csa3,cas6,cas3,cas8b3,cas7,csx1,cas10d,csc2gr7,csc1gr5,cas4	Type III-B,Type III-D,Type III-C,Type III-A, Type III-C?	TCTATGAAAATATTAAGCAGTTGTCGGAGCGACTGCTAAGGGTTGGAT	48	0	0	NA	NA	NA	1	1	TypeIII-B,TypeIII-C?,TypeIII-D,TypeIII-C,TypeIII-A	PD-DExK,cas2,cas1,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,RT,DinG,DEDDh,WYL,csa3,cas6,cas3,cas8b3,cas7,csx1,cas10d,csc2gr7,csc1gr5,cas4	NA|68aa|up_7|NZ_CP021983.1_709575_709779_+,NA|75aa|up_5|NZ_CP021983.1_710274_710499_+,NA|72aa|up_4|NZ_CP021983.1_710912_711128_+,NA|78aa|up_3|NZ_CP021983.1_711407_711641_+,NA|278aa|down_0|NZ_CP021983.1_713249_714083_-,cmr5gr11|138aa|down_2|NZ_CP021983.1_716205_716619_-,NA|66aa|down_4|NZ_CP021983.1_717746_717944_-	NA|175aa|up_9|NZ_CP021983.1_708322_708847_+	pfam01724, DUF29, Domain of unknown function DUF29	NA|222aa|up_8|NZ_CP021983.1_708854_709520_-	pfam05685, Uma2, Putative restriction endonuclease	NA|68aa|up_7|NZ_CP021983.1_709575_709779_+	NA	NA|156aa|up_6|NZ_CP021983.1_709802_710270_+	cd18687, PIN_VapC-like, uncharacterized subfamily of the VapC (virulence-associated protein C)-like family of the PIN domain superfamily	NA|75aa|up_5|NZ_CP021983.1_710274_710499_+	NA	NA|72aa|up_4|NZ_CP021983.1_710912_711128_+	NA	NA|78aa|up_3|NZ_CP021983.1_711407_711641_+	NA	NA|137aa|up_2|NZ_CP021983.1_711637_712048_+	cd18745, PIN_VapC4-5_FitB-like, uncharacterized subgroup of the PIN_VapC4-5_FitB-like subfamily of the PIN domain superfamily	NA|122aa|up_1|NZ_CP021983.1_712205_712571_-	pfam18480, DUF5615, Domain of unknown function (DUF5615)	NA|111aa|up_0|NZ_CP021983.1_712585_712918_-	pfam04255, DUF433, Protein of unknown function (DUF433)	NA|278aa|down_0|NZ_CP021983.1_713249_714083_-	NA	cmr6gr7|710aa|down_1|NZ_CP021983.1_714121_716251_-	COG1604, COG1604, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	cmr5gr11|138aa|down_2|NZ_CP021983.1_716205_716619_-	NA	cmr4gr7|327aa|down_3|NZ_CP021983.1_716705_717686_-	TIGR02580, putative_CRISPR-associated_protein, CRISPR type III-B/RAMP module RAMP protein Cmr4	NA|66aa|down_4|NZ_CP021983.1_717746_717944_-	NA	cmr3gr5|400aa|down_5|NZ_CP021983.1_718029_719229_-	cd09748, Cmr3_III-B, CRISPR/Cas system-associated RAMP superfamily protein Cmr3	cas10|1096aa|down_6|NZ_CP021983.1_719225_722513_-	pfam12469, DUF3692, CRISPR-associated protein	NA|183aa|down_7|NZ_CP021983.1_722962_723511_+	cd02232, cupin_ARD, acireductone dioxygenase (ARD), cupin domain	NA|727aa|down_8|NZ_CP021983.1_723563_725744_-	cd07302, CHD, cyclase homology domain	NA|351aa|down_9|NZ_CP021983.1_726721_727774_-	pfam10017, Methyltransf_33, Histidine-specific methyltransferase, SAM-dependent
GCF_002075285.2_ASM207528v2	NZ_CP021983	Halomicronema hongdechloris C2206 genome	5	805950-806042	5	CRISPRCasFinder	no	cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas1,cas2	PD-DExK,cas2,cas1,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,RT,DinG,DEDDh,WYL,csa3,cas6,cas3,cas8b3,cas7,csx1,cas10d,csc2gr7,csc1gr5,cas4	Type III-B,Type III-D,Type III-C,Type III-A	GGGCGCCCAGGCTATTGGTCGGGGCGCACGG	31	0	0	NA	NA	NA	1	1	TypeIII-A,TypeIII-D,TypeIII-C,TypeIII-B	PD-DExK,cas2,cas1,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,RT,DinG,DEDDh,WYL,csa3,cas6,cas3,cas8b3,cas7,csx1,cas10d,csc2gr7,csc1gr5,cas4	NA|67aa|up_7|NZ_CP021983.1_799828_800029_+,NA|157aa|up_1|NZ_CP021983.1_804292_804763_-,NA|394aa|down_2|NZ_CP021983.1_809407_810589_+,NA|232aa|down_4|NZ_CP021983.1_811216_811912_-,NA|356aa|down_5|NZ_CP021983.1_811964_813031_-,NA|411aa|down_6|NZ_CP021983.1_813049_814282_-,cmr5gr11|141aa|down_8|NZ_CP021983.1_816344_816767_-	NA|1005aa|up_9|NZ_CP021983.1_795490_798505_+	PRK05306, infB, translation initiation factor IF-2; Validated	NA|365aa|up_8|NZ_CP021983.1_798504_799599_+	NF033183, colliding_TM, low-complexity tail membrane protein	NA|67aa|up_7|NZ_CP021983.1_799828_800029_+	NA	NA|295aa|up_6|NZ_CP021983.1_800225_801110_+	pfam14257, DUF4349, Domain of unknown function (DUF4349)	NA|305aa|up_5|NZ_CP021983.1_801702_802617_+	pfam08241, Methyltransf_11, Methyltransferase domain	NA|140aa|up_4|NZ_CP021983.1_802691_803111_+	COG3682, COG3682, Predicted transcriptional regulator [Transcription]	NA|284aa|up_3|NZ_CP021983.1_803110_803962_+	cd07326, M56_BlaR1_MecR1_like, Peptidase M56-like including those in BlaR1 and MecR1, integral membrane metallopeptidase	NA|64aa|up_2|NZ_CP021983.1_803996_804188_-	PLN00014, PLN00014, light-harvesting-like protein 3; Provisional	NA|157aa|up_1|NZ_CP021983.1_804292_804763_-	NA	NA|252aa|up_0|NZ_CP021983.1_804870_805626_-	PRK07994, PRK07994, DNA polymerase III subunits gamma and tau; Validated	NA|185aa|down_0|NZ_CP021983.1_807837_808392_-	COG4636, Uma2, Endonuclease, Uma2 family (restriction endonuclease fold) [General function prediction only]	NA|195aa|down_1|NZ_CP021983.1_808397_808982_-	pfam05685, Uma2, Putative restriction endonuclease	NA|394aa|down_2|NZ_CP021983.1_809407_810589_+	NA	NA|93aa|down_3|NZ_CP021983.1_810790_811069_+	COG2804, PulE, Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB [Cell motility and secretion / Intracellular trafficking and secretion]	NA|232aa|down_4|NZ_CP021983.1_811216_811912_-	NA	NA|356aa|down_5|NZ_CP021983.1_811964_813031_-	NA	NA|411aa|down_6|NZ_CP021983.1_813049_814282_-	NA	NA|571aa|down_7|NZ_CP021983.1_814400_816113_-	TIGR01898, repair_system, CRISPR type III-B/RAMP module RAMP protein Cmr6	cmr5gr11|141aa|down_8|NZ_CP021983.1_816344_816767_-	NA	cmr4gr7|350aa|down_9|NZ_CP021983.1_816785_817835_-	COG1336, COG1336, CRISPR system related protein, RAMP superfamily [Defense mechanisms]
GCF_002075285.2_ASM207528v2	NZ_CP021983	Halomicronema hongdechloris C2206 genome	6	826073-832012	6,3,3,7	CRISPRCasFinder,CRT,PILER-CR,CRISPRCasFinder	no	cmr5gr11,cmr4gr7,cmr3gr5,cas10,cas1,cas2	PD-DExK,cas2,cas1,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,RT,DinG,DEDDh,WYL,csa3,cas6,cas3,cas8b3,cas7,csx1,cas10d,csc2gr7,csc1gr5,cas4	Type III-B,Type III-D,Type III-C,Type III-A	CCCCACCGATTGGGTTAATTCGGAATAGTTGGAAAC,NNCCCCACCGATTGGGTTAATTCGGAATAGTTGGAAA,TTCCGAATTAACCCAATCG,CCCCACCGATTGGGTTAATTCGGAATAGTTGGAAAC	36,37,19,36	0	0	NA	NA	NA:NA:NA:NA	75,77,76,75	77	TypeIII-A,TypeIII-D,TypeIII-C,TypeIII-B	PD-DExK,cas2,cas1,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,RT,DinG,DEDDh,WYL,csa3,cas6,cas3,cas8b3,cas7,csx1,cas10d,csc2gr7,csc1gr5,cas4	cmr5gr11|141aa|up_8|NZ_CP021983.1_816344_816767_-,NA|238aa|up_4|NZ_CP021983.1_821386_822100_-,NA|140aa|up_2|NZ_CP021983.1_823943_824363_+,NA|84aa|down_7|NZ_CP021983.1_840458_840710_-,NA|127aa|down_8|NZ_CP021983.1_840735_841116_-	NA|571aa|up_9|NZ_CP021983.1_814400_816113_-	TIGR01898, repair_system, CRISPR type III-B/RAMP module RAMP protein Cmr6	cmr5gr11|141aa|up_8|NZ_CP021983.1_816344_816767_-	NA	cmr4gr7|350aa|up_7|NZ_CP021983.1_816785_817835_-	COG1336, COG1336, CRISPR system related protein, RAMP superfamily [Defense mechanisms]	cmr3gr5|397aa|up_6|NZ_CP021983.1_817964_819155_-	cd09748, Cmr3_III-B, CRISPR/Cas system-associated RAMP superfamily protein Cmr3	cas10|633aa|up_5|NZ_CP021983.1_819175_821074_-	cd09679, Cas10_III, CRISPR/Cas system-associated protein Cas10	NA|238aa|up_4|NZ_CP021983.1_821386_822100_-	NA	NA|206aa|up_3|NZ_CP021983.1_822428_823046_+	pfam05685, Uma2, Putative restriction endonuclease	NA|140aa|up_2|NZ_CP021983.1_823943_824363_+	NA	cas1|335aa|up_1|NZ_CP021983.1_824591_825596_+	pfam01867, Cas_Cas1, CRISPR associated protein Cas1	cas2|92aa|up_0|NZ_CP021983.1_825595_825871_+	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	NA|115aa|down_0|NZ_CP021983.1_833018_833363_+	COG2852, COG2852, Very-short-patch-repair endonuclease [Replication, recombination,    and repair]	NA|349aa|down_1|NZ_CP021983.1_834015_835062_-	pfam05598, DUF772, Transposase domain (DUF772)	NA|384aa|down_2|NZ_CP021983.1_835427_836579_-	pfam13808, DDE_Tnp_1_assoc, DDE_Tnp_1-associated	NA|391aa|down_3|NZ_CP021983.1_837129_838302_-	pfam01609, DDE_Tnp_1, Transposase DDE domain	NA|221aa|down_4|NZ_CP021983.1_838896_839559_-	PRK12567, PRK12567, putative monovalent cation/H+ antiporter subunit B; Reviewed	NA|212aa|down_5|NZ_CP021983.1_839555_840191_-	PRK07377, PRK07377, hypothetical protein; Provisional	NA|93aa|down_6|NZ_CP021983.1_840183_840462_-	pfam03334, PhaG_MnhG_YufB, Na+/H+ antiporter subunit	NA|84aa|down_7|NZ_CP021983.1_840458_840710_-	NA	NA|127aa|down_8|NZ_CP021983.1_840735_841116_-	NA	NA|478aa|down_9|NZ_CP021983.1_841112_842546_-	PRK07234, PRK07234, putative monovalent cation/H+ antiporter subunit D; Reviewed
GCF_002075285.2_ASM207528v2	NZ_CP021983	Halomicronema hongdechloris C2206 genome	7	1091429-1091515	8	CRISPRCasFinder	no		PD-DExK,cas2,cas1,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,RT,DinG,DEDDh,WYL,csa3,cas6,cas3,cas8b3,cas7,csx1,cas10d,csc2gr7,csc1gr5,cas4	Orphan	TGCGTCCCAAATAAAACTCTATGG	24	0	0	NA	NA	NA	1	1	Orphan	PD-DExK,cas2,cas1,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,RT,DinG,DEDDh,WYL,csa3,cas6,cas3,cas8b3,cas7,csx1,cas10d,csc2gr7,csc1gr5,cas4	NA|62aa|up_1|NZ_CP021983.1_1088773_1088959_-,NA|462aa|down_1|NZ_CP021983.1_1092319_1093705_-,NA|84aa|down_3|NZ_CP021983.1_1094732_1094984_+	NA|77aa|up_9|NZ_CP021983.1_1083713_1083944_-	pfam13701, DDE_Tnp_1_4, Transposase DDE domain group 1	NA|337aa|up_8|NZ_CP021983.1_1083915_1084926_-	TIGR04025, hypothetical_protein, PPOX class probable FMN-dependent enzyme, DR_2398 family	NA|200aa|up_7|NZ_CP021983.1_1084965_1085565_-	cd03206, GST_C_7, C-terminal, alpha helical domain of an unknown subfamily 7 of Glutathione S-transferases	NA|183aa|up_6|NZ_CP021983.1_1085712_1086261_+	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|251aa|up_5|NZ_CP021983.1_1086351_1087104_+	COG0580, GlpF, Glycerol uptake facilitator and related permeases (Major Intrinsic Protein Family) [Carbohydrate transport and metabolism]	NA|232aa|up_4|NZ_CP021983.1_1087160_1087856_-	PRK11752, PRK11752, putative S-transferase; Provisional	NA|116aa|up_3|NZ_CP021983.1_1088030_1088378_-	PRK09907, PRK09907, endoribonuclease MazF	NA|81aa|up_2|NZ_CP021983.1_1088371_1088614_-	COG2336, MazE, Growth regulator [Signal transduction mechanisms]	NA|62aa|up_1|NZ_CP021983.1_1088773_1088959_-	NA	NA|706aa|up_0|NZ_CP021983.1_1089107_1091225_-	TIGR02917, TPR_domain_protein, putative PEP-CTERM system TPR-repeat lipoprotein	NA|133aa|down_0|NZ_CP021983.1_1091789_1092188_-	cd00158, RHOD, Rhodanese Homology Domain (RHOD); an alpha beta fold domain found duplicated in the rhodanese protein	NA|462aa|down_1|NZ_CP021983.1_1092319_1093705_-	NA	NA|178aa|down_2|NZ_CP021983.1_1093804_1094338_-	COG0783, Dps, DNA-binding ferritin-like protein (oxidative damage protectant) [Inorganic ion transport and metabolism]	NA|84aa|down_3|NZ_CP021983.1_1094732_1094984_+	NA	NA|664aa|down_4|NZ_CP021983.1_1095252_1097244_+	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|474aa|down_5|NZ_CP021983.1_1097250_1098672_-	COG0469, PykF, Pyruvate kinase [Carbohydrate transport and metabolism]	NA|454aa|down_6|NZ_CP021983.1_1098936_1100298_-	cd00880, Era_like, E	NA|507aa|down_7|NZ_CP021983.1_1100492_1102013_-	pfam05128, DUF697, Domain of unknown function (DUF697)	NA|1046aa|down_8|NZ_CP021983.1_1102660_1105798_+	COG3899, COG3899, Predicted ATPase [General function prediction only]	NA|84aa|down_9|NZ_CP021983.1_1105810_1106062_+	pfam14026, DUF4242, Protein of unknown function (DUF4242)
GCF_002075285.2_ASM207528v2	NZ_CP021983	Halomicronema hongdechloris C2206 genome	8	1166486-1166561	9	CRISPRCasFinder	no		PD-DExK,cas2,cas1,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,RT,DinG,DEDDh,WYL,csa3,cas6,cas3,cas8b3,cas7,csx1,cas10d,csc2gr7,csc1gr5,cas4	Orphan	CCTTGCCGATGCGACGCGATCTAT	24	0	0	NA	NA	NA	1	1	Orphan	PD-DExK,cas2,cas1,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,RT,DinG,DEDDh,WYL,csa3,cas6,cas3,cas8b3,cas7,csx1,cas10d,csc2gr7,csc1gr5,cas4	NA|137aa|up_9|NZ_CP021983.1_1154800_1155211_+,NA|235aa|up_6|NZ_CP021983.1_1158167_1158872_+,NA	NA|137aa|up_9|NZ_CP021983.1_1154800_1155211_+	NA	NA|809aa|up_8|NZ_CP021983.1_1155251_1157678_+	cd03875, M28_Fxna_like, M28 Zn-peptidase Endoplasmic reticulum metallopeptidase 1	NA|77aa|up_7|NZ_CP021983.1_1157719_1157950_+	pfam12676, DUF3796, Protein of unknown function (DUF3796)	NA|235aa|up_6|NZ_CP021983.1_1158167_1158872_+	NA	NA|273aa|up_5|NZ_CP021983.1_1158871_1159690_+	cd01935, Ntn_CGH_like, Choloylglycine hydrolase (CGH)_like	NA|337aa|up_4|NZ_CP021983.1_1159803_1160814_-	pfam13673, Acetyltransf_10, Acetyltransferase (GNAT) domain	NA|137aa|up_3|NZ_CP021983.1_1160901_1161312_-	cd06154, YjgF_YER057c_UK114_like_6, This group of proteins belong to a large family of YjgF/YER057c/UK114-like proteins present in bacteria, archaea, and eukaryotes with no definitive function	NA|201aa|up_2|NZ_CP021983.1_1161333_1161936_-	cd01835, SGNH_hydrolase_like_3, SGNH_hydrolase subfamily	NA|393aa|up_1|NZ_CP021983.1_1162454_1163633_+	cd03801, GT4_PimA-like, phosphatidyl-myo-inositol mannosyltransferase	NA|848aa|up_0|NZ_CP021983.1_1163645_1166189_-	pfam00343, Phosphorylase, Carbohydrate phosphorylase	NA|82aa|down_0|NZ_CP021983.1_1166823_1167069_-	pfam13359, DDE_Tnp_4, DDE superfamily endonuclease	NA|73aa|down_1|NZ_CP021983.1_1167185_1167404_-	pfam13619, KTSC, KTSC domain	NA|397aa|down_2|NZ_CP021983.1_1167575_1168766_-	PRK14012, PRK14012, IscS subfamily cysteine desulfurase	NA|617aa|down_3|NZ_CP021983.1_1168806_1170657_-	COG0370, FeoB, Fe2+ transport system protein B [Inorganic ion transport and metabolism]	NA|301aa|down_4|NZ_CP021983.1_1170891_1171794_+	pfam08447, PAS_3, PAS fold	NA|354aa|down_5|NZ_CP021983.1_1171860_1172921_+	pfam13358, DDE_3, DDE superfamily endonuclease	NA|381aa|down_6|NZ_CP021983.1_1172991_1174134_+	cd01948, EAL, EAL domain	NA|124aa|down_7|NZ_CP021983.1_1174105_1174477_-	cd19937, REC_OmpR_BsPhoP-like, phosphoacceptor receiver (REC) domain of BsPhoP-like OmpR family response regulators	NA|407aa|down_8|NZ_CP021983.1_1174525_1175746_-	pfam05626, DUF790, Protein of unknown function (DUF790)	NA|261aa|down_9|NZ_CP021983.1_1175790_1176573_-	pfam02548, Pantoate_transf, Ketopantoate hydroxymethyltransferase
GCF_002075285.2_ASM207528v2	NZ_CP021983	Halomicronema hongdechloris C2206 genome	9	1269698-1269795	10	CRISPRCasFinder	no	csa3	PD-DExK,cas2,cas1,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,RT,DinG,DEDDh,WYL,csa3,cas6,cas3,cas8b3,cas7,csx1,cas10d,csc2gr7,csc1gr5,cas4	Type I-A	CTTAGGTTTCAGCCCTACACGCA	23	0	0	NA	NA	NA	1	1	Orphan	PD-DExK,cas2,cas1,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,RT,DinG,DEDDh,WYL,csa3,cas6,cas3,cas8b3,cas7,csx1,cas10d,csc2gr7,csc1gr5,cas4	NA|708aa|up_0|NZ_CP021983.1_1266914_1269038_+,NA|73aa|down_0|NZ_CP021983.1_1269869_1270088_+,NA|80aa|down_1|NZ_CP021983.1_1270185_1270425_+,NA|132aa|down_2|NZ_CP021983.1_1270448_1270844_+,NA|105aa|down_3|NZ_CP021983.1_1270988_1271303_+,NA|221aa|down_5|NZ_CP021983.1_1272352_1273015_+,NA|267aa|down_7|NZ_CP021983.1_1273587_1274388_+,NA|84aa|down_8|NZ_CP021983.1_1275320_1275572_-	NA|426aa|up_9|NZ_CP021983.1_1258115_1259393_-	pfam09506, Salt_tol_Pase, Glucosylglycerol-phosphate phosphatase (Salt_tol_Pase)	NA|62aa|up_8|NZ_CP021983.1_1259834_1260020_+	pfam14105, DUF4278, Domain of unknown function (DUF4278)	NA|190aa|up_7|NZ_CP021983.1_1260172_1260742_-	cd03424, ADPRase_NUDT5, ADP-ribose pyrophosphatase (ADPRase) catalyzes the hydrolysis of ADP-ribose and a variety of additional ADP-sugar conjugates to AMP and ribose-5-phosphate	NA|167aa|up_6|NZ_CP021983.1_1260961_1261462_-	cd00483, HPPK, 7,8-dihydro-6-hydroxymethylpterin-pyrophosphokinase (HPPK)	NA|106aa|up_5|NZ_CP021983.1_1261550_1261868_-	TIGR01068, Thioredoxin-like_protein_slr0233, thioredoxin	NA|249aa|up_4|NZ_CP021983.1_1262048_1262795_+	PRK05557, fabG, 3-ketoacyl-(acyl-carrier-protein) reductase; Validated	NA|592aa|up_3|NZ_CP021983.1_1262784_1264560_-	COG1649, COG1649, Uncharacterized protein conserved in bacteria [Function unknown]	NA|119aa|up_2|NZ_CP021983.1_1264602_1264959_-	PRK07451, PRK07451, translation initiation factor	NA|577aa|up_1|NZ_CP021983.1_1265133_1266864_+	COG2831, FhaC, Hemolysin activation/secretion protein [Intracellular trafficking and secretion]	NA|708aa|up_0|NZ_CP021983.1_1266914_1269038_+	NA	NA|73aa|down_0|NZ_CP021983.1_1269869_1270088_+	NA	NA|80aa|down_1|NZ_CP021983.1_1270185_1270425_+	NA	NA|132aa|down_2|NZ_CP021983.1_1270448_1270844_+	NA	NA|105aa|down_3|NZ_CP021983.1_1270988_1271303_+	NA	NA|177aa|down_4|NZ_CP021983.1_1271450_1271981_-	cd02042, ParAB_family, partition proteins ParAB family	NA|221aa|down_5|NZ_CP021983.1_1272352_1273015_+	NA	NA|143aa|down_6|NZ_CP021983.1_1273011_1273440_-	TIGR02225, Tyrosine_recombinase_XerD, tyrosine recombinase XerD	NA|267aa|down_7|NZ_CP021983.1_1273587_1274388_+	NA	NA|84aa|down_8|NZ_CP021983.1_1275320_1275572_-	NA	NA|219aa|down_9|NZ_CP021983.1_1275612_1276269_-	cd01197, INT_FimBE_like, FimB and FimE and related proteins, integrase/recombinases
GCF_002075285.2_ASM207528v2	NZ_CP021983	Halomicronema hongdechloris C2206 genome	10	1277480-1277627	11	CRISPRCasFinder	no	csa3	PD-DExK,cas2,cas1,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,RT,DinG,DEDDh,WYL,csa3,cas6,cas3,cas8b3,cas7,csx1,cas10d,csc2gr7,csc1gr5,cas4	Type I-A	CACTCCCCCCCGCAGGTCGGGGGCGGG	27	0	0	NA	NA	NA	2	2	Orphan	PD-DExK,cas2,cas1,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,RT,DinG,DEDDh,WYL,csa3,cas6,cas3,cas8b3,cas7,csx1,cas10d,csc2gr7,csc1gr5,cas4	NA|80aa|up_9|NZ_CP021983.1_1270185_1270425_+,NA|132aa|up_8|NZ_CP021983.1_1270448_1270844_+,NA|105aa|up_7|NZ_CP021983.1_1270988_1271303_+,NA|221aa|up_5|NZ_CP021983.1_1272352_1273015_+,NA|267aa|up_3|NZ_CP021983.1_1273587_1274388_+,NA|84aa|up_2|NZ_CP021983.1_1275320_1275572_-,NA|304aa|up_0|NZ_CP021983.1_1276421_1277333_-,NA|82aa|down_0|NZ_CP021983.1_1277678_1277924_-,NA|139aa|down_3|NZ_CP021983.1_1279768_1280185_-,NA|175aa|down_4|NZ_CP021983.1_1280181_1280706_-,NA|438aa|down_5|NZ_CP021983.1_1280686_1282000_-,NA|216aa|down_6|NZ_CP021983.1_1281996_1282644_-,NA|106aa|down_7|NZ_CP021983.1_1282647_1282965_-,NA|61aa|down_9|NZ_CP021983.1_1283280_1283463_-	NA|80aa|up_9|NZ_CP021983.1_1270185_1270425_+	NA	NA|132aa|up_8|NZ_CP021983.1_1270448_1270844_+	NA	NA|105aa|up_7|NZ_CP021983.1_1270988_1271303_+	NA	NA|177aa|up_6|NZ_CP021983.1_1271450_1271981_-	cd02042, ParAB_family, partition proteins ParAB family	NA|221aa|up_5|NZ_CP021983.1_1272352_1273015_+	NA	NA|143aa|up_4|NZ_CP021983.1_1273011_1273440_-	TIGR02225, Tyrosine_recombinase_XerD, tyrosine recombinase XerD	NA|267aa|up_3|NZ_CP021983.1_1273587_1274388_+	NA	NA|84aa|up_2|NZ_CP021983.1_1275320_1275572_-	NA	NA|219aa|up_1|NZ_CP021983.1_1275612_1276269_-	cd01197, INT_FimBE_like, FimB and FimE and related proteins, integrase/recombinases	NA|304aa|up_0|NZ_CP021983.1_1276421_1277333_-	NA	NA|82aa|down_0|NZ_CP021983.1_1277678_1277924_-	NA	NA|103aa|down_1|NZ_CP021983.1_1277936_1278245_-	pfam01541, GIY-YIG, GIY-YIG catalytic domain	NA|403aa|down_2|NZ_CP021983.1_1278603_1279812_-	pfam01551, Peptidase_M23, Peptidase family M23	NA|139aa|down_3|NZ_CP021983.1_1279768_1280185_-	NA	NA|175aa|down_4|NZ_CP021983.1_1280181_1280706_-	NA	NA|438aa|down_5|NZ_CP021983.1_1280686_1282000_-	NA	NA|216aa|down_6|NZ_CP021983.1_1281996_1282644_-	NA	NA|106aa|down_7|NZ_CP021983.1_1282647_1282965_-	NA	NA|79aa|down_8|NZ_CP021983.1_1282968_1283205_-	PRK00432, PRK00432, 30S ribosomal protein S27ae; Validated	NA|61aa|down_9|NZ_CP021983.1_1283280_1283463_-	NA
GCF_002075285.2_ASM207528v2	NZ_CP021983	Halomicronema hongdechloris C2206 genome	11	1482357-1482438	12	CRISPRCasFinder	no	WYL,PD-DExK	PD-DExK,cas2,cas1,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,RT,DinG,DEDDh,WYL,csa3,cas6,cas3,cas8b3,cas7,csx1,cas10d,csc2gr7,csc1gr5,cas4	Unclear	GGACTCCGGCTTTGAGCAAACCCGCCAA	28	0	0	NA	NA	NA	1	1	Orphan	PD-DExK,cas2,cas1,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,RT,DinG,DEDDh,WYL,csa3,cas6,cas3,cas8b3,cas7,csx1,cas10d,csc2gr7,csc1gr5,cas4	NA,NA|242aa|down_4|NZ_CP021983.1_1487860_1488586_-,NA|61aa|down_7|NZ_CP021983.1_1492044_1492227_+,PD-DExK|207aa|down_9|NZ_CP021983.1_1493814_1494435_+	NA|418aa|up_9|NZ_CP021983.1_1464495_1465749_+	cd03825, GT4_WcaC-like, putative colanic acid biosynthesis glycosyl transferase WcaC and similar proteins	NA|115aa|up_8|NZ_CP021983.1_1465765_1466110_-	TIGR02181, GRX_bact, Glutaredoxin, GrxC family	NA|168aa|up_7|NZ_CP021983.1_1466492_1466996_+	sd00006, TPR, Tetratricopeptide repeat	NA|750aa|up_6|NZ_CP021983.1_1467257_1469507_+	PRK05402, PRK05402, 1,4-alpha-glucan branching protein GlgB	NA|458aa|up_5|NZ_CP021983.1_1469503_1470877_-	PRK02705, murD, UDP-N-acetylmuramoyl-L-alanine--D-glutamate ligase	NA|719aa|up_4|NZ_CP021983.1_1470880_1473037_-	PRK01233, glyS, glycyl-tRNA synthetase subunit beta; Validated	NA|571aa|up_3|NZ_CP021983.1_1473184_1474897_-	pfam09818, ABC_ATPase, Predicted ATPase of the ABC class	NA|669aa|up_2|NZ_CP021983.1_1475593_1477600_-	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|133aa|up_1|NZ_CP021983.1_1479749_1480148_+	pfam03912, Psb28, Psb28 protein	NA|309aa|up_0|NZ_CP021983.1_1480144_1481071_+	cd05242, SDR_a8, atypical (a) SDRs, subgroup 8	NA|207aa|down_0|NZ_CP021983.1_1483018_1483639_+	COG2119, COG2119, Predicted membrane protein [Function unknown]	NA|463aa|down_1|NZ_CP021983.1_1483646_1485035_-	COG3379, COG3379, Uncharacterized conserved protein [Function unknown]	NA|394aa|down_2|NZ_CP021983.1_1485073_1486255_-	COG1453, COG1453, Predicted oxidoreductases of the aldo/keto reductase family [General function prediction only]	WYL|242aa|down_3|NZ_CP021983.1_1486906_1487632_-	TIGR03985, hypothetical_protein_sll7078, CRISPR-associated protein, TIGR03985 family	NA|242aa|down_4|NZ_CP021983.1_1487860_1488586_-	NA	NA|131aa|down_5|NZ_CP021983.1_1488578_1488971_-	pfam18182, mCpol, minimal CRISPR polymerase domain	NA|457aa|down_6|NZ_CP021983.1_1489701_1491072_-	pfam01609, DDE_Tnp_1, Transposase DDE domain	NA|61aa|down_7|NZ_CP021983.1_1492044_1492227_+	NA	NA|423aa|down_8|NZ_CP021983.1_1492449_1493718_+	pfam13676, TIR_2, TIR domain	PD-DExK|207aa|down_9|NZ_CP021983.1_1493814_1494435_+	NA
GCF_002075285.2_ASM207528v2	NZ_CP021983	Halomicronema hongdechloris C2206 genome	12	1777108-1777357	4,13,4	PILER-CR,CRISPRCasFinder,CRT	no		PD-DExK,cas2,cas1,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,RT,DinG,DEDDh,WYL,csa3,cas6,cas3,cas8b3,cas7,csx1,cas10d,csc2gr7,csc1gr5,cas4	Orphan	GCTTCAATGGGGCCGCTCATTTAGAGAGCGGTGCGAC,GCTTCAATGGGGCCGCTCATTTAGAGAGCGGTGCGAC,GCTTCAATGGGGCCGCTCATTTAGAGAGCGGTGCGAC	37,37,37	0	0	NA	NA	NA:NA:NA	2,2,3	3	Orphan	PD-DExK,cas2,cas1,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,RT,DinG,DEDDh,WYL,csa3,cas6,cas3,cas8b3,cas7,csx1,cas10d,csc2gr7,csc1gr5,cas4	NA|96aa|up_9|NZ_CP021983.1_1767058_1767346_+,NA|279aa|up_8|NZ_CP021983.1_1767424_1768261_+,NA|92aa|up_7|NZ_CP021983.1_1768555_1768831_+,NA|73aa|up_2|NZ_CP021983.1_1774389_1774608_+,NA|66aa|down_0|NZ_CP021983.1_1777424_1777622_+,NA|125aa|down_1|NZ_CP021983.1_1778354_1778729_-	NA|96aa|up_9|NZ_CP021983.1_1767058_1767346_+	NA	NA|279aa|up_8|NZ_CP021983.1_1767424_1768261_+	NA	NA|92aa|up_7|NZ_CP021983.1_1768555_1768831_+	NA	NA|766aa|up_6|NZ_CP021983.1_1768953_1771251_+	PRK05402, PRK05402, 1,4-alpha-glucan branching protein GlgB	NA|226aa|up_5|NZ_CP021983.1_1771867_1772545_-	PRK00058, PRK00058, peptide-methionine (S)-S-oxide reductase MsrA	NA|255aa|up_4|NZ_CP021983.1_1772746_1773511_-	COG1836, COG1836, Predicted membrane protein [Function unknown]	NA|145aa|up_3|NZ_CP021983.1_1773714_1774149_+	cd08357, VOC_like, uncharacterized subfamily of vicinal oxygen chelate (VOC) familyprotein, glyoxalase I, and type I ring-cleaving dioxygenases	NA|73aa|up_2|NZ_CP021983.1_1774389_1774608_+	NA	NA|233aa|up_1|NZ_CP021983.1_1774706_1775405_-	PRK01112, PRK01112, 2,3-bisphosphoglycerate-dependent phosphoglycerate mutase	NA|204aa|up_0|NZ_CP021983.1_1776202_1776814_-	pfam05685, Uma2, Putative restriction endonuclease	NA|66aa|down_0|NZ_CP021983.1_1777424_1777622_+	NA	NA|125aa|down_1|NZ_CP021983.1_1778354_1778729_-	NA	NA|443aa|down_2|NZ_CP021983.1_1779171_1780500_-	PRK00197, proA, gamma-glutamyl phosphate reductase; Provisional	NA|157aa|down_3|NZ_CP021983.1_1781547_1782018_-	pfam08670, MEKHLA, MEKHLA domain	NA|174aa|down_4|NZ_CP021983.1_1782103_1782625_-	pfam09988, DUF2227, Uncharacterized metal-binding protein (DUF2227)	NA|1133aa|down_5|NZ_CP021983.1_1783995_1787394_+	cd09178, PLDc_N_Snf2_like, N-terminal putative catalytic domain of uncharacterized HKD family nucleases fused to putative helicases from the Snf2-like family	NA|185aa|down_6|NZ_CP021983.1_1787418_1787973_+	cd06260, DUF820, Domain of unknown function (DUF820)	NA|97aa|down_7|NZ_CP021983.1_1787990_1788281_+	COG1669, COG1669, Predicted nucleotidyltransferases [General function prediction only]	NA|457aa|down_8|NZ_CP021983.1_1788349_1789720_-	pfam01609, DDE_Tnp_1, Transposase DDE domain	NA|302aa|down_9|NZ_CP021983.1_1791841_1792747_-	cd07987, LPLAT_MGAT-like, Lysophospholipid Acyltransferases (LPLATs) of Glycerophospholipid Biosynthesis: MGAT-like
GCF_002075285.2_ASM207528v2	NZ_CP021983	Halomicronema hongdechloris C2206 genome	13	2138193-2138305	14	CRISPRCasFinder	no		PD-DExK,cas2,cas1,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,RT,DinG,DEDDh,WYL,csa3,cas6,cas3,cas8b3,cas7,csx1,cas10d,csc2gr7,csc1gr5,cas4	Orphan	CGCAGCGCTCTACCAACTGAGCTAATTCCCC	31	0	0	NA	NA	NA	1	1	Orphan	PD-DExK,cas2,cas1,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,RT,DinG,DEDDh,WYL,csa3,cas6,cas3,cas8b3,cas7,csx1,cas10d,csc2gr7,csc1gr5,cas4	NA|161aa|up_8|NZ_CP021983.1_2118355_2118838_-,NA|141aa|up_7|NZ_CP021983.1_2118862_2119285_-,NA|132aa|down_1|NZ_CP021983.1_2140051_2140447_+,NA|142aa|down_2|NZ_CP021983.1_2141447_2141873_+,NA|68aa|down_6|NZ_CP021983.1_2147993_2148197_+	NA|395aa|up_9|NZ_CP021983.1_2117167_2118352_-	cd01158, SCAD_SBCAD, Short chain acyl-CoA dehydrogenases and eukaryotic short/branched chain acyl-CoA dehydrogenases	NA|161aa|up_8|NZ_CP021983.1_2118355_2118838_-	NA	NA|141aa|up_7|NZ_CP021983.1_2118862_2119285_-	NA	NA|82aa|up_6|NZ_CP021983.1_2119299_2119545_-	pfam00550, PP-binding, Phosphopantetheine attachment site	NA|285aa|up_5|NZ_CP021983.1_2119565_2120420_-	PRK05808, PRK05808, 3-hydroxybutyryl-CoA dehydrogenase; Validated	NA|1838aa|up_4|NZ_CP021983.1_2120441_2125955_-	COG3321, COG3321, Polyketide synthase modules and related proteins [Secondary metabolites biosynthesis, transport, and catabolism]	NA|1772aa|up_3|NZ_CP021983.1_2125955_2131271_-	PRK05691, PRK05691, peptide synthase; Validated	NA|707aa|up_2|NZ_CP021983.1_2133900_2136021_+	pfam01804, Penicil_amidase, Penicillin amidase	NA|205aa|up_1|NZ_CP021983.1_2136307_2136922_+	pfam05685, Uma2, Putative restriction endonuclease	NA|201aa|up_0|NZ_CP021983.1_2136957_2137560_-	pfam05685, Uma2, Putative restriction endonuclease	NA|255aa|down_0|NZ_CP021983.1_2138319_2139084_-	COG0637, COG0637, Predicted phosphatase/phosphohexomutase [General function prediction only]	NA|132aa|down_1|NZ_CP021983.1_2140051_2140447_+	NA	NA|142aa|down_2|NZ_CP021983.1_2141447_2141873_+	NA	NA|206aa|down_3|NZ_CP021983.1_2143188_2143806_+	PRK14730, coaE, dephospho-CoA kinase; Provisional	NA|455aa|down_4|NZ_CP021983.1_2143808_2145173_-	COG2133, COG2133, Glucose/sorbosone dehydrogenases [Carbohydrate transport and metabolism]	NA|78aa|down_5|NZ_CP021983.1_2145191_2145425_-	pfam02672, CP12, CP12 domain	NA|68aa|down_6|NZ_CP021983.1_2147993_2148197_+	NA	NA|87aa|down_7|NZ_CP021983.1_2148700_2148961_+	smart00966, SpoVT_AbrB, SpoVT / AbrB like domain	NA|144aa|down_8|NZ_CP021983.1_2148947_2149379_+	cd18683, PIN_VapC-like, Uncharacterized subfamily of the VapC (virulence-associated protein C)-like family of the PIN domain superfamily	NA|124aa|down_9|NZ_CP021983.1_2149442_2149814_+	smart00421, HTH_LUXR, helix_turn_helix, Lux Regulon
GCF_002075285.2_ASM207528v2	NZ_CP021983	Halomicronema hongdechloris C2206 genome	14	2245320-2247343	15,5,5	CRISPRCasFinder,CRT,PILER-CR	no	WYL,cas6,cas3,cas8b3,cas7,cas1,cas2	PD-DExK,cas2,cas1,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,RT,DinG,DEDDh,WYL,csa3,cas6,cas3,cas8b3,cas7,csx1,cas10d,csc2gr7,csc1gr5,cas4	Unclear	CCGCCAAACCTCTGATGCCGCAAGGCGTTGAGCAC,CCGCCAAACCTCTGATGCCGCAAGGCGTTGAGCAC,CCGCCAAACCTCTGATGCCGCAAGGCGTTGAGCAC	35,35,35	1	1	2245425-2245461	NZ_CP021983.1_1276826-1276862	I-A,I-B,II-B:I-A,I-B,II-B:I-A,I-B,II-B	27,28,26	28	Unclear	PD-DExK,cas2,cas1,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,RT,DinG,DEDDh,WYL,csa3,cas6,cas3,cas8b3,cas7,csx1,cas10d,csc2gr7,csc1gr5,cas4	NA|73aa|up_8|NZ_CP021983.1_2238202_2238421_-,NA|165aa|up_6|NZ_CP021983.1_2238708_2239203_+,NA|79aa|up_2|NZ_CP021983.1_2242552_2242789_+,NA|66aa|down_5|NZ_CP021983.1_2250285_2250483_+	cas3|802aa|up_9|NZ_CP021983.1_2235792_2238198_+	cd17930, DEXHc_cas3, DEXH/Q-box helicase domain of Cas3	NA|73aa|up_8|NZ_CP021983.1_2238202_2238421_-	NA	NA|92aa|up_7|NZ_CP021983.1_2238404_2238680_-	pfam04365, BrnT_toxin, Ribonuclease toxin, BrnT, of type II toxin-antitoxin system	NA|165aa|up_6|NZ_CP021983.1_2238708_2239203_+	NA	cas8b3|557aa|up_5|NZ_CP021983.1_2239205_2240876_+	TIGR04413, hypothetical_protein_LEP1GSC082_4029, CRISPR type MYXAN-associated protein Cmx8	cas7|294aa|up_4|NZ_CP021983.1_2240902_2241784_+	cd09687, Cas7_I-C, CRISPR/Cas system-associated RAMP superfamily protein Cas7	NA|247aa|up_3|NZ_CP021983.1_2241796_2242537_+	TIGR02593, CRISPR-associated_protein_Cas5, CRISPR-associated protein Cas5, N-terminal domain	NA|79aa|up_2|NZ_CP021983.1_2242552_2242789_+	NA	cas1|556aa|up_1|NZ_CP021983.1_2243100_2244768_+	TIGR03983, hypothetical_protein_LA3181, CRISPR-associated endonuclease Cas1, subtype MYXAN	cas2|98aa|up_0|NZ_CP021983.1_2244777_2245071_+	pfam09827, CRISPR_Cas2, CRISPR associated protein Cas2	NA|133aa|down_0|NZ_CP021983.1_2247410_2247809_+	pfam10049, DUF2283, Protein of unknown function (DUF2283)	NA|65aa|down_1|NZ_CP021983.1_2247756_2247951_-	COG1598, COG1598, Predicted nuclease of the RNAse H fold, HicB family [General    function prediction only]	NA|354aa|down_2|NZ_CP021983.1_2248139_2249200_-	pfam13358, DDE_3, DDE superfamily endonuclease	NA|82aa|down_3|NZ_CP021983.1_2249438_2249684_+	pfam03683, UPF0175, Uncharacterized protein family (UPF0175)	NA|168aa|down_4|NZ_CP021983.1_2249676_2250180_+	COG2405, COG2405, Predicted nucleic acid-binding protein, contains PIN domain [General function prediction only]	NA|66aa|down_5|NZ_CP021983.1_2250285_2250483_+	NA	NA|65aa|down_6|NZ_CP021983.1_2250489_2250684_-	COG1598, COG1598, Predicted nuclease of the RNAse H fold, HicB family [General    function prediction only]	NA|115aa|down_7|NZ_CP021983.1_2250792_2251137_-	cd05403, NT_KNTase_like, Nucleotidyltransferase (NT) domain of Staphylococcus aureus kanamycin nucleotidyltransferase, and similar proteins	NA|191aa|down_8|NZ_CP021983.1_2251348_2251921_+	pfam05685, Uma2, Putative restriction endonuclease	NA|677aa|down_9|NZ_CP021983.1_2252287_2254318_-	COG1118, CysA, ABC-type sulfate/molybdate transport systems, ATPase component [Inorganic ion transport and metabolism]
GCF_002075285.2_ASM207528v2	NZ_CP021983	Halomicronema hongdechloris C2206 genome	15	2291345-2291594	6,6,16	PILER-CR,CRT,CRISPRCasFinder	no	cas10,cmr3gr5,cmr4gr7,cmr5gr11,csx1,cas1,cas2	PD-DExK,cas2,cas1,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,RT,DinG,DEDDh,WYL,csa3,cas6,cas3,cas8b3,cas7,csx1,cas10d,csc2gr7,csc1gr5,cas4	Type III-B,Type III-D,Type III-C,Type III-A	CCTTCCCACTCAGT-GGGAAACTAATTGAATGGAAAC,CCTTCCCACTCAGTGGGAAACTAATTGAATGGAAAC,CTTCCCACTCAGTGGGAAACTAATTGAATGGAAAC	37,36,35	0	0	NA	NA	NA:NA:NA	3,3,3	3	TypeIII-A,TypeIII-D,TypeIII-C,TypeIII-B	PD-DExK,cas2,cas1,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,RT,DinG,DEDDh,WYL,csa3,cas6,cas3,cas8b3,cas7,csx1,cas10d,csc2gr7,csc1gr5,cas4	cmr5gr11|151aa|up_9|NZ_CP021983.1_2281597_2282050_+,csx1|496aa|up_7|NZ_CP021983.1_2283864_2285352_+,NA|479aa|up_3|NZ_CP021983.1_2287035_2288472_+,NA|140aa|up_0|NZ_CP021983.1_2289695_2290115_-,NA	cmr5gr11|151aa|up_9|NZ_CP021983.1_2281597_2282050_+	NA	NA|565aa|up_8|NZ_CP021983.1_2282033_2283728_+	TIGR01898, repair_system, CRISPR type III-B/RAMP module RAMP protein Cmr6	csx1|496aa|up_7|NZ_CP021983.1_2283864_2285352_+	NA	NA|103aa|up_6|NZ_CP021983.1_2285611_2285920_-	cd10456, GIY-YIG_UPF0213, The GIY-YIG domain of uncharacterized protein family UPF0213 related to structure-specific endonuclease SLX1	NA|108aa|up_5|NZ_CP021983.1_2286321_2286645_+	pfam04255, DUF433, Protein of unknown function (DUF433)	NA|121aa|up_4|NZ_CP021983.1_2286649_2287012_+	pfam18480, DUF5615, Domain of unknown function (DUF5615)	NA|479aa|up_3|NZ_CP021983.1_2287035_2288472_+	NA	NA|151aa|up_2|NZ_CP021983.1_2288542_2288995_-	COG2405, COG2405, Predicted nucleic acid-binding protein, contains PIN domain [General function prediction only]	NA|73aa|up_1|NZ_CP021983.1_2288991_2289210_-	pfam03683, UPF0175, Uncharacterized protein family (UPF0175)	NA|140aa|up_0|NZ_CP021983.1_2289695_2290115_-	NA	cas1|331aa|down_0|NZ_CP021983.1_2293250_2294243_+	pfam01867, Cas_Cas1, CRISPR associated protein Cas1	cas2|93aa|down_1|NZ_CP021983.1_2294247_2294526_+	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	NA|811aa|down_2|NZ_CP021983.1_2297065_2299498_+	PRK00409, PRK00409, recombination and DNA strand exchange inhibitor protein; Reviewed	NA|482aa|down_3|NZ_CP021983.1_2299505_2300951_-	TIGR03556, photolyase_8HDF, deoxyribodipyrimidine photo-lyase, 8-HDF type	NA|395aa|down_4|NZ_CP021983.1_2301926_2303111_+	PRK12309, PRK12309, transaldolase	NA|268aa|down_5|NZ_CP021983.1_2303193_2303997_+	cd05358, GlcDH_SDR_c, glucose 1 dehydrogenase (GlcDH), classical (c) SDRs	NA|562aa|down_6|NZ_CP021983.1_2304406_2306092_+	cd11333, AmyAc_SI_OligoGlu_DGase, Alpha amylase catalytic domain found in Sucrose isomerases, oligo-1,6-glucosidase (also called isomaltase; sucrase-isomaltase; alpha-limit dextrinase), dextran glucosidase (also called glucan 1,6-alpha-glucosidase), and related proteins	NA|676aa|down_7|NZ_CP021983.1_2306241_2308269_+	pfam06202, GDE_C, Amylo-alpha-1,6-glucosidase	NA|288aa|down_8|NZ_CP021983.1_2308274_2309138_-	cd01637, IMPase_like, Inositol-monophosphatase-like domains	NA|229aa|down_9|NZ_CP021983.1_2309246_2309933_-	PRK05653, fabG, 3-oxoacyl-ACP reductase FabG
GCF_002075285.2_ASM207528v2	NZ_CP021983	Halomicronema hongdechloris C2206 genome	16	2294711-2296526	7,17,7,8	PILER-CR,CRISPRCasFinder,CRT,PILER-CR	no	cas10,cmr3gr5,cmr4gr7,cmr5gr11,csx1,cas1,cas2	PD-DExK,cas2,cas1,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,RT,DinG,DEDDh,WYL,csa3,cas6,cas3,cas8b3,cas7,csx1,cas10d,csc2gr7,csc1gr5,cas4	Type III-B,Type III-D,Type III-C,Type III-A	GTCCCCACTCGTTGGGGAAACTAATTGAATGGAAAC,GTCCCCACTCGTTGGGGAAACTAATTGAATGGAAAC,CCCCACTCGTTGGGGAAACTAATTGAATGGAAA,CCCACTCGTTGGGGAAACTAATTGAATGGAAAC	36,36,33,33	0	0	NA	NA	NA:NA:NA:NA	22,24,24,22	24	TypeIII-A,TypeIII-D,TypeIII-C,TypeIII-B	PD-DExK,cas2,cas1,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,RT,DinG,DEDDh,WYL,csa3,cas6,cas3,cas8b3,cas7,csx1,cas10d,csc2gr7,csc1gr5,cas4	csx1|496aa|up_9|NZ_CP021983.1_2283864_2285352_+,NA|479aa|up_5|NZ_CP021983.1_2287035_2288472_+,NA|140aa|up_2|NZ_CP021983.1_2289695_2290115_-,NA	csx1|496aa|up_9|NZ_CP021983.1_2283864_2285352_+	NA	NA|103aa|up_8|NZ_CP021983.1_2285611_2285920_-	cd10456, GIY-YIG_UPF0213, The GIY-YIG domain of uncharacterized protein family UPF0213 related to structure-specific endonuclease SLX1	NA|108aa|up_7|NZ_CP021983.1_2286321_2286645_+	pfam04255, DUF433, Protein of unknown function (DUF433)	NA|121aa|up_6|NZ_CP021983.1_2286649_2287012_+	pfam18480, DUF5615, Domain of unknown function (DUF5615)	NA|479aa|up_5|NZ_CP021983.1_2287035_2288472_+	NA	NA|151aa|up_4|NZ_CP021983.1_2288542_2288995_-	COG2405, COG2405, Predicted nucleic acid-binding protein, contains PIN domain [General function prediction only]	NA|73aa|up_3|NZ_CP021983.1_2288991_2289210_-	pfam03683, UPF0175, Uncharacterized protein family (UPF0175)	NA|140aa|up_2|NZ_CP021983.1_2289695_2290115_-	NA	cas1|331aa|up_1|NZ_CP021983.1_2293250_2294243_+	pfam01867, Cas_Cas1, CRISPR associated protein Cas1	cas2|93aa|up_0|NZ_CP021983.1_2294247_2294526_+	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	NA|811aa|down_0|NZ_CP021983.1_2297065_2299498_+	PRK00409, PRK00409, recombination and DNA strand exchange inhibitor protein; Reviewed	NA|482aa|down_1|NZ_CP021983.1_2299505_2300951_-	TIGR03556, photolyase_8HDF, deoxyribodipyrimidine photo-lyase, 8-HDF type	NA|395aa|down_2|NZ_CP021983.1_2301926_2303111_+	PRK12309, PRK12309, transaldolase	NA|268aa|down_3|NZ_CP021983.1_2303193_2303997_+	cd05358, GlcDH_SDR_c, glucose 1 dehydrogenase (GlcDH), classical (c) SDRs	NA|562aa|down_4|NZ_CP021983.1_2304406_2306092_+	cd11333, AmyAc_SI_OligoGlu_DGase, Alpha amylase catalytic domain found in Sucrose isomerases, oligo-1,6-glucosidase (also called isomaltase; sucrase-isomaltase; alpha-limit dextrinase), dextran glucosidase (also called glucan 1,6-alpha-glucosidase), and related proteins	NA|676aa|down_5|NZ_CP021983.1_2306241_2308269_+	pfam06202, GDE_C, Amylo-alpha-1,6-glucosidase	NA|288aa|down_6|NZ_CP021983.1_2308274_2309138_-	cd01637, IMPase_like, Inositol-monophosphatase-like domains	NA|229aa|down_7|NZ_CP021983.1_2309246_2309933_-	PRK05653, fabG, 3-oxoacyl-ACP reductase FabG	NA|1792aa|down_8|NZ_CP021983.1_2310020_2315396_-	COG3899, COG3899, Predicted ATPase [General function prediction only]	NA|1216aa|down_9|NZ_CP021983.1_2315792_2319440_-	TIGR02082, Methionine_synthase, 5-methyltetrahydrofolate--homocysteine methyltransferase
GCF_002075285.2_ASM207528v2	NZ_CP021983	Halomicronema hongdechloris C2206 genome	17	2567342-2567427	18	CRISPRCasFinder	no	csa3	PD-DExK,cas2,cas1,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,RT,DinG,DEDDh,WYL,csa3,cas6,cas3,cas8b3,cas7,csx1,cas10d,csc2gr7,csc1gr5,cas4	Type I-A	TTTTTGCCGGTCCTTCTCTATAGT	24	0	0	NA	NA	NA	1	1	Orphan	PD-DExK,cas2,cas1,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,RT,DinG,DEDDh,WYL,csa3,cas6,cas3,cas8b3,cas7,csx1,cas10d,csc2gr7,csc1gr5,cas4	NA|300aa|up_7|NZ_CP021983.1_2554875_2555775_+,NA|121aa|down_4|NZ_CP021983.1_2572673_2573036_-,NA|89aa|down_7|NZ_CP021983.1_2574557_2574824_-,NA|105aa|down_8|NZ_CP021983.1_2574964_2575279_-	NA|318aa|up_9|NZ_CP021983.1_2552742_2553696_+	cd06433, GT_2_WfgS_like, WfgS and WfeV are involved in O-antigen biosynthesis	NA|398aa|up_8|NZ_CP021983.1_2553673_2554867_+	cd05844, GT4-like, glycosyltransferase family 4 proteins	NA|300aa|up_7|NZ_CP021983.1_2554875_2555775_+	NA	NA|296aa|up_6|NZ_CP021983.1_2555785_2556673_+	pfam13489, Methyltransf_23, Methyltransferase domain	NA|394aa|up_5|NZ_CP021983.1_2556676_2557858_+	cd03801, GT4_PimA-like, phosphatidyl-myo-inositol mannosyltransferase	NA|266aa|up_4|NZ_CP021983.1_2557899_2558697_+	TIGR02168, Chromosome_partition_protein_Smc, chromosome segregation protein SMC, common bacterial type	NA|259aa|up_3|NZ_CP021983.1_2559351_2560128_+	COG1922, WecG, Teichoic acid biosynthesis proteins [Cell envelope biogenesis, outer membrane]	NA|472aa|up_2|NZ_CP021983.1_2560181_2561597_+	TIGR03025, EPS_sugtrans, exopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase	NA|918aa|up_1|NZ_CP021983.1_2561866_2564620_+	TIGR00947, 2A73, putative bicarbonate transporter, IctB family	NA|841aa|up_0|NZ_CP021983.1_2564623_2567146_+	pfam04932, Wzy_C, O-Antigen ligase	NA|369aa|down_0|NZ_CP021983.1_2567824_2568931_-	cd16383, GUN4, porphyrin-binding protein domain GUN4	NA|670aa|down_1|NZ_CP021983.1_2569217_2571227_+	pfam13699, DUF4157, Domain of unknown function (DUF4157)	NA|304aa|down_2|NZ_CP021983.1_2571306_2572218_+	pfam08894, DUF1838, Protein of unknown function (DUF1838)	NA|64aa|down_3|NZ_CP021983.1_2572250_2572442_-	pfam11165, DUF2949, Protein of unknown function (DUF2949)	NA|121aa|down_4|NZ_CP021983.1_2572673_2573036_-	NA	NA|153aa|down_5|NZ_CP021983.1_2573394_2573853_-	cd07254, VOC_like, uncharacterized subfamily of vicinal oxygen chelate (VOC) family	csa3|129aa|down_6|NZ_CP021983.1_2574043_2574430_-	cd00090, HTH_ARSR, Arsenical Resistance Operon Repressor and similar prokaryotic, metal regulated homodimeric repressors	NA|89aa|down_7|NZ_CP021983.1_2574557_2574824_-	NA	NA|105aa|down_8|NZ_CP021983.1_2574964_2575279_-	NA	NA|182aa|down_9|NZ_CP021983.1_2575881_2576427_+	pfam13767, DUF4168, Domain of unknown function (DUF4168)
GCF_002075285.2_ASM207528v2	NZ_CP021983	Halomicronema hongdechloris C2206 genome	18	2822496-2823556	9,19,8	PILER-CR,CRISPRCasFinder,CRT	no	cas3,WYL,cas10d,csc2gr7,csc1gr5,cas6,cas4,cas1,cas2	PD-DExK,cas2,cas1,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,RT,DinG,DEDDh,WYL,csa3,cas6,cas3,cas8b3,cas7,csx1,cas10d,csc2gr7,csc1gr5,cas4	Type I-D	CTCGCCAACTTCTAAATCTCGGCAACGAGACTGAAA,CTCGCCAACTTCTAAATCTCGGCAACGAGACTGAAAC,CTCGCCAACTTCTAAATCTCGGCAACGAGACTGAAAC	36,37,37	0	0	NA	NA	NA:NA:NA	13,14,14	14	TypeI-D	PD-DExK,cas2,cas1,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,RT,DinG,DEDDh,WYL,csa3,cas6,cas3,cas8b3,cas7,csx1,cas10d,csc2gr7,csc1gr5,cas4	NA|87aa|up_4|NZ_CP021983.1_2819597_2819858_+,NA|92aa|up_3|NZ_CP021983.1_2819934_2820210_+,NA|235aa|down_3|NZ_CP021983.1_2826810_2827515_+,NA|179aa|down_5|NZ_CP021983.1_2829793_2830330_+,NA|193aa|down_6|NZ_CP021983.1_2830366_2830945_-	cas3|707aa|up_9|NZ_CP021983.1_2811884_2814005_+	cd09710, Cas3_I-D, CRISPR/Cas system-associated protein Cas3; Distinct diverged subfamily of Cas3 helicase domain	cas10d|995aa|up_8|NZ_CP021983.1_2814018_2817003_+	cd09712, Cas10d_I-D, CRISPR/Cas system-associated protein Cas10d	csc2gr7|331aa|up_7|NZ_CP021983.1_2817005_2817998_+	pfam18320, Csc2, Csc2 Crispr	csc1gr5|253aa|up_6|NZ_CP021983.1_2818018_2818777_+	TIGR03159, cas_Csc1, CRISPR type I-D/CYANO-associated protein Csc1	cas6|282aa|up_5|NZ_CP021983.1_2818715_2819561_+	COG5551, COG5551, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	NA|87aa|up_4|NZ_CP021983.1_2819597_2819858_+	NA	NA|92aa|up_3|NZ_CP021983.1_2819934_2820210_+	NA	cas4|194aa|up_2|NZ_CP021983.1_2820371_2820953_+	cd09637, Cas4_I-A_I-B_I-C_I-D_II-B, CRISPR/Cas system-associated protein Cas4	cas1|326aa|up_1|NZ_CP021983.1_2820955_2821933_+	TIGR04093, hypothetical_protein_L8106_25395, CRISPR-associated endonuclease Cas1, subtype CYANO	cas2|99aa|up_0|NZ_CP021983.1_2821990_2822287_+	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	NA|256aa|down_0|NZ_CP021983.1_2823666_2824434_-	cd07987, LPLAT_MGAT-like, Lysophospholipid Acyltransferases (LPLATs) of Glycerophospholipid Biosynthesis: MGAT-like	NA|555aa|down_1|NZ_CP021983.1_2824548_2826213_-	PRK05380, pyrG, CTP synthetase; Validated	NA|78aa|down_2|NZ_CP021983.1_2826564_2826798_+	cd19101, AKR_unchar, uncharacterized aldo-keto reductase (AKR) superfamily protein	NA|235aa|down_3|NZ_CP021983.1_2826810_2827515_+	NA	NA|198aa|down_4|NZ_CP021983.1_2829186_2829780_+	pfam09367, CpeS, CpeS-like protein	NA|179aa|down_5|NZ_CP021983.1_2829793_2830330_+	NA	NA|193aa|down_6|NZ_CP021983.1_2830366_2830945_-	NA	NA|190aa|down_7|NZ_CP021983.1_2831357_2831927_+	COG1357, COG1357, Pentapeptide repeats containing protein [Function unknown]	NA|210aa|down_8|NZ_CP021983.1_2832342_2832972_-	PRK05953, PRK05953, Precorrin-8X methylmutase	NA|112aa|down_9|NZ_CP021983.1_2834606_2834942_-	COG4980, GvpP, Gas vesicle protein [General function prediction only]
GCF_002075285.2_ASM207528v2	NZ_CP021983	Halomicronema hongdechloris C2206 genome	19	2828011-2828403	10,20,9	PILER-CR,CRISPRCasFinder,CRT	no	WYL,cas3,cas10d,csc2gr7,csc1gr5,cas6,cas4,cas1,cas2	PD-DExK,cas2,cas1,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,RT,DinG,DEDDh,WYL,csa3,cas6,cas3,cas8b3,cas7,csx1,cas10d,csc2gr7,csc1gr5,cas4	Type I-D	CTTCCCACTCAGT-GGGAAACTAATTGAATGGAAAC,CTTCCCACTCAGTGGGAAACTAATTGAATGGAAAC,CTTCCCACTCAGTGGGAAACTAATTGAATGGAAAC	36,35,35	0	0	NA	NA	NA:NA:NA	5,5,5	5	TypeI-D	PD-DExK,cas2,cas1,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,RT,DinG,DEDDh,WYL,csa3,cas6,cas3,cas8b3,cas7,csx1,cas10d,csc2gr7,csc1gr5,cas4	NA|87aa|up_8|NZ_CP021983.1_2819597_2819858_+,NA|92aa|up_7|NZ_CP021983.1_2819934_2820210_+,NA|235aa|up_0|NZ_CP021983.1_2826810_2827515_+,NA|179aa|down_1|NZ_CP021983.1_2829793_2830330_+,NA|193aa|down_2|NZ_CP021983.1_2830366_2830945_-,NA|110aa|down_6|NZ_CP021983.1_2835023_2835353_-	cas6|282aa|up_9|NZ_CP021983.1_2818715_2819561_+	COG5551, COG5551, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	NA|87aa|up_8|NZ_CP021983.1_2819597_2819858_+	NA	NA|92aa|up_7|NZ_CP021983.1_2819934_2820210_+	NA	cas4|194aa|up_6|NZ_CP021983.1_2820371_2820953_+	cd09637, Cas4_I-A_I-B_I-C_I-D_II-B, CRISPR/Cas system-associated protein Cas4	cas1|326aa|up_5|NZ_CP021983.1_2820955_2821933_+	TIGR04093, hypothetical_protein_L8106_25395, CRISPR-associated endonuclease Cas1, subtype CYANO	cas2|99aa|up_4|NZ_CP021983.1_2821990_2822287_+	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	NA|256aa|up_3|NZ_CP021983.1_2823666_2824434_-	cd07987, LPLAT_MGAT-like, Lysophospholipid Acyltransferases (LPLATs) of Glycerophospholipid Biosynthesis: MGAT-like	NA|555aa|up_2|NZ_CP021983.1_2824548_2826213_-	PRK05380, pyrG, CTP synthetase; Validated	NA|78aa|up_1|NZ_CP021983.1_2826564_2826798_+	cd19101, AKR_unchar, uncharacterized aldo-keto reductase (AKR) superfamily protein	NA|235aa|up_0|NZ_CP021983.1_2826810_2827515_+	NA	NA|198aa|down_0|NZ_CP021983.1_2829186_2829780_+	pfam09367, CpeS, CpeS-like protein	NA|179aa|down_1|NZ_CP021983.1_2829793_2830330_+	NA	NA|193aa|down_2|NZ_CP021983.1_2830366_2830945_-	NA	NA|190aa|down_3|NZ_CP021983.1_2831357_2831927_+	COG1357, COG1357, Pentapeptide repeats containing protein [Function unknown]	NA|210aa|down_4|NZ_CP021983.1_2832342_2832972_-	PRK05953, PRK05953, Precorrin-8X methylmutase	NA|112aa|down_5|NZ_CP021983.1_2834606_2834942_-	COG4980, GvpP, Gas vesicle protein [General function prediction only]	NA|110aa|down_6|NZ_CP021983.1_2835023_2835353_-	NA	NA|734aa|down_7|NZ_CP021983.1_2835539_2837741_+	pfam07602, DUF1565, Protein of unknown function (DUF1565)	NA|112aa|down_8|NZ_CP021983.1_2838401_2838737_+	PRK13612, PRK13612, photosystem II reaction center protein Psb28; Provisional	NA|166aa|down_9|NZ_CP021983.1_2838753_2839251_+	cd00886, MogA_MoaB, MogA_MoaB family
GCF_002075285.2_ASM207528v2	NZ_CP021983	Halomicronema hongdechloris C2206 genome	20	2878020-2878091	21	CRISPRCasFinder	no	csa3	PD-DExK,cas2,cas1,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,RT,DinG,DEDDh,WYL,csa3,cas6,cas3,cas8b3,cas7,csx1,cas10d,csc2gr7,csc1gr5,cas4	Type I-A	GTGATGGGGTGTTGGGGTGATGGG	24	0	0	NA	NA	NA	1	1	Orphan	PD-DExK,cas2,cas1,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,RT,DinG,DEDDh,WYL,csa3,cas6,cas3,cas8b3,cas7,csx1,cas10d,csc2gr7,csc1gr5,cas4	NA,NA|106aa|down_9|NZ_CP021983.1_2887194_2887512_-	NA|74aa|up_9|NZ_CP021983.1_2867707_2867929_-	pfam10047, DUF2281, Protein of unknown function (DUF2281)	NA|124aa|up_8|NZ_CP021983.1_2867992_2868364_-	pfam07845, DUF1636, Protein of unknown function (DUF1636)	NA|100aa|up_7|NZ_CP021983.1_2868369_2868669_-	COG0614, FepB, ABC-type Fe3+-hydroxamate transport system, periplasmic component [Inorganic ion transport and metabolism]	NA|178aa|up_6|NZ_CP021983.1_2868814_2869348_+	pfam13808, DDE_Tnp_1_assoc, DDE_Tnp_1-associated	NA|191aa|up_5|NZ_CP021983.1_2869307_2869880_+	COG5433, COG5433, Transposase [DNA replication, recombination, and repair]	NA|183aa|up_4|NZ_CP021983.1_2869965_2870514_+	COG1943, COG1943, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|336aa|up_3|NZ_CP021983.1_2872082_2873090_-	COG4759, COG4759, Uncharacterized protein conserved in bacteria containing thioredoxin-like domain [Posttranslational modification, protein turnover, chaperones]	NA|341aa|up_2|NZ_CP021983.1_2873183_2874206_-	cd01146, FhuD, Fe3+-siderophore binding domain FhuD	NA|876aa|up_1|NZ_CP021983.1_2874262_2876890_-	TIGR01783, Ferrienterobactin_receptor, TonB-dependent siderophore receptor	NA|194aa|up_0|NZ_CP021983.1_2877393_2877975_-	pfam05685, Uma2, Putative restriction endonuclease	NA|320aa|down_0|NZ_CP021983.1_2878235_2879195_-	COG2207, AraC, AraC-type DNA-binding domain-containing proteins [Transcription]	NA|245aa|down_1|NZ_CP021983.1_2879451_2880186_-	cd06260, DUF820, Domain of unknown function (DUF820)	NA|112aa|down_2|NZ_CP021983.1_2880498_2880834_-	pfam05168, HEPN, HEPN domain	NA|107aa|down_3|NZ_CP021983.1_2880826_2881147_-	cd05403, NT_KNTase_like, Nucleotidyltransferase (NT) domain of Staphylococcus aureus kanamycin nucleotidyltransferase, and similar proteins	NA|92aa|down_4|NZ_CP021983.1_2881179_2881455_-	pfam07845, DUF1636, Protein of unknown function (DUF1636)	NA|318aa|down_5|NZ_CP021983.1_2881502_2882456_-	COG2207, AraC, AraC-type DNA-binding domain-containing proteins [Transcription]	NA|292aa|down_6|NZ_CP021983.1_2882828_2883704_+	cd14656, Imelysin-like_EfeO, EfeO is a component of the EfeUOB operon	NA|215aa|down_7|NZ_CP021983.1_2883891_2884536_-	sd00006, TPR, Tetratricopeptide repeat	NA|613aa|down_8|NZ_CP021983.1_2885147_2886986_-	COG1132, MdlB, ABC-type multidrug transport system, ATPase and permease components [Defense mechanisms]	NA|106aa|down_9|NZ_CP021983.1_2887194_2887512_-	NA
GCF_002075285.2_ASM207528v2	NZ_CP021983	Halomicronema hongdechloris C2206 genome	21	2964669-2964754	22	CRISPRCasFinder	no		PD-DExK,cas2,cas1,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,RT,DinG,DEDDh,WYL,csa3,cas6,cas3,cas8b3,cas7,csx1,cas10d,csc2gr7,csc1gr5,cas4	Orphan	CCCCATCACCCCATCACCCCATCTAC	26	0	0	NA	NA	NA	1	1	Orphan	PD-DExK,cas2,cas1,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,RT,DinG,DEDDh,WYL,csa3,cas6,cas3,cas8b3,cas7,csx1,cas10d,csc2gr7,csc1gr5,cas4	NA|61aa|up_1|NZ_CP021983.1_2962866_2963049_+,NA|75aa|down_0|NZ_CP021983.1_2965414_2965639_+,NA|64aa|down_4|NZ_CP021983.1_2968420_2968612_+,NA|122aa|down_5|NZ_CP021983.1_2968604_2968970_-,NA|78aa|down_7|NZ_CP021983.1_2969707_2969941_+	NA|587aa|up_9|NZ_CP021983.1_2951642_2953403_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|484aa|up_8|NZ_CP021983.1_2953875_2955327_+	COG1596, Wza, Periplasmic protein involved in polysaccharide export, contains    SLBB domain of b-grasp fold [Cell wall/membrane/envelope biogenesis]	NA|754aa|up_7|NZ_CP021983.1_2955609_2957871_+	cd05387, BY-kinase, bacterial tyrosine-kinase	NA|272aa|up_6|NZ_CP021983.1_2958329_2959145_-	COG0561, Cof, Predicted hydrolases of the HAD superfamily [General function prediction only]	NA|554aa|up_5|NZ_CP021983.1_2959215_2960877_+	pfam08291, Peptidase_M15_3, Peptidase M15	NA|189aa|up_4|NZ_CP021983.1_2960883_2961450_-	pfam14015, DUF4231, Protein of unknown function (DUF4231)	NA|145aa|up_3|NZ_CP021983.1_2961597_2962032_+	COG1525, COG1525, Micrococcal nuclease (thermonuclease) homologs [DNA replication, recombination, and repair]	NA|267aa|up_2|NZ_CP021983.1_2962046_2962847_-	COG0300, DltE, Short-chain dehydrogenases of various substrate specificities [General function prediction only]	NA|61aa|up_1|NZ_CP021983.1_2962866_2963049_+	NA	NA|333aa|up_0|NZ_CP021983.1_2963048_2964047_+	cd16282, metallo-hydrolase-like_MBL-fold, uncharacterized subgroup of the MBL-fold_metallo-hydrolase superfamily; MBL-fold metallo hydrolase domain	NA|75aa|down_0|NZ_CP021983.1_2965414_2965639_+	NA	NA|283aa|down_1|NZ_CP021983.1_2965713_2966562_+	cd09993, HDAC_classIV, Histone deacetylase class IV also known as histone deacetylase 11	NA|270aa|down_2|NZ_CP021983.1_2966605_2967415_+	COG1054, COG1054, Predicted sulfurtransferase [General function prediction only]	NA|219aa|down_3|NZ_CP021983.1_2967432_2968089_-	pfam01596, Methyltransf_3, O-methyltransferase	NA|64aa|down_4|NZ_CP021983.1_2968420_2968612_+	NA	NA|122aa|down_5|NZ_CP021983.1_2968604_2968970_-	NA	NA|191aa|down_6|NZ_CP021983.1_2968963_2969536_-	cd08866, SRPBCC_11, Ligand-binding SRPBCC domain of an uncharacterized subfamily of proteins	NA|78aa|down_7|NZ_CP021983.1_2969707_2969941_+	NA	NA|340aa|down_8|NZ_CP021983.1_2969973_2970993_-	pfam03631, Virul_fac_BrkB, Virulence factor BrkB	NA|391aa|down_9|NZ_CP021983.1_2971106_2972279_-	PRK09303, PRK09303, histidine kinase
GCF_002075285.2_ASM207528v2	NZ_CP021983	Halomicronema hongdechloris C2206 genome	22	3044329-3045219	11,23,10	PILER-CR,CRISPRCasFinder,CRT	no		PD-DExK,cas2,cas1,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,RT,DinG,DEDDh,WYL,csa3,cas6,cas3,cas8b3,cas7,csx1,cas10d,csc2gr7,csc1gr5,cas4	Orphan	CTTCCCACTAGGT-GGGAAACTAATTGAATGGAAAC,CTTCCCACTAGGTGGGAAACTAATTGAATGGAAAC,CTTCCCACTAGGTGGGAAACTAATTGAATGGAAAC	36,35,35	0	0	NA	NA	NA:NA:NA	12,12,12	12	Orphan	PD-DExK,cas2,cas1,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,RT,DinG,DEDDh,WYL,csa3,cas6,cas3,cas8b3,cas7,csx1,cas10d,csc2gr7,csc1gr5,cas4	NA|75aa|up_2|NZ_CP021983.1_3041562_3041787_+,NA|194aa|down_0|NZ_CP021983.1_3045309_3045891_-,NA|83aa|down_7|NZ_CP021983.1_3051123_3051372_-	NA|141aa|up_9|NZ_CP021983.1_3037222_3037645_+	COG3795, COG3795, Uncharacterized protein conserved in bacteria [Function unknown]	NA|82aa|up_8|NZ_CP021983.1_3038378_3038624_+	COG4319, COG4319, Ketosteroid isomerase homolog [Function unknown]	NA|287aa|up_7|NZ_CP021983.1_3038677_3039538_+	COG2602, COG2602, Beta-lactamase class D [Defense mechanisms]	NA|139aa|up_6|NZ_CP021983.1_3039564_3039981_+	COG3791, COG3791, Uncharacterized conserved protein [Function unknown]	NA|137aa|up_5|NZ_CP021983.1_3039999_3040410_+	COG4319, COG4319, Ketosteroid isomerase homolog [Function unknown]	NA|144aa|up_4|NZ_CP021983.1_3040666_3041098_+	cd08355, TioX_like, Micromonospora sp	NA|139aa|up_3|NZ_CP021983.1_3041132_3041549_+	cd06588, PhnB_like, Escherichia coli PhnB and similar proteins	NA|75aa|up_2|NZ_CP021983.1_3041562_3041787_+	NA	NA|119aa|up_1|NZ_CP021983.1_3041788_3042145_+	pfam07617, DUF1579, Protein of unknown function (DUF1579)	NA|213aa|up_0|NZ_CP021983.1_3042754_3043393_+	COG0625, Gst, Glutathione S-transferase [Posttranslational modification, protein turnover, chaperones]	NA|194aa|down_0|NZ_CP021983.1_3045309_3045891_-	NA	NA|145aa|down_1|NZ_CP021983.1_3046033_3046468_+	COG1047, SlpA, FKBP-type peptidyl-prolyl cis-trans isomerases 2 [Posttranslational modification, protein turnover, chaperones]	NA|320aa|down_2|NZ_CP021983.1_3046464_3047424_-	TIGR00950, Uncharacterized_inner_membrane_transporter_YicL, Carboxylate/Amino Acid/Amine Transporter	NA|612aa|down_3|NZ_CP021983.1_3047472_3049308_-	sd00006, TPR, Tetratricopeptide repeat	NA|194aa|down_4|NZ_CP021983.1_3049769_3050351_+	COG1357, COG1357, Pentapeptide repeats containing protein [Function unknown]	NA|132aa|down_5|NZ_CP021983.1_3050360_3050756_-	TIGR03323, alt_F1F0_F1_gam, alternate F1F0 ATPase, F1 subunit gamma	NA|81aa|down_6|NZ_CP021983.1_3050836_3051079_-	TIGR03323, alt_F1F0_F1_gam, alternate F1F0 ATPase, F1 subunit gamma	NA|83aa|down_7|NZ_CP021983.1_3051123_3051372_-	NA	NA|276aa|down_8|NZ_CP021983.1_3051681_3052509_-	cd00156, REC, phosphoacceptor receiver (REC) domain of response regulators (RRs) and pseudo response regulators (PRRs)	NA|256aa|down_9|NZ_CP021983.1_3052753_3053521_+	cd14498, DSP, dual-specificity phosphatase domain
GCF_002075285.2_ASM207528v2	NZ_CP021983	Halomicronema hongdechloris C2206 genome	23	3276075-3276171	24	CRISPRCasFinder	no		PD-DExK,cas2,cas1,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,RT,DinG,DEDDh,WYL,csa3,cas6,cas3,cas8b3,cas7,csx1,cas10d,csc2gr7,csc1gr5,cas4	Orphan	GTTCCAATCAACAAAAACCCTTC	23	0	0	NA	NA	NA	1	1	Orphan	PD-DExK,cas2,cas1,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,RT,DinG,DEDDh,WYL,csa3,cas6,cas3,cas8b3,cas7,csx1,cas10d,csc2gr7,csc1gr5,cas4	NA,NA	NA|319aa|up_9|NZ_CP021983.1_3259499_3260456_+	cd06581, TM_PBP1_LivM_like, Transmembrane subunit (TM) of Escherichia coli LivM and related proteins	NA|155aa|up_8|NZ_CP021983.1_3260830_3261295_+	COG1357, COG1357, Pentapeptide repeats containing protein [Function unknown]	NA|408aa|up_7|NZ_CP021983.1_3261435_3262659_-	cd06346, PBP1_ABC_ligand_binding-like, type 1 periplasmic ligand-binding domain of uncharacterized ABC (Atpase Binding Cassette)-type active transport systems predicted to be involved in uptake of amino acids, peptides, or inorganic ions	NA|260aa|up_6|NZ_CP021983.1_3263391_3264171_+	cd03219, ABC_Mj1267_LivG_branched, ATP-binding cassette component of branched chain amino acids transport system	NA|234aa|up_5|NZ_CP021983.1_3264167_3264869_+	cd03224, ABC_TM1139_LivF_branched, ATP-binding cassette domain of branched-chain amino acid transporter	NA|145aa|up_4|NZ_CP021983.1_3267204_3267639_-	TIGR04416, hypothetical_protein, group II intron reverse transcriptase/maturase	NA|797aa|up_3|NZ_CP021983.1_3268259_3270650_-	pfam13111, DUF3962, Protein of unknown function (DUF3962)	NA|883aa|up_2|NZ_CP021983.1_3270652_3273301_-	COG1199, DinG, Rad3-related DNA helicases [Transcription / DNA replication, recombination, and repair]	NA|181aa|up_1|NZ_CP021983.1_3273297_3273840_-	pfam18155, pPIWI_RE_Z, pPIWI RE three-gene island domain Z	NA|359aa|up_0|NZ_CP021983.1_3273826_3274903_-	pfam18154, pPIWI_RE_REase, REase associating with pPIWI_RE	NA|115aa|down_0|NZ_CP021983.1_3276187_3276532_-	pfam01610, DDE_Tnp_ISL3, Transposase	NA|132aa|down_1|NZ_CP021983.1_3277938_3278334_-	pfam01610, DDE_Tnp_ISL3, Transposase	NA|141aa|down_2|NZ_CP021983.1_3278507_3278930_-	COG3464, COG3464, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|192aa|down_3|NZ_CP021983.1_3279666_3280242_-	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|155aa|down_4|NZ_CP021983.1_3280522_3280987_-	pfam09655, Nitr_red_assoc, Conserved nitrate reductase-associated protein (Nitr_red_assoc)	NA|735aa|down_5|NZ_CP021983.1_3281093_3283298_-	cd02754, MopB_Nitrate-R-NapA-like, Nitrate reductases, NapA (Nitrate-R-NapA), NasA, and NarB catalyze the reduction of nitrate to nitrite	NA|512aa|down_6|NZ_CP021983.1_3283482_3285018_-	TIGR00886, Nitrate/nitrite_transporter_NarK, nitrite extrusion protein (nitrite facilitator)	NA|529aa|down_7|NZ_CP021983.1_3285166_3286753_-	PRK09566, nirA, ferredoxin-nitrite reductase; Reviewed	NA|368aa|down_8|NZ_CP021983.1_3287250_3288354_+	COG1413, COG1413, FOG: HEAT repeat [Energy production and conversion]	NA|304aa|down_9|NZ_CP021983.1_3288461_3289373_+	COG0583, LysR, Transcriptional regulator [Transcription]
GCF_002075285.2_ASM207528v2	NZ_CP021983	Halomicronema hongdechloris C2206 genome	24	3278981-3279075	25	CRISPRCasFinder	no		PD-DExK,cas2,cas1,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,RT,DinG,DEDDh,WYL,csa3,cas6,cas3,cas8b3,cas7,csx1,cas10d,csc2gr7,csc1gr5,cas4	Orphan	CAACCCTTCTTAGGGATTGAAAC	23	0	0	NA	NA	NA	1	1	Orphan	PD-DExK,cas2,cas1,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,RT,DinG,DEDDh,WYL,csa3,cas6,cas3,cas8b3,cas7,csx1,cas10d,csc2gr7,csc1gr5,cas4	NA|278aa|up_3|NZ_CP021983.1_3275309_3276143_-,NA	NA|234aa|up_9|NZ_CP021983.1_3264167_3264869_+	cd03224, ABC_TM1139_LivF_branched, ATP-binding cassette domain of branched-chain amino acid transporter	NA|145aa|up_8|NZ_CP021983.1_3267204_3267639_-	TIGR04416, hypothetical_protein, group II intron reverse transcriptase/maturase	NA|797aa|up_7|NZ_CP021983.1_3268259_3270650_-	pfam13111, DUF3962, Protein of unknown function (DUF3962)	NA|883aa|up_6|NZ_CP021983.1_3270652_3273301_-	COG1199, DinG, Rad3-related DNA helicases [Transcription / DNA replication, recombination, and repair]	NA|181aa|up_5|NZ_CP021983.1_3273297_3273840_-	pfam18155, pPIWI_RE_Z, pPIWI RE three-gene island domain Z	NA|359aa|up_4|NZ_CP021983.1_3273826_3274903_-	pfam18154, pPIWI_RE_REase, REase associating with pPIWI_RE	NA|278aa|up_3|NZ_CP021983.1_3275309_3276143_-	NA	NA|115aa|up_2|NZ_CP021983.1_3276187_3276532_-	pfam01610, DDE_Tnp_ISL3, Transposase	NA|132aa|up_1|NZ_CP021983.1_3277938_3278334_-	pfam01610, DDE_Tnp_ISL3, Transposase	NA|141aa|up_0|NZ_CP021983.1_3278507_3278930_-	COG3464, COG3464, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|192aa|down_0|NZ_CP021983.1_3279666_3280242_-	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|155aa|down_1|NZ_CP021983.1_3280522_3280987_-	pfam09655, Nitr_red_assoc, Conserved nitrate reductase-associated protein (Nitr_red_assoc)	NA|735aa|down_2|NZ_CP021983.1_3281093_3283298_-	cd02754, MopB_Nitrate-R-NapA-like, Nitrate reductases, NapA (Nitrate-R-NapA), NasA, and NarB catalyze the reduction of nitrate to nitrite	NA|512aa|down_3|NZ_CP021983.1_3283482_3285018_-	TIGR00886, Nitrate/nitrite_transporter_NarK, nitrite extrusion protein (nitrite facilitator)	NA|529aa|down_4|NZ_CP021983.1_3285166_3286753_-	PRK09566, nirA, ferredoxin-nitrite reductase; Reviewed	NA|368aa|down_5|NZ_CP021983.1_3287250_3288354_+	COG1413, COG1413, FOG: HEAT repeat [Energy production and conversion]	NA|304aa|down_6|NZ_CP021983.1_3288461_3289373_+	COG0583, LysR, Transcriptional regulator [Transcription]	NA|358aa|down_7|NZ_CP021983.1_3289470_3290544_+	PRK07394, PRK07394, hypothetical protein; Provisional	NA|198aa|down_8|NZ_CP021983.1_3290540_3291134_-	PRK02726, PRK02726, molybdenum cofactor guanylyltransferase	NA|216aa|down_9|NZ_CP021983.1_3291300_3291948_+	pfam11866, DUF3386, Protein of unknown function (DUF3386)
GCF_002075285.2_ASM207528v2	NZ_CP021983	Halomicronema hongdechloris C2206 genome	25	4121034-4121145	26	CRISPRCasFinder	no		PD-DExK,cas2,cas1,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,RT,DinG,DEDDh,WYL,csa3,cas6,cas3,cas8b3,cas7,csx1,cas10d,csc2gr7,csc1gr5,cas4	Orphan	CTGGTATGGCACCTCACCCACTGAGTTGCGCCAGCACT	38	0	0	NA	NA	NA	1	1	Orphan	PD-DExK,cas2,cas1,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,RT,DinG,DEDDh,WYL,csa3,cas6,cas3,cas8b3,cas7,csx1,cas10d,csc2gr7,csc1gr5,cas4	NA|162aa|up_4|NZ_CP021983.1_4114169_4114655_-,NA|131aa|up_1|NZ_CP021983.1_4119787_4120180_+,NA|64aa|down_7|NZ_CP021983.1_4132637_4132829_+	NA|243aa|up_9|NZ_CP021983.1_4109579_4110308_-	cd07709, flavodiiron_proteins_MBL-fold, catalytic domain of flavodiiron proteins (FDPs) and related proteins; MBL-fold metallo-hydrolase domain	NA|142aa|up_8|NZ_CP021983.1_4110968_4111394_-	COG3791, COG3791, Uncharacterized conserved protein [Function unknown]	NA|318aa|up_7|NZ_CP021983.1_4111599_4112553_-	pfam00685, Sulfotransfer_1, Sulfotransferase domain	NA|97aa|up_6|NZ_CP021983.1_4112598_4112889_-	pfam07045, DUF1330, Domain of unknown function (DUF1330)	NA|296aa|up_5|NZ_CP021983.1_4113276_4114164_-	pfam00583, Acetyltransf_1, Acetyltransferase (GNAT) family	NA|162aa|up_4|NZ_CP021983.1_4114169_4114655_-	NA	NA|1306aa|up_3|NZ_CP021983.1_4115037_4118955_+	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|146aa|up_2|NZ_CP021983.1_4119162_4119600_+	pfam13301, DUF4079, Protein of unknown function (DUF4079)	NA|131aa|up_1|NZ_CP021983.1_4119787_4120180_+	NA	NA|265aa|up_0|NZ_CP021983.1_4120125_4120920_+	COG1135, AbcC, ABC-type metal ion transport system, ATPase component [Inorganic ion transport and metabolism]	NA|222aa|down_0|NZ_CP021983.1_4121358_4122024_+	COG1413, COG1413, FOG: HEAT repeat [Energy production and conversion]	NA|158aa|down_1|NZ_CP021983.1_4122541_4123015_-	COG0735, Fur, Fe2+/Zn2+ uptake regulation proteins [Inorganic ion transport and metabolism]	NA|179aa|down_2|NZ_CP021983.1_4123335_4123872_+	COG0783, Dps, DNA-binding ferritin-like protein (oxidative damage protectant) [Inorganic ion transport and metabolism]	NA|547aa|down_3|NZ_CP021983.1_4124601_4126242_-	PRK09328, PRK09328, N5-glutamine S-adenosyl-L-methionine-dependent methyltransferase; Provisional	NA|228aa|down_4|NZ_CP021983.1_4127805_4128489_-	cd01948, EAL, EAL domain	NA|662aa|down_5|NZ_CP021983.1_4128858_4130844_-	PRK05218, PRK05218, heat shock protein 90; Provisional	NA|348aa|down_6|NZ_CP021983.1_4131595_4132639_-	cd03822, GT4_mannosyltransferase-like, mannosyltransferases of glycosyltransferase family 4 and similar proteins	NA|64aa|down_7|NZ_CP021983.1_4132637_4132829_+	NA	NA|335aa|down_8|NZ_CP021983.1_4132829_4133834_-	cd00761, Glyco_tranf_GTA_type, Glycosyltransferase family A (GT-A) includes diverse families of glycosyl transferases with a common GT-A type structural fold	NA|894aa|down_9|NZ_CP021983.1_4134050_4136732_-	TIGR03346, chaperone_ClpB, ATP-dependent chaperone ClpB
GCF_002075285.2_ASM207528v2	NZ_CP021983	Halomicronema hongdechloris C2206 genome	26	4429520-4429615	27	CRISPRCasFinder	no		PD-DExK,cas2,cas1,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,RT,DinG,DEDDh,WYL,csa3,cas6,cas3,cas8b3,cas7,csx1,cas10d,csc2gr7,csc1gr5,cas4	Orphan	TGAACCACTAGGGGTGTCTAGTATGGTGG	29	0	0	NA	NA	NA	1	1	Orphan	PD-DExK,cas2,cas1,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,RT,DinG,DEDDh,WYL,csa3,cas6,cas3,cas8b3,cas7,csx1,cas10d,csc2gr7,csc1gr5,cas4	NA|78aa|up_9|NZ_CP021983.1_4420947_4421181_-,NA|100aa|up_1|NZ_CP021983.1_4427465_4427765_-,NA|84aa|down_1|NZ_CP021983.1_4430320_4430572_+	NA|78aa|up_9|NZ_CP021983.1_4420947_4421181_-	NA	NA|261aa|up_8|NZ_CP021983.1_4421279_4422062_-	cd01839, SGNH_arylesterase_like, SGNH_hydrolase subfamily, similar to arylesterase (7-aminocephalosporanic acid-deacetylating enzyme) of A	NA|155aa|up_7|NZ_CP021983.1_4422165_4422630_-	COG3631, COG3631, Ketosteroid isomerase-related protein [General function prediction only]	NA|178aa|up_6|NZ_CP021983.1_4422749_4423283_-	COG3631, COG3631, Ketosteroid isomerase-related protein [General function prediction only]	NA|354aa|up_5|NZ_CP021983.1_4423335_4424396_-	pfam13358, DDE_3, DDE superfamily endonuclease	NA|288aa|up_4|NZ_CP021983.1_4424462_4425326_-	PRK06180, PRK06180, short chain dehydrogenase; Provisional	NA|352aa|up_3|NZ_CP021983.1_4425353_4426409_-	cd08948, 5beta-POR_like_SDR_a, progesterone 5-beta-reductase-like proteins (5beta-POR), atypical (a) SDRs	NA|339aa|up_2|NZ_CP021983.1_4426413_4427430_-	cd08276, MDR7, Medium chain dehydrogenases/reductase (MDR)/zinc-dependent alcohol dehydrogenase-like family	NA|100aa|up_1|NZ_CP021983.1_4427465_4427765_-	NA	NA|485aa|up_0|NZ_CP021983.1_4427961_4429416_+	COG1167, ARO8, Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs [Transcription / Amino acid transport and metabolism]	NA|111aa|down_0|NZ_CP021983.1_4429959_4430292_-	cd05403, NT_KNTase_like, Nucleotidyltransferase (NT) domain of Staphylococcus aureus kanamycin nucleotidyltransferase, and similar proteins	NA|84aa|down_1|NZ_CP021983.1_4430320_4430572_+	NA	NA|311aa|down_2|NZ_CP021983.1_4431030_4431963_+	COG1230, CzcD, Co/Zn/Cd efflux system component [Inorganic ion transport and metabolism]	NA|207aa|down_3|NZ_CP021983.1_4432355_4432976_+	COG1280, RhtB, Putative threonine efflux protein [Amino acid transport and metabolism]	NA|368aa|down_4|NZ_CP021983.1_4432997_4434101_-	COG0520, csdA, Selenocysteine lyase/Cysteine desulfurase [Posttranslational modification, protein turnover, chaperones]	NA|484aa|down_5|NZ_CP021983.1_4434601_4436053_+	TIGR03491, TIGR03491, RecB family nuclease, putative, TM0106 family	NA|272aa|down_6|NZ_CP021983.1_4436165_4436981_+	PRK12461, PRK12461, UDP-N-acetylglucosamine acyltransferase; Provisional	NA|147aa|down_7|NZ_CP021983.1_4438028_4438469_+	PLN03088, PLN03088, SGT1,  suppressor of G2 allele of SKP1; Provisional	NA|275aa|down_8|NZ_CP021983.1_4438474_4439299_+	COG1189, COG1189, Predicted rRNA methylase [Translation, ribosomal structure and biogenesis]	NA|437aa|down_9|NZ_CP021983.1_4439896_4441207_+	pfam13304, AAA_21, AAA domain, putative AbiEii toxin, Type IV TA system
GCF_002075285.2_ASM207528v2	NZ_CP021983	Halomicronema hongdechloris C2206 genome	27	4878256-4878366	28	CRISPRCasFinder	no	csa3	PD-DExK,cas2,cas1,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,RT,DinG,DEDDh,WYL,csa3,cas6,cas3,cas8b3,cas7,csx1,cas10d,csc2gr7,csc1gr5,cas4	Type I-A	CCCACAGCCGAAGCACCTCAAAGGTGTGACGATTCTCCG	39	0	0	NA	NA	NA	1	1	Orphan	PD-DExK,cas2,cas1,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,RT,DinG,DEDDh,WYL,csa3,cas6,cas3,cas8b3,cas7,csx1,cas10d,csc2gr7,csc1gr5,cas4	NA|178aa|up_1|NZ_CP021983.1_4876590_4877124_-,NA|149aa|down_1|NZ_CP021983.1_4884278_4884725_-,NA|171aa|down_2|NZ_CP021983.1_4884621_4885134_-,NA|112aa|down_3|NZ_CP021983.1_4885269_4885605_+,NA|353aa|down_5|NZ_CP021983.1_4887202_4888261_-,NA|94aa|down_7|NZ_CP021983.1_4889674_4889956_-	NA|349aa|up_9|NZ_CP021983.1_4867010_4868057_+	PRK06256, PRK06256, biotin synthase; Validated	NA|250aa|up_8|NZ_CP021983.1_4868467_4869217_+	cd00144, MPP_PPP_family, phosphoprotein phosphatases of the metallophosphatase superfamily, metallophosphatase domain	NA|152aa|up_7|NZ_CP021983.1_4869289_4869745_-	COG1051, COG1051, ADP-ribose pyrophosphatase [Nucleotide transport and metabolism]	NA|157aa|up_6|NZ_CP021983.1_4869744_4870215_-	COG1225, Bcp, Peroxiredoxin [Posttranslational modification, protein turnover, chaperones]	NA|398aa|up_5|NZ_CP021983.1_4870338_4871532_+	COG1253, TlyC, Hemolysins and related proteins containing CBS domains [General function prediction only]	NA|462aa|up_4|NZ_CP021983.1_4872400_4873786_-	cd06176, MFS_BCD_PucC-like, Bacteriochlorophyll delivery (BCD) family, also called PucC family, of the Major Facilitator Superfamily	NA|373aa|up_3|NZ_CP021983.1_4874085_4875204_-	COG1357, COG1357, Pentapeptide repeats containing protein [Function unknown]	NA|249aa|up_2|NZ_CP021983.1_4875293_4876040_-	pfam13267, DUF4058, Protein of unknown function (DUF4058)	NA|178aa|up_1|NZ_CP021983.1_4876590_4877124_-	NA	NA|144aa|up_0|NZ_CP021983.1_4877156_4877588_-	pfam13673, Acetyltransf_10, Acetyltransferase (GNAT) domain	NA|1314aa|down_0|NZ_CP021983.1_4878500_4882442_-	COG5635, COG5635, Predicted NTPase (NACHT family) [Signal transduction mechanisms]	NA|149aa|down_1|NZ_CP021983.1_4884278_4884725_-	NA	NA|171aa|down_2|NZ_CP021983.1_4884621_4885134_-	NA	NA|112aa|down_3|NZ_CP021983.1_4885269_4885605_+	NA	NA|488aa|down_4|NZ_CP021983.1_4885749_4887213_-	cd04659, Piwi_piwi-like_ProArk, Piwi_piwi-like_ProArk: PIWI domain, Piwi-like subfamily found in Archaea and Bacteria	NA|353aa|down_5|NZ_CP021983.1_4887202_4888261_-	NA	NA|88aa|down_6|NZ_CP021983.1_4888293_4888557_-	pfam14280, DUF4365, Domain of unknown function (DUF4365)	NA|94aa|down_7|NZ_CP021983.1_4889674_4889956_-	NA	NA|192aa|down_8|NZ_CP021983.1_4891823_4892399_-	sd00033, LRR_RI, leucine-rich repeats, ribonuclease inhibitor (RI)-like subfamily	csa3|120aa|down_9|NZ_CP021983.1_4892441_4892801_-	smart00418, HTH_ARSR, helix_turn_helix, Arsenical Resistance Operon Repressor
GCF_002075285.2_ASM207528v2	NZ_CP021983	Halomicronema hongdechloris C2206 genome	28	4961134-4961222	29	CRISPRCasFinder	no		PD-DExK,cas2,cas1,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,RT,DinG,DEDDh,WYL,csa3,cas6,cas3,cas8b3,cas7,csx1,cas10d,csc2gr7,csc1gr5,cas4	Orphan	CTGGATTCCCGCCGGCGCGGGAATGAC	27	0	0	NA	NA	I-E	1	1	Orphan	PD-DExK,cas2,cas1,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,RT,DinG,DEDDh,WYL,csa3,cas6,cas3,cas8b3,cas7,csx1,cas10d,csc2gr7,csc1gr5,cas4	NA|158aa|up_4|NZ_CP021983.1_4950900_4951374_+,NA|73aa|down_4|NZ_CP021983.1_4967640_4967859_+,NA|89aa|down_9|NZ_CP021983.1_4971728_4971995_+	NA|255aa|up_9|NZ_CP021983.1_4945216_4945981_+	COG1595, RpoE, DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog [Transcription]	NA|345aa|up_8|NZ_CP021983.1_4946018_4947053_+	sd00006, TPR, Tetratricopeptide repeat	NA|191aa|up_7|NZ_CP021983.1_4947093_4947666_-	pfam11371, DUF3172, Protein of unknown function (DUF3172)	NA|313aa|up_6|NZ_CP021983.1_4948053_4948992_+	pfam12974, Phosphonate-bd, ABC transporter, phosphonate, periplasmic substrate-binding protein	NA|613aa|up_5|NZ_CP021983.1_4948864_4950703_+	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|158aa|up_4|NZ_CP021983.1_4950900_4951374_+	NA	NA|202aa|up_3|NZ_CP021983.1_4951881_4952487_-	pfam13358, DDE_3, DDE superfamily endonuclease	NA|1607aa|up_2|NZ_CP021983.1_4953656_4958477_+	COG4251, COG4251, Bacteriophytochrome (light-regulated signal transduction histidine kinase) [Signal transduction mechanisms]	NA|141aa|up_1|NZ_CP021983.1_4958480_4958903_+	cd17557, REC_Rcp-like, phosphoacceptor receiver (REC) domain of cyanobacterial phytochrome response regulator Rcp and similar domains	NA|697aa|up_0|NZ_CP021983.1_4958905_4960996_+	PRK10060, PRK10060, cyclic di-GMP phosphodiesterase	NA|365aa|down_0|NZ_CP021983.1_4961887_4962982_+	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|778aa|down_1|NZ_CP021983.1_4963088_4965422_-	pfam02026, RyR, RyR domain	NA|389aa|down_2|NZ_CP021983.1_4965632_4966799_+	PRK05764, PRK05764, aspartate aminotransferase; Provisional	NA|188aa|down_3|NZ_CP021983.1_4966802_4967366_-	cd00156, REC, phosphoacceptor receiver (REC) domain of response regulators (RRs) and pseudo response regulators (PRRs)	NA|73aa|down_4|NZ_CP021983.1_4967640_4967859_+	NA	NA|300aa|down_5|NZ_CP021983.1_4967887_4968787_+	PRK12928, PRK12928, lipoyl synthase; Provisional	NA|372aa|down_6|NZ_CP021983.1_4968887_4970003_+	cd13682, PBP2_TRAP_alpha-ketoacid, Substrate-binding component of an alpha-keto acid binding Tripartite ATP-independent Periplasmic transporter and related proteins; contains the type 2 periplasmic-binding protein fold	NA|251aa|down_7|NZ_CP021983.1_4970079_4970832_+	cd06260, DUF820, Domain of unknown function (DUF820)	NA|222aa|down_8|NZ_CP021983.1_4970984_4971650_-	pfam05685, Uma2, Putative restriction endonuclease	NA|89aa|down_9|NZ_CP021983.1_4971728_4971995_+	NA
