assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000227745.2_ASM22774v3	NC_018012	Thiocystis violascens DSM 198, complete sequence	1	184209-184295	1	CRISPRCasFinder	no		DEDDh,RT,cas3,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,cas6,csa3,DinG,csx16,cas4,cmr3gr5,cas10,cmr1gr7,csx1	Orphan	ACCACGAACGGCTTCAGGCTCCA	23	0	0	NA	NA	NA	1	1	Orphan	DEDDh,RT,cas3,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,cas6,csa3,DinG,csx16,cas4,cmr3gr5,cas10,cmr1gr7,csx1	NA|111aa|up_6|NC_018012.1_179619_179952_-,NA|50aa|up_1|NC_018012.1_183706_183856_-,NA|80aa|up_0|NC_018012.1_183874_184114_-,NA|153aa|down_1|NC_018012.1_186825_187284_-,NA|205aa|down_5|NC_018012.1_191341_191956_-,NA|620aa|down_6|NC_018012.1_191952_193812_-,NA|181aa|down_8|NC_018012.1_195775_196318_-	NA|448aa|up_9|NC_018012.1_176410_177754_-	COG3266, DamX, Uncharacterized protein conserved in bacteria [Function unknown]	NA|163aa|up_8|NC_018012.1_178341_178830_-	PRK09267, PRK09267, flavodoxin FldA; Validated	NA|97aa|up_7|NC_018012.1_179151_179442_+	TIGR02607, Virulence-associated_protein_I, addiction module antidote protein, HigA family	NA|111aa|up_6|NC_018012.1_179619_179952_-	NA	NA|329aa|up_5|NC_018012.1_180195_181182_-	pfam07433, DUF1513, Protein of unknown function (DUF1513)	NA|149aa|up_4|NC_018012.1_181183_181630_-	cd14659, Imelysin-like_IPPA, Imelysin-like protein	NA|190aa|up_3|NC_018012.1_181708_182278_-	cd14659, Imelysin-like_IPPA, Imelysin-like protein	NA|365aa|up_2|NC_018012.1_182598_183693_-	cd14659, Imelysin-like_IPPA, Imelysin-like protein	NA|50aa|up_1|NC_018012.1_183706_183856_-	NA	NA|80aa|up_0|NC_018012.1_183874_184114_-	NA	NA|727aa|down_0|NC_018012.1_184330_186511_-	TIGR02517, Putative_type_II_secretion_system_protein_D, type II secretion system protein D	NA|153aa|down_1|NC_018012.1_186825_187284_-	NA	NA|406aa|down_2|NC_018012.1_187304_188522_-	COG1459, PulF, Type II secretory pathway, component PulF [Cell motility and secretion / Intracellular trafficking and secretion]	NA|542aa|down_3|NC_018012.1_188523_190149_-	COG2804, PulE, Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB [Cell motility and secretion / Intracellular trafficking and secretion]	NA|395aa|down_4|NC_018012.1_190148_191333_-	COG2805, PilT, Tfp pilus assembly protein, pilus retraction ATPase PilT [Cell motility and secretion / Intracellular trafficking and secretion]	NA|205aa|down_5|NC_018012.1_191341_191956_-	NA	NA|620aa|down_6|NC_018012.1_191952_193812_-	NA	NA|498aa|down_7|NC_018012.1_194228_195722_-	COG2165, PulG, Type II secretory pathway, pseudopilin PulG [Cell motility and secretion / Intracellular trafficking and secretion]	NA|181aa|down_8|NC_018012.1_195775_196318_-	NA	NA|312aa|down_9|NC_018012.1_196850_197786_-	COG2165, PulG, Type II secretory pathway, pseudopilin PulG [Cell motility and secretion / Intracellular trafficking and secretion]
GCF_000227745.2_ASM22774v3	NC_018012	Thiocystis violascens DSM 198, complete sequence	2	480145-482861	1,2,1	PILER-CR,CRISPRCasFinder,CRT	no	cas3,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2	DEDDh,RT,cas3,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,cas6,csa3,DinG,csx16,cas4,cmr3gr5,cas10,cmr1gr7,csx1	Type I-E	GTGTTCCCCGCGCTCGCGGGGATGAACCG,GTGTTCCCCGCGCTCGCGGGGATGAACCG,GTGTTCCCCGCGCTCGCGGGGATGAACCG	29,29,29	0	0	NA	NA	I-E:I-E:I-E	10,44,44	44	TypeI-E	DEDDh,RT,cas3,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,cas6,csa3,DinG,csx16,cas4,cmr3gr5,cas10,cmr1gr7,csx1	NA|76aa|up_3|NC_018012.1_478163_478391_+,NA	cas3|918aa|up_9|NC_018012.1_469346_472100_+	PRK09694, PRK09694, CRISPR-associated helicase/endonuclease Cas3	cas8e|518aa|up_8|NC_018012.1_472083_473637_+	cd09729, Cse1_I-E, CRISPR/Cas system-associated protein Cse1	cse2gr11|186aa|up_7|NC_018012.1_473651_474209_+	pfam09485, CRISPR_Cse2, CRISPR-associated protein Cse2 (CRISPR_cse2)	cas7|366aa|up_6|NC_018012.1_474235_475333_+	pfam09344, Cas_CT1975, CT1975-like protein	cas5|269aa|up_5|NC_018012.1_475329_476136_+	cd09645, Cas5_I-E, CRISPR/Cas system-associated RAMP superfamily protein Cas5	cas6e|246aa|up_4|NC_018012.1_476162_476900_+	smart01101, CRISPR_assoc, This domain forms an anti-parallel beta strand structure with flanking alpha helical regions	NA|76aa|up_3|NC_018012.1_478163_478391_+	NA	NA|146aa|up_2|NC_018012.1_478380_478818_+	cd18692, PIN_VapC-like, uncharacterized subfamily of the VapC-like nuclease family of the PIN domain superfamily	cas1|306aa|up_1|NC_018012.1_478853_479771_+	TIGR03638, cas1_ECOLI, CRISPR-associated endonuclease Cas1, subtype I-E/ECOLI	cas2|100aa|up_0|NC_018012.1_479751_480051_+	PRK11558, PRK11558, putative ssRNA endonuclease; Provisional	NA|279aa|down_0|NC_018012.1_482880_483717_-	PRK00166, apaH, symmetrical bis(5'-nucleosyl)-tetraphosphatase	NA|125aa|down_1|NC_018012.1_483922_484297_-	PRK05461, apaG, CO2+/MG2+ efflux protein ApaG; Reviewed	NA|328aa|down_2|NC_018012.1_484696_485680_+	COG0673, MviM, Predicted dehydrogenases and related proteins [General function prediction only]	NA|115aa|down_3|NC_018012.1_485742_486087_-	pfam07238, PilZ, PilZ domain	NA|512aa|down_4|NC_018012.1_486173_487709_-	cd13123, MATE_MurJ_like, MurJ/MviN, a subfamily of the multidrug and toxic compound extrusion (MATE)-like proteins	NA|352aa|down_5|NC_018012.1_487852_488908_-	pfam01609, DDE_Tnp_1, Transposase DDE domain	NA|88aa|down_6|NC_018012.1_489650_489914_+	PRK00239, rpsT, 30S ribosomal protein S20; Reviewed	NA|248aa|down_7|NC_018012.1_489954_490698_+	COG1125, OpuBA, ABC-type proline/glycine betaine transport systems, ATPase components [Amino acid transport and metabolism]	NA|484aa|down_8|NC_018012.1_490742_492194_+	cd13607, PBP2_AfProX_like, Substrate-binding protein ProX of ABC-type osmoregulatory transporter from Archaeoglobus fulgidus and its related proteins; the type 2 periplasmic-binding protein fold	NA|273aa|down_9|NC_018012.1_492318_493137_-	pfam13612, DDE_Tnp_1_3, Transposase DDE domain
GCF_000227745.2_ASM22774v3	NC_018012	Thiocystis violascens DSM 198, complete sequence	3	904322-904435	3	CRISPRCasFinder	no		DEDDh,RT,cas3,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,cas6,csa3,DinG,csx16,cas4,cmr3gr5,cas10,cmr1gr7,csx1	Orphan	GTACTTTCTCCGTAAACGACGTAGTCTCCATTGCTACCC	39	0	0	NA	NA	NA	1	1	Orphan	DEDDh,RT,cas3,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,cas6,csa3,DinG,csx16,cas4,cmr3gr5,cas10,cmr1gr7,csx1	NA|67aa|up_9|NC_018012.1_889117_889318_-,NA|60aa|down_0|NC_018012.1_904722_904902_-,NA|94aa|down_6|NC_018012.1_910783_911065_-,NA|251aa|down_9|NC_018012.1_913134_913887_-	NA|67aa|up_9|NC_018012.1_889117_889318_-	NA	NA|75aa|up_8|NC_018012.1_889358_889583_-	pfam10047, DUF2281, Protein of unknown function (DUF2281)	NA|496aa|up_7|NC_018012.1_889679_891167_-	COG0433, COG0433,  HerA helicase [Replication, recombination, and repair]	NA|765aa|up_6|NC_018012.1_893706_896001_-	COG1061, SSL2, DNA or RNA helicases of superfamily II [Transcription / DNA replication, recombination, and repair]	NA|206aa|up_5|NC_018012.1_896039_896657_+	pfam13649, Methyltransf_25, Methyltransferase domain	NA|938aa|up_4|NC_018012.1_897071_899885_-	pfam14820, SPRR2, Small proline-rich 2	NA|300aa|up_3|NC_018012.1_899881_900781_-	pfam13182, DUF4007, Protein of unknown function (DUF4007)	NA|87aa|up_2|NC_018012.1_901238_901499_-	COG1104, NifS, Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes [Amino acid transport and metabolism]	NA|144aa|up_1|NC_018012.1_902797_903229_-	pfam01797, Y1_Tnp, Transposase IS200 like	NA|325aa|up_0|NC_018012.1_903311_904286_-	COG1104, NifS, Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes [Amino acid transport and metabolism]	NA|60aa|down_0|NC_018012.1_904722_904902_-	NA	NA|720aa|down_1|NC_018012.1_904921_907081_-	pfam01471, PG_binding_1, Putative peptidoglycan binding domain	NA|144aa|down_2|NC_018012.1_907339_907771_-	PRK10996, PRK10996, thioredoxin 2; Provisional	NA|288aa|down_3|NC_018012.1_907906_908770_+	COG3118, COG3118, Thioredoxin domain-containing protein [Posttranslational modification, protein turnover, chaperones]	NA|146aa|down_4|NC_018012.1_908893_909331_+	pfam01724, DUF29, Domain of unknown function DUF29	NA|476aa|down_5|NC_018012.1_909284_910712_-	PRK10660, tilS, tRNA(Ile)-lysidine synthetase; Provisional	NA|94aa|down_6|NC_018012.1_910783_911065_-	NA	NA|320aa|down_7|NC_018012.1_911116_912076_-	PRK05724, PRK05724, acetyl-CoA carboxylase carboxyltransferase subunit alpha; Validated	NA|286aa|down_8|NC_018012.1_912143_913001_-	cd06260, DUF820, Domain of unknown function (DUF820)	NA|251aa|down_9|NC_018012.1_913134_913887_-	NA
GCF_000227745.2_ASM22774v3	NC_018012	Thiocystis violascens DSM 198, complete sequence	4	1161625-1162811	4,2,2	CRISPRCasFinder,CRT,PILER-CR	no	cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,cas3	DEDDh,RT,cas3,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,cas6,csa3,DinG,csx16,cas4,cmr3gr5,cas10,cmr1gr7,csx1	Type I-E	GGGCCTATCCCCGCGAGCGCGGGGGAACC,GGGCCTATCCCCGCGAGCGCGGGGGAACC,GGCCTATCCCCGCGAGCGCGGGG--GAACC	29,29,30	0	0	NA	NA	NA:NA:NA	19,19,14	19	TypeI-E	DEDDh,RT,cas3,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,cas6,csa3,DinG,csx16,cas4,cmr3gr5,cas10,cmr1gr7,csx1	NA|149aa|up_1|NC_018012.1_1160315_1160762_+,NA|55aa|down_6|NC_018012.1_1167453_1167618_-	NA|445aa|up_9|NC_018012.1_1149162_1150497_-	pfam04403, PqiA, Paraquat-inducible protein A	NA|1337aa|up_8|NC_018012.1_1150516_1154527_-	PRK05673, dnaE, DNA polymerase III subunit alpha; Validated	NA|174aa|up_7|NC_018012.1_1154629_1155151_-	cd02966, TlpA_like_family, TlpA-like family; composed of  TlpA, ResA, DsbE and similar proteins	NA|457aa|up_6|NC_018012.1_1155325_1156696_+	pfam03631, Virul_fac_BrkB, Virulence factor BrkB	NA|183aa|up_5|NC_018012.1_1156685_1157234_-	pfam03843, Slp, Outer membrane lipoprotein Slp family	NA|176aa|up_4|NC_018012.1_1157408_1157936_-	pfam03843, Slp, Outer membrane lipoprotein Slp family	NA|66aa|up_3|NC_018012.1_1158008_1158206_-	COG2835, COG2835, Uncharacterized conserved protein [Function unknown]	NA|526aa|up_2|NC_018012.1_1158541_1160119_+	COG2267, PldB, Lysophospholipase [Lipid metabolism]	NA|149aa|up_1|NC_018012.1_1160315_1160762_+	NA	NA|161aa|up_0|NC_018012.1_1160817_1161300_+	cd03769, SR_IS607_transposase_like, Serine Recombinase (SR) family, IS607-like transposase subfamily, catalytic domain; members contain a DNA binding domain with homology to MerR/SoxR located N-terminal to the catalytic domain	cas2|99aa|down_0|NC_018012.1_1162877_1163174_-	pfam09707, Cas_Cas2CT1978, CRISPR-associated protein (Cas_Cas2CT1978)	cas1|297aa|down_1|NC_018012.1_1163145_1164036_-	cd09719, Cas1_I-E, CRISPR/Cas system-associated protein Cas1	cas6e|239aa|down_2|NC_018012.1_1164029_1164746_-	pfam08798, CRISPR_assoc, CRISPR associated protein	cas5|239aa|down_3|NC_018012.1_1164742_1165459_-	TIGR01868, hypothetical_protein, CRISPR-associated protein Cas5/CasD, subtype I-E/ECOLI	cas7|428aa|down_4|NC_018012.1_1165460_1166744_-	pfam09344, Cas_CT1975, CT1975-like protein	cse2gr11|192aa|down_5|NC_018012.1_1166781_1167357_-	pfam09485, CRISPR_Cse2, CRISPR-associated protein Cse2 (CRISPR_cse2)	NA|55aa|down_6|NC_018012.1_1167453_1167618_-	NA	cas8e|560aa|down_7|NC_018012.1_1167722_1169402_-	TIGR02547, CRISPR_system_Cascade_subunit_CasA, CRISPR type I-E/ECOLI-associated protein CasA/Cse1	cas3|878aa|down_8|NC_018012.1_1169417_1172051_-	PRK09694, PRK09694, CRISPR-associated helicase/endonuclease Cas3	NA|156aa|down_9|NC_018012.1_1172324_1172792_+	pfam10027, DUF2269, Predicted integral membrane protein (DUF2269)
GCF_000227745.2_ASM22774v3	NC_018012	Thiocystis violascens DSM 198, complete sequence	5	1333353-1333539	5	CRISPRCasFinder	no	cas3	DEDDh,RT,cas3,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,cas6,csa3,DinG,csx16,cas4,cmr3gr5,cas10,cmr1gr7,csx1	Unclear	GGGTAGCCATCACTGCCATTCAGTTAAGCCGTTCGTGGTGAGGCA	45	0	0	NA	NA	NA	1	1	Unclear	DEDDh,RT,cas3,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,cas6,csa3,DinG,csx16,cas4,cmr3gr5,cas10,cmr1gr7,csx1	NA,NA|48aa|down_1|NC_018012.1_1333898_1334042_+,NA|82aa|down_2|NC_018012.1_1334114_1334360_+,NA|178aa|down_6|NC_018012.1_1336088_1336622_+,NA|260aa|down_7|NC_018012.1_1336773_1337553_+,NA|88aa|down_8|NC_018012.1_1337674_1337938_+,NA|102aa|down_9|NC_018012.1_1338011_1338317_+	NA|330aa|up_9|NC_018012.1_1317131_1318121_-	COG3437, COG3437, Response regulator containing a CheY-like receiver domain and an HD-GYP domain [Transcription / Signal transduction mechanisms]	NA|516aa|up_8|NC_018012.1_1318267_1319815_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|349aa|up_7|NC_018012.1_1319885_1320932_-	PRK00742, PRK00742, chemotaxis-specific protein-glutamate methyltransferase CheB	NA|396aa|up_6|NC_018012.1_1321003_1322191_-	smart00283, MA, Methyl-accepting chemotaxis-like domains (chemotaxis sensory transducer)	NA|204aa|up_5|NC_018012.1_1322249_1322861_-	PRK13487, PRK13487, chemoreceptor glutamine deamidase CheD; Provisional	NA|284aa|up_4|NC_018012.1_1322862_1323714_-	PRK10611, PRK10611, protein-glutamate O-methyltransferase CheR	NA|745aa|up_3|NC_018012.1_1323694_1325929_-	PRK15048, PRK15048, methyl-accepting chemotaxis protein II; Provisional	NA|168aa|up_2|NC_018012.1_1326020_1326524_-	PRK10612, PRK10612, chemotaxis protein CheW	NA|760aa|up_1|NC_018012.1_1326552_1328832_-	COG0643, CheA, Chemotaxis protein histidine kinase and related kinases [Cell motility and secretion / Signal transduction mechanisms]	NA|122aa|up_0|NC_018012.1_1328866_1329232_-	cd17562, REC_CheY4-like, phosphoacceptor receiver (REC) domain of chemotaxis response regulator CheY4 and similar CheY family proteins	NA|71aa|down_0|NC_018012.1_1333689_1333902_+	pfam03683, UPF0175, Uncharacterized protein family (UPF0175)	NA|48aa|down_1|NC_018012.1_1333898_1334042_+	NA	NA|82aa|down_2|NC_018012.1_1334114_1334360_+	NA	NA|63aa|down_3|NC_018012.1_1334359_1334548_+	cd05267, SDR_a6, atypical (a) SDRs, subgroup 6	NA|242aa|down_4|NC_018012.1_1334650_1335376_+	pfam13365, Trypsin_2, Trypsin-like peptidase domain	NA|91aa|down_5|NC_018012.1_1335480_1335753_+	pfam04365, BrnT_toxin, Ribonuclease toxin, BrnT, of type II toxin-antitoxin system	NA|178aa|down_6|NC_018012.1_1336088_1336622_+	NA	NA|260aa|down_7|NC_018012.1_1336773_1337553_+	NA	NA|88aa|down_8|NC_018012.1_1337674_1337938_+	NA	NA|102aa|down_9|NC_018012.1_1338011_1338317_+	NA
GCF_000227745.2_ASM22774v3	NC_018012	Thiocystis violascens DSM 198, complete sequence	6	1764703-1764792	6	CRISPRCasFinder	no		DEDDh,RT,cas3,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,cas6,csa3,DinG,csx16,cas4,cmr3gr5,cas10,cmr1gr7,csx1	Orphan	GCGCCGAACCGGGCGCGCGCGGGTGCTGC	29	1	1	1764732-1764763	NC_018012.1_752340-752371	NA	1	1	Orphan	DEDDh,RT,cas3,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,cas6,csa3,DinG,csx16,cas4,cmr3gr5,cas10,cmr1gr7,csx1	NA|59aa|up_9|NC_018012.1_1750433_1750610_-,NA|86aa|up_8|NC_018012.1_1750619_1750877_-,NA|146aa|up_0|NC_018012.1_1759761_1760199_-,NA|46aa|down_0|NC_018012.1_1766306_1766444_+,NA|128aa|down_4|NC_018012.1_1771686_1772070_-,NA|76aa|down_7|NC_018012.1_1774955_1775183_-	NA|59aa|up_9|NC_018012.1_1750433_1750610_-	NA	NA|86aa|up_8|NC_018012.1_1750619_1750877_-	NA	NA|370aa|up_7|NC_018012.1_1751135_1752245_-	pfam13340, DUF4096, Putative transposase of IS4/5 family (DUF4096)	NA|138aa|up_6|NC_018012.1_1752312_1752726_-	pfam13700, DUF4158, Domain of unknown function (DUF4158)	NA|198aa|up_5|NC_018012.1_1752866_1753460_+	smart00857, Resolvase, Resolvase, N terminal domain	NA|824aa|up_4|NC_018012.1_1753770_1756242_+	pfam12228, DUF3604, Protein of unknown function (DUF3604)	NA|370aa|up_3|NC_018012.1_1756154_1757264_-	pfam13340, DUF4096, Putative transposase of IS4/5 family (DUF4096)	NA|307aa|up_2|NC_018012.1_1757461_1758382_+	pfam13145, Rotamase_2, PPIC-type PPIASE domain	NA|332aa|up_1|NC_018012.1_1758378_1759374_+	pfam13795, HupE_UreJ_2, HupE / UreJ protein	NA|146aa|up_0|NC_018012.1_1759761_1760199_-	NA	NA|46aa|down_0|NC_018012.1_1766306_1766444_+	NA	NA|374aa|down_1|NC_018012.1_1766744_1767866_-	PRK00035, hemH, ferrochelatase; Reviewed	NA|188aa|down_2|NC_018012.1_1767862_1768426_-	COG3945, COG3945, Uncharacterized conserved protein [Function unknown]	NA|569aa|down_3|NC_018012.1_1769967_1771674_-	pfam02254, TrkA_N, TrkA-N domain	NA|128aa|down_4|NC_018012.1_1771686_1772070_-	NA	NA|110aa|down_5|NC_018012.1_1772230_1772560_+	smart00899, FeoA, This entry represents the core domain of the ferrous iron (Fe2+) transport protein FeoA found in bacteria	NA|776aa|down_6|NC_018012.1_1772573_1774901_+	PRK09554, feoB, Fe(2+) transporter permease subunit FeoB	NA|76aa|down_7|NC_018012.1_1774955_1775183_-	NA	NA|65aa|down_8|NC_018012.1_1775857_1776052_-	cd01142, TroA_e, Periplasmic binding protein TroA_e	NA|110aa|down_9|NC_018012.1_1776048_1776378_-	cd01142, TroA_e, Periplasmic binding protein TroA_e
GCF_000227745.2_ASM22774v3	NC_018012	Thiocystis violascens DSM 198, complete sequence	7	1782233-1782359	7	CRISPRCasFinder	no	RT	DEDDh,RT,cas3,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,cas6,csa3,DinG,csx16,cas4,cmr3gr5,cas10,cmr1gr7,csx1	Unclear	CGCGGAGGAGGAGTGCAGAAATTCCTCCTAGAGCAGC	37	0	0	NA	NA	NA	1	1	Orphan	DEDDh,RT,cas3,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,cas6,csa3,DinG,csx16,cas4,cmr3gr5,cas10,cmr1gr7,csx1	NA|76aa|up_9|NC_018012.1_1774955_1775183_-,NA|67aa|up_1|NC_018012.1_1780153_1780354_-,NA|149aa|down_4|NC_018012.1_1786446_1786893_+,NA|404aa|down_5|NC_018012.1_1787787_1788999_+,NA|71aa|down_7|NC_018012.1_1792003_1792216_+	NA|76aa|up_9|NC_018012.1_1774955_1775183_-	NA	NA|65aa|up_8|NC_018012.1_1775857_1776052_-	cd01142, TroA_e, Periplasmic binding protein TroA_e	NA|110aa|up_7|NC_018012.1_1776048_1776378_-	cd01142, TroA_e, Periplasmic binding protein TroA_e	NA|249aa|up_6|NC_018012.1_1776831_1777578_-	COG1230, CzcD, Co/Zn/Cd efflux system component [Inorganic ion transport and metabolism]	NA|151aa|up_5|NC_018012.1_1777873_1778326_+	cd04784, HTH_CadR-PbrR, Helix-Turn-Helix DNA binding domain of the CadR and PbrR transcription regulators	NA|244aa|up_4|NC_018012.1_1778491_1779223_+	TIGR01172, Serine_acetyltransferase, serine O-acetyltransferase	NA|90aa|up_3|NC_018012.1_1779404_1779674_+	pfam02604, PhdYeFM_antitox, Antitoxin Phd_YefM, type II toxin-antitoxin system	NA|93aa|up_2|NC_018012.1_1779670_1779949_+	COG3041, COG3041, Uncharacterized protein conserved in bacteria [Function unknown]	NA|67aa|up_1|NC_018012.1_1780153_1780354_-	NA	NA|395aa|up_0|NC_018012.1_1780337_1781522_-	pfam13808, DDE_Tnp_1_assoc, DDE_Tnp_1-associated	NA|87aa|down_0|NC_018012.1_1783297_1783558_-	smart00966, SpoVT_AbrB, SpoVT / AbrB like domain	NA|240aa|down_1|NC_018012.1_1783706_1784426_-	pfam09582, AnfO_nitrog, Iron only nitrogenase protein AnfO (AnfO_nitrog)	NA|169aa|down_2|NC_018012.1_1784463_1784970_-	COG3467, COG3467, Predicted flavin-nucleotide-binding protein [General function prediction only]	NA|156aa|down_3|NC_018012.1_1785897_1786365_-	COG1846, MarR, Transcriptional regulators [Transcription]	NA|149aa|down_4|NC_018012.1_1786446_1786893_+	NA	NA|404aa|down_5|NC_018012.1_1787787_1788999_+	NA	NA|929aa|down_6|NC_018012.1_1789027_1791814_+	sd00006, TPR, Tetratricopeptide repeat	NA|71aa|down_7|NC_018012.1_1792003_1792216_+	NA	NA|455aa|down_8|NC_018012.1_1792722_1794087_+	pfam13304, AAA_21, AAA domain, putative AbiEii toxin, Type IV TA system	NA|291aa|down_9|NC_018012.1_1794083_1794956_+	pfam14491, DUF4435, Protein of unknown function (DUF4435)
GCF_000227745.2_ASM22774v3	NC_018012	Thiocystis violascens DSM 198, complete sequence	8	1942936-1943045	8	CRISPRCasFinder	no		DEDDh,RT,cas3,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,cas6,csa3,DinG,csx16,cas4,cmr3gr5,cas10,cmr1gr7,csx1	Orphan	CTTAGAGTGCTATCGGAAAACTAGGAGCGTAT	32	0	0	NA	NA	NA	1	1	Orphan	DEDDh,RT,cas3,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,cas6,csa3,DinG,csx16,cas4,cmr3gr5,cas10,cmr1gr7,csx1	NA|106aa|up_9|NC_018012.1_1935110_1935428_+,NA|77aa|up_2|NC_018012.1_1941891_1942122_+,NA|62aa|up_1|NC_018012.1_1942371_1942557_+,NA|151aa|down_0|NC_018012.1_1945093_1945546_+,NA|71aa|down_1|NC_018012.1_1946026_1946239_+	NA|106aa|up_9|NC_018012.1_1935110_1935428_+	NA	NA|352aa|up_8|NC_018012.1_1935499_1936555_+	pfam01992, vATP-synt_AC39, ATP synthase (C/AC39) subunit	NA|632aa|up_7|NC_018012.1_1936614_1938510_+	PRK05771, PRK05771, V-type ATP synthase subunit I; Validated	NA|150aa|up_6|NC_018012.1_1938542_1938992_+	cd18120, ATP-synt_Vo_Ao_c, Membrane-bound Vo/Ao complexes of V/A-type ATP synthases, subunit c	NA|124aa|up_5|NC_018012.1_1939002_1939374_+	pfam01990, ATP-synt_F, ATP synthase (F/14-kDa) subunit	NA|216aa|up_4|NC_018012.1_1939370_1940018_+	PRK03963, PRK03963, V-type ATP synthase subunit E; Provisional	NA|611aa|up_3|NC_018012.1_1940025_1941858_+	PRK04192, PRK04192, V-type ATP synthase subunit A; Provisional	NA|77aa|up_2|NC_018012.1_1941891_1942122_+	NA	NA|62aa|up_1|NC_018012.1_1942371_1942557_+	NA	NA|100aa|up_0|NC_018012.1_1942544_1942844_+	COG3668, ParE, Plasmid stabilization system protein [General function prediction only]	NA|151aa|down_0|NC_018012.1_1945093_1945546_+	NA	NA|71aa|down_1|NC_018012.1_1946026_1946239_+	NA	NA|460aa|down_2|NC_018012.1_1946427_1947807_+	PRK04196, PRK04196, V-type ATP synthase subunit B; Provisional	NA|99aa|down_3|NC_018012.1_1947929_1948226_+	COG0727, COG0727, Predicted Fe-S-cluster oxidoreductase [General function prediction only]	NA|215aa|down_4|NC_018012.1_1948246_1948891_+	pfam01813, ATP-synt_D, ATP synthase subunit D	NA|237aa|down_5|NC_018012.1_1949058_1949769_-	cd01741, GATase1_1, Subgroup of proteins having the Type 1 glutamine amidotransferase (GATase1) domain	NA|93aa|down_6|NC_018012.1_1949806_1950085_-	pfam11455, DUF3018, Protein of unknown function (DUF3018)	NA|284aa|down_7|NC_018012.1_1950089_1950941_-	TIGR02795, Uncharacterized_protein_in_oprL_3'region, tol-pal system protein YbgF	NA|183aa|down_8|NC_018012.1_1950968_1951517_-	TIGR02802, Peptidoglycan-associated_lipoprotein, peptidoglycan-associated lipoprotein	NA|437aa|down_9|NC_018012.1_1951794_1953105_-	PRK04922, tolB, Tol-Pal system beta propeller repeat protein TolB
GCF_000227745.2_ASM22774v3	NC_018012	Thiocystis violascens DSM 198, complete sequence	9	2332454-2332547	9	CRISPRCasFinder	no		DEDDh,RT,cas3,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,cas6,csa3,DinG,csx16,cas4,cmr3gr5,cas10,cmr1gr7,csx1	Orphan	CCTACTTATAGGCCAGGTTAGCGTCACTG	29	0	0	NA	NA	NA	1	1	Orphan	DEDDh,RT,cas3,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,cas6,csa3,DinG,csx16,cas4,cmr3gr5,cas10,cmr1gr7,csx1	NA,NA	NA|222aa|up_9|NC_018012.1_2324053_2324719_-	TIGR02135, Uncharacterized_protein, phosphate transport system regulatory protein PhoU	NA|504aa|up_8|NC_018012.1_2325099_2326611_+	TIGR01290, FeMo_cofactor_biosynthesis_protein_NifB, nitrogenase cofactor biosynthesis protein NifB	NA|93aa|up_7|NC_018012.1_2326654_2326933_+	COG1142, HycB, Fe-S-cluster-containing hydrogenase components 2 [Energy production and conversion]	NA|148aa|up_6|NC_018012.1_2326935_2327379_+	cd03033, ArsC_15kD, Arsenate Reductase (ArsC) family, 15kD protein subfamily; composed of proteins of unknown function with similarity to thioredoxin-fold arsenic reductases, ArsC	NA|430aa|up_5|NC_018012.1_2327375_2328665_+	COG0426, FpaA, Uncharacterized flavoproteins [Energy production and conversion]	NA|95aa|up_4|NC_018012.1_2329048_2329333_+	cd00207, fer2, 2Fe-2S iron-sulfur cluster binding domain	NA|117aa|up_3|NC_018012.1_2329346_2329697_+	cd00207, fer2, 2Fe-2S iron-sulfur cluster binding domain	NA|195aa|up_2|NC_018012.1_2329701_2330286_+	pfam04891, NifQ, NifQ	NA|314aa|up_1|NC_018012.1_2330260_2331202_+	TIGR02662, ADP-ribosyl-_glycohydrolase, ADP-ribosyl-[dinitrogen reductase] hydrolase	NA|288aa|up_0|NC_018012.1_2331539_2332403_+	pfam11103, DUF2887, Protein of unknown function (DUF2887)	NA|370aa|down_0|NC_018012.1_2332825_2333935_+	pfam13340, DUF4096, Putative transposase of IS4/5 family (DUF4096)	NA|159aa|down_1|NC_018012.1_2335067_2335544_+	TIGR02523, probable_type-4_fimbrial_biogenesis_protein, type IV pilus modification protein PilV	NA|327aa|down_2|NC_018012.1_2335549_2336530_+	COG4966, PilW, Tfp pilus assembly protein PilW [Cell motility and secretion / Intracellular trafficking and secretion]	NA|195aa|down_3|NC_018012.1_2336542_2337127_+	COG4726, PilX, Tfp pilus assembly protein PilX [Cell motility and secretion / Intracellular trafficking and secretion]	NA|1441aa|down_4|NC_018012.1_2337137_2341460_+	COG3419, PilY1, Tfp pilus assembly protein, tip-associated adhesin PilY1 [Cell motility and secretion / Intracellular trafficking and secretion]	NA|134aa|down_5|NC_018012.1_2341463_2341865_+	COG4968, PilE, Tfp pilus assembly protein PilE [Cell motility and secretion / Intracellular trafficking and secretion]	NA|137aa|down_6|NC_018012.1_2343215_2343626_-	TIGR02481, hemeryth_dom, hemerythrin-like metal-binding domain	NA|118aa|down_7|NC_018012.1_2343938_2344292_+	cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|578aa|down_8|NC_018012.1_2344400_2346134_+	COG0028, IlvB, Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] [Amino acid transport and metabolism / Coenzyme metabolism]	NA|351aa|down_9|NC_018012.1_2346193_2347246_+	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment
GCF_000227745.2_ASM22774v3	NC_018012	Thiocystis violascens DSM 198, complete sequence	10	2830403-2830509	10	CRISPRCasFinder	no		DEDDh,RT,cas3,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,cas6,csa3,DinG,csx16,cas4,cmr3gr5,cas10,cmr1gr7,csx1	Orphan	CAGTTCCTCGTTGACCGATTGCA	23	1	1	2830468-2830486	NC_018012.1_3271813-3271831	NA	2	2	Orphan	DEDDh,RT,cas3,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,cas6,csa3,DinG,csx16,cas4,cmr3gr5,cas10,cmr1gr7,csx1	NA|57aa|up_3|NC_018012.1_2820942_2821113_+,NA|98aa|down_5|NC_018012.1_2839077_2839371_+	NA|78aa|up_9|NC_018012.1_2815321_2815555_+	PRK10540, PRK10540, osmotically-inducible lipoprotein OsmB	NA|575aa|up_8|NC_018012.1_2815633_2817358_+	COG0840, Tar, Methyl-accepting chemotaxis protein [Cell motility and secretion / Signal transduction mechanisms]	NA|234aa|up_7|NC_018012.1_2817705_2818407_+	COG0664, Crp, cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases [Signal transduction mechanisms]	NA|245aa|up_6|NC_018012.1_2818792_2819527_+	COG0664, Crp, cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases [Signal transduction mechanisms]	NA|155aa|up_5|NC_018012.1_2819862_2820327_+	pfam13565, HTH_32, Homeodomain-like domain	NA|129aa|up_4|NC_018012.1_2820413_2820800_+	COG3118, COG3118, Thioredoxin domain-containing protein [Posttranslational modification, protein turnover, chaperones]	NA|57aa|up_3|NC_018012.1_2820942_2821113_+	NA	NA|1092aa|up_2|NC_018012.1_2821383_2824659_+	TIGR02956, sensor_protein_TorS, TMAO reductase sytem sensor TorS	NA|774aa|up_1|NC_018012.1_2824919_2827241_-	PRK10060, PRK10060, cyclic di-GMP phosphodiesterase	NA|587aa|up_0|NC_018012.1_2827806_2829567_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|550aa|down_0|NC_018012.1_2833066_2834716_+	COG5305, COG5305, Predicted membrane protein [Function unknown]	NA|879aa|down_1|NC_018012.1_2835037_2837674_+	cd06435, CESA_NdvC_like, NdvC_like  proteins in this family are putative bacterial beta-(1,6)-glucosyltransferase	NA|81aa|down_2|NC_018012.1_2837762_2838005_+	smart00966, SpoVT_AbrB, SpoVT / AbrB like domain	NA|137aa|down_3|NC_018012.1_2837982_2838393_+	cd18683, PIN_VapC-like, Uncharacterized subfamily of the VapC (virulence-associated protein C)-like family of the PIN domain superfamily	NA|132aa|down_4|NC_018012.1_2838604_2839000_+	COG0797, RlpA, Lipoproteins [Cell envelope biogenesis, outer membrane]	NA|98aa|down_5|NC_018012.1_2839077_2839371_+	NA	NA|341aa|down_6|NC_018012.1_2839385_2840408_-	COG0842, COG0842, ABC-type multidrug transport system, permease component [Defense mechanisms]	NA|294aa|down_7|NC_018012.1_2840410_2841292_-	COG1131, CcmA, ABC-type multidrug transport system, ATPase component [Defense mechanisms]	NA|85aa|down_8|NC_018012.1_2841489_2841744_-	PRK11130, moaD, molybdopterin synthase small subunit; Provisional	NA|419aa|down_9|NC_018012.1_2841771_2843028_-	cd00887, MoeA, MoeA family
GCF_000227745.2_ASM22774v3	NC_018012	Thiocystis violascens DSM 198, complete sequence	11	3684693-3684802	11	CRISPRCasFinder	no		DEDDh,RT,cas3,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,cas6,csa3,DinG,csx16,cas4,cmr3gr5,cas10,cmr1gr7,csx1	Orphan	CGATTCGGGAACTGGCCGAACGCGGGCAT	29	0	0	NA	NA	NA	1	1	Orphan	DEDDh,RT,cas3,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,cas6,csa3,DinG,csx16,cas4,cmr3gr5,cas10,cmr1gr7,csx1	NA,NA|149aa|down_4|NC_018012.1_3689152_3689599_+	NA|149aa|up_9|NC_018012.1_3674139_3674586_+	pfam01724, DUF29, Domain of unknown function DUF29	NA|537aa|up_8|NC_018012.1_3674680_3676291_+	PRK13981, PRK13981, NAD synthetase; Provisional	NA|113aa|up_7|NC_018012.1_3676381_3676720_+	COG0347, GlnK, Nitrogen regulatory protein PII [Amino acid transport and metabolism]	NA|276aa|up_6|NC_018012.1_3676716_3677544_-	TIGR03302, OM_YfiO, outer membrane assembly lipoprotein YfiO	NA|327aa|up_5|NC_018012.1_3677658_3678639_+	PRK11180, rluD, 23S rRNA pseudouridine(1911/1915/1917) synthase RluD	NA|266aa|up_4|NC_018012.1_3678619_3679417_+	PRK10723, PRK10723, polyphenol oxidase	NA|174aa|up_3|NC_018012.1_3679445_3679967_+	COG2716, GcvR, Glycine cleavage system regulatory protein [Amino acid transport and metabolism]	NA|168aa|up_2|NC_018012.1_3679970_3680474_+	COG0242, Def, N-formylmethionyl-tRNA deformylase [Translation, ribosomal structure and biogenesis]	NA|700aa|up_1|NC_018012.1_3680602_3682702_+	PRK11186, PRK11186, carboxy terminal-processing peptidase	NA|528aa|up_0|NC_018012.1_3682704_3684288_+	PRK02107, PRK02107, glutamate--cysteine ligase; Provisional	NA|466aa|down_0|NC_018012.1_3685453_3686851_-	PRK05249, PRK05249, Si-specific NAD(P)(+) transhydrogenase	NA|144aa|down_1|NC_018012.1_3686935_3687367_-	pfam08897, DUF1841, Domain of unknown function (DUF1841)	NA|212aa|down_2|NC_018012.1_3687467_3688103_-	PRK10702, PRK10702, endonuclease III; Provisional	NA|260aa|down_3|NC_018012.1_3688223_3689003_+	cd00254, LT-like, lytic transglycosylase(LT)-like domain	NA|149aa|down_4|NC_018012.1_3689152_3689599_+	NA	NA|496aa|down_5|NC_018012.1_3689698_3691186_-	cd03088, ManB, ManB is a bacterial phosphomannomutase (PMM) that catalyzes the conversion of mannose 6-phosphate to mannose-1-phosphate in the second of three steps in the GDP-mannose pathway, in which GDP-D-mannose is synthesized from fructose-6-phosphate	NA|277aa|down_6|NC_018012.1_3691506_3692337_+	PRK12270, kgd, multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit	NA|220aa|down_7|NC_018012.1_3692582_3693242_-	PRK00454, engB, GTP-binding protein YsxC; Reviewed	NA|162aa|down_8|NC_018012.1_3693250_3693736_-	PRK09364, moaC, cyclic pyranopterin monophosphate synthase MoaC	NA|183aa|down_9|NC_018012.1_3693735_3694284_-	PRK01250, PRK01250, inorganic diphosphatase
GCF_000227745.2_ASM22774v3	NC_018012	Thiocystis violascens DSM 198, complete sequence	12	3895246-3895334	12	CRISPRCasFinder	no		DEDDh,RT,cas3,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,cas6,csa3,DinG,csx16,cas4,cmr3gr5,cas10,cmr1gr7,csx1	Orphan	GCTCCCTATGGCTACGGCGGCCC	23	0	0	NA	NA	NA	1	1	Orphan	DEDDh,RT,cas3,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,cas6,csa3,DinG,csx16,cas4,cmr3gr5,cas10,cmr1gr7,csx1	NA|224aa|up_1|NC_018012.1_3890800_3891472_+,NA|260aa|down_0|NC_018012.1_3895834_3896614_+,NA|105aa|down_2|NC_018012.1_3899025_3899340_-,NA|60aa|down_9|NC_018012.1_3906518_3906698_+	NA|364aa|up_9|NC_018012.1_3880751_3881843_+	cd13682, PBP2_TRAP_alpha-ketoacid, Substrate-binding component of an alpha-keto acid binding Tripartite ATP-independent Periplasmic transporter and related proteins; contains the type 2 periplasmic-binding protein fold	NA|484aa|up_8|NC_018012.1_3881874_3883326_-	COG4664, FcbT3, TRAP-type mannitol/chloroaromatic compound transport system, large permease component [Secondary metabolites biosynthesis, transport, and catabolism]	NA|178aa|up_7|NC_018012.1_3883327_3883861_-	COG4665, FcbT2, TRAP-type mannitol/chloroaromatic compound transport system, small permease component [Secondary metabolites biosynthesis, transport, and catabolism]	NA|186aa|up_6|NC_018012.1_3883884_3884442_-	COG3038, CybB, Cytochrome B561 [Energy production and conversion]	NA|398aa|up_5|NC_018012.1_3884645_3885839_+	cd12828, TmCorA-like_1, Thermotoga maritima CorA_like subfamily	NA|332aa|up_4|NC_018012.1_3885908_3886904_+	COG1835, COG1835, Predicted acyltransferases [Lipid metabolism]	NA|595aa|up_3|NC_018012.1_3887119_3888904_+	cd01465, vWA_subgroup, VWA subgroup: Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF)	NA|501aa|up_2|NC_018012.1_3889092_3890595_+	pfam13531, SBP_bac_11, Bacterial extracellular solute-binding protein	NA|224aa|up_1|NC_018012.1_3890800_3891472_+	NA	NA|525aa|up_0|NC_018012.1_3893002_3894577_+	cd17486, MFS_AmpG_like, AmpG and similar transporters of the Major Facilitator Superfamily	NA|260aa|down_0|NC_018012.1_3895834_3896614_+	NA	NA|733aa|down_1|NC_018012.1_3896694_3898893_+	pfam03030, H_PPase, Inorganic H+ pyrophosphatase	NA|105aa|down_2|NC_018012.1_3899025_3899340_-	NA	NA|116aa|down_3|NC_018012.1_3899669_3900017_+	pfam04930, FUN14, FUN14 family	NA|330aa|down_4|NC_018012.1_3900013_3901003_-	TIGR00452, methyltransferase_putative, tRNA (mo5U34)-methyltransferase	NA|244aa|down_5|NC_018012.1_3901175_3901907_-	TIGR00740, tRNA_cmo5U34-methyltransferase, tRNA (cmo5U34)-methyltransferase	NA|748aa|down_6|NC_018012.1_3902045_3904289_+	PRK05443, PRK05443, polyphosphate kinase; Provisional	NA|114aa|down_7|NC_018012.1_3904438_3904780_-	cd06260, DUF820, Domain of unknown function (DUF820)	NA|507aa|down_8|NC_018012.1_3904801_3906322_-	COG0248, GppA, Exopolyphosphatase [Nucleotide transport and metabolism / Inorganic ion transport and metabolism]	NA|60aa|down_9|NC_018012.1_3906518_3906698_+	NA
GCF_000227745.2_ASM22774v3	NC_018012	Thiocystis violascens DSM 198, complete sequence	13	4763203-4763358	13	CRISPRCasFinder	no		DEDDh,RT,cas3,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,cas6,csa3,DinG,csx16,cas4,cmr3gr5,cas10,cmr1gr7,csx1	Orphan	CCAGCCGGGAACCGGAGCCGTCCGGGTCCGGGTTGGCCTCGCTCGGGGAACCG	53	0	0	NA	NA	NA	1	1	Orphan	DEDDh,RT,cas3,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,cas6,csa3,DinG,csx16,cas4,cmr3gr5,cas10,cmr1gr7,csx1	NA|186aa|up_9|NC_018012.1_4751857_4752415_+,NA|233aa|up_2|NC_018012.1_4760988_4761687_+,NA|55aa|up_1|NC_018012.1_4761683_4761848_+,NA|197aa|up_0|NC_018012.1_4761834_4762425_+,NA|209aa|down_6|NC_018012.1_4769265_4769892_+,NA|90aa|down_8|NC_018012.1_4770222_4770492_+,NA|89aa|down_9|NC_018012.1_4770484_4770751_+	NA|186aa|up_9|NC_018012.1_4751857_4752415_+	NA	NA|201aa|up_8|NC_018012.1_4752862_4753465_+	cd03768, SR_ResInv, Serine Recombinase (SR) family, Resolvase and Invertase subfamily, catalytic domain; members contain a C-terminal DNA binding domain	NA|546aa|up_7|NC_018012.1_4753490_4755128_+	pfam00665, rve, Integrase core domain	NA|322aa|up_6|NC_018012.1_4755133_4756099_+	COG3267, ExeA, Type II secretory pathway, component ExeA (predicted ATPase) [Intracellular trafficking and secretion]	NA|82aa|up_5|NC_018012.1_4756291_4756537_+	pfam12728, HTH_17, Helix-turn-helix domain	NA|142aa|up_4|NC_018012.1_4756816_4757242_+	TIGR01391, DNA_primase, DNA primase, catalytic core	NA|781aa|up_3|NC_018012.1_4758649_4760992_+	pfam06048, DUF927, Domain of unknown function (DUF927)	NA|233aa|up_2|NC_018012.1_4760988_4761687_+	NA	NA|55aa|up_1|NC_018012.1_4761683_4761848_+	NA	NA|197aa|up_0|NC_018012.1_4761834_4762425_+	NA	NA|143aa|down_0|NC_018012.1_4763860_4764289_+	pfam07508, Recombinase, Recombinase	NA|228aa|down_1|NC_018012.1_4764484_4765168_-	COG2932, COG2932, Predicted transcriptional regulator [Transcription]	NA|112aa|down_2|NC_018012.1_4765389_4765725_+	pfam13693, HTH_35, Winged helix-turn-helix DNA-binding	NA|109aa|down_3|NC_018012.1_4765721_4766048_+	pfam02316, HTH_Tnp_Mu_1, Mu DNA-binding domain	NA|770aa|down_4|NC_018012.1_4766044_4768354_+	pfam09299, Mu-transpos_C, Mu transposase, C-terminal	NA|255aa|down_5|NC_018012.1_4768501_4769266_+	pfam13401, AAA_22, AAA domain	NA|209aa|down_6|NC_018012.1_4769265_4769892_+	NA	NA|114aa|down_7|NC_018012.1_4769884_4770226_+	pfam01381, HTH_3, Helix-turn-helix	NA|90aa|down_8|NC_018012.1_4770222_4770492_+	NA	NA|89aa|down_9|NC_018012.1_4770484_4770751_+	NA
GCF_000227745.2_ASM22774v3	NC_018012	Thiocystis violascens DSM 198, complete sequence	14	4773487-4773574	14	CRISPRCasFinder	no		DEDDh,RT,cas3,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,cas6,csa3,DinG,csx16,cas4,cmr3gr5,cas10,cmr1gr7,csx1	Orphan	CCCCGCTCGCCCCGGAGGACGCCCCATG	28	0	0	NA	NA	NA	1	1	Orphan	DEDDh,RT,cas3,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,cas6,csa3,DinG,csx16,cas4,cmr3gr5,cas10,cmr1gr7,csx1	NA|209aa|up_8|NC_018012.1_4769265_4769892_+,NA|90aa|up_6|NC_018012.1_4770222_4770492_+,NA|89aa|up_5|NC_018012.1_4770484_4770751_+,NA|197aa|up_4|NC_018012.1_4770705_4771296_+,NA|205aa|up_3|NC_018012.1_4771292_4771907_+,NA|52aa|up_2|NC_018012.1_4771903_4772059_+,NA|123aa|up_1|NC_018012.1_4772051_4772420_+,NA|112aa|down_2|NC_018012.1_4774845_4775181_+,NA|87aa|down_3|NC_018012.1_4775254_4775515_-,NA|75aa|down_4|NC_018012.1_4775530_4775755_+,NA|158aa|down_5|NC_018012.1_4776136_4776610_+,NA|96aa|down_8|NC_018012.1_4778112_4778400_+,NA|84aa|down_9|NC_018012.1_4778390_4778642_+	NA|255aa|up_9|NC_018012.1_4768501_4769266_+	pfam13401, AAA_22, AAA domain	NA|209aa|up_8|NC_018012.1_4769265_4769892_+	NA	NA|114aa|up_7|NC_018012.1_4769884_4770226_+	pfam01381, HTH_3, Helix-turn-helix	NA|90aa|up_6|NC_018012.1_4770222_4770492_+	NA	NA|89aa|up_5|NC_018012.1_4770484_4770751_+	NA	NA|197aa|up_4|NC_018012.1_4770705_4771296_+	NA	NA|205aa|up_3|NC_018012.1_4771292_4771907_+	NA	NA|52aa|up_2|NC_018012.1_4771903_4772059_+	NA	NA|123aa|up_1|NC_018012.1_4772051_4772420_+	NA	NA|222aa|up_0|NC_018012.1_4772476_4773142_+	pfam11363, DUF3164, Protein of unknown function (DUF3164)	NA|141aa|down_0|NC_018012.1_4773957_4774380_+	pfam06252, DUF1018, Protein of unknown function (DUF1018)	NA|152aa|down_1|NC_018012.1_4774376_4774832_+	pfam08765, Mor, Mor transcription activator family	NA|112aa|down_2|NC_018012.1_4774845_4775181_+	NA	NA|87aa|down_3|NC_018012.1_4775254_4775515_-	NA	NA|75aa|down_4|NC_018012.1_4775530_4775755_+	NA	NA|158aa|down_5|NC_018012.1_4776136_4776610_+	NA	NA|176aa|down_6|NC_018012.1_4776624_4777152_+	pfam13511, DUF4124, Domain of unknown function (DUF4124)	NA|217aa|down_7|NC_018012.1_4777465_4778116_+	pfam09669, Phage_pRha, Phage regulatory protein Rha (Phage_pRha)	NA|96aa|down_8|NC_018012.1_4778112_4778400_+	NA	NA|84aa|down_9|NC_018012.1_4778390_4778642_+	NA
GCF_000227745.2_ASM22774v3	NC_018012	Thiocystis violascens DSM 198, complete sequence	15	4879408-4879738	15,3,3	CRISPRCasFinder,CRT,PILER-CR	no	csx16,cas4,cas2,cas1,cas6,cmr3gr5,cas10,cmr1gr7,csx1	DEDDh,RT,cas3,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,cas6,csa3,DinG,csx16,cas4,cmr3gr5,cas10,cmr1gr7,csx1	Type III-C,Type III-A,Type III-B,Type III-D	GTTTCAATCCTTGTTGTATTGGATTGGGTGCTGCAGC,GTTTCAATCCTTGTTGTATTGGATTGGGTGCTGCAGCC,GTTTCAATCCTTGTTGTATTGGATTGGGTGCTGCAGCC	37,38,38	0	0	NA	NA	I-B:I-B:I-B	4,4,3	4	TypeIII-A,TypeIII-B,TypeIII-D,TypeIII-C	DEDDh,RT,cas3,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,cas6,csa3,DinG,csx16,cas4,cmr3gr5,cas10,cmr1gr7,csx1	NA|140aa|up_8|NC_018012.1_4869910_4870330_+,NA|275aa|up_7|NC_018012.1_4870353_4871178_-,NA|118aa|down_7|NC_018012.1_4887024_4887378_-	NA|414aa|up_9|NC_018012.1_4868669_4869911_+	PRK00725, glgC, glucose-1-phosphate adenylyltransferase; Provisional	NA|140aa|up_8|NC_018012.1_4869910_4870330_+	NA	NA|275aa|up_7|NC_018012.1_4870353_4871178_-	NA	NA|485aa|up_6|NC_018012.1_4871298_4872753_-	cd00198, vWFA, Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF)	NA|254aa|up_5|NC_018012.1_4872814_4873576_+	COG2231, COG2231, Uncharacterized protein related to Endonuclease III [DNA replication, recombination, and repair]	NA|660aa|up_4|NC_018012.1_4873666_4875646_+	cd01949, GGDEF, Diguanylate-cyclase (DGC) or GGDEF domain	NA|566aa|up_3|NC_018012.1_4875799_4877497_+	COG2303, BetA, Choline dehydrogenase and related flavoproteins [Amino acid transport and metabolism]	NA|116aa|up_2|NC_018012.1_4878019_4878367_+	pfam11756, YgbA_NO, Nitrous oxide-stimulated promoter	NA|180aa|up_1|NC_018012.1_4878393_4878933_-	pfam09851, SHOCT, Short C-terminal domain	csx16|103aa|up_0|NC_018012.1_4879031_4879340_-	pfam09652, Cas_VVA1548, Putative CRISPR-associated protein (Cas_VVA1548)	NA|395aa|down_0|NC_018012.1_4879750_4880935_-	pfam13808, DDE_Tnp_1_assoc, DDE_Tnp_1-associated	NA|395aa|down_1|NC_018012.1_4882233_4883418_-	pfam13808, DDE_Tnp_1_assoc, DDE_Tnp_1-associated	cas4|189aa|down_2|NC_018012.1_4883452_4884019_-	TIGR00372, conserved_hypothetical_protein, CRISPR-associated protein Cas4	cas2|102aa|down_3|NC_018012.1_4884015_4884321_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|343aa|down_4|NC_018012.1_4884342_4885371_-	pfam01867, Cas_Cas1, CRISPR associated protein Cas1	NA|190aa|down_5|NC_018012.1_4885700_4886270_-	cd06260, DUF820, Domain of unknown function (DUF820)	cas6|247aa|down_6|NC_018012.1_4886296_4887037_-	pfam17262, DUF5328, Family of unknown function (DUF5328)	NA|118aa|down_7|NC_018012.1_4887024_4887378_-	NA	NA|370aa|down_8|NC_018012.1_4887499_4888609_+	pfam13340, DUF4096, Putative transposase of IS4/5 family (DUF4096)	cmr3gr5|199aa|down_9|NC_018012.1_4888521_4889118_-	cd09748, Cmr3_III-B, CRISPR/Cas system-associated RAMP superfamily protein Cmr3
GCF_000227745.2_ASM22774v3	NC_018012	Thiocystis violascens DSM 198, complete sequence	16	4881006-4881922	4,4,16	CRT,PILER-CR,CRISPRCasFinder	no	csx16,cas4,cas2,cas1,cas6,cmr3gr5,cas10,cmr1gr7,csx1	DEDDh,RT,cas3,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,cas6,csa3,DinG,csx16,cas4,cmr3gr5,cas10,cmr1gr7,csx1	Type III-C,Type III-A,Type III-B,Type III-D	GTTTCAATCCTTGTTGTATTGGATTGGGTGCTGCAGC,GTTTCAATCCTTGTTGTATTGGATTGGGTGCTGCAGC,GTTTCAATCCTTGTTGTATTGGATTGGGTGCTGCAGC	37,37,37	0	0	NA	NA	I-B:I-B:I-B	12,11,11	12	TypeIII-A,TypeIII-B,TypeIII-D,TypeIII-C	DEDDh,RT,cas3,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,cas6,csa3,DinG,csx16,cas4,cmr3gr5,cas10,cmr1gr7,csx1	NA|140aa|up_9|NC_018012.1_4869910_4870330_+,NA|275aa|up_8|NC_018012.1_4870353_4871178_-,NA|118aa|down_6|NC_018012.1_4887024_4887378_-	NA|140aa|up_9|NC_018012.1_4869910_4870330_+	NA	NA|275aa|up_8|NC_018012.1_4870353_4871178_-	NA	NA|485aa|up_7|NC_018012.1_4871298_4872753_-	cd00198, vWFA, Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF)	NA|254aa|up_6|NC_018012.1_4872814_4873576_+	COG2231, COG2231, Uncharacterized protein related to Endonuclease III [DNA replication, recombination, and repair]	NA|660aa|up_5|NC_018012.1_4873666_4875646_+	cd01949, GGDEF, Diguanylate-cyclase (DGC) or GGDEF domain	NA|566aa|up_4|NC_018012.1_4875799_4877497_+	COG2303, BetA, Choline dehydrogenase and related flavoproteins [Amino acid transport and metabolism]	NA|116aa|up_3|NC_018012.1_4878019_4878367_+	pfam11756, YgbA_NO, Nitrous oxide-stimulated promoter	NA|180aa|up_2|NC_018012.1_4878393_4878933_-	pfam09851, SHOCT, Short C-terminal domain	csx16|103aa|up_1|NC_018012.1_4879031_4879340_-	pfam09652, Cas_VVA1548, Putative CRISPR-associated protein (Cas_VVA1548)	NA|395aa|up_0|NC_018012.1_4879750_4880935_-	pfam13808, DDE_Tnp_1_assoc, DDE_Tnp_1-associated	NA|395aa|down_0|NC_018012.1_4882233_4883418_-	pfam13808, DDE_Tnp_1_assoc, DDE_Tnp_1-associated	cas4|189aa|down_1|NC_018012.1_4883452_4884019_-	TIGR00372, conserved_hypothetical_protein, CRISPR-associated protein Cas4	cas2|102aa|down_2|NC_018012.1_4884015_4884321_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|343aa|down_3|NC_018012.1_4884342_4885371_-	pfam01867, Cas_Cas1, CRISPR associated protein Cas1	NA|190aa|down_4|NC_018012.1_4885700_4886270_-	cd06260, DUF820, Domain of unknown function (DUF820)	cas6|247aa|down_5|NC_018012.1_4886296_4887037_-	pfam17262, DUF5328, Family of unknown function (DUF5328)	NA|118aa|down_6|NC_018012.1_4887024_4887378_-	NA	NA|370aa|down_7|NC_018012.1_4887499_4888609_+	pfam13340, DUF4096, Putative transposase of IS4/5 family (DUF4096)	cmr3gr5|199aa|down_8|NC_018012.1_4888521_4889118_-	cd09748, Cmr3_III-B, CRISPR/Cas system-associated RAMP superfamily protein Cmr3	cas10|650aa|down_9|NC_018012.1_4889114_4891064_-	cd09679, Cas10_III, CRISPR/Cas system-associated protein Cas10
