assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000204075.1_ASM20407v1	NC_007413	Trichormus variabilis ATCC 29413, complete sequence	1	1202094-1202351	1	PILER-CR	no		csa3,cas3,c2c5_V-U5,Cas14c_CAS-V-F,cas14j,DinG,RT,cas6,cas8b3,cas7,cas5,WYL,PD-DExK,cas10d,csc2gr7,csc1gr5,2OG_CAS,cas4,cas1,cas2,Cas9_archaeal,cas14k	Orphan	ACTTTTTCATCCTTGAGGATATTAGCGATGTCAGGGACATACTTGGCT	48	0	0	NA	NA	NA	2	2	Orphan	csa3,cas3,c2c5_V-U5,Cas14c_CAS-V-F,cas14j,DinG,RT,cas6,cas8b3,cas7,cas5,WYL,PD-DExK,cas10d,csc2gr7,csc1gr5,2OG_CAS,cas4,cas1,cas2,Cas9_archaeal,cas14k,Cas14u_CAS-V,DEDDh	NA|262aa|up_8|NC_007413.1_1187370_1188156_+,NA|137aa|down_4|NC_007413.1_1208608_1209019_-,NA|51aa|down_5|NC_007413.1_1209417_1209570_-	NA|75aa|up_9|NC_007413.1_1186904_1187129_+	COG1146, COG1146, Ferredoxin [Energy production and conversion]	NA|262aa|up_8|NC_007413.1_1187370_1188156_+	NA	NA|248aa|up_7|NC_007413.1_1188236_1188980_+	COG0410, LivF, ABC-type branched-chain amino acid transport systems, ATPase component [Amino acid transport and metabolism]	NA|386aa|up_6|NC_007413.1_1189056_1190214_+	cd00997, PBP2_GluR0, Bacterial GluR0 ligand-binding domain; the type 2 periplasmic binding protein fold	NA|244aa|up_5|NC_007413.1_1190854_1191586_+	cd03378, beta_CA_cladeC, Carbonic anhydrases (CA) are zinc-containing enzymes that catalyze the reversible hydration of carbon dioxide in a two-step mechanism in which the nucleophilic attack of a zinc-bound hydroxide ion on carbon dioxide is followed by the regeneration of an active site by ionization of the zinc-bound water molecule and removal of a proton from the active site	NA|245aa|up_4|NC_007413.1_1191960_1192695_+	cd03378, beta_CA_cladeC, Carbonic anhydrases (CA) are zinc-containing enzymes that catalyze the reversible hydration of carbon dioxide in a two-step mechanism in which the nucleophilic attack of a zinc-bound hydroxide ion on carbon dioxide is followed by the regeneration of an active site by ionization of the zinc-bound water molecule and removal of a proton from the active site	NA|738aa|up_3|NC_007413.1_1193090_1195304_+	cd07550, P-type_ATPase_HM, P-type heavy metal-transporting ATPase; uncharacterized subfamily	NA|166aa|up_2|NC_007413.1_1195521_1196019_+	cd08070, MPN_like, Mpr1p, Pad1p N-terminal (MPN) domains with catalytic isopeptidase activity (metal-binding)	NA|391aa|up_1|NC_007413.1_1196082_1197255_+	PRK07411, PRK07411, molybdopterin-synthase adenylyltransferase MoeB	NA|552aa|up_0|NC_007413.1_1197571_1199227_-	pfam13751, DDE_Tnp_1_6, Transposase DDE domain	NA|552aa|down_0|NC_007413.1_1203458_1205114_+	pfam13751, DDE_Tnp_1_6, Transposase DDE domain	NA|190aa|down_1|NC_007413.1_1205239_1205809_+	pfam05685, Uma2, Putative restriction endonuclease	NA|391aa|down_2|NC_007413.1_1206024_1207197_+	cd17313, MFS_SLC45_SUC, Solute carrier family 45 and similar sugar transporters of the Major Facilitator Superfamily of transporters	NA|273aa|down_3|NC_007413.1_1207206_1208025_-	cd05243, SDR_a5, atypical (a) SDRs, subgroup 5	NA|137aa|down_4|NC_007413.1_1208608_1209019_-	NA	NA|51aa|down_5|NC_007413.1_1209417_1209570_-	NA	NA|301aa|down_6|NC_007413.1_1210127_1211030_+	COG0053, MMT1, Predicted Co/Zn/Cd cation transporters [Inorganic ion transport and metabolism]	NA|752aa|down_7|NC_007413.1_1211369_1213625_+	COG4251, COG4251, Bacteriophytochrome (light-regulated signal transduction histidine kinase) [Signal transduction mechanisms]	NA|152aa|down_8|NC_007413.1_1213707_1214163_+	cd17557, REC_Rcp-like, phosphoacceptor receiver (REC) domain of cyanobacterial phytochrome response regulator Rcp and similar domains	NA|647aa|down_9|NC_007413.1_1214165_1216106_+	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]
GCF_000204075.1_ASM20407v1	NC_007413	Trichormus variabilis ATCC 29413, complete sequence	2	1234556-1237182	2,1,1	PILER-CR,CRISPRCasFinder,CRT	no		csa3,cas3,c2c5_V-U5,Cas14c_CAS-V-F,cas14j,DinG,RT,cas6,cas8b3,cas7,cas5,WYL,PD-DExK,cas10d,csc2gr7,csc1gr5,2OG_CAS,cas4,cas1,cas2,Cas9_archaeal,cas14k	Orphan	GTTTTAATTAACAAAAATCCCTATCAGGGATTGAAAC,GTTTTAATTAACAAAAATCCCTATCAGGGATTGAAAC,GTTTTAATTAACAAAAATCCCTATCAGGGATTGAAAC	37,37,37	0	0	NA	NA	I-D,II-B:I-D,II-B:I-D,II-B	35,36,36	36	Orphan	csa3,cas3,c2c5_V-U5,Cas14c_CAS-V-F,cas14j,DinG,RT,cas6,cas8b3,cas7,cas5,WYL,PD-DExK,cas10d,csc2gr7,csc1gr5,2OG_CAS,cas4,cas1,cas2,Cas9_archaeal,cas14k,Cas14u_CAS-V,DEDDh	NA|169aa|up_8|NC_007413.1_1223252_1223759_-,NA|65aa|up_7|NC_007413.1_1223842_1224037_+,NA|126aa|up_6|NC_007413.1_1224019_1224397_-,NA	NA|260aa|up_9|NC_007413.1_1222394_1223174_-	TIGR03069, RNA-binding_S4_domain-containing_protein, photosystem II S4 domain protein	NA|169aa|up_8|NC_007413.1_1223252_1223759_-	NA	NA|65aa|up_7|NC_007413.1_1223842_1224037_+	NA	NA|126aa|up_6|NC_007413.1_1224019_1224397_-	NA	NA|732aa|up_5|NC_007413.1_1224464_1226660_-	COG1538, TolC, Outer membrane protein [Cell envelope biogenesis, outer membrane / Intracellular trafficking and secretion]	NA|520aa|up_4|NC_007413.1_1226764_1228324_-	TIGR02655, Circadian_clock_protein_kinase_KaiC, circadian clock protein KaiC	NA|109aa|up_3|NC_007413.1_1228398_1228725_-	PRK09301, PRK09301, circadian clock protein KaiB; Provisional	NA|265aa|up_2|NC_007413.1_1228978_1229772_+	pfam13340, DUF4096, Putative transposase of IS4/5 family (DUF4096)	NA|90aa|up_1|NC_007413.1_1229781_1230051_-	pfam07688, KaiA, KaiA C-terminal domain	NA|1122aa|up_0|NC_007413.1_1230922_1234288_+	PRK11091, PRK11091, aerobic respiration control sensor protein ArcB; Provisional	NA|211aa|down_0|NC_007413.1_1237485_1238118_-	COG3932, COG3932, Uncharacterized ABC-type transport system, permease components [General function prediction only]	NA|292aa|down_1|NC_007413.1_1238239_1239115_-	PRK13057, PRK13057, lipid kinase	NA|289aa|down_2|NC_007413.1_1239534_1240401_-	TIGR01184, Nitrate_transport_ATP-binding_protein_NrtC, nitrate transport ATP-binding subunits C and D	NA|668aa|down_3|NC_007413.1_1240535_1242539_-	TIGR01184, Nitrate_transport_ATP-binding_protein_NrtC, nitrate transport ATP-binding subunits C and D	NA|280aa|down_4|NC_007413.1_1242620_1243460_-	TIGR01183, Nitrate_transport_permease_protein_NrtB, nitrate ABC transporter, permease protein	NA|459aa|down_5|NC_007413.1_1243573_1244950_-	pfam13379, NMT1_2, NMT1-like family	NA|486aa|down_6|NC_007413.1_1245529_1246987_+	pfam02696, UPF0061, Uncharacterized ACR, YdiU/UPF0061 family	NA|1821aa|down_7|NC_007413.1_1247779_1253242_+	PRK11107, PRK11107, hybrid sensory histidine kinase BarA; Provisional	NA|331aa|down_8|NC_007413.1_1253238_1254231_+	COG3706, PleD, Response regulator containing a CheY-like receiver domain and a GGDEF domain [Signal transduction mechanisms]	NA|336aa|down_9|NC_007413.1_1254274_1255282_-	COG4240, COG4240, Predicted kinase [General function prediction only]
GCF_000204075.1_ASM20407v1	NC_007413	Trichormus variabilis ATCC 29413, complete sequence	3	1423865-1424049	3	PILER-CR	no	c2c5_V-U5	csa3,cas3,c2c5_V-U5,Cas14c_CAS-V-F,cas14j,DinG,RT,cas6,cas8b3,cas7,cas5,WYL,PD-DExK,cas10d,csc2gr7,csc1gr5,2OG_CAS,cas4,cas1,cas2,Cas9_archaeal,cas14k	Type V-U5	AGTTTCAACACCCCTCCCGAAGTGGGGCGGGTTGAAAG	38	0	0	NA	NA	V-U5	2	2	TypeV-U5	csa3,cas3,c2c5_V-U5,Cas14c_CAS-V-F,cas14j,DinG,RT,cas6,cas8b3,cas7,cas5,WYL,PD-DExK,cas10d,csc2gr7,csc1gr5,2OG_CAS,cas4,cas1,cas2,Cas9_archaeal,cas14k,Cas14u_CAS-V,DEDDh	NA|157aa|up_7|NC_007413.1_1412056_1412527_-,c2c5_V-U5|643aa|up_0|NC_007413.1_1421471_1423400_+,NA	NA|168aa|up_9|NC_007413.1_1410105_1410609_+	pfam06527, TniQ, TniQ	NA|355aa|up_8|NC_007413.1_1410991_1412056_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|157aa|up_7|NC_007413.1_1412056_1412527_-	NA	NA|112aa|up_6|NC_007413.1_1412572_1412908_-	cd07344, M48_yhfN_like, Peptidase M48 YhfN-like, a novel minigluzincin	NA|975aa|up_5|NC_007413.1_1412991_1415916_-	TIGR00348, R_protein, type I site-specific deoxyribonuclease, HsdR family	NA|688aa|up_4|NC_007413.1_1415937_1418001_-	COG2865, COG2865, Predicted transcriptional regulator containing an HTH domain and an uncharacterized domain shared with the mammalian protein Schlafen [Transcription]	NA|455aa|up_3|NC_007413.1_1418018_1419383_-	cd17278, RMtype1_S_LdeBORF1052P-TRD2-CR2, Type I restriction-modification system specificity (S) subunit Target Recognition Domain-ConseRved domain (TRD-CR), similar to Lactobacillus delbrueckii subsp	NA|517aa|up_2|NC_007413.1_1419379_1420930_-	COG0286, HsdM, Type I restriction-modification system methyltransferase subunit [Defense mechanisms]	NA|151aa|up_1|NC_007413.1_1420949_1421402_-	cd01105, HTH_GlnR-like, Helix-Turn-Helix DNA binding domain of GlnR-like transcription regulators	c2c5_V-U5|643aa|up_0|NC_007413.1_1421471_1423400_+	NA	NA|292aa|down_0|NC_007413.1_1424493_1425369_+	PRK05481, PRK05481, lipoyl synthase; Provisional	NA|308aa|down_1|NC_007413.1_1425420_1426344_+	PRK14619, PRK14619, NAD(P)H-dependent glycerol-3-phosphate dehydrogenase; Provisional	NA|417aa|down_2|NC_007413.1_1426866_1428117_+	PRK07598, PRK07598, RNA polymerase sigma factor SigC; Validated	NA|152aa|down_3|NC_007413.1_1428659_1429115_+	pfam01475, FUR, Ferric uptake regulator family	NA|413aa|down_4|NC_007413.1_1429244_1430483_-	COG3409, COG3409, Putative peptidoglycan-binding domain-containing protein [Cell envelope biogenesis, outer membrane]	NA|1289aa|down_5|NC_007413.1_1430501_1434368_-	PRK05989, cobN, cobaltochelatase subunit CobN; Reviewed	NA|677aa|down_6|NC_007413.1_1434452_1436483_+	PRK11107, PRK11107, hybrid sensory histidine kinase BarA; Provisional	NA|303aa|down_7|NC_007413.1_1436583_1437492_-	PRK14950, PRK14950, DNA polymerase III subunits gamma and tau; Provisional	NA|301aa|down_8|NC_007413.1_1437624_1438527_+	cd07378, MPP_ACP5, Homo sapiens acid phosphatase 5 and related proteins, metallophosphatase domain	NA|393aa|down_9|NC_007413.1_1438714_1439893_+	PRK03080, PRK03080, phosphoserine transaminase
GCF_000204075.1_ASM20407v1	NC_007413	Trichormus variabilis ATCC 29413, complete sequence	4	1436927-1437019	2	CRISPRCasFinder	no	c2c5_V-U5	csa3,cas3,c2c5_V-U5,Cas14c_CAS-V-F,cas14j,DinG,RT,cas6,cas8b3,cas7,cas5,WYL,PD-DExK,cas10d,csc2gr7,csc1gr5,2OG_CAS,cas4,cas1,cas2,Cas9_archaeal,cas14k	Type V-U5	GCTTCTGGTTCCGATGTAATTTCT	24	0	0	NA	NA	NA	1	1	TypeV-U5	csa3,cas3,c2c5_V-U5,Cas14c_CAS-V-F,cas14j,DinG,RT,cas6,cas8b3,cas7,cas5,WYL,PD-DExK,cas10d,csc2gr7,csc1gr5,2OG_CAS,cas4,cas1,cas2,Cas9_archaeal,cas14k,Cas14u_CAS-V,DEDDh	c2c5_V-U5|643aa|up_7|NC_007413.1_1421471_1423400_+,NA	NA|517aa|up_9|NC_007413.1_1419379_1420930_-	COG0286, HsdM, Type I restriction-modification system methyltransferase subunit [Defense mechanisms]	NA|151aa|up_8|NC_007413.1_1420949_1421402_-	cd01105, HTH_GlnR-like, Helix-Turn-Helix DNA binding domain of GlnR-like transcription regulators	c2c5_V-U5|643aa|up_7|NC_007413.1_1421471_1423400_+	NA	NA|292aa|up_6|NC_007413.1_1424493_1425369_+	PRK05481, PRK05481, lipoyl synthase; Provisional	NA|308aa|up_5|NC_007413.1_1425420_1426344_+	PRK14619, PRK14619, NAD(P)H-dependent glycerol-3-phosphate dehydrogenase; Provisional	NA|417aa|up_4|NC_007413.1_1426866_1428117_+	PRK07598, PRK07598, RNA polymerase sigma factor SigC; Validated	NA|152aa|up_3|NC_007413.1_1428659_1429115_+	pfam01475, FUR, Ferric uptake regulator family	NA|413aa|up_2|NC_007413.1_1429244_1430483_-	COG3409, COG3409, Putative peptidoglycan-binding domain-containing protein [Cell envelope biogenesis, outer membrane]	NA|1289aa|up_1|NC_007413.1_1430501_1434368_-	PRK05989, cobN, cobaltochelatase subunit CobN; Reviewed	NA|677aa|up_0|NC_007413.1_1434452_1436483_+	PRK11107, PRK11107, hybrid sensory histidine kinase BarA; Provisional	NA|301aa|down_0|NC_007413.1_1437624_1438527_+	cd07378, MPP_ACP5, Homo sapiens acid phosphatase 5 and related proteins, metallophosphatase domain	NA|393aa|down_1|NC_007413.1_1438714_1439893_+	PRK03080, PRK03080, phosphoserine transaminase	NA|451aa|down_2|NC_007413.1_1440516_1441869_+	COG2239, MgtE, Mg/Co/Ni transporter MgtE (contains CBS domain) [Inorganic ion transport and metabolism]	NA|357aa|down_3|NC_007413.1_1442039_1443110_+	pfam13358, DDE_3, DDE superfamily endonuclease	NA|334aa|down_4|NC_007413.1_1443152_1444154_+	PRK00856, pyrB, aspartate carbamoyltransferase catalytic subunit	NA|373aa|down_5|NC_007413.1_1444564_1445683_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|85aa|down_6|NC_007413.1_1446096_1446351_+	COG4327, COG4327, Predicted membrane protein [Function unknown]	NA|560aa|down_7|NC_007413.1_1446360_1448040_+	TIGR03648, Na_symport_lg, probable sodium:solute symporter, VC_2705 subfamily	NA|281aa|down_8|NC_007413.1_1448108_1448951_-	sd00006, TPR, Tetratricopeptide repeat	NA|223aa|down_9|NC_007413.1_1449705_1450374_-	PRK00058, PRK00058, peptide-methionine (S)-S-oxide reductase MsrA
GCF_000204075.1_ASM20407v1	NC_007413	Trichormus variabilis ATCC 29413, complete sequence	5	1622725-1622907	4	PILER-CR	no	c2c5_V-U5	csa3,cas3,c2c5_V-U5,Cas14c_CAS-V-F,cas14j,DinG,RT,cas6,cas8b3,cas7,cas5,WYL,PD-DExK,cas10d,csc2gr7,csc1gr5,2OG_CAS,cas4,cas1,cas2,Cas9_archaeal,cas14k	Type V-U5	AGTTTCAACGACCATCCCGGCTAGGGGCGGGTTGAAAGATT	41	0	0	NA	NA	V-U5	2	2	TypeV-U5	csa3,cas3,c2c5_V-U5,Cas14c_CAS-V-F,cas14j,DinG,RT,cas6,cas8b3,cas7,cas5,WYL,PD-DExK,cas10d,csc2gr7,csc1gr5,2OG_CAS,cas4,cas1,cas2,Cas9_archaeal,cas14k,Cas14u_CAS-V,DEDDh	NA,NA|139aa|down_8|NC_007413.1_1644520_1644937_+,NA|123aa|down_9|NC_007413.1_1645076_1645445_-	NA|205aa|up_9|NC_007413.1_1611024_1611639_-	pfam13525, YfiO, Outer membrane lipoprotein	NA|910aa|up_8|NC_007413.1_1611652_1614382_-	sd00006, TPR, Tetratricopeptide repeat	NA|149aa|up_7|NC_007413.1_1614381_1614828_-	PRK11198, PRK11198, LysM domain/BON superfamily protein; Provisional	NA|738aa|up_6|NC_007413.1_1614824_1617038_-	cd10170, HSP70_NBD, Nucleotide-binding domain of the HSP70 family	NA|141aa|up_5|NC_007413.1_1617234_1617657_-	COG4270, COG4270, Predicted membrane protein [Function unknown]	NA|164aa|up_4|NC_007413.1_1617873_1618365_-	COG4446, COG4446, Uncharacterized protein conserved in bacteria [Function unknown]	NA|187aa|up_3|NC_007413.1_1618503_1619064_+	pfam06271, RDD, RDD family	NA|65aa|up_2|NC_007413.1_1619108_1619303_+	CHL00104, rpl33, ribosomal protein L33	NA|72aa|up_1|NC_007413.1_1619305_1619521_+	PRK00391, rpsR, 30S ribosomal protein S18; Reviewed	NA|687aa|up_0|NC_007413.1_1619735_1621796_+	COG0557, VacB, Exoribonuclease R [Transcription]	c2c5_V-U5|636aa|down_0|NC_007413.1_1623401_1625309_-	TIGR01766, Putative_transposase_MJ0751, transposase, IS605 OrfB family, central region	NA|1083aa|down_1|NC_007413.1_1626203_1629452_+	cd18011, DEXDc_RapA, DEXH-box helicase domain of RapA	NA|1322aa|down_2|NC_007413.1_1629465_1633431_+	NF033451, BREX_2_MTaseX, BREX-2 system adenine-specific DNA-methyltransferase PglX	NA|135aa|down_3|NC_007413.1_1633767_1634172_+	pfam01797, Y1_Tnp, Transposase IS200 like	NA|1160aa|down_4|NC_007413.1_1634347_1637827_+	smart00490, HELICc, helicase superfamily c-terminal domain	NA|621aa|down_5|NC_007413.1_1637823_1639686_+	pfam09369, DUF1998, Domain of unknown function (DUF1998)	NA|906aa|down_6|NC_007413.1_1640234_1642952_+	pfam08722, Tn7_Tnp_TnsA_N, TnsA endonuclease N terminal	NA|493aa|down_7|NC_007413.1_1642951_1644430_+	pfam13401, AAA_22, AAA domain	NA|139aa|down_8|NC_007413.1_1644520_1644937_+	NA	NA|123aa|down_9|NC_007413.1_1645076_1645445_-	NA
GCF_000204075.1_ASM20407v1	NC_007413	Trichormus variabilis ATCC 29413, complete sequence	6	2101782-2101868	3	CRISPRCasFinder	no		csa3,cas3,c2c5_V-U5,Cas14c_CAS-V-F,cas14j,DinG,RT,cas6,cas8b3,cas7,cas5,WYL,PD-DExK,cas10d,csc2gr7,csc1gr5,2OG_CAS,cas4,cas1,cas2,Cas9_archaeal,cas14k	Orphan	GGTGCAGAGGTGCAGGGGAGAGA	23	0	0	NA	NA	NA	1	1	Orphan	csa3,cas3,c2c5_V-U5,Cas14c_CAS-V-F,cas14j,DinG,RT,cas6,cas8b3,cas7,cas5,WYL,PD-DExK,cas10d,csc2gr7,csc1gr5,2OG_CAS,cas4,cas1,cas2,Cas9_archaeal,cas14k,Cas14u_CAS-V,DEDDh	NA|163aa|up_7|NC_007413.1_2090745_2091234_+,NA|73aa|up_3|NC_007413.1_2095758_2095977_+,NA|167aa|down_1|NC_007413.1_2103683_2104184_+,NA|69aa|down_3|NC_007413.1_2105157_2105364_+	NA|198aa|up_9|NC_007413.1_2088704_2089298_+	COG4636, Uma2, Endonuclease, Uma2 family (restriction endonuclease fold) [General function prediction only]	NA|348aa|up_8|NC_007413.1_2089331_2090375_+	pfam14903, WG_beta_rep, WG containing repeat	NA|163aa|up_7|NC_007413.1_2090745_2091234_+	NA	NA|254aa|up_6|NC_007413.1_2091351_2092113_+	COG1045, CysE, Serine acetyltransferase [Amino acid transport and metabolism]	NA|195aa|up_5|NC_007413.1_2092355_2092940_-	TIGR04026, hypothetical_protein, PPOX class probable FMN-dependent enzyme, alr4036 family	NA|766aa|up_4|NC_007413.1_2093334_2095632_+	TIGR02505, RTPR, ribonucleoside-triphosphate reductase, adenosylcobalamin-dependent	NA|73aa|up_3|NC_007413.1_2095758_2095977_+	NA	NA|857aa|up_2|NC_007413.1_2096217_2098788_+	cd01347, ligand_gated_channel, TonB dependent/Ligand-Gated channels are created by a monomeric 22 strand (22,24) anti-parallel beta-barrel	NA|397aa|up_1|NC_007413.1_2098854_2100045_+	cd17485, MFS_MFSD3, Major facilitator superfamily domain containing 3 protein	NA|481aa|up_0|NC_007413.1_2100329_2101772_+	pfam01654, Cyt_bd_oxida_I, Cytochrome bd terminal oxidase subunit I	NA|338aa|down_0|NC_007413.1_2102010_2103024_+	COG1294, AppB, Cytochrome bd-type quinol oxidase, subunit 2 [Energy production and conversion]	NA|167aa|down_1|NC_007413.1_2103683_2104184_+	NA	NA|221aa|down_2|NC_007413.1_2104447_2105110_+	sd00006, TPR, Tetratricopeptide repeat	NA|69aa|down_3|NC_007413.1_2105157_2105364_+	NA	NA|95aa|down_4|NC_007413.1_2105882_2106167_+	cd12399, RRM_HP0827_like, RNA recognition motif in Helicobacter pylori HP0827 protein and similar proteins	NA|350aa|down_5|NC_007413.1_2107030_2108080_+	PRK09293, PRK09293, class 1 fructose-bisphosphatase	NA|382aa|down_6|NC_007413.1_2108286_2109432_+	PRK03343, PRK03343, transaldolase; Validated	NA|510aa|down_7|NC_007413.1_2109574_2111104_+	PRK05722, PRK05722, glucose-6-phosphate 1-dehydrogenase; Validated	NA|459aa|down_8|NC_007413.1_2111261_2112638_+	COG3429, COG3429, Glucose-6-P dehydrogenase subunit [Carbohydrate transport and metabolism]	NA|354aa|down_9|NC_007413.1_2112927_2113989_+	TIGR02595, conserved_hypothetical_protein, PEP-CTERM protein-sorting domain
GCF_000204075.1_ASM20407v1	NC_007413	Trichormus variabilis ATCC 29413, complete sequence	7	2314524-2314619	4	CRISPRCasFinder	no		csa3,cas3,c2c5_V-U5,Cas14c_CAS-V-F,cas14j,DinG,RT,cas6,cas8b3,cas7,cas5,WYL,PD-DExK,cas10d,csc2gr7,csc1gr5,2OG_CAS,cas4,cas1,cas2,Cas9_archaeal,cas14k	Orphan	AGGCGATCGCTCACCCTTTTTACTCAAAGTTTC	33	0	0	NA	NA	NA	1	1	Orphan	csa3,cas3,c2c5_V-U5,Cas14c_CAS-V-F,cas14j,DinG,RT,cas6,cas8b3,cas7,cas5,WYL,PD-DExK,cas10d,csc2gr7,csc1gr5,2OG_CAS,cas4,cas1,cas2,Cas9_archaeal,cas14k,Cas14u_CAS-V,DEDDh	NA,NA|66aa|down_3|NC_007413.1_2316988_2317186_-,NA|456aa|down_9|NC_007413.1_2324130_2325498_-	NA|41aa|up_9|NC_007413.1_2309839_2309962_-	CHL00108, psbJ, photosystem II protein J	NA|41aa|up_8|NC_007413.1_2310031_2310154_-	PRK00753, psbL, photosystem II reaction center L; Provisional	NA|46aa|up_7|NC_007413.1_2310171_2310309_-	PRK02561, psbF, cytochrome b559 subunit beta; Provisional	NA|83aa|up_6|NC_007413.1_2310318_2310567_-	PRK02557, psbE, cytochrome b559 subunit alpha; Provisional	NA|340aa|up_5|NC_007413.1_2310672_2311692_-	PRK13684, PRK13684, photosynthesis system II assembly factor Ycf48	NA|112aa|up_4|NC_007413.1_2311850_2312186_-	COG1773, COG1773, Rubredoxin [Energy production and conversion]	NA|121aa|up_3|NC_007413.1_2312381_2312744_+	CHL00022, ndhC, NADH dehydrogenase subunit 3	NA|246aa|up_2|NC_007413.1_2312734_2313472_+	CHL00023, ndhK, NADH dehydrogenase subunit K	NA|176aa|up_1|NC_007413.1_2313464_2313992_+	PRK12494, PRK12494, NAD(P)H-quinone oxidoreductase subunit J	NA|78aa|up_0|NC_007413.1_2314170_2314404_+	pfam13374, TPR_10, Tetratricopeptide repeat	NA|281aa|down_0|NC_007413.1_2314674_2315517_+	pfam11103, DUF2887, Protein of unknown function (DUF2887)	NA|270aa|down_1|NC_007413.1_2315625_2316435_+	cd05358, GlcDH_SDR_c, glucose 1 dehydrogenase (GlcDH), classical (c) SDRs	NA|177aa|down_2|NC_007413.1_2316461_2316992_+	pfam07176, DUF1400, Alpha/beta hydrolase of unknown function (DUF1400)	NA|66aa|down_3|NC_007413.1_2316988_2317186_-	NA	NA|204aa|down_4|NC_007413.1_2317192_2317804_-	NF033183, colliding_TM, low-complexity tail membrane protein	NA|1039aa|down_5|NC_007413.1_2317995_2321112_-	PRK05306, infB, translation initiation factor IF-2; Validated	NA|90aa|down_6|NC_007413.1_2321604_2321874_-	pfam04296, DUF448, Protein of unknown function (DUF448)	NA|424aa|down_7|NC_007413.1_2322022_2323294_-	PRK12329, nusA, transcription termination factor NusA	NA|154aa|down_8|NC_007413.1_2323458_2323920_-	PRK00092, PRK00092, ribosome maturation protein RimP; Reviewed	NA|456aa|down_9|NC_007413.1_2324130_2325498_-	NA
GCF_000204075.1_ASM20407v1	NC_007413	Trichormus variabilis ATCC 29413, complete sequence	8	2395670-2398703	5,2,5	CRISPRCasFinder,CRT,PILER-CR	no		csa3,cas3,c2c5_V-U5,Cas14c_CAS-V-F,cas14j,DinG,RT,cas6,cas8b3,cas7,cas5,WYL,PD-DExK,cas10d,csc2gr7,csc1gr5,2OG_CAS,cas4,cas1,cas2,Cas9_archaeal,cas14k	Orphan	GTTTCAATCCCTGATAGGGATTTTAGAGGGTTTTAAC,GTTTCAATCCCTGATAGGGATTTTAGAGGGTTTTAAC,GTTAAAACCCTCTAAAATCCCTATCAGGGATTGAAAC	37,37,37	0	0	NA	NA	I-D,II-B:I-D,II-B:I-D,II-B	42,42,42	42	Orphan	csa3,cas3,c2c5_V-U5,Cas14c_CAS-V-F,cas14j,DinG,RT,cas6,cas8b3,cas7,cas5,WYL,PD-DExK,cas10d,csc2gr7,csc1gr5,2OG_CAS,cas4,cas1,cas2,Cas9_archaeal,cas14k,Cas14u_CAS-V,DEDDh	NA|64aa|up_1|NC_007413.1_2393482_2393674_+,NA|528aa|up_0|NC_007413.1_2393871_2395455_-,NA|431aa|down_0|NC_007413.1_2400939_2402232_-,NA|82aa|down_7|NC_007413.1_2408514_2408760_+	NA|306aa|up_9|NC_007413.1_2384704_2385622_+	PRK02649, ppnK, NAD(+) kinase	NA|229aa|up_8|NC_007413.1_2385704_2386391_+	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|174aa|up_7|NC_007413.1_2386762_2387284_+	COG1430, COG1430, Uncharacterized conserved protein [Function unknown]	NA|76aa|up_6|NC_007413.1_2387787_2388015_+	pfam11165, DUF2949, Protein of unknown function (DUF2949)	NA|83aa|up_5|NC_007413.1_2388267_2388516_-	pfam17275, DUF5340, Family of unknown function (DUF5340)	NA|297aa|up_4|NC_007413.1_2388860_2389751_-	PRK00278, trpC, indole-3-glycerol phosphate synthase TrpC	NA|476aa|up_3|NC_007413.1_2389846_2391274_-	PRK06416, PRK06416, dihydrolipoamide dehydrogenase; Reviewed	NA|614aa|up_2|NC_007413.1_2391462_2393304_+	COG4715, COG4715, Uncharacterized conserved protein [Function unknown]	NA|64aa|up_1|NC_007413.1_2393482_2393674_+	NA	NA|528aa|up_0|NC_007413.1_2393871_2395455_-	NA	NA|431aa|down_0|NC_007413.1_2400939_2402232_-	NA	NA|491aa|down_1|NC_007413.1_2402390_2403863_-	COG1167, ARO8, Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs [Transcription / Amino acid transport and metabolism]	NA|224aa|down_2|NC_007413.1_2403971_2404643_+	pfam12900, Pyridox_ox_2, Pyridoxamine 5'-phosphate oxidase	NA|552aa|down_3|NC_007413.1_2404789_2406445_-	pfam13751, DDE_Tnp_1_6, Transposase DDE domain	NA|185aa|down_4|NC_007413.1_2406728_2407283_+	pfam01243, Putative_PNPOx, Pyridoxamine 5'-phosphate oxidase	NA|62aa|down_5|NC_007413.1_2407674_2407860_-	pfam07927, HicA_toxin, HicA toxin of bacterial toxin-antitoxin,	NA|74aa|down_6|NC_007413.1_2407859_2408081_-	COG1598, COG1598, Predicted nuclease of the RNAse H fold, HicB family [General    function prediction only]	NA|82aa|down_7|NC_007413.1_2408514_2408760_+	NA	NA|1221aa|down_8|NC_007413.1_2409008_2412671_-	TIGR02025, Magnesium-chelatase_subunit_H, magnesium chelatase, H subunit	NA|605aa|down_9|NC_007413.1_2412948_2414763_+	COG1132, MdlB, ABC-type multidrug transport system, ATPase and permease components [Defense mechanisms]
GCF_000204075.1_ASM20407v1	NC_007413	Trichormus variabilis ATCC 29413, complete sequence	9	2672847-2672927	6	CRISPRCasFinder	no		csa3,cas3,c2c5_V-U5,Cas14c_CAS-V-F,cas14j,DinG,RT,cas6,cas8b3,cas7,cas5,WYL,PD-DExK,cas10d,csc2gr7,csc1gr5,2OG_CAS,cas4,cas1,cas2,Cas9_archaeal,cas14k	Orphan	ACCAATGATTTGGGATAATTATCTGCGT	28	0	0	NA	NA	NA	1	1	Orphan	csa3,cas3,c2c5_V-U5,Cas14c_CAS-V-F,cas14j,DinG,RT,cas6,cas8b3,cas7,cas5,WYL,PD-DExK,cas10d,csc2gr7,csc1gr5,2OG_CAS,cas4,cas1,cas2,Cas9_archaeal,cas14k,Cas14u_CAS-V,DEDDh	NA,NA	NA|408aa|up_9|NC_007413.1_2659157_2660381_+	cd19920, REC_PA4781-like, phosphoacceptor receiver (REC) domain of cyclic di-GMP phosphodiesterase PA4781 and similar domains	NA|444aa|up_8|NC_007413.1_2660862_2662194_+	cd19920, REC_PA4781-like, phosphoacceptor receiver (REC) domain of cyclic di-GMP phosphodiesterase PA4781 and similar domains	NA|384aa|up_7|NC_007413.1_2662439_2663591_+	COG3287, COG3287, Uncharacterized conserved protein [Function unknown]	NA|343aa|up_6|NC_007413.1_2663606_2664635_+	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|521aa|up_5|NC_007413.1_2664695_2666258_-	PRK02504, PRK02504, NAD(P)H-quinone oxidoreductase subunit N	NA|511aa|up_4|NC_007413.1_2666562_2668095_-	cd02142, McbC_SagB-like_oxidoreductase, oxidase similar to the microcin B17 processing protein McbC	NA|292aa|up_3|NC_007413.1_2668303_2669179_-	pfam14257, DUF4349, Domain of unknown function (DUF4349)	NA|153aa|up_2|NC_007413.1_2669271_2669730_-	pfam12158, DUF3592, Protein of unknown function (DUF3592)	NA|489aa|up_1|NC_007413.1_2669902_2671369_-	COG1982, LdcC, Arginine/lysine/ornithine decarboxylases [Amino acid transport and metabolism]	NA|287aa|up_0|NC_007413.1_2671845_2672706_+	pfam06485, DUF1092, Protein of unknown function (DUF1092)	NA|863aa|down_0|NC_007413.1_2674415_2677004_+	cd01031, EriC, ClC chloride channel EriC	NA|326aa|down_1|NC_007413.1_2677389_2678367_+	pfam05982, Sbt_1, Na+-dependent bicarbonate transporter superfamily	NA|98aa|down_2|NC_007413.1_2678370_2678664_+	COG0347, GlnK, Nitrogen regulatory protein PII [Amino acid transport and metabolism]	NA|225aa|down_3|NC_007413.1_2678939_2679614_+	cd00884, beta_CA_cladeB, Carbonic anhydrases (CA) are zinc-containing enzymes that catalyze the reversible hydration of carbon dioxide in a two-step mechanism in which the nucleophilic attack of a zinc-bound hydroxide ion on carbon dioxide is followed by the regeneration of an active site by ionization of the zinc-bound water molecule and removal of a proton from the active site	NA|457aa|down_4|NC_007413.1_2679791_2681162_-	COG4370, COG4370, Uncharacterized protein conserved in bacteria [Function unknown]	NA|134aa|down_5|NC_007413.1_2681427_2681829_-	COG2335, COG2335, Secreted and surface protein containing fasciclin-like repeats [Cell envelope biogenesis, outer membrane]	NA|473aa|down_6|NC_007413.1_2681951_2683370_-	COG3670, COG3670, Lignostilbene-alpha,beta-dioxygenase and related enzymes [Secondary metabolites biosynthesis, transport, and catabolism]	NA|377aa|down_7|NC_007413.1_2683617_2684748_+	cd02801, DUS_like_FMN, Dihydrouridine synthase-like (DUS-like) FMN-binding domain	NA|355aa|down_8|NC_007413.1_2684782_2685847_-	COG1609, PurR, Transcriptional regulators [Transcription]	NA|344aa|down_9|NC_007413.1_2686211_2687243_+	cd06321, PBP1_ABC_sugar_binding-like, periplasmic sugar-binding domain of uncharacterized ABC-type transport systems
GCF_000204075.1_ASM20407v1	NC_007413	Trichormus variabilis ATCC 29413, complete sequence	10	2727347-2727599	6,3,7	PILER-CR,CRT,CRISPRCasFinder	no	c2c5_V-U5	csa3,cas3,c2c5_V-U5,Cas14c_CAS-V-F,cas14j,DinG,RT,cas6,cas8b3,cas7,cas5,WYL,PD-DExK,cas10d,csc2gr7,csc1gr5,2OG_CAS,cas4,cas1,cas2,Cas9_archaeal,cas14k	Type V-U5	AAGGTGACAATAGCCCTTCCCGTGTTGAGCGGGTTGAAAGG,GGTGACAATAGCCCTTCCCGTGTTGAGCGGGTTGAAAG,GTGACAATAGCCCTTCCCGTGTTGAGCGGGTTGAAAG	41,38,37	0	0	NA	NA	V-U5:V-U5:V-U5	2,3,3	3	TypeV-U5	csa3,cas3,c2c5_V-U5,Cas14c_CAS-V-F,cas14j,DinG,RT,cas6,cas8b3,cas7,cas5,WYL,PD-DExK,cas10d,csc2gr7,csc1gr5,2OG_CAS,cas4,cas1,cas2,Cas9_archaeal,cas14k,Cas14u_CAS-V,DEDDh	NA|86aa|up_3|NC_007413.1_2723713_2723971_-,NA|102aa|up_2|NC_007413.1_2724168_2724474_+,c2c5_V-U5|644aa|up_0|NC_007413.1_2724971_2726903_+,NA|238aa|down_7|NC_007413.1_2738286_2739000_-	NA|551aa|up_9|NC_007413.1_2709588_2711241_+	pfam09299, Mu-transpos_C, Mu transposase, C-terminal	NA|276aa|up_8|NC_007413.1_2711250_2712078_+	pfam05621, TniB, Bacterial TniB protein	NA|174aa|up_7|NC_007413.1_2712079_2712601_+	pfam06527, TniQ, TniQ	NA|1173aa|up_6|NC_007413.1_2712907_2716426_+	cd18011, DEXDc_RapA, DEXH-box helicase domain of RapA	NA|793aa|up_5|NC_007413.1_2716682_2719061_+	COG1743, COG1743, Adenine-specific DNA methylase containing a Zn-ribbon [DNA replication, recombination, and repair]	NA|1110aa|up_4|NC_007413.1_2720097_2723427_+	COG1483, COG1483, Predicted ATPase (AAA+ superfamily) [General function prediction only]	NA|86aa|up_3|NC_007413.1_2723713_2723971_-	NA	NA|102aa|up_2|NC_007413.1_2724168_2724474_+	NA	NA|84aa|up_1|NC_007413.1_2724598_2724850_-	cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins	c2c5_V-U5|644aa|up_0|NC_007413.1_2724971_2726903_+	NA	NA|307aa|down_0|NC_007413.1_2728482_2729403_+	PRK00779, PRK00779, ornithine carbamoyltransferase; Provisional	NA|202aa|down_1|NC_007413.1_2729549_2730155_+	COG1974, LexA, SOS-response transcriptional repressors (RecA-mediated autopeptidases) [Transcription / Signal transduction mechanisms]	NA|493aa|down_2|NC_007413.1_2730266_2731745_+	TIGR04095, type_III_restriction_protein_res_subunit, DNA phosphorothioation system restriction enzyme	NA|691aa|down_3|NC_007413.1_2732242_2734315_+	TIGR03185, DNA_S_dndD, DNA sulfur modification protein DndD	NA|189aa|down_4|NC_007413.1_2734476_2735043_+	cd06260, DUF820, Domain of unknown function (DUF820)	NA|154aa|down_5|NC_007413.1_2735769_2736231_+	TIGR04062, hypothetical_protein_CY0110_29519, dnd system-associated protein 4	NA|665aa|down_6|NC_007413.1_2736286_2738281_-	pfam14072, DndB, DNA-sulfur modification-associated	NA|238aa|down_7|NC_007413.1_2738286_2739000_-	NA	NA|533aa|down_8|NC_007413.1_2739047_2740646_-	TIGR03187, hypothetical_protein, DGQHR domain	NA|567aa|down_9|NC_007413.1_2740803_2742504_+	PRK06850, PRK06850, hypothetical protein; Provisional
GCF_000204075.1_ASM20407v1	NC_007413	Trichormus variabilis ATCC 29413, complete sequence	11	4230039-4230111	8	CRISPRCasFinder	no		csa3,cas3,c2c5_V-U5,Cas14c_CAS-V-F,cas14j,DinG,RT,cas6,cas8b3,cas7,cas5,WYL,PD-DExK,cas10d,csc2gr7,csc1gr5,2OG_CAS,cas4,cas1,cas2,Cas9_archaeal,cas14k	Orphan	TCTACAGATGGTTTATTTGTCGGA	24	1	14	4230063-4230087|4230063-4230087|4230063-4230087|4230063-4230087|4230063-4230087|4230063-4230087|4230063-4230087|4230063-4230087|4230063-4230087|4230063-4230087|4230063-4230087|4230063-4230087|4230063-4230087|4230063-4230087	NC_007413.1_252438-252414|NC_007413.1_779818-779794|NC_007413.1_267349-267373|NC_007413.1_3217845-3217869|NC_007413.1_4752276-4752300|NC_007413.1_1790215-1790191|NC_007413.1_3874514-3874490|NC_007413.1_1910847-1910871|NC_007413.1_2106322-2106346|NC_007413.1_5401661-5401685|NC_007413.1_3199828-3199804|NC_007413.1_6179789-6179765|NC_007413.1_2929157-2929181|NC_007413.1_2959242-2959266	NA	1	1	Orphan	csa3,cas3,c2c5_V-U5,Cas14c_CAS-V-F,cas14j,DinG,RT,cas6,cas8b3,cas7,cas5,WYL,PD-DExK,cas10d,csc2gr7,csc1gr5,2OG_CAS,cas4,cas1,cas2,Cas9_archaeal,cas14k,Cas14u_CAS-V,DEDDh	NA,NA|86aa|down_0|NC_007413.1_4230298_4230556_+,NA|73aa|down_2|NC_007413.1_4231308_4231527_+,NA|76aa|down_9|NC_007413.1_4239454_4239682_-	NA|560aa|up_9|NC_007413.1_4220632_4222312_+	COG1305, COG1305, Transglutaminase-like enzymes, putative cysteine proteases [Amino acid transport and metabolism]	NA|216aa|up_8|NC_007413.1_4222477_4223125_+	TIGR04155, hypothetical_protein, PEP-CTERM protein sorting domain, cyanobacterial subclass	NA|313aa|up_7|NC_007413.1_4223250_4224189_-	pfam13354, Beta-lactamase2, Beta-lactamase enzyme family	NA|235aa|up_6|NC_007413.1_4224195_4224900_+	pfam00877, NLPC_P60, NlpC/P60 family	NA|338aa|up_5|NC_007413.1_4224914_4225928_+	cd04187, DPM1_like_bac, Bacterial DPM1_like enzymes are related to eukaryotic DPM1	NA|412aa|up_4|NC_007413.1_4226007_4227243_+	cd17489, MFS_YfcJ_like, Escherichia coli YfcJ, YhhS, and similar transporters of the Major Facilitator Superfamily	NA|169aa|up_3|NC_007413.1_4227405_4227912_+	PRK09364, moaC, cyclic pyranopterin monophosphate synthase MoaC	NA|66aa|up_2|NC_007413.1_4227913_4228111_-	pfam11387, DUF2795, Protein of unknown function (DUF2795)	NA|396aa|up_1|NC_007413.1_4228409_4229597_+	COG3284, AcoR, Transcriptional activator of acetoin/glycerol metabolism [Secondary metabolites biosynthesis, transport, and catabolism / Transcription]	NA|128aa|up_0|NC_007413.1_4229603_4229987_+	pfam00072, Response_reg, Response regulator receiver domain	NA|86aa|down_0|NC_007413.1_4230298_4230556_+	NA	NA|103aa|down_1|NC_007413.1_4230675_4230984_+	pfam11378, DUF3181, Protein of unknown function (DUF3181)	NA|73aa|down_2|NC_007413.1_4231308_4231527_+	NA	NA|157aa|down_3|NC_007413.1_4231606_4232077_-	pfam14108, DUF4281, Domain of unknown function (DUF4281)	NA|347aa|down_4|NC_007413.1_4232174_4233215_-	TIGR02475, Probable_cobalamine_biosynthesis_protein, cobalamin biosynthesis protein CobW	NA|300aa|down_5|NC_007413.1_4233437_4234337_+	COG1295, Rbn, Ribonuclease BN family enzyme [Replication, recombination, and repair]	NA|173aa|down_6|NC_007413.1_4234390_4234909_+	COG2323, COG2323, Predicted membrane protein [Function unknown]	NA|432aa|down_7|NC_007413.1_4235011_4236307_+	PRK07380, PRK07380, adenylosuccinate lyase; Provisional	NA|426aa|down_8|NC_007413.1_4236710_4237988_-	cd03800, GT4_sucrose_synthase, sucrose-phosphate synthase and similar proteins	NA|76aa|down_9|NC_007413.1_4239454_4239682_-	NA
GCF_000204075.1_ASM20407v1	NC_007413	Trichormus variabilis ATCC 29413, complete sequence	12	4244524-4244631	9	CRISPRCasFinder	no		csa3,cas3,c2c5_V-U5,Cas14c_CAS-V-F,cas14j,DinG,RT,cas6,cas8b3,cas7,cas5,WYL,PD-DExK,cas10d,csc2gr7,csc1gr5,2OG_CAS,cas4,cas1,cas2,Cas9_archaeal,cas14k	Orphan	ATTGTTTCCATCCCCGTGAGGGGTAAAGGAATTAAAACC	39	0	0	NA	NA	NA	1	1	Orphan	csa3,cas3,c2c5_V-U5,Cas14c_CAS-V-F,cas14j,DinG,RT,cas6,cas8b3,cas7,cas5,WYL,PD-DExK,cas10d,csc2gr7,csc1gr5,2OG_CAS,cas4,cas1,cas2,Cas9_archaeal,cas14k,Cas14u_CAS-V,DEDDh	NA|76aa|up_3|NC_007413.1_4239454_4239682_-,NA|131aa|up_1|NC_007413.1_4242790_4243183_-,NA|285aa|up_0|NC_007413.1_4243365_4244220_-,NA|143aa|down_1|NC_007413.1_4247526_4247955_+	NA|157aa|up_9|NC_007413.1_4231606_4232077_-	pfam14108, DUF4281, Domain of unknown function (DUF4281)	NA|347aa|up_8|NC_007413.1_4232174_4233215_-	TIGR02475, Probable_cobalamine_biosynthesis_protein, cobalamin biosynthesis protein CobW	NA|300aa|up_7|NC_007413.1_4233437_4234337_+	COG1295, Rbn, Ribonuclease BN family enzyme [Replication, recombination, and repair]	NA|173aa|up_6|NC_007413.1_4234390_4234909_+	COG2323, COG2323, Predicted membrane protein [Function unknown]	NA|432aa|up_5|NC_007413.1_4235011_4236307_+	PRK07380, PRK07380, adenylosuccinate lyase; Provisional	NA|426aa|up_4|NC_007413.1_4236710_4237988_-	cd03800, GT4_sucrose_synthase, sucrose-phosphate synthase and similar proteins	NA|76aa|up_3|NC_007413.1_4239454_4239682_-	NA	NA|784aa|up_2|NC_007413.1_4240073_4242425_+	sd00006, TPR, Tetratricopeptide repeat	NA|131aa|up_1|NC_007413.1_4242790_4243183_-	NA	NA|285aa|up_0|NC_007413.1_4243365_4244220_-	NA	NA|552aa|down_0|NC_007413.1_4245148_4246804_+	pfam13751, DDE_Tnp_1_6, Transposase DDE domain	NA|143aa|down_1|NC_007413.1_4247526_4247955_+	NA	NA|394aa|down_2|NC_007413.1_4248021_4249203_-	pfam01139, RtcB, tRNA-splicing ligase RtcB	NA|300aa|down_3|NC_007413.1_4249651_4250551_+	pfam13649, Methyltransf_25, Methyltransferase domain	NA|243aa|down_4|NC_007413.1_4250547_4251276_+	COG0637, COG0637, Predicted phosphatase/phosphohexomutase [General function prediction only]	NA|275aa|down_5|NC_007413.1_4251794_4252619_+	COG1354, scpA, Rec8/ScpA/Scc1-like protein (kleisin family) [Replication,    recombination, and repair]	NA|390aa|down_6|NC_007413.1_4253006_4254176_+	COG1208, GCD1, Nucleoside-diphosphate-sugar pyrophosphorylase involved in lipopolysaccharide biosynthesis/translation initiation factor 2B, gamma/epsilon subunits (eIF-2Bgamma/eIF-2Bepsilon) [Cell envelope biogenesis, outer membrane / Translation, ribosomal structure and biogenesis]	NA|672aa|down_7|NC_007413.1_4254241_4256257_-	PRK05354, PRK05354, biosynthetic arginine decarboxylase	NA|150aa|down_8|NC_007413.1_4256488_4256938_+	PRK00668, ndk, mulitfunctional nucleoside diphosphate kinase/apyrimidinic endonuclease/3'-; Validated	NA|241aa|down_9|NC_007413.1_4257032_4257755_-	COG0861, TerC, Membrane protein TerC, possibly involved in tellurium resistance [Inorganic ion transport and metabolism]
GCF_000204075.1_ASM20407v1	NC_007413	Trichormus variabilis ATCC 29413, complete sequence	13	4247274-4247383	10	CRISPRCasFinder	no		csa3,cas3,c2c5_V-U5,Cas14c_CAS-V-F,cas14j,DinG,RT,cas6,cas8b3,cas7,cas5,WYL,PD-DExK,cas10d,csc2gr7,csc1gr5,2OG_CAS,cas4,cas1,cas2,Cas9_archaeal,cas14k	Orphan	TTTCTCATTTGCTGTAAATGCTTTCT	26	0	0	NA	NA	NA	1	1	Orphan	csa3,cas3,c2c5_V-U5,Cas14c_CAS-V-F,cas14j,DinG,RT,cas6,cas8b3,cas7,cas5,WYL,PD-DExK,cas10d,csc2gr7,csc1gr5,2OG_CAS,cas4,cas1,cas2,Cas9_archaeal,cas14k,Cas14u_CAS-V,DEDDh	NA|76aa|up_4|NC_007413.1_4239454_4239682_-,NA|131aa|up_2|NC_007413.1_4242790_4243183_-,NA|285aa|up_1|NC_007413.1_4243365_4244220_-,NA|143aa|down_0|NC_007413.1_4247526_4247955_+	NA|347aa|up_9|NC_007413.1_4232174_4233215_-	TIGR02475, Probable_cobalamine_biosynthesis_protein, cobalamin biosynthesis protein CobW	NA|300aa|up_8|NC_007413.1_4233437_4234337_+	COG1295, Rbn, Ribonuclease BN family enzyme [Replication, recombination, and repair]	NA|173aa|up_7|NC_007413.1_4234390_4234909_+	COG2323, COG2323, Predicted membrane protein [Function unknown]	NA|432aa|up_6|NC_007413.1_4235011_4236307_+	PRK07380, PRK07380, adenylosuccinate lyase; Provisional	NA|426aa|up_5|NC_007413.1_4236710_4237988_-	cd03800, GT4_sucrose_synthase, sucrose-phosphate synthase and similar proteins	NA|76aa|up_4|NC_007413.1_4239454_4239682_-	NA	NA|784aa|up_3|NC_007413.1_4240073_4242425_+	sd00006, TPR, Tetratricopeptide repeat	NA|131aa|up_2|NC_007413.1_4242790_4243183_-	NA	NA|285aa|up_1|NC_007413.1_4243365_4244220_-	NA	NA|552aa|up_0|NC_007413.1_4245148_4246804_+	pfam13751, DDE_Tnp_1_6, Transposase DDE domain	NA|143aa|down_0|NC_007413.1_4247526_4247955_+	NA	NA|394aa|down_1|NC_007413.1_4248021_4249203_-	pfam01139, RtcB, tRNA-splicing ligase RtcB	NA|300aa|down_2|NC_007413.1_4249651_4250551_+	pfam13649, Methyltransf_25, Methyltransferase domain	NA|243aa|down_3|NC_007413.1_4250547_4251276_+	COG0637, COG0637, Predicted phosphatase/phosphohexomutase [General function prediction only]	NA|275aa|down_4|NC_007413.1_4251794_4252619_+	COG1354, scpA, Rec8/ScpA/Scc1-like protein (kleisin family) [Replication,    recombination, and repair]	NA|390aa|down_5|NC_007413.1_4253006_4254176_+	COG1208, GCD1, Nucleoside-diphosphate-sugar pyrophosphorylase involved in lipopolysaccharide biosynthesis/translation initiation factor 2B, gamma/epsilon subunits (eIF-2Bgamma/eIF-2Bepsilon) [Cell envelope biogenesis, outer membrane / Translation, ribosomal structure and biogenesis]	NA|672aa|down_6|NC_007413.1_4254241_4256257_-	PRK05354, PRK05354, biosynthetic arginine decarboxylase	NA|150aa|down_7|NC_007413.1_4256488_4256938_+	PRK00668, ndk, mulitfunctional nucleoside diphosphate kinase/apyrimidinic endonuclease/3'-; Validated	NA|241aa|down_8|NC_007413.1_4257032_4257755_-	COG0861, TerC, Membrane protein TerC, possibly involved in tellurium resistance [Inorganic ion transport and metabolism]	NA|312aa|down_9|NC_007413.1_4258156_4259092_-	pfam14261, DUF4351, Domain of unknown function (DUF4351)
GCF_000204075.1_ASM20407v1	NC_007413	Trichormus variabilis ATCC 29413, complete sequence	14	4334197-4334377	7,11	PILER-CR,CRISPRCasFinder	no	cas6,cas8b3,cas7,cas5	csa3,cas3,c2c5_V-U5,Cas14c_CAS-V-F,cas14j,DinG,RT,cas6,cas8b3,cas7,cas5,WYL,PD-DExK,cas10d,csc2gr7,csc1gr5,2OG_CAS,cas4,cas1,cas2,Cas9_archaeal,cas14k	Unclear	GTGCTTTAACATTAGATGTCGTTAGGCGTTGAGCAGG,GTGCTTTAACATTAGATGTCGTTAGGCGTTGAGCA	37,35	0	0	NA	NA	I-A,I-B,II-B:I-A,I-B,II-B	2,2	2	Unclear	csa3,cas3,c2c5_V-U5,Cas14c_CAS-V-F,cas14j,DinG,RT,cas6,cas8b3,cas7,cas5,WYL,PD-DExK,cas10d,csc2gr7,csc1gr5,2OG_CAS,cas4,cas1,cas2,Cas9_archaeal,cas14k,Cas14u_CAS-V,DEDDh	NA|743aa|up_6|NC_007413.1_4325001_4327230_+,NA|53aa|down_0|NC_007413.1_4334445_4334604_-,NA|77aa|down_1|NC_007413.1_4334633_4334864_-,NA|101aa|down_3|NC_007413.1_4337112_4337415_-	NA|82aa|up_9|NC_007413.1_4321577_4321823_+	CHL00065, psaC, photosystem I subunit VII	NA|634aa|up_8|NC_007413.1_4322125_4324027_+	PRK00331, PRK00331, isomerizing glutamine--fructose-6-phosphate transaminase	NA|273aa|up_7|NC_007413.1_4324160_4324979_+	pfam08721, Tn7_Tnp_TnsA_C, TnsA endonuclease C terminal	NA|743aa|up_6|NC_007413.1_4325001_4327230_+	NA	NA|557aa|up_5|NC_007413.1_4327216_4328887_+	pfam13401, AAA_22, AAA domain	NA|317aa|up_4|NC_007413.1_4328894_4329845_+	pfam06527, TniQ, TniQ	cas6|205aa|up_3|NC_007413.1_4330082_4330697_+	TIGR02807, hypothetical_protein_LA_3189, CRISPR-associated protein Cas6, subtype MYXAN	cas8b3|495aa|up_2|NC_007413.1_4330768_4332253_+	cd09713, Cas8c_I-C, CRISPR/Cas system-associated protein Cas8c	cas7|325aa|up_1|NC_007413.1_4332311_4333286_+	cd09687, Cas7_I-C, CRISPR/Cas system-associated RAMP superfamily protein Cas7	cas5|211aa|up_0|NC_007413.1_4333287_4333920_+	cd09688, Cas5_I-C, CRISPR/Cas system-associated RAMP superfamily protein Cas5	NA|53aa|down_0|NC_007413.1_4334445_4334604_-	NA	NA|77aa|down_1|NC_007413.1_4334633_4334864_-	NA	NA|637aa|down_2|NC_007413.1_4335159_4337070_+	pfam15978, TnsD, Tn7-like transposition protein D	NA|101aa|down_3|NC_007413.1_4337112_4337415_-	NA	NA|98aa|down_4|NC_007413.1_4337547_4337841_-	pfam13443, HTH_26, Cro/C1-type HTH DNA-binding domain	NA|695aa|down_5|NC_007413.1_4338038_4340123_+	COG0286, HsdM, Type I restriction-modification system methyltransferase subunit [Defense mechanisms]	NA|406aa|down_6|NC_007413.1_4340129_4341347_+	cd17512, RMtype1_S_BceB55ORF5615P-TRD2-CR2_like, Type I restriction-modification system specificity (S) subunit Target Recognition Domain-ConseRved domain (TRD-CR), similar to Bacillus cereus HuB5-5 S subunit (S	NA|82aa|down_7|NC_007413.1_4341372_4341618_+	pfam03683, UPF0175, Uncharacterized protein family (UPF0175)	NA|166aa|down_8|NC_007413.1_4341614_4342112_+	COG2405, COG2405, Predicted nucleic acid-binding protein, contains PIN domain [General function prediction only]	NA|1080aa|down_9|NC_007413.1_4342243_4345483_+	COG0610, COG0610, Type I site-specific restriction-modification system, R (restriction) subunit and related helicases [Defense mechanisms]
GCF_000204075.1_ASM20407v1	NC_007413	Trichormus variabilis ATCC 29413, complete sequence	15	4821250-4823752	12,4,8	CRISPRCasFinder,CRT,PILER-CR	no		csa3,cas3,c2c5_V-U5,Cas14c_CAS-V-F,cas14j,DinG,RT,cas6,cas8b3,cas7,cas5,WYL,PD-DExK,cas10d,csc2gr7,csc1gr5,2OG_CAS,cas4,cas1,cas2,Cas9_archaeal,cas14k	Orphan	GTTGCAACACCACATAATCCCTATTAGGGATTGAAAC,GTTGCAACACCACATAATCCCTATTAGGGATTGAAAC,GTTGCAACACCACATAATCCCTATTAGGGATTGAAAC	37,37,37	0	0	NA	NA	I-D,II-B:I-D,II-B:I-D,II-B	34,34,31	34	Orphan	csa3,cas3,c2c5_V-U5,Cas14c_CAS-V-F,cas14j,DinG,RT,cas6,cas8b3,cas7,cas5,WYL,PD-DExK,cas10d,csc2gr7,csc1gr5,2OG_CAS,cas4,cas1,cas2,Cas9_archaeal,cas14k,Cas14u_CAS-V,DEDDh	NA|96aa|up_5|NC_007413.1_4812516_4812804_-,NA|144aa|down_2|NC_007413.1_4825483_4825915_-,NA|266aa|down_4|NC_007413.1_4826809_4827607_+	NA|371aa|up_9|NC_007413.1_4808604_4809717_+	pfam13304, AAA_21, AAA domain, putative AbiEii toxin, Type IV TA system	NA|182aa|up_8|NC_007413.1_4809703_4810249_+	COG1257, HMG1, Hydroxymethylglutaryl-CoA reductase [Lipid metabolism]	NA|369aa|up_7|NC_007413.1_4810315_4811422_+	COG2082, CobH, Precorrin isomerase [Coenzyme metabolism]	NA|348aa|up_6|NC_007413.1_4811467_4812511_-	COG0628, yhhT, Predicted permease, member of the PurR regulon [General function prediction only]	NA|96aa|up_5|NC_007413.1_4812516_4812804_-	NA	NA|126aa|up_4|NC_007413.1_4813218_4813596_+	pfam07100, ASRT, Anabaena sensory rhodopsin transducer	NA|421aa|up_3|NC_007413.1_4813601_4814864_-	TIGR02966, Phosphate_regulon_sensor_protein_PhoR, phosphate regulon sensor kinase PhoR	NA|100aa|up_2|NC_007413.1_4815180_4815480_+	PRK02724, PRK02724, 30S ribosomal protein PSRP-3	NA|561aa|up_1|NC_007413.1_4815602_4817285_-	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|1022aa|up_0|NC_007413.1_4817863_4820929_+	PRK10060, PRK10060, cyclic di-GMP phosphodiesterase	NA|304aa|down_0|NC_007413.1_4823841_4824753_-	COG2207, AraC, AraC-type DNA-binding domain-containing proteins [Transcription]	NA|135aa|down_1|NC_007413.1_4824941_4825346_+	COG3011, COG3011, Predicted thiol-disulfide oxidoreductase [General function    prediction only]	NA|144aa|down_2|NC_007413.1_4825483_4825915_-	NA	NA|185aa|down_3|NC_007413.1_4826044_4826599_-	PRK05800, cobU, adenosylcobinamide kinase/adenosylcobinamide-phosphate guanylyltransferase; Validated	NA|266aa|down_4|NC_007413.1_4826809_4827607_+	NA	NA|314aa|down_5|NC_007413.1_4827756_4828698_+	cd00761, Glyco_tranf_GTA_type, Glycosyltransferase family A (GT-A) includes diverse families of glycosyl transferases with a common GT-A type structural fold	NA|314aa|down_6|NC_007413.1_4828718_4829660_+	cd02511, Beta4Glucosyltransferase, UDP-glucose LOS-beta-1,4 glucosyltransferase is required for biosynthesis of lipooligosaccharide	NA|188aa|down_7|NC_007413.1_4829694_4830258_+	PRK10502, PRK10502, putative acyl transferase; Provisional	NA|268aa|down_8|NC_007413.1_4830422_4831226_-	pfam06051, DUF928, Domain of Unknown Function (DUF928)	NA|783aa|down_9|NC_007413.1_4831330_4833679_-	COG4252, COG4252, Predicted transmembrane sensor domain [Signal transduction mechanisms]
GCF_000204075.1_ASM20407v1	NC_007413	Trichormus variabilis ATCC 29413, complete sequence	16	5215638-5215733	13	CRISPRCasFinder	no	WYL,cas3,PD-DExK,cas10d,csc2gr7,csc1gr5,2OG_CAS,cas6,cas4,cas1,cas2	csa3,cas3,c2c5_V-U5,Cas14c_CAS-V-F,cas14j,DinG,RT,cas6,cas8b3,cas7,cas5,WYL,PD-DExK,cas10d,csc2gr7,csc1gr5,2OG_CAS,cas4,cas1,cas2,Cas9_archaeal,cas14k	Type I-D	GTTTCAATCCCTGATAGGGATTT	23	0	0	NA	NA	I-D,II-B	1	1	TypeI-D	csa3,cas3,c2c5_V-U5,Cas14c_CAS-V-F,cas14j,DinG,RT,cas6,cas8b3,cas7,cas5,WYL,PD-DExK,cas10d,csc2gr7,csc1gr5,2OG_CAS,cas4,cas1,cas2,Cas9_archaeal,cas14k,Cas14u_CAS-V,DEDDh	NA|51aa|up_7|NC_007413.1_5186505_5186658_-,NA|249aa|up_4|NC_007413.1_5208394_5209141_-,NA|292aa|up_3|NC_007413.1_5210340_5211216_+,NA|68aa|down_0|NC_007413.1_5215966_5216170_+	NA|462aa|up_9|NC_007413.1_5183333_5184719_-	TIGR01137, Cystathionine_beta-synthase, cystathionine beta-synthase	NA|158aa|up_8|NC_007413.1_5185981_5186455_+	cd14503, PTP-bact, bacterial tyrosine-protein phosphataseS similar to Neisseria NMA1982	NA|51aa|up_7|NC_007413.1_5186505_5186658_-	NA	NA|313aa|up_6|NC_007413.1_5187008_5187947_-	pfam11949, DUF3466, Protein of unknown function (DUF3466)	NA|6582aa|up_5|NC_007413.1_5187999_5207745_-	pfam06346, Drf_FH1, Formin Homology Region 1	NA|249aa|up_4|NC_007413.1_5208394_5209141_-	NA	NA|292aa|up_3|NC_007413.1_5210340_5211216_+	NA	NA|373aa|up_2|NC_007413.1_5211296_5212415_+	COG1173, DppC, ABC-type dipeptide/oligopeptide/nickel transport systems, permease components [Amino acid transport and metabolism / Inorganic ion transport and metabolism]	WYL|289aa|up_1|NC_007413.1_5212491_5213358_-	pfam13280, WYL, WYL domain	cas3|726aa|up_0|NC_007413.1_5213458_5215636_+	cd09710, Cas3_I-D, CRISPR/Cas system-associated protein Cas3; Distinct diverged subfamily of Cas3 helicase domain	NA|68aa|down_0|NC_007413.1_5215966_5216170_+	NA	NA|82aa|down_1|NC_007413.1_5216397_5216643_+	COG2442, COG2442, Uncharacterized conserved protein [Function unknown]	NA|120aa|down_2|NC_007413.1_5216645_5217005_+	COG4634, COG4634, Uncharacterized protein conserved in bacteria [Function unknown]	PD-DExK|340aa|down_3|NC_007413.1_5217052_5218072_+	pfam06250, DUF1016, Protein of unknown function (DUF1016)	cas10d|1092aa|down_4|NC_007413.1_5218090_5221366_+	TIGR03174, cas_Csc3, CRISPR type I-D/CYANO-associated protein Csc3/Cas10d	csc2gr7|339aa|down_5|NC_007413.1_5221411_5222428_+	pfam18320, Csc2, Csc2 Crispr	csc1gr5|236aa|down_6|NC_007413.1_5222427_5223135_+	cd09711, Csc1_I-D, CRISPR/Cas system-associated protein Csc1	2OG_CAS|204aa|down_7|NC_007413.1_5223245_5223857_+	pfam13640, 2OG-FeII_Oxy_3, 2OG-Fe(II) oxygenase superfamily	cas6|291aa|down_8|NC_007413.1_5223899_5224772_+	COG5551, COG5551, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	cas4|197aa|down_9|NC_007413.1_5224856_5225447_+	cd09637, Cas4_I-A_I-B_I-C_I-D_II-B, CRISPR/Cas system-associated protein Cas4
GCF_000204075.1_ASM20407v1	NC_007413	Trichormus variabilis ATCC 29413, complete sequence	17	5227213-5229282	9,14,5	PILER-CR,CRISPRCasFinder,CRT	no	WYL,cas3,PD-DExK,cas10d,csc2gr7,csc1gr5,2OG_CAS,cas6,cas4,cas1,cas2	csa3,cas3,c2c5_V-U5,Cas14c_CAS-V-F,cas14j,DinG,RT,cas6,cas8b3,cas7,cas5,WYL,PD-DExK,cas10d,csc2gr7,csc1gr5,2OG_CAS,cas4,cas1,cas2,Cas9_archaeal,cas14k	Type I-D	GTTTCTATTAACACAAATCCCTATCAGGGATTGAAAG,GTTTCTATTAACACAAATCCCTATCAGGGATTGAAAG,GTTTCTATTAACACAAATCCCTATCAGGGATTGAAAG	37,37,37	0	0	NA	NA	I-D,II-B:I-D,II-B:I-D,II-B	27,28,28	28	TypeI-D	csa3,cas3,c2c5_V-U5,Cas14c_CAS-V-F,cas14j,DinG,RT,cas6,cas8b3,cas7,cas5,WYL,PD-DExK,cas10d,csc2gr7,csc1gr5,2OG_CAS,cas4,cas1,cas2,Cas9_archaeal,cas14k,Cas14u_CAS-V,DEDDh	NA,NA|159aa|down_9|NC_007413.1_5246697_5247174_+	NA|120aa|up_9|NC_007413.1_5216645_5217005_+	COG4634, COG4634, Uncharacterized protein conserved in bacteria [Function unknown]	PD-DExK|340aa|up_8|NC_007413.1_5217052_5218072_+	pfam06250, DUF1016, Protein of unknown function (DUF1016)	cas10d|1092aa|up_7|NC_007413.1_5218090_5221366_+	TIGR03174, cas_Csc3, CRISPR type I-D/CYANO-associated protein Csc3/Cas10d	csc2gr7|339aa|up_6|NC_007413.1_5221411_5222428_+	pfam18320, Csc2, Csc2 Crispr	csc1gr5|236aa|up_5|NC_007413.1_5222427_5223135_+	cd09711, Csc1_I-D, CRISPR/Cas system-associated protein Csc1	2OG_CAS|204aa|up_4|NC_007413.1_5223245_5223857_+	pfam13640, 2OG-FeII_Oxy_3, 2OG-Fe(II) oxygenase superfamily	cas6|291aa|up_3|NC_007413.1_5223899_5224772_+	COG5551, COG5551, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	cas4|197aa|up_2|NC_007413.1_5224856_5225447_+	cd09637, Cas4_I-A_I-B_I-C_I-D_II-B, CRISPR/Cas system-associated protein Cas4	cas1|334aa|up_1|NC_007413.1_5225620_5226622_+	TIGR04093, hypothetical_protein_L8106_25395, CRISPR-associated endonuclease Cas1, subtype CYANO	cas2|91aa|up_0|NC_007413.1_5226683_5226956_+	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	NA|195aa|down_0|NC_007413.1_5229674_5230259_-	pfam14015, DUF4231, Protein of unknown function (DUF4231)	NA|246aa|down_1|NC_007413.1_5230268_5231006_-	pfam18171, LSDAT_prok, SLOG in TRPM, prokaryote	NA|783aa|down_2|NC_007413.1_5231390_5233739_+	TIGR01901, Heme/hemopexin-binding_protein, filamentous hemagglutinin family N-terminal domain	NA|609aa|down_3|NC_007413.1_5233735_5235562_+	COG2831, FhaC, Hemolysin activation/secretion protein [Intracellular trafficking and secretion]	NA|922aa|down_4|NC_007413.1_5235669_5238435_+	COG4995, COG4995, Uncharacterized protein conserved in bacteria [Function unknown]	NA|1049aa|down_5|NC_007413.1_5238785_5241932_+	COG4252, COG4252, Predicted transmembrane sensor domain [Signal transduction mechanisms]	NA|259aa|down_6|NC_007413.1_5241955_5242732_+	pfam06051, DUF928, Domain of Unknown Function (DUF928)	NA|615aa|down_7|NC_007413.1_5243541_5245386_+	COG2303, BetA, Choline dehydrogenase and related flavoproteins [Amino acid transport and metabolism]	NA|367aa|down_8|NC_007413.1_5245549_5246650_+	pfam12902, Ferritin-like, Ferritin-like	NA|159aa|down_9|NC_007413.1_5246697_5247174_+	NA
GCF_000204075.1_ASM20407v1	NC_007413	Trichormus variabilis ATCC 29413, complete sequence	18	5764010-5766428	10,15,6,11	PILER-CR,CRISPRCasFinder,CRT,PILER-CR	no		csa3,cas3,c2c5_V-U5,Cas14c_CAS-V-F,cas14j,DinG,RT,cas6,cas8b3,cas7,cas5,WYL,PD-DExK,cas10d,csc2gr7,csc1gr5,2OG_CAS,cas4,cas1,cas2,Cas9_archaeal,cas14k	Orphan	ATTGCAATTAACTAAAATCCCTATCAGGGATTGAAAC,ATTGCAATTAACTAAAATCCCTATCAGGGATTGAAAC,ATTGCAATTAACTAAAATCCCTATCAGGGATTGAAAC,ATTGCAATTAACTAAAATCCCTATCAGGGATTGAAAC	37,37,37,37	0	0	NA	NA	I-D,II-B:I-D,II-B:I-D,II-B:I-D,II-B	31,33,33,31	33	Orphan	csa3,cas3,c2c5_V-U5,Cas14c_CAS-V-F,cas14j,DinG,RT,cas6,cas8b3,cas7,cas5,WYL,PD-DExK,cas10d,csc2gr7,csc1gr5,2OG_CAS,cas4,cas1,cas2,Cas9_archaeal,cas14k,Cas14u_CAS-V,DEDDh	NA|71aa|up_0|NC_007413.1_5763787_5764000_-,NA|300aa|down_5|NC_007413.1_5772670_5773570_-,NA|105aa|down_7|NC_007413.1_5776375_5776690_-,NA|63aa|down_8|NC_007413.1_5777346_5777535_-,NA|226aa|down_9|NC_007413.1_5778014_5778692_+	NA|159aa|up_9|NC_007413.1_5755093_5755570_+	COG0694, COG0694, Thioredoxin-like proteins and domains [Posttranslational modification, protein turnover, chaperones]	NA|391aa|up_8|NC_007413.1_5755595_5756768_+	cd05819, NHL, NHL repeat unit of beta-propeller proteins	NA|787aa|up_7|NC_007413.1_5756791_5759152_+	COG0068, HypF, Hydrogenase maturation factor [Posttranslational modification, protein turnover, chaperones]	NA|87aa|up_6|NC_007413.1_5759241_5759502_+	pfam01455, HupF_HypC, HupF/HypC family	NA|384aa|up_5|NC_007413.1_5759736_5760888_+	PRK15062, PRK15062, hydrogenase isoenzymes formation protein HypD; Provisional	NA|72aa|up_4|NC_007413.1_5760941_5761157_+	COG1942, COG1942, Uncharacterized protein, 4-oxalocrotonate tautomerase homolog [General function prediction only]	NA|368aa|up_3|NC_007413.1_5761448_5762552_+	TIGR02124, Hydrogenase_expression/formation_protein_HypE, hydrogenase expression/formation protein HypE	NA|114aa|up_2|NC_007413.1_5762570_5762912_+	pfam01155, HypA, Hydrogenase/urease nickel incorporation, metallochaperone, hypA	NA|282aa|up_1|NC_007413.1_5762902_5763748_+	PRK10463, PRK10463, hydrogenase nickel incorporation protein HypB; Provisional	NA|71aa|up_0|NC_007413.1_5763787_5764000_-	NA	NA|414aa|down_0|NC_007413.1_5766607_5767849_+	TIGR02037, Probable_periplasmic_serine_protease_do/HhoA-like, periplasmic serine protease, Do/DeqQ family	NA|124aa|down_1|NC_007413.1_5767970_5768342_-	pfam01925, TauE, Sulfite exporter TauE/SafE	NA|325aa|down_2|NC_007413.1_5768414_5769389_-	COG2421, COG2421, Predicted acetamidase/formamidase [Energy production and conversion]	NA|357aa|down_3|NC_007413.1_5769519_5770590_-	pfam13358, DDE_3, DDE superfamily endonuclease	NA|641aa|down_4|NC_007413.1_5770735_5772658_-	COG0443, DnaK, Molecular chaperone [Posttranslational modification, protein turnover, chaperones]	NA|300aa|down_5|NC_007413.1_5772670_5773570_-	NA	NA|763aa|down_6|NC_007413.1_5773718_5776007_-	PRK05402, PRK05402, 1,4-alpha-glucan branching protein GlgB	NA|105aa|down_7|NC_007413.1_5776375_5776690_-	NA	NA|63aa|down_8|NC_007413.1_5777346_5777535_-	NA	NA|226aa|down_9|NC_007413.1_5778014_5778692_+	NA
GCF_000204075.1_ASM20407v1	NC_007410	Trichormus variabilis ATCC 29413 plasmid A, complete sequence	1	53505-53673	1	CRISPRCasFinder	no	Cas9_archaeal	RT,Cas9_archaeal,cas14k,Cas14u_CAS-V,cas14j,DEDDh	Type II-A, or Type II-C?, Type II-B	CCCGAAACACCCCCGAAACACCCC	24	0	0	NA	NA	NA	3	3	TypeII-A,orTypeII-C?,TypeII-B	csa3,cas3,c2c5_V-U5,Cas14c_CAS-V-F,cas14j,DinG,RT,cas6,cas8b3,cas7,cas5,WYL,PD-DExK,cas10d,csc2gr7,csc1gr5,2OG_CAS,cas4,cas1,cas2,Cas9_archaeal,cas14k,Cas14u_CAS-V,DEDDh	NA|198aa|up_8|NC_007410.1_44122_44716_-,NA|109aa|up_5|NC_007410.1_46475_46802_-,NA|219aa|up_4|NC_007410.1_47274_47931_+,NA|188aa|down_0|NC_007410.1_53742_54306_+,NA|137aa|down_3|NC_007410.1_57428_57839_+,NA|127aa|down_4|NC_007410.1_58023_58404_+,NA|177aa|down_5|NC_007410.1_58480_59011_-,NA|127aa|down_6|NC_007410.1_63041_63422_+,NA|206aa|down_8|NC_007410.1_65076_65694_+	NA|221aa|up_9|NC_007410.1_43447_44110_+	COG2197, CitB, Response regulator containing a CheY-like receiver domain and an HTH DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|198aa|up_8|NC_007410.1_44122_44716_-	NA	NA|236aa|up_7|NC_007410.1_44715_45423_-	cd05386, TraL, transfer origin protein TraL	NA|341aa|up_6|NC_007410.1_45406_46429_-	cd05386, TraL, transfer origin protein TraL	NA|109aa|up_5|NC_007410.1_46475_46802_-	NA	NA|219aa|up_4|NC_007410.1_47274_47931_+	NA	NA|187aa|up_3|NC_007410.1_48069_48630_-	COG1430, COG1430, Uncharacterized conserved protein [Function unknown]	NA|549aa|up_2|NC_007410.1_48626_50273_-	COG0464, SpoVK, ATPases of the AAA+ class [Posttranslational modification, protein turnover, chaperones]	NA|170aa|up_1|NC_007410.1_50380_50890_-	pfam13154, DUF3991, Protein of unknown function (DUF3991)	NA|357aa|up_0|NC_007410.1_50943_52014_-	pfam13358, DDE_3, DDE superfamily endonuclease	NA|188aa|down_0|NC_007410.1_53742_54306_+	NA	NA|400aa|down_1|NC_007410.1_54328_55528_+	cd10227, ParM_like, Plasmid segregation protein ParM and similar proteins	NA|401aa|down_2|NC_007410.1_56224_57427_+	pfam10592, AIPR, AIPR protein	NA|137aa|down_3|NC_007410.1_57428_57839_+	NA	NA|127aa|down_4|NC_007410.1_58023_58404_+	NA	NA|177aa|down_5|NC_007410.1_58480_59011_-	NA	NA|127aa|down_6|NC_007410.1_63041_63422_+	NA	NA|463aa|down_7|NC_007410.1_63558_64947_+	PLN02829, PLN02829, Probable galacturonosyltransferase	NA|206aa|down_8|NC_007410.1_65076_65694_+	NA	NA|317aa|down_9|NC_007410.1_65808_66759_+	TIGR02997, RNA_polymerase_sigma_subunit_sigma70/sigma32, RNA polymerase sigma factor, cyanobacterial RpoD-like family
GCF_000204075.1_ASM20407v1	NC_007412	Trichormus variabilis ATCC 29413 plasmid C, complete sequence	1	155934-156045	1	CRISPRCasFinder	no		cas14j,RT,cas14k,cas3,PD-DExK	Orphan	GCCGCTGTACGCATCGTCTCTTCCGGATTACCGCGCG	37	0	0	NA	NA	NA	1	1	Orphan	csa3,cas3,c2c5_V-U5,Cas14c_CAS-V-F,cas14j,DinG,RT,cas6,cas8b3,cas7,cas5,WYL,PD-DExK,cas10d,csc2gr7,csc1gr5,2OG_CAS,cas4,cas1,cas2,Cas9_archaeal,cas14k,Cas14u_CAS-V,DEDDh	NA|392aa|up_7|NC_007412.1_146345_147521_-,NA|140aa|up_5|NC_007412.1_148213_148633_+,NA|185aa|down_2|NC_007412.1_158919_159474_-	NA|210aa|up_9|NC_007412.1_138275_138905_-	COG2197, CitB, Response regulator containing a CheY-like receiver domain and an HTH DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|2362aa|up_8|NC_007412.1_138917_146003_-	COG3899, COG3899, Predicted ATPase [General function prediction only]	NA|392aa|up_7|NC_007412.1_146345_147521_-	NA	NA|128aa|up_6|NC_007412.1_147756_148140_+	PRK07459, PRK07459, single-stranded DNA-binding protein; Provisional	NA|140aa|up_5|NC_007412.1_148213_148633_+	NA	NA|265aa|up_4|NC_007412.1_148771_149566_+	COG1192, Soj, ATPases involved in chromosome partitioning [Cell division and chromosome partitioning]	NA|428aa|up_3|NC_007412.1_149773_151057_+	pfam14239, RRXRR, RRXRR protein	NA|329aa|up_2|NC_007412.1_151141_152128_+	TIGR04285, parB-like_partition_protein, nucleoid occlusion protein	NA|199aa|up_1|NC_007412.1_152435_153032_-	cd04182, GT_2_like_f, GT_2_like_f is a subfamily of the glycosyltransferase family 2 (GT-2) with unknown function	NA|387aa|up_0|NC_007412.1_153010_154171_-	pfam13478, XdhC_C, XdhC Rossmann domain	NA|331aa|down_0|NC_007412.1_156538_157531_-	COG1319, CoxM, Aerobic-type carbon monoxide dehydrogenase, middle subunit CoxM/CutM homologs [Energy production and conversion]	NA|281aa|down_1|NC_007412.1_157517_158360_-	PRK11433, PRK11433, aldehyde oxidoreductase 2Fe-2S subunit; Provisional	NA|185aa|down_2|NC_007412.1_158919_159474_-	NA	NA|187aa|down_3|NC_007412.1_159660_160221_-	COG3247, HdeD, Uncharacterized conserved protein [Function unknown]	NA|874aa|down_4|NC_007412.1_160326_162948_-	TIGR03346, chaperone_ClpB, ATP-dependent chaperone ClpB	NA|106aa|down_5|NC_007412.1_162992_163310_-	pfam13591, MerR_2, MerR HTH family regulatory protein	NA|336aa|down_6|NC_007412.1_163306_164314_-	PRK14299, PRK14299, chaperone protein DnaJ; Provisional	NA|325aa|down_7|NC_007412.1_164398_165373_-	pfam10938, YfdX, YfdX protein	NA|301aa|down_8|NC_007412.1_165838_166741_-	pfam10938, YfdX, YfdX protein	NA|280aa|down_9|NC_007412.1_166929_167769_-	cd19138, AKR_YeaE, Escherichia coli YeaE and similar proteins
