assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000092605.1_ASM9260v1	NC_014153	Thiomonas intermedia K12, complete genome	1	157050-157134	1	CRISPRCasFinder	no		c2c9_V-U4,Cas14u_CAS-V,csa3,DEDDh,DinG	Orphan	CACCTCCTCCACTTGTGGGGAGG	23	0	0	NA	NA	NA	1	1	Orphan	c2c9_V-U4,Cas14u_CAS-V,csa3,DEDDh,DinG	NA,NA|224aa|down_0|NC_014153.1_157213_157885_-,NA|100aa|down_7|NC_014153.1_165778_166078_+	NA|320aa|up_9|NC_014153.1_145751_146711_-	cd08419, PBP2_CbbR_RubisCO_like, The C-terminal substrate binding of LysR-type transcriptional regulator (CbbR) of RubisCO operon, which is involved in the carbon dioxide fixation, contains the type 2 periplasmic binding fold	NA|360aa|up_8|NC_014153.1_147119_148199_+	PRK09293, PRK09293, class 1 fructose-bisphosphatase	NA|292aa|up_7|NC_014153.1_148355_149231_+	PRK15453, PRK15453, phosphoribulokinase; Provisional	NA|676aa|up_6|NC_014153.1_149262_151290_+	PRK12753, PRK12753, transketolase; Reviewed	NA|236aa|up_5|NC_014153.1_151312_152020_+	PRK13222, PRK13222, N-acetylmuramic acid 6-phosphate phosphatase MupP	NA|336aa|up_4|NC_014153.1_152091_153099_+	COG0057, GapA, Glyceraldehyde-3-phosphate dehydrogenase/erythrose-4-phosphate dehydrogenase [Carbohydrate transport and metabolism]	NA|402aa|up_3|NC_014153.1_153169_154375_+	PRK00073, pgk, phosphoglycerate kinase; Provisional	NA|355aa|up_2|NC_014153.1_154428_155493_+	PRK09196, PRK09196, fructose-bisphosphate aldolase class II	NA|186aa|up_1|NC_014153.1_155602_156160_+	cd02969, PRX_like1, Peroxiredoxin (PRX)-like 1 family; hypothetical proteins that show sequence similarity to PRXs	NA|255aa|up_0|NC_014153.1_156156_156921_+	cd07528, HAD_CbbY-like, subfamily of beta-phosphoglucomutase-like family, similar to Rhodobacter sphaeroides xylulose-1,5-bisphosphate phosphatase CbbY	NA|224aa|down_0|NC_014153.1_157213_157885_-	NA	NA|364aa|down_1|NC_014153.1_158002_159094_-	PRK05574, holA, DNA polymerase III subunit delta; Reviewed	NA|163aa|down_2|NC_014153.1_159093_159582_-	COG2980, RlpB, Rare lipoprotein B [Cell envelope biogenesis, outer membrane]	NA|877aa|down_3|NC_014153.1_159582_162213_-	PRK00390, leuS, leucyl-tRNA synthetase; Validated	NA|444aa|down_4|NC_014153.1_162364_163696_+	COG1252, Ndh, NADH dehydrogenase, FAD-containing subunit [Energy production and conversion]	NA|119aa|down_5|NC_014153.1_163797_164154_+	pfam16998, 17kDa_Anti_2, 17 kDa outer membrane surface antigen	NA|497aa|down_6|NC_014153.1_164196_165687_+	cd01299, Met_dep_hydrolase_A, Metallo-dependent hydrolases, subgroup A is part of the superfamily of metallo-dependent hydrolases, a large group of proteins that show conservation in their 3-dimensional fold (TIM barrel) and in details of their active site	NA|100aa|down_7|NC_014153.1_165778_166078_+	NA	NA|131aa|down_8|NC_014153.1_166177_166570_+	cd07262, VOC_like, uncharacterized subfamily of vicinal oxygen chelate (VOC) family	NA|483aa|down_9|NC_014153.1_166662_168111_-	PRK05477, gatB, Asp-tRNA(Asn)/Glu-tRNA(Gln) amidotransferase subunit GatB
GCF_000092605.1_ASM9260v1	NC_014153	Thiomonas intermedia K12, complete genome	2	423058-423159	2	CRISPRCasFinder	no		c2c9_V-U4,Cas14u_CAS-V,csa3,DEDDh,DinG	Orphan	CGGTTTACCTCGCAAGCCCGCGCCGTTGCTTGC	33	0	0	NA	NA	NA	1	1	Orphan	c2c9_V-U4,Cas14u_CAS-V,csa3,DEDDh,DinG	NA|80aa|up_6|NC_014153.1_416083_416323_-,NA|174aa|up_4|NC_014153.1_417284_417806_-,NA|71aa|up_3|NC_014153.1_417855_418068_-,NA|136aa|down_0|NC_014153.1_423237_423645_-,NA|63aa|down_1|NC_014153.1_423646_423835_-,NA|83aa|down_2|NC_014153.1_423831_424080_-,NA|122aa|down_3|NC_014153.1_424495_424861_-	NA|407aa|up_9|NC_014153.1_411006_412227_-	COG1459, PulF, Type II secretory pathway, component PulF [Cell motility and secretion / Intracellular trafficking and secretion]	NA|580aa|up_8|NC_014153.1_412241_413981_-	TIGR02538, Type_IV_pilus_assembly_protein_PilF, type IV-A pilus assembly ATPase PilB	NA|408aa|up_7|NC_014153.1_414652_415876_+	PRK09692, PRK09692, integrase; Provisional	NA|80aa|up_6|NC_014153.1_416083_416323_-	NA	NA|165aa|up_5|NC_014153.1_416730_417225_-	COG3133, SlyB, Outer membrane lipoprotein [Cell envelope biogenesis, outer membrane]	NA|174aa|up_4|NC_014153.1_417284_417806_-	NA	NA|71aa|up_3|NC_014153.1_417855_418068_-	NA	NA|946aa|up_2|NC_014153.1_418274_421112_-	pfam08751, TrwC, TrwC relaxase	NA|536aa|up_1|NC_014153.1_421115_422723_-	pfam10412, TrwB_AAD_bind, Type IV secretion-system coupling protein DNA-binding domain	NA|78aa|up_0|NC_014153.1_422731_422965_-	COG0143, MetG, Methionyl-tRNA synthetase [Translation, ribosomal structure and biogenesis]	NA|136aa|down_0|NC_014153.1_423237_423645_-	NA	NA|63aa|down_1|NC_014153.1_423646_423835_-	NA	NA|83aa|down_2|NC_014153.1_423831_424080_-	NA	NA|122aa|down_3|NC_014153.1_424495_424861_-	NA	NA|324aa|down_4|NC_014153.1_424857_425829_-	pfam16793, RepB_primase, RepB DNA-primase from phage plasmid	NA|343aa|down_5|NC_014153.1_425948_426977_-	pfam06504, RepC, Replication protein C (RepC)	NA|503aa|down_6|NC_014153.1_426963_428472_-	cd01125, repA, Hexameric Replicative Helicase RepA	NA|72aa|down_7|NC_014153.1_428565_428781_-	pfam12728, HTH_17, Helix-turn-helix domain	NA|252aa|down_8|NC_014153.1_429204_429960_+	PRK00346, surE, 5'(3')-nucleotidase/polyphosphatase; Provisional	NA|284aa|down_9|NC_014153.1_429956_430808_+	COG2518, Pcm, Protein-L-isoaspartate carboxylmethyltransferase [Posttranslational modification, protein turnover, chaperones]
GCF_000092605.1_ASM9260v1	NC_014153	Thiomonas intermedia K12, complete genome	3	533351-533447	3	CRISPRCasFinder	no		c2c9_V-U4,Cas14u_CAS-V,csa3,DEDDh,DinG	Orphan	AGCGCTTCGCGTTCCCTTCGGATGCCCCTCCCGG	34	1	3	533385-533413|533385-533413|533385-533413	NC_014153.1_157135-157107|NC_014153.1_616881-616853|NC_014153.1_1498689-1498661	NA	1	1	Orphan	c2c9_V-U4,Cas14u_CAS-V,csa3,DEDDh,DinG	NA,NA|67aa|down_0|NC_014153.1_534427_534628_+,NA|127aa|down_4|NC_014153.1_536840_537221_+,NA|162aa|down_8|NC_014153.1_542964_543450_-	NA|812aa|up_9|NC_014153.1_520772_523208_+	TIGR02504, ribonucleotide_reductase, ribonucleoside-diphosphate reductase, adenosylcobalamin-dependent	NA|86aa|up_8|NC_014153.1_523259_523517_-	PRK13989, PRK13989, cell division topological specificity factor MinE; Provisional	NA|271aa|up_7|NC_014153.1_523531_524344_-	COG2894, MinD, Septum formation inhibitor-activating ATPase [Cell division and chromosome partitioning]	NA|272aa|up_6|NC_014153.1_524362_525178_-	PRK01973, PRK01973, septum site-determining protein MinC	NA|228aa|up_5|NC_014153.1_525272_525956_-	cd03255, ABC_MJ0796_LolCDE_FtsE, ATP-binding cassette domain of the transporters involved in export of lipoprotein and macrolide, and cell division protein	NA|289aa|up_4|NC_014153.1_526180_527047_-	cd00293, USP_Like, Usp: Universal stress protein family	NA|493aa|up_3|NC_014153.1_527057_528536_-	COG0659, SUL1, Sulfate permease and related transporters (MFS superfamily) [Inorganic ion transport and metabolism]	NA|612aa|up_2|NC_014153.1_528757_530593_-	COG1086, COG1086, Predicted nucleoside-diphosphate sugar epimerases [Cell envelope biogenesis, outer membrane / Carbohydrate transport and metabolism]	NA|503aa|up_1|NC_014153.1_530589_532098_-	TIGR04181, DegT/DnrJ/EryC1/StrS_aminotransferase, aminotransferase, LLPSF_NHT_00031 family	NA|208aa|up_0|NC_014153.1_532094_532718_-	cd03360, LbH_AT_putative, Putative Acyltransferase (AT), Left-handed parallel beta-Helix (LbH) domain; This group is composed of mostly uncharacterized proteins containing an N-terminal helical subdomain followed by a LbH domain	NA|67aa|down_0|NC_014153.1_534427_534628_+	NA	NA|165aa|down_1|NC_014153.1_535244_535739_-	pfam12728, HTH_17, Helix-turn-helix domain	NA|78aa|down_2|NC_014153.1_535846_536080_+	TIGR02384, Putative_antitoxin_RelB, addiction module antitoxin, RelB/DinJ family	NA|97aa|down_3|NC_014153.1_536076_536367_+	pfam15738, YafQ_toxin, Bacterial toxin of type II toxin-antitoxin system, YafQ	NA|127aa|down_4|NC_014153.1_536840_537221_+	NA	NA|439aa|down_5|NC_014153.1_537222_538539_+	pfam07804, HipA_C, HipA-like C-terminal domain	NA|274aa|down_6|NC_014153.1_539288_540110_-	pfam01695, IstB_IS21, IstB-like ATP binding protein	NA|92aa|down_7|NC_014153.1_542227_542503_+	pfam08808, RES, RES domain	NA|162aa|down_8|NC_014153.1_542964_543450_-	NA	NA|614aa|down_9|NC_014153.1_543456_545298_-	cd00796, INT_Rci_Hp1_C, Shufflon-specific DNA recombinase Rci and Bacteriophage Hp1_like integrase, C-terminal catalytic domain
GCF_000092605.1_ASM9260v1	NC_014153	Thiomonas intermedia K12, complete genome	4	1967896-1968021	4	CRISPRCasFinder	no		c2c9_V-U4,Cas14u_CAS-V,csa3,DEDDh,DinG	Orphan	CCCCAAGTTTCCTTGACCCCCTCGGGGGGCGGGCTGGGCGAAGCCC	46	0	0	NA	NA	NA	1	1	Orphan	c2c9_V-U4,Cas14u_CAS-V,csa3,DEDDh,DinG	NA|58aa|up_3|NC_014153.1_1963845_1964019_+,NA	NA|183aa|up_9|NC_014153.1_1957479_1958028_+	cd16345, LMWP_ArsC, Arsenate reductase of the LMWP family	NA|434aa|up_8|NC_014153.1_1958114_1959416_+	PRK15445, PRK15445, arsenical efflux pump membrane protein ArsB	NA|155aa|up_7|NC_014153.1_1959490_1959955_+	cd07254, VOC_like, uncharacterized subfamily of vicinal oxygen chelate (VOC) family	NA|201aa|up_6|NC_014153.1_1960465_1961068_+	COG1279, COG1279, Lysine efflux permease [General function prediction only]	NA|473aa|up_5|NC_014153.1_1961086_1962505_-	cd01949, GGDEF, Diguanylate-cyclase (DGC) or GGDEF domain	NA|371aa|up_4|NC_014153.1_1962560_1963673_-	COG0697, RhaT, Permeases of the drug/metabolite transporter (DMT) superfamily [Carbohydrate transport and metabolism / Amino acid transport and metabolism / General function prediction only]	NA|58aa|up_3|NC_014153.1_1963845_1964019_+	NA	NA|301aa|up_2|NC_014153.1_1964035_1964938_-	PRK11761, cysM, cysteine synthase CysM	NA|408aa|up_1|NC_014153.1_1964991_1966215_-	TIGR00540, TPR_hemY_coli, heme biosynthesis-associated TPR protein	NA|384aa|up_0|NC_014153.1_1966214_1967366_-	PRK06975, PRK06975, bifunctional uroporphyrinogen-III synthetase/uroporphyrin-III C-methyltransferase; Reviewed	NA|318aa|down_0|NC_014153.1_1968422_1969376_-	PRK00072, hemC, porphobilinogen deaminase; Reviewed	NA|937aa|down_1|NC_014153.1_1969573_1972384_+	PRK00009, PRK00009, phosphoenolpyruvate carboxylase; Reviewed	NA|327aa|down_2|NC_014153.1_1972390_1973371_-	cd07228, Pat_NTE_like_bacteria, Bacterial patatin-like phospholipase domain containing protein 6	NA|112aa|down_3|NC_014153.1_1973474_1973810_+	pfam03413, PepSY, Peptidase propeptide and YPEB domain	NA|206aa|down_4|NC_014153.1_1973899_1974517_+	PRK11837, PRK11837, undecaprenyl pyrophosphate phosphatase; Provisional	NA|476aa|down_5|NC_014153.1_1974513_1975941_+	COG1538, TolC, Outer membrane protein [Cell envelope biogenesis, outer membrane / Intracellular trafficking and secretion]	NA|357aa|down_6|NC_014153.1_1975937_1977008_+	TIGR01730, COG0845:_Membrane-fusion_protein, RND family efflux transporter, MFP subunit	NA|1095aa|down_7|NC_014153.1_1977004_1980289_+	COG0841, AcrB, Cation/multidrug efflux pump [Defense mechanisms]	NA|226aa|down_8|NC_014153.1_1980292_1980970_+	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|447aa|down_9|NC_014153.1_1980966_1982307_+	PRK10337, PRK10337, sensor protein QseC; Provisional
GCF_000092605.1_ASM9260v1	NC_014153	Thiomonas intermedia K12, complete genome	5	2166179-2166348	5	CRISPRCasFinder	no	csa3	c2c9_V-U4,Cas14u_CAS-V,csa3,DEDDh,DinG	Type I-A	GGGCGTCACAGGTGACGCGTCAC	23	0	0	NA	NA	NA	2	2	Orphan	c2c9_V-U4,Cas14u_CAS-V,csa3,DEDDh,DinG	NA|47aa|up_9|NC_014153.1_2152794_2152935_+,NA|81aa|up_2|NC_014153.1_2163772_2164015_-,NA|188aa|up_1|NC_014153.1_2164023_2164587_-,NA|69aa|down_4|NC_014153.1_2168596_2168803_+,NA|71aa|down_5|NC_014153.1_2168971_2169184_+,NA|91aa|down_6|NC_014153.1_2169199_2169472_+	NA|47aa|up_9|NC_014153.1_2152794_2152935_+	NA	NA|1042aa|up_8|NC_014153.1_2152956_2156082_-	pfam04851, ResIII, Type III restriction enzyme, res subunit	NA|1139aa|up_7|NC_014153.1_2156083_2159500_-	COG2189, COG2189, Adenine specific DNA methylase Mod [DNA replication, recombination, and repair]	NA|276aa|up_6|NC_014153.1_2159715_2160543_-	COG3177, COG3177, Fic family protein [Function unknown]	NA|724aa|up_5|NC_014153.1_2160559_2162731_-	PRK07726, PRK07726, DNA topoisomerase 3	NA|189aa|up_4|NC_014153.1_2162854_2163421_+	cd16892, LT_VirB1-like, VirB1-like subfamily	NA|112aa|up_3|NC_014153.1_2163422_2163758_-	pfam01817, CM_2, Chorismate mutase type II	NA|81aa|up_2|NC_014153.1_2163772_2164015_-	NA	NA|188aa|up_1|NC_014153.1_2164023_2164587_-	NA	NA|321aa|up_0|NC_014153.1_2164579_2165542_-	smart00470, ParB, ParB-like nuclease domain	NA|54aa|down_0|NC_014153.1_2166810_2166972_-	COG3311, AlpA, Predicted transcriptional regulator [Transcription]	NA|98aa|down_1|NC_014153.1_2167012_2167306_-	pfam15943, YdaS_antitoxin, Putative antitoxin of bacterial toxin-antitoxin system, YdaS/YdaT	NA|225aa|down_2|NC_014153.1_2167381_2168056_+	COG2932, COG2932, Predicted transcriptional regulator [Transcription]	NA|143aa|down_3|NC_014153.1_2168171_2168600_+	TIGR00621, Single-stranded_DNA-binding_protein, single stranded DNA-binding protein (ssb)	NA|69aa|down_4|NC_014153.1_2168596_2168803_+	NA	NA|71aa|down_5|NC_014153.1_2168971_2169184_+	NA	NA|91aa|down_6|NC_014153.1_2169199_2169472_+	NA	NA|334aa|down_7|NC_014153.1_2169743_2170745_-	cd19076, AKR_AKR13A_13D, AKR13A and AKR13D families of aldo-keto reductase (AKR)	NA|303aa|down_8|NC_014153.1_2170846_2171755_+	cd08474, PBP2_CrgA_like_5, The C-terminal substrate binding domain of an uncharacterized LysR-type transcriptional regulator CrgA-like, contains the type 2 periplasmic binding fold	NA|124aa|down_9|NC_014153.1_2171799_2172171_+	COG3613, COG3613, Nucleoside 2-deoxyribosyltransferase [Nucleotide transport and metabolism]
GCF_000092605.1_ASM9260v1	NC_014153	Thiomonas intermedia K12, complete genome	6	2220675-2220769	6	CRISPRCasFinder	no		c2c9_V-U4,Cas14u_CAS-V,csa3,DEDDh,DinG	Orphan	AGCAATCCTTGCGCTCCGTCCAA	23	0	0	NA	NA	NA	1	1	Orphan	c2c9_V-U4,Cas14u_CAS-V,csa3,DEDDh,DinG	NA|412aa|up_9|NC_014153.1_2211407_2212643_+,NA|126aa|up_4|NC_014153.1_2216202_2216580_-,NA|240aa|down_0|NC_014153.1_2220848_2221568_+	NA|412aa|up_9|NC_014153.1_2211407_2212643_+	NA	NA|130aa|up_8|NC_014153.1_2212652_2213042_+	COG2863, COG2863, Cytochrome c553 [Energy production and conversion]	NA|446aa|up_7|NC_014153.1_2213052_2214390_+	COG0446, HcaD, Uncharacterized NAD(FAD)-dependent dehydrogenases [General function prediction only]	NA|253aa|up_6|NC_014153.1_2214386_2215145_+	COG2863, COG2863, Cytochrome c553 [Energy production and conversion]	NA|335aa|up_5|NC_014153.1_2215187_2216192_-	cd02549, Peptidase_C39A, A sub-family of peptidase family C39	NA|126aa|up_4|NC_014153.1_2216202_2216580_-	NA	NA|269aa|up_3|NC_014153.1_2216675_2217482_-	pfam01925, TauE, Sulfite exporter TauE/SafE	NA|430aa|up_2|NC_014153.1_2217621_2218911_-	COG0446, HcaD, Uncharacterized NAD(FAD)-dependent dehydrogenases [General function prediction only]	NA|170aa|up_1|NC_014153.1_2219336_2219846_+	pfam04143, Sulf_transp, Sulphur transport	NA|202aa|up_0|NC_014153.1_2219842_2220448_+	pfam04143, Sulf_transp, Sulphur transport	NA|240aa|down_0|NC_014153.1_2220848_2221568_+	NA	NA|329aa|down_1|NC_014153.1_2221633_2222620_+	COG2897, SseA, Rhodanese-related sulfurtransferase [Inorganic ion transport and metabolism]	NA|206aa|down_2|NC_014153.1_2222914_2223532_+	TIGR03027, pepcterm_export, putative polysaccharide export protein, PEP-CTERM sytem-associated	NA|553aa|down_3|NC_014153.1_2223573_2225232_+	TIGR03016, hypothetical_protein, uncharacterized protein, PEP-CTERM system associated	NA|716aa|down_4|NC_014153.1_2225241_2227389_+	TIGR03015, pepcterm_ATPase, putative secretion ATPase, PEP-CTERM locus subfamily	NA|536aa|down_5|NC_014153.1_2227441_2229049_+	TIGR03007, pepcterm_ChnLen, polysaccharide chain length determinant protein, PEP-CTERM locus subfamily	NA|317aa|down_6|NC_014153.1_2229058_2230009_+	TIGR03018, pepcterm_TyrKin, exopolysaccharide/PEP-CTERM locus tyrosine autokinase	NA|2485aa|down_7|NC_014153.1_2230136_2237591_+	COG3321, COG3321, Polyketide synthase modules and related proteins [Secondary metabolites biosynthesis, transport, and catabolism]	NA|1651aa|down_8|NC_014153.1_2237598_2242551_+	PRK12467, PRK12467, peptide synthase; Provisional	NA|348aa|down_9|NC_014153.1_2242601_2243645_+	pfam13621, Cupin_8, Cupin-like domain
GCF_000092605.1_ASM9260v1	NC_014153	Thiomonas intermedia K12, complete genome	7	2311189-2311316	7	CRISPRCasFinder	no		c2c9_V-U4,Cas14u_CAS-V,csa3,DEDDh,DinG	Orphan	CCCCTCAAGGGGCGACACGCCCTTGGGGCGGCCCGGCGGGCGTG	44	0	0	NA	NA	NA	1	1	Orphan	c2c9_V-U4,Cas14u_CAS-V,csa3,DEDDh,DinG	NA,NA	NA|203aa|up_9|NC_014153.1_2301224_2301833_+	pfam11304, DUF3106, Protein of unknown function (DUF3106)	NA|183aa|up_8|NC_014153.1_2301817_2302366_+	pfam06271, RDD, RDD family	NA|197aa|up_7|NC_014153.1_2302372_2302963_+	cd03395, PAP2_like_4, PAP2_like_4 proteins	NA|303aa|up_6|NC_014153.1_2303993_2304902_+	cd08422, PBP2_CrgA_like, The C-terminal substrate binding domain of LysR-type transcriptional regulator CrgA and its related homologs, contains the type 2 periplasmic binding domain	NA|362aa|up_5|NC_014153.1_2304952_2306038_-	cd13682, PBP2_TRAP_alpha-ketoacid, Substrate-binding component of an alpha-keto acid binding Tripartite ATP-independent Periplasmic transporter and related proteins; contains the type 2 periplasmic-binding protein fold	NA|361aa|up_4|NC_014153.1_2306172_2307255_-	cd13682, PBP2_TRAP_alpha-ketoacid, Substrate-binding component of an alpha-keto acid binding Tripartite ATP-independent Periplasmic transporter and related proteins; contains the type 2 periplasmic-binding protein fold	NA|452aa|up_3|NC_014153.1_2307486_2308842_-	PRK11040, PRK11040, peptidase PmbA; Provisional	NA|191aa|up_2|NC_014153.1_2308932_2309505_+	cd16331, YjgA-like, uncharacterized proteins similar to Escherichia coli YjgA	NA|208aa|up_1|NC_014153.1_2309482_2310106_+	PRK09417, mogA, molybdenum cofactor biosynthesis protein MogA; Provisional	NA|320aa|up_0|NC_014153.1_2310194_2311154_+	cd13540, PBP2_ModA_WtpA, Substrate binding domain of ModA/WtpA from Pyrococcus furiosus and its closest homologs;the type 2 periplasmic binding protein fold	NA|275aa|down_0|NC_014153.1_2311349_2312174_+	COG0555, CysU, ABC-type sulfate transport system, permease component [Posttranslational modification, protein turnover, chaperones]	NA|384aa|down_1|NC_014153.1_2312161_2313313_+	COG3842, PotA, ABC-type spermidine/putrescine transport systems, ATPase components [Amino acid transport and metabolism]	NA|631aa|down_2|NC_014153.1_2313330_2315223_-	COG3706, PleD, Response regulator containing a CheY-like receiver domain and a GGDEF domain [Signal transduction mechanisms]	NA|329aa|down_3|NC_014153.1_2315324_2316311_-	cd19076, AKR_AKR13A_13D, AKR13A and AKR13D families of aldo-keto reductase (AKR)	NA|488aa|down_4|NC_014153.1_2316508_2317972_+	cd01949, GGDEF, Diguanylate-cyclase (DGC) or GGDEF domain	NA|333aa|down_5|NC_014153.1_2318011_2319010_-	COG1609, PurR, Transcriptional regulators [Transcription]	NA|836aa|down_6|NC_014153.1_2319174_2321682_+	COG1080, PtsA, Phosphoenolpyruvate-protein kinase (PTS system EI component in bacteria) [Carbohydrate transport and metabolism]	NA|313aa|down_7|NC_014153.1_2321678_2322617_+	cd01164, FruK_PfkB_like, 1-phosphofructokinase (FruK), minor 6-phosphofructokinase (pfkB) and related sugar kinases	NA|571aa|down_8|NC_014153.1_2322642_2324355_+	PRK10712, PRK10712, PTS system fructose-specific transporter subunits IIBC; Provisional	NA|433aa|down_9|NC_014153.1_2324381_2325680_+	pfam04966, OprB, Carbohydrate-selective porin, OprB family
GCF_000092605.1_ASM9260v1	NC_014153	Thiomonas intermedia K12, complete genome	8	2887174-2887280	8	CRISPRCasFinder	no		c2c9_V-U4,Cas14u_CAS-V,csa3,DEDDh,DinG	Orphan	AGAAAACGTAGGGCCGCCCCAAGTTT	26	1	3	2887200-2887254|2887200-2887254|2887200-2887254	NC_014153.1_270611-270557|NC_014153.1_366584-366530|NC_014153.1_2105855-2105801	NA	1	1	Orphan	c2c9_V-U4,Cas14u_CAS-V,csa3,DEDDh,DinG	NA,NA	NA|518aa|up_9|NC_014153.1_2876302_2877856_-	COG1858, MauG, Cytochrome c peroxidase [Inorganic ion transport and metabolism]	NA|583aa|up_8|NC_014153.1_2877971_2879720_-	cd16013, AcpA, acid phosphatase A	NA|191aa|up_7|NC_014153.1_2879953_2880526_-	PRK00116, ruvA, Holliday junction branch migration protein RuvA	NA|172aa|up_6|NC_014153.1_2880586_2881102_+	PRK05174, PRK05174, bifunctional 3-hydroxydecanoyl-ACP dehydratase/trans-2-decenoyl-ACP isomerase	NA|344aa|up_5|NC_014153.1_2881214_2882246_+	cd16282, metallo-hydrolase-like_MBL-fold, uncharacterized subgroup of the MBL-fold_metallo-hydrolase superfamily; MBL-fold metallo hydrolase domain	NA|383aa|up_4|NC_014153.1_2882287_2883436_-	cd06342, PBP1_ABC_LIVBP-like, type 1 periplasmic ligand-binding domain of ABC (Atpase Binding Cassette)-type active transport systems involved in the transport of all three branched chain aliphatic amino acids (leucine, isoleucine and valine)	NA|196aa|up_3|NC_014153.1_2883538_2884126_-	PRK00039, ruvC, Holliday junction resolvase; Reviewed	NA|517aa|up_2|NC_014153.1_2884142_2885693_-	PRK00881, purH, bifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase; Provisional	NA|82aa|up_1|NC_014153.1_2885736_2885982_-	PRK01905, PRK01905, Fis family transcriptional regulator	NA|332aa|up_0|NC_014153.1_2886009_2887005_-	TIGR00737, Probable_tRNA-dihydrouridine_synthase, putative TIM-barrel protein, nifR3 family	NA|73aa|down_0|NC_014153.1_2887467_2887686_-	pfam06945, DUF1289, Protein of unknown function (DUF1289)	NA|171aa|down_1|NC_014153.1_2887682_2888195_-	cd04333, ProX_deacylase, This CD, composed mainly of bacterial single-domain proteins, includes the Thermus thermophilus (Tt) YbaK-like protein, a homolog of the trans-acting Escherichia coli YbaK Cys-tRNA(Pro) deacylase and the Agrobacterium tumefaciens  ProX Ala-tRNA(Pro) deacylase and also the cis-acting prolyl-tRNA synthetase-editing domain (ProRS-INS)	NA|316aa|down_2|NC_014153.1_2888206_2889154_-	PRK05692, PRK05692, hydroxymethylglutaryl-CoA lyase; Provisional	NA|317aa|down_3|NC_014153.1_2889167_2890118_-	cd12164, GDH_like_2, Putative glycerate dehydrogenase and related proteins of the D-specific 2-hydroxy dehydrogenase family	NA|694aa|down_4|NC_014153.1_2890114_2892196_-	COG4770, COG4770, Acetyl/propionyl-CoA carboxylase, alpha subunit [Lipid metabolism]	NA|226aa|down_5|NC_014153.1_2892222_2892900_-	pfam13548, DUF4126, Domain of unknown function (DUF4126)	NA|343aa|down_6|NC_014153.1_2892896_2893925_-	TIGR00433, biotin_synthase, biotin synthase	NA|267aa|down_7|NC_014153.1_2893951_2894752_-	PRK05995, PRK05995, enoyl-CoA hydratase; Provisional	NA|176aa|down_8|NC_014153.1_2894748_2895276_-	COG2318, DinB, Uncharacterized protein conserved in bacteria [Function unknown]	NA|133aa|down_9|NC_014153.1_2895304_2895703_-	cd18760, PIN_MtVapC3-like, uncharacterized subgroup of the VapC3-like nuclease subfamily of the PIN domain superfamily
GCF_000092605.1_ASM9260v1	NC_014153	Thiomonas intermedia K12, complete genome	9	3057618-3057744	9	CRISPRCasFinder	no		c2c9_V-U4,Cas14u_CAS-V,csa3,DEDDh,DinG	Orphan	TTTGCAGCGAAGACATCAATTATTGCACTACTTTTAAT	38	0	0	NA	NA	NA	1	1	Orphan	c2c9_V-U4,Cas14u_CAS-V,csa3,DEDDh,DinG	NA,NA	NA|92aa|up_9|NC_014153.1_3044814_3045090_-	PRK00357, rpsS, 30S ribosomal protein S19; Reviewed	NA|275aa|up_8|NC_014153.1_3045100_3045925_-	PRK09374, rplB, 50S ribosomal protein L2; Validated	NA|105aa|up_7|NC_014153.1_3045925_3046240_-	PRK05738, rplW, 50S ribosomal protein L23; Reviewed	NA|207aa|up_6|NC_014153.1_3046236_3046857_-	PRK05319, rplD, 50S ribosomal protein L4; Provisional	NA|221aa|up_5|NC_014153.1_3046856_3047519_-	PRK00001, rplC, 50S ribosomal protein L3; Validated	NA|104aa|up_4|NC_014153.1_3047693_3048005_-	PRK00596, rpsJ, 30S ribosomal protein S10; Reviewed	NA|397aa|up_3|NC_014153.1_3048073_3049264_-	PRK00049, PRK00049, elongation factor Tu; Reviewed	NA|702aa|up_2|NC_014153.1_3049316_3051422_-	PRK00007, PRK00007, elongation factor G; Reviewed	NA|157aa|up_1|NC_014153.1_3051442_3051913_-	PRK05302, PRK05302, 30S ribosomal protein S7; Validated	NA|125aa|up_0|NC_014153.1_3052009_3052384_-	PRK05163, rpsL, 30S ribosomal protein S12; Validated	NA|292aa|down_0|NC_014153.1_3057904_3058780_-	cd01558, D-AAT_like, D-Alanine aminotransferase (D-AAT_like): D-amino acid aminotransferase catalyzes transamination between D-amino acids and their respective alpha-keto acids	NA|224aa|down_1|NC_014153.1_3058823_3059495_-	COG2197, CitB, Response regulator containing a CheY-like receiver domain and an HTH DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|344aa|down_2|NC_014153.1_3059632_3060664_+	COG1702, PhoH, Phosphate starvation-inducible protein PhoH, predicted ATPase [Signal transduction mechanisms]	NA|176aa|down_3|NC_014153.1_3060660_3061188_+	PRK13963, PRK13963, rRNA maturation RNase YbeY	NA|551aa|down_4|NC_014153.1_3061232_3062885_+	PRK00302, lnt, apolipoprotein N-acyltransferase; Reviewed	NA|153aa|down_5|NC_014153.1_3062776_3063235_-	pfam10861, DUF2784, Protein of Unknown function (DUF2784)	NA|215aa|down_6|NC_014153.1_3063422_3064067_+	cd02966, TlpA_like_family, TlpA-like family; composed of  TlpA, ResA, DsbE and similar proteins	NA|440aa|down_7|NC_014153.1_3064080_3065400_+	cd17325, MFS_MdtG_SLC18_like, bacterial MdtG-like and eukaryotic solute carrier 18 (SLC18) family of the Major Facilitator Superfamily of transporters	NA|308aa|down_8|NC_014153.1_3065497_3066421_-	COG3258, COG3258, Cytochrome c [Energy production and conversion]	NA|218aa|down_9|NC_014153.1_3066454_3067108_-	COG2863, COG2863, Cytochrome c553 [Energy production and conversion]
