assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000321415.2_ASM32141v2	NC_019902	Thioalkalivibrio nitratireducens DSM 14787, complete sequence	1	14392-14467	1	CRISPRCasFinder	no		cas3,cas6,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,cas2,cas1,csx1,cmr1gr7,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,csa3,RT,cas6f,cas7f,cas5f,cas8f,cas3-cas2,DEDDh,WYL,DinG,PrimPol	Orphan	GGGGTCAGGTCTCGCCTTGTAGCA	24	0	0	NA	NA	NA	1	1	Orphan	cas3,cas6,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,cas2,cas1,csx1,cmr1gr7,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,csa3,RT,cas6f,cas7f,cas5f,cas8f,cas3-cas2,DEDDh,WYL,DinG,PrimPol	NA|522aa|up_4|NC_019902.2_8446_10012_-,NA|70aa|up_2|NC_019902.2_13232_13442_-,NA	NA|420aa|up_9|NC_019902.2_2561_3821_+	cd03811, GT4_GT28_WabH-like, family 4 and family 28 glycosyltransferases similar to Klebsiella WabH	NA|337aa|up_8|NC_019902.2_3765_4776_-	cd03801, GT4_PimA-like, phosphatidyl-myo-inositol mannosyltransferase	NA|97aa|up_7|NC_019902.2_5257_5548_-	pfam14384, BrnA_antitoxin, BrnA antitoxin of type II toxin-antitoxin system	NA|304aa|up_6|NC_019902.2_5663_6575_-	pfam14261, DUF4351, Domain of unknown function (DUF4351)	NA|206aa|up_5|NC_019902.2_6980_7598_+	COG2184, Fic, Protein involved in cell division [Cell division and chromosome partitioning]	NA|522aa|up_4|NC_019902.2_8446_10012_-	NA	NA|191aa|up_3|NC_019902.2_10245_10818_-	pfam14261, DUF4351, Domain of unknown function (DUF4351)	NA|70aa|up_2|NC_019902.2_13232_13442_-	NA	NA|77aa|up_1|NC_019902.2_13634_13865_+	COG4118, Phd, Antitoxin of toxin-antitoxin stability system [Cell division and chromosome partitioning]	NA|135aa|up_0|NC_019902.2_13866_14271_+	cd18735, PIN_HiVapC1-like, VapC-like PIN domain of Haemophilus influenzae VapC1 and related proteins	NA|97aa|down_0|NC_019902.2_14614_14905_+	COG1669, COG1669, Predicted nucleotidyltransferases [General function prediction only]	NA|137aa|down_1|NC_019902.2_14901_15312_+	COG2361, COG2361, Uncharacterized conserved protein [Function unknown]	NA|453aa|down_2|NC_019902.2_15395_16754_+	pfam01610, DDE_Tnp_ISL3, Transposase	NA|163aa|down_3|NC_019902.2_17099_17588_-	COG2405, COG2405, Predicted nucleic acid-binding protein, contains PIN domain [General function prediction only]	NA|115aa|down_4|NC_019902.2_17569_17914_-	pfam03683, UPF0175, Uncharacterized protein family (UPF0175)	NA|139aa|down_5|NC_019902.2_18496_18913_-	cd18683, PIN_VapC-like, Uncharacterized subfamily of the VapC (virulence-associated protein C)-like family of the PIN domain superfamily	NA|88aa|down_6|NC_019902.2_18899_19163_-	COG2002, AbrB, Regulators of stationary/sporulation gene expression [Transcription]	NA|136aa|down_7|NC_019902.2_19393_19801_-	pfam18765, Polbeta, Polymerase beta, Nucleotidyltransferase	NA|83aa|down_8|NC_019902.2_19997_20246_+	pfam02604, PhdYeFM_antitox, Antitoxin Phd_YefM, type II toxin-antitoxin system	NA|143aa|down_9|NC_019902.2_20238_20667_+	COG1569, COG1569, Predicted nucleic acid-binding protein, contains PIN domain [General function prediction only]
GCF_000321415.2_ASM32141v2	NC_019902	Thioalkalivibrio nitratireducens DSM 14787, complete sequence	2	290068-290205	2	CRISPRCasFinder	no		cas3,cas6,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,cas2,cas1,csx1,cmr1gr7,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,csa3,RT,cas6f,cas7f,cas5f,cas8f,cas3-cas2,DEDDh,WYL,DinG,PrimPol	Orphan	GTTAGCGTGAAAATACGCCGCCCTAGAATCGTATGTCTCTGATT	44	1	19	290112-290161|290112-290161|290112-290161|290112-290161|290112-290161|290112-290161|290112-290161|290112-290161|290112-290161|290112-290161|290112-290161|290112-290161|290112-290161|290112-290161|290112-290161|290112-290161|290112-290161|290112-290161|290112-290161	NC_019902.2_382120-382071|NC_019902.2_424945-424896|NC_019902.2_1267334-1267285|NC_019902.2_2789595-2789546|NC_019902.2_2791290-2791241|NC_019902.2_701086-701135|NC_019902.2_724713-724762|NC_019902.2_995487-995536|NC_019902.2_1068607-1068656|NC_019902.2_2614171-2614220|NC_019902.2_61674-61625|NC_019902.2_426640-426591|NC_019902.2_1299886-1299837|NC_019902.2_2154232-2154183|NC_019902.2_536984-537033|NC_019902.2_1154931-1154980|NC_019902.2_3905444-3905493|NC_019902.2_2620852-2620803|NC_019902.2_11172-11221	NA	1	1	Orphan	cas3,cas6,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,cas2,cas1,csx1,cmr1gr7,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,csa3,RT,cas6f,cas7f,cas5f,cas8f,cas3-cas2,DEDDh,WYL,DinG,PrimPol	NA|241aa|up_9|NC_019902.2_282317_283040_+,NA|190aa|up_7|NC_019902.2_284586_285156_+,NA|237aa|up_4|NC_019902.2_286589_287300_-,NA	NA|241aa|up_9|NC_019902.2_282317_283040_+	NA	NA|348aa|up_8|NC_019902.2_283282_284326_+	cd01902, Ntn_CGH, Choloylglycine hydrolase (CGH) is a bile salt-modifying enzyme that hydrolyzes non-peptide carbon-nitrogen bonds in choloylglycine and choloyltaurine, both of which are present in bile	NA|190aa|up_7|NC_019902.2_284586_285156_+	NA	NA|277aa|up_6|NC_019902.2_285202_286033_-	cd01638, CysQ, CysQ, a 3'-Phosphoadenosine-5'-phosphosulfate (PAPS) 3'-phosphatase, is a bacterial member of the inositol monophosphatase family	NA|185aa|up_5|NC_019902.2_286029_286584_-	PRK11762, nudE, adenosine nucleotide hydrolase NudE; Provisional	NA|237aa|up_4|NC_019902.2_286589_287300_-	NA	NA|247aa|up_3|NC_019902.2_287336_288077_+	PRK14988, PRK14988, GMP/IMP nucleotidase; Provisional	NA|104aa|up_2|NC_019902.2_288194_288506_+	pfam08755, YccV-like, Hemimethylated DNA-binding protein YccV like	NA|222aa|up_1|NC_019902.2_288634_289300_+	pfam13386, DsbD_2, Cytochrome C biogenesis protein transmembrane region	NA|151aa|up_0|NC_019902.2_289311_289764_-	pfam14467, DUF4426, Domain of unknown function (DUF4426)	NA|186aa|down_0|NC_019902.2_290525_291083_-	pfam02325, YGGT, YGGT family	NA|273aa|down_1|NC_019902.2_291219_292038_-	PRK11880, PRK11880, pyrroline-5-carboxylate reductase; Reviewed	NA|238aa|down_2|NC_019902.2_292034_292748_-	cd06824, PLPDE_III_Yggs_like, Pyridoxal 5-phosphate (PLP)-binding TIM barrel domain of Type III PLP-Dependent Enzymes, Yggs-like proteins	NA|345aa|down_3|NC_019902.2_292855_293890_+	COG2805, PilT, Tfp pilus assembly protein, pilus retraction ATPase PilT [Cell motility and secretion / Intracellular trafficking and secretion]	NA|391aa|down_4|NC_019902.2_293951_295124_+	COG5008, PilU, Tfp pilus assembly protein, ATPase PilU [Cell motility and secretion / Intracellular trafficking and secretion]	NA|377aa|down_5|NC_019902.2_295131_296262_+	COG5008, PilU, Tfp pilus assembly protein, ATPase PilU [Cell motility and secretion / Intracellular trafficking and secretion]	NA|431aa|down_6|NC_019902.2_296352_297645_-	PRK09357, pyrC, dihydroorotase; Validated	NA|336aa|down_7|NC_019902.2_297641_298649_-	PRK00856, pyrB, aspartate carbamoyltransferase catalytic subunit	NA|169aa|down_8|NC_019902.2_298645_299152_-	PRK05205, PRK05205, bifunctional pyr operon transcriptional regulator/uracil phosphoribosyltransferase PyrR	NA|134aa|down_9|NC_019902.2_299144_299546_-	PRK00109, PRK00109, Holliday junction resolvase RuvX
GCF_000321415.2_ASM32141v2	NC_019902	Thioalkalivibrio nitratireducens DSM 14787, complete sequence	3	1443767-1445908	1,3,1	PILER-CR,CRISPRCasFinder,CRT	no	cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,cas2,cas1,csx1,cmr1gr7,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6	cas3,cas6,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,cas2,cas1,csx1,cmr1gr7,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,csa3,RT,cas6f,cas7f,cas5f,cas8f,cas3-cas2,DEDDh,WYL,DinG,PrimPol	 Type III-B?,Type III-A,Type III-C,Type III-D,Type III-B	GTCAGAACGACTTCCCTGATGAAGAAGGGATTAAGAC,GTCAGAACGACTTCCCTGATGAAGAAGGGATTAAGAC,GTCAGAACGACTTCCCTGATGAAGAAGGGATTAAGAC	37,37,37	0	0	NA	NA	NA:NA:NA	30,30,30	30	TypeIII-B?,TypeIII-A,TypeIII-C,TypeIII-D,TypeIII-B	cas3,cas6,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,cas2,cas1,csx1,cmr1gr7,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,csa3,RT,cas6f,cas7f,cas5f,cas8f,cas3-cas2,DEDDh,WYL,DinG,PrimPol	NA,NA|175aa|down_0|NC_019902.2_1448605_1449130_+	NA|164aa|up_9|NC_019902.2_1431180_1431672_+	PRK00117, recX, recombination regulator RecX; Reviewed	NA|872aa|up_8|NC_019902.2_1431737_1434353_+	PRK00252, alaS, alanyl-tRNA synthetase; Reviewed	NA|414aa|up_7|NC_019902.2_1434484_1435726_+	PRK06635, PRK06635, aspartate kinase; Reviewed	NA|74aa|up_6|NC_019902.2_1435793_1436015_+	PRK01712, PRK01712, carbon storage regulator CsrA	NA|85aa|up_5|NC_019902.2_1436465_1436720_-	pfam02599, CsrA, Global regulator protein family	cas10|899aa|up_4|NC_019902.2_1436985_1439682_+	cd09680, Cas10_III, CRISPR/Cas system-associated protein Cas10	csm2gr11|167aa|up_3|NC_019902.2_1439693_1440194_+	pfam03750, Csm2_III-A, Csm2 Type III-A	csm3gr7|237aa|up_2|NC_019902.2_1440215_1440926_+	cd09684, Csm3_III-A, CRISPR/Cas system-associated RAMP superfamily protein Csm3	csm4gr5|325aa|up_1|NC_019902.2_1440939_1441914_+	TIGR01903, Hypothetical_protein	csm5gr7|548aa|up_0|NC_019902.2_1441910_1443554_+	COG1332, COG1332, CRISPR system related protein, RAMP superfamily [Defense mechanisms]	NA|175aa|down_0|NC_019902.2_1448605_1449130_+	NA	cas2|95aa|down_1|NC_019902.2_1449208_1449493_+	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|285aa|down_2|NC_019902.2_1449579_1450434_+	pfam01867, Cas_Cas1, CRISPR associated protein Cas1	cas2|126aa|down_3|NC_019902.2_1450426_1450804_+	pfam09827, CRISPR_Cas2, CRISPR associated protein Cas2	csx1|373aa|down_4|NC_019902.2_1451059_1452178_+	cd09741, Csx1_III-U, CRISPR/Cas system-associated protein Csx1	cmr1gr7|419aa|down_5|NC_019902.2_1452360_1453617_+	COG1367, COG1367, CRISPR system related protein, RAMP superfamily [Defense mechanisms]	cas10|625aa|down_6|NC_019902.2_1453616_1455491_+	cd09679, Cas10_III, CRISPR/Cas system-associated protein Cas10	cmr3gr5|444aa|down_7|NC_019902.2_1455490_1456822_+	cd09748, Cmr3_III-B, CRISPR/Cas system-associated RAMP superfamily protein Cmr3	cmr4gr7|294aa|down_8|NC_019902.2_1456853_1457735_+	TIGR02580, putative_CRISPR-associated_protein, CRISPR type III-B/RAMP module RAMP protein Cmr4	cmr5gr11|144aa|down_9|NC_019902.2_1457731_1458163_+	pfam09701, Cas_Cmr5, CRISPR-associated protein (Cas_Cmr5)
GCF_000321415.2_ASM32141v2	NC_019902	Thioalkalivibrio nitratireducens DSM 14787, complete sequence	4	1466440-1467532	2,4,2,3	PILER-CR,CRISPRCasFinder,CRT,PILER-CR	no	cas2,cas1,csx1,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6	cas3,cas6,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,cas2,cas1,csx1,cmr1gr7,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,csa3,RT,cas6f,cas7f,cas5f,cas8f,cas3-cas2,DEDDh,WYL,DinG,PrimPol	Type III-B,Type III-A,Type III-C,Type III-D	GTCAGAACGACTTCCCTGATGAAGAAGGGATTAAGACCCCG,GTCAGAACGACTTCCCTGATGAAGAAGGGATTAAGAC,GTCAGAACGACTTCCCTGATGAAGAAGGGATTAAGAC,GTCAGAACGACTTCCCTGATGAAGAAGGGATTAAGAC	41,37,37,37	0	0	NA	NA	NA:NA:NA:NA	10,15,15,10	15	TypeIII-A,TypeIII-C,TypeIII-D,TypeIII-B	cas3,cas6,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,cas2,cas1,csx1,cmr1gr7,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,csa3,RT,cas6f,cas7f,cas5f,cas8f,cas3-cas2,DEDDh,WYL,DinG,PrimPol	NA,NA	cmr5gr11|144aa|up_9|NC_019902.2_1457731_1458163_+	pfam09701, Cas_Cmr5, CRISPR-associated protein (Cas_Cmr5)	cmr6gr7|387aa|up_8|NC_019902.2_1458159_1459320_+	cd09661, Cmr6_III-B, CRISPR/Cas system-associated RAMP superfamily protein Cmr6	csx1|402aa|up_7|NC_019902.2_1459481_1460687_+	pfam09002, DUF1887, Domain of unknown function (DUF1887)	csx1|504aa|up_6|NC_019902.2_1460868_1462380_+	pfam09652, Cas_VVA1548, Putative CRISPR-associated protein (Cas_VVA1548)	NA|126aa|up_5|NC_019902.2_1462473_1462851_-	COG1569, COG1569, Predicted nucleic acid-binding protein, contains PIN domain [General function prediction only]	cas6|319aa|up_4|NC_019902.2_1462864_1463821_+	cd09760, Cas6_III, CRISPR/Cas system-associated RAMP superfamily protein Cas6	cas2|95aa|up_3|NC_019902.2_1463908_1464193_+	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|330aa|up_2|NC_019902.2_1464198_1465188_+	pfam01867, Cas_Cas1, CRISPR associated protein Cas1	NA|271aa|up_1|NC_019902.2_1465184_1465997_+	pfam01867, Cas_Cas1, CRISPR associated protein Cas1	cas2|110aa|up_0|NC_019902.2_1465987_1466317_+	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	NA|415aa|down_0|NC_019902.2_1471245_1472490_-	TIGR02679, conserved_hypothetical_protein, TIGR02679 family protein	NA|418aa|down_1|NC_019902.2_1476831_1478085_-	pfam09661, DUF2398, Protein of unknown function (DUF2398)	NA|515aa|down_2|NC_019902.2_1478084_1479629_-	pfam09660, DUF2397, Protein of unknown function (DUF2397)	NA|836aa|down_3|NC_019902.2_1479767_1482275_-	cd01948, EAL, EAL domain	NA|105aa|down_4|NC_019902.2_1482450_1482765_-	pfam13591, MerR_2, MerR HTH family regulatory protein	NA|318aa|down_5|NC_019902.2_1482767_1483721_-	TIGR02349, Chaperone_protein_DnaJ, chaperone protein DnaJ	NA|146aa|down_6|NC_019902.2_1483882_1484320_+	COG0071, IbpA, Molecular chaperone (small heat shock protein) [Posttranslational modification, protein turnover, chaperones]	NA|132aa|down_7|NC_019902.2_1484337_1484733_+	cd06464, ACD_sHsps-like, Alpha-crystallin domain (ACD) of alpha-crystallin-type small(s) heat shock proteins (Hsps)	NA|133aa|down_8|NC_019902.2_1484900_1485299_+	COG1734, DksA, DnaK suppressor protein [Signal transduction mechanisms]	NA|556aa|down_9|NC_019902.2_1485479_1487147_-	PRK11819, PRK11819, putative ABC transporter ATP-binding protein; Reviewed
GCF_000321415.2_ASM32141v2	NC_019902	Thioalkalivibrio nitratireducens DSM 14787, complete sequence	5	1469893-1471052	3,4,5,5	CRT,PILER-CR,CRISPRCasFinder,PILER-CR	no	cas2,csx1,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas6,cas1	cas3,cas6,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,cas2,cas1,csx1,cmr1gr7,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,csa3,RT,cas6f,cas7f,cas5f,cas8f,cas3-cas2,DEDDh,WYL,DinG,PrimPol	Type III-B,Type III-A,Type III-C,Type III-D	GTCAGAACGACTTCCCTGATGAAGAAGGGATTAAGAC,GTCAGAACGACTTCCCTGATGAAGAAGGGATTAAGAC,GTCAGAACGACTTCCCTGATGAAGAAGGGATTAAGAC,TGCGTCTTCTTCGTGGTCAGAACGACTTCCCTGATGAAGAAGGGATTAAGACT	37,37,37,53	0	0	NA	NA	NA:NA:NA:NA	16,15,15,15	16	TypeIII-A,TypeIII-C,TypeIII-D,TypeIII-B	cas3,cas6,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,cas2,cas1,csx1,cmr1gr7,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,csa3,RT,cas6f,cas7f,cas5f,cas8f,cas3-cas2,DEDDh,WYL,DinG,PrimPol	NA,NA	cmr5gr11|144aa|up_9|NC_019902.2_1457731_1458163_+	pfam09701, Cas_Cmr5, CRISPR-associated protein (Cas_Cmr5)	cmr6gr7|387aa|up_8|NC_019902.2_1458159_1459320_+	cd09661, Cmr6_III-B, CRISPR/Cas system-associated RAMP superfamily protein Cmr6	csx1|402aa|up_7|NC_019902.2_1459481_1460687_+	pfam09002, DUF1887, Domain of unknown function (DUF1887)	csx1|504aa|up_6|NC_019902.2_1460868_1462380_+	pfam09652, Cas_VVA1548, Putative CRISPR-associated protein (Cas_VVA1548)	NA|126aa|up_5|NC_019902.2_1462473_1462851_-	COG1569, COG1569, Predicted nucleic acid-binding protein, contains PIN domain [General function prediction only]	cas6|319aa|up_4|NC_019902.2_1462864_1463821_+	cd09760, Cas6_III, CRISPR/Cas system-associated RAMP superfamily protein Cas6	cas2|95aa|up_3|NC_019902.2_1463908_1464193_+	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|330aa|up_2|NC_019902.2_1464198_1465188_+	pfam01867, Cas_Cas1, CRISPR associated protein Cas1	NA|271aa|up_1|NC_019902.2_1465184_1465997_+	pfam01867, Cas_Cas1, CRISPR associated protein Cas1	cas2|110aa|up_0|NC_019902.2_1465987_1466317_+	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	NA|415aa|down_0|NC_019902.2_1471245_1472490_-	TIGR02679, conserved_hypothetical_protein, TIGR02679 family protein	NA|418aa|down_1|NC_019902.2_1476831_1478085_-	pfam09661, DUF2398, Protein of unknown function (DUF2398)	NA|515aa|down_2|NC_019902.2_1478084_1479629_-	pfam09660, DUF2397, Protein of unknown function (DUF2397)	NA|836aa|down_3|NC_019902.2_1479767_1482275_-	cd01948, EAL, EAL domain	NA|105aa|down_4|NC_019902.2_1482450_1482765_-	pfam13591, MerR_2, MerR HTH family regulatory protein	NA|318aa|down_5|NC_019902.2_1482767_1483721_-	TIGR02349, Chaperone_protein_DnaJ, chaperone protein DnaJ	NA|146aa|down_6|NC_019902.2_1483882_1484320_+	COG0071, IbpA, Molecular chaperone (small heat shock protein) [Posttranslational modification, protein turnover, chaperones]	NA|132aa|down_7|NC_019902.2_1484337_1484733_+	cd06464, ACD_sHsps-like, Alpha-crystallin domain (ACD) of alpha-crystallin-type small(s) heat shock proteins (Hsps)	NA|133aa|down_8|NC_019902.2_1484900_1485299_+	COG1734, DksA, DnaK suppressor protein [Signal transduction mechanisms]	NA|556aa|down_9|NC_019902.2_1485479_1487147_-	PRK11819, PRK11819, putative ABC transporter ATP-binding protein; Reviewed
GCF_000321415.2_ASM32141v2	NC_019902	Thioalkalivibrio nitratireducens DSM 14787, complete sequence	6	2346451-2346549	6	CRISPRCasFinder	no		cas3,cas6,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,cas2,cas1,csx1,cmr1gr7,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,csa3,RT,cas6f,cas7f,cas5f,cas8f,cas3-cas2,DEDDh,WYL,DinG,PrimPol	Orphan	GCGTTGCTCAGGTCGGCCCCGGCA	24	0	0	NA	NA	NA	1	1	Orphan	cas3,cas6,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,cas2,cas1,csx1,cmr1gr7,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,csa3,RT,cas6f,cas7f,cas5f,cas8f,cas3-cas2,DEDDh,WYL,DinG,PrimPol	NA|410aa|up_9|NC_019902.2_2331733_2332963_+,NA|138aa|up_7|NC_019902.2_2334939_2335353_-,NA|120aa|up_3|NC_019902.2_2338982_2339342_+,NA|273aa|up_1|NC_019902.2_2341150_2341969_+,NA|1112aa|down_0|NC_019902.2_2346922_2350258_-,NA|95aa|down_4|NC_019902.2_2355017_2355302_+,NA|143aa|down_7|NC_019902.2_2357008_2357437_+	NA|410aa|up_9|NC_019902.2_2331733_2332963_+	NA	NA|417aa|up_8|NC_019902.2_2333597_2334848_-	COG3177, COG3177, Fic family protein [Function unknown]	NA|138aa|up_7|NC_019902.2_2334939_2335353_-	NA	NA|140aa|up_6|NC_019902.2_2335768_2336188_-	cd18735, PIN_HiVapC1-like, VapC-like PIN domain of Haemophilus influenzae VapC1 and related proteins	NA|88aa|up_5|NC_019902.2_2336184_2336448_-	COG4456, VagC, Virulence-associated protein and related proteins [Function unknown]	NA|432aa|up_4|NC_019902.2_2337151_2338447_+	pfam13620, CarboxypepD_reg, Carboxypeptidase regulatory-like domain	NA|120aa|up_3|NC_019902.2_2338982_2339342_+	NA	NA|148aa|up_2|NC_019902.2_2339785_2340229_-	cd02215, cupin_QDO_N_C, quercetinase, N- and C-terminal cupin domains	NA|273aa|up_1|NC_019902.2_2341150_2341969_+	NA	NA|197aa|up_0|NC_019902.2_2344011_2344602_-	pfam02872, 5_nucleotid_C, 5'-nucleotidase, C-terminal domain	NA|1112aa|down_0|NC_019902.2_2346922_2350258_-	NA	NA|804aa|down_1|NC_019902.2_2350382_2352794_-	COG1357, COG1357, Pentapeptide repeats containing protein [Function unknown]	NA|220aa|down_2|NC_019902.2_2353273_2353933_-	cd03379, beta_CA_cladeD, Carbonic anhydrases (CA) are zinc-containing enzymes that catalyze the reversible hydration of carbon dioxide in a two-step mechanism in which the nucleophilic attack of a zinc-bound hydroxide ion on carbon dioxide is followed by the regeneration of an active site by ionization of the zinc-bound water molecule and removal of a proton from the active site	NA|274aa|down_3|NC_019902.2_2354192_2355014_+	pfam14236, DUF4338, Domain of unknown function (DUF4338)	NA|95aa|down_4|NC_019902.2_2355017_2355302_+	NA	NA|233aa|down_5|NC_019902.2_2355229_2355928_-	pfam13523, Acetyltransf_8, Acetyltransferase (GNAT) domain	NA|111aa|down_6|NC_019902.2_2356133_2356466_-	COG0262, FolA, Dihydrofolate reductase [Coenzyme metabolism]	NA|143aa|down_7|NC_019902.2_2357008_2357437_+	NA	NA|310aa|down_8|NC_019902.2_2357436_2358366_+	cd19095, AKR_PA4992-like, Pseudomona aeruginosa PA4992 and similar proteins	NA|228aa|down_9|NC_019902.2_2358373_2359057_+	PRK02983, lysS, bifunctional lysylphosphatidylglycerol synthetase/lysine--tRNA ligase LysX
GCF_000321415.2_ASM32141v2	NC_019902	Thioalkalivibrio nitratireducens DSM 14787, complete sequence	7	2520190-2523886	6,7,4,7,8	PILER-CR,CRISPRCasFinder,CRT,PILER-CR,PILER-CR	no	cas6f,cas7f,cas5f,cas8f,cas3-cas2,cas1	cas3,cas6,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,cas2,cas1,csx1,cmr1gr7,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,csa3,RT,cas6f,cas7f,cas5f,cas8f,cas3-cas2,DEDDh,WYL,DinG,PrimPol	Type I-F	ATTTCTGAGCTGCCTGCGCGGCAGCGAAC,TTTCTGAGCTGCCTGCGCGGCAGCGAAC,TTTCTGAGCTGCCTGCGCGGCAGCGAAC,TTTCTGAGCTGCCTGCGCGGCAGCGAAC,TTTCTGAGCTGCCTGCGCGGCAGCGAAC	29,28,28,28,28	0	0	NA	NA	I-F:I-F:I-F:I-F:I-F	59,61,61,59,59	61	TypeI-F	cas3,cas6,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,cas2,cas1,csx1,cmr1gr7,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,csa3,RT,cas6f,cas7f,cas5f,cas8f,cas3-cas2,DEDDh,WYL,DinG,PrimPol	NA|371aa|up_3|NC_019902.2_2513065_2514178_+,NA|88aa|down_6|NC_019902.2_2532785_2533049_+	NA|844aa|up_9|NC_019902.2_2503876_2506408_-	COG4591, LolE, ABC-type transport system, involved in lipoprotein release, permease component [Cell envelope biogenesis, outer membrane]	NA|241aa|up_8|NC_019902.2_2506445_2507168_-	cd03255, ABC_MJ0796_LolCDE_FtsE, ATP-binding cassette domain of the transporters involved in export of lipoprotein and macrolide, and cell division protein	NA|455aa|up_7|NC_019902.2_2507775_2509140_+	COG2239, MgtE, Mg/Co/Ni transporter MgtE (contains CBS domain) [Inorganic ion transport and metabolism]	NA|454aa|up_6|NC_019902.2_2509153_2510515_+	COG2239, MgtE, Mg/Co/Ni transporter MgtE (contains CBS domain) [Inorganic ion transport and metabolism]	NA|363aa|up_5|NC_019902.2_2510705_2511794_-	COG0701, COG0701, Predicted permeases [General function prediction only]	NA|142aa|up_4|NC_019902.2_2511914_2512340_+	cd04785, HTH_CadR-PbrR-like, Helix-Turn-Helix DNA binding domain of the CadR- and PbrR-like transcription regulators	NA|371aa|up_3|NC_019902.2_2513065_2514178_+	NA	NA|772aa|up_2|NC_019902.2_2514744_2517060_+	pfam14236, DUF4338, Domain of unknown function (DUF4338)	NA|478aa|up_1|NC_019902.2_2517940_2519374_-	pfam03050, DDE_Tnp_IS66, Transposase IS66 family	NA|222aa|up_0|NC_019902.2_2519389_2520055_-	COG0338, Dam, Site-specific DNA methylase [DNA replication, recombination, and repair]	cas6f|187aa|down_0|NC_019902.2_2524018_2524579_-	pfam09618, Cas_Csy4, CRISPR-associated protein (Cas_Csy4)	cas7f|347aa|down_1|NC_019902.2_2524582_2525623_-	pfam09615, Cas_Csy3, CRISPR-associated protein (Cas_Csy3)	cas5f|332aa|down_2|NC_019902.2_2525640_2526636_-	pfam09614, Cas_Csy2, CRISPR-associated protein (Cas_Csy2)	cas8f|422aa|down_3|NC_019902.2_2526628_2527894_-	pfam09611, Cas_Csy1, CRISPR-associated protein (Cas_Csy1)	cas3-cas2|1122aa|down_4|NC_019902.2_2528166_2531532_-	TIGR02562, conserved_hypothetical_protein, CRISPR-associated helicase Cas3, subtype I-F/YPEST	cas1|326aa|down_5|NC_019902.2_2531528_2532506_-	TIGR03637, cas1_YPEST, CRISPR-associated endonuclease Cas1, subtype I-F/YPEST	NA|88aa|down_6|NC_019902.2_2532785_2533049_+	NA	NA|156aa|down_7|NC_019902.2_2533059_2533527_-	COG3150, COG3150, Predicted esterase [General function prediction only]	NA|98aa|down_8|NC_019902.2_2533523_2533817_-	cd16331, YjgA-like, uncharacterized proteins similar to Escherichia coli YjgA	NA|397aa|down_9|NC_019902.2_2534818_2536009_+	pfam03417, AAT, Acyl-coenzyme A:6-aminopenicillanic acid acyl-transferase
GCF_000321415.2_ASM32141v2	NC_019902	Thioalkalivibrio nitratireducens DSM 14787, complete sequence	8	2926723-2927422	5,9	CRT,PILER-CR	no	WYL	cas3,cas6,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,cas2,cas1,csx1,cmr1gr7,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,csa3,RT,cas6f,cas7f,cas5f,cas8f,cas3-cas2,DEDDh,WYL,DinG,PrimPol	Unclear	CGGTTCATCCCCGCGCGGGCGGGGAACGC,CGGTTCATCCCCGCGCGGGCGGGGAACGC	29,29	0	0	NA	NA	I-E:I-E	11,9	11	Orphan	cas3,cas6,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,cas2,cas1,csx1,cmr1gr7,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,csa3,RT,cas6f,cas7f,cas5f,cas8f,cas3-cas2,DEDDh,WYL,DinG,PrimPol	NA|259aa|up_4|NC_019902.2_2919686_2920463_+,NA|101aa|up_2|NC_019902.2_2924850_2925153_-,NA|96aa|up_0|NC_019902.2_2925788_2926076_-,NA|242aa|down_5|NC_019902.2_2937390_2938116_+,NA|160aa|down_8|NC_019902.2_2940708_2941188_-	NA|342aa|up_9|NC_019902.2_2912625_2913651_+	cd16282, metallo-hydrolase-like_MBL-fold, uncharacterized subgroup of the MBL-fold_metallo-hydrolase superfamily; MBL-fold metallo hydrolase domain	WYL|323aa|up_8|NC_019902.2_2913754_2914723_+	COG2378, COG2378, Predicted transcriptional regulator [Transcription]	NA|483aa|up_7|NC_019902.2_2915184_2916633_+	PRK09287, PRK09287, NADP-dependent phosphogluconate dehydrogenase	NA|497aa|up_6|NC_019902.2_2916636_2918127_+	PRK05722, PRK05722, glucose-6-phosphate 1-dehydrogenase; Validated	NA|261aa|up_5|NC_019902.2_2918209_2918992_-	cd01400, 6PGL, 6PGL: 6-Phosphogluconolactonase (6PGL) subfamily; 6PGL catalyzes the second step of the oxidative phase of the pentose phosphate pathway, the hydrolyzation of 6-phosphoglucono-1,5-lactone (delta form) to 6-phosphogluconate	NA|259aa|up_4|NC_019902.2_2919686_2920463_+	NA	NA|1371aa|up_3|NC_019902.2_2920550_2924663_+	COG0553, HepA, Superfamily II DNA/RNA helicases, SNF2 family [Transcription / DNA replication, recombination, and repair]	NA|101aa|up_2|NC_019902.2_2924850_2925153_-	NA	NA|95aa|up_1|NC_019902.2_2925504_2925789_-	COG3668, ParE, Plasmid stabilization system protein [General function prediction only]	NA|96aa|up_0|NC_019902.2_2925788_2926076_-	NA	NA|293aa|down_0|NC_019902.2_2928715_2929594_+	pfam14236, DUF4338, Domain of unknown function (DUF4338)	NA|461aa|down_1|NC_019902.2_2929590_2930973_+	pfam03050, DDE_Tnp_IS66, Transposase IS66 family	NA|76aa|down_2|NC_019902.2_2935485_2935713_+	COG2442, COG2442, Uncharacterized conserved protein [Function unknown]	NA|111aa|down_3|NC_019902.2_2935712_2936045_+	pfam18480, DUF5615, Domain of unknown function (DUF5615)	NA|91aa|down_4|NC_019902.2_2936304_2936577_-	pfam15738, YafQ_toxin, Bacterial toxin of type II toxin-antitoxin system, YafQ	NA|242aa|down_5|NC_019902.2_2937390_2938116_+	NA	NA|324aa|down_6|NC_019902.2_2938115_2939087_+	cd00685, Trans_IPPS_HT, Trans-Isoprenyl Diphosphate Synthases, head-to-tail	NA|247aa|down_7|NC_019902.2_2939078_2939819_-	COG2226, UbiE, Methylase involved in ubiquinone/menaquinone biosynthesis [Coenzyme metabolism]	NA|160aa|down_8|NC_019902.2_2940708_2941188_-	NA	NA|209aa|down_9|NC_019902.2_2942014_2942641_+	COG2119, COG2119, Predicted membrane protein [Function unknown]
GCF_000321415.2_ASM32141v2	NC_019902	Thioalkalivibrio nitratireducens DSM 14787, complete sequence	9	2928155-2928603	6,10,8	CRT,PILER-CR,CRISPRCasFinder	no	WYL	cas3,cas6,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,cas2,cas1,csx1,cmr1gr7,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,csa3,RT,cas6f,cas7f,cas5f,cas8f,cas3-cas2,DEDDh,WYL,DinG,PrimPol	Unclear	TTTCTGAGCTGCCTGCGCGGCAGCGAAC,TTTCTGAGCTGCCTGCGCGGCAGCGAAC,TTTCTGAGCTGCCTGCGCGGCAGCGAAC	28,28,28	0	0	NA	NA	I-F:I-F:I-F	7,6,6	7	Orphan	cas3,cas6,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,cas2,cas1,csx1,cmr1gr7,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,csa3,RT,cas6f,cas7f,cas5f,cas8f,cas3-cas2,DEDDh,WYL,DinG,PrimPol	NA|259aa|up_4|NC_019902.2_2919686_2920463_+,NA|101aa|up_2|NC_019902.2_2924850_2925153_-,NA|96aa|up_0|NC_019902.2_2925788_2926076_-,NA|242aa|down_5|NC_019902.2_2937390_2938116_+,NA|160aa|down_8|NC_019902.2_2940708_2941188_-	NA|342aa|up_9|NC_019902.2_2912625_2913651_+	cd16282, metallo-hydrolase-like_MBL-fold, uncharacterized subgroup of the MBL-fold_metallo-hydrolase superfamily; MBL-fold metallo hydrolase domain	WYL|323aa|up_8|NC_019902.2_2913754_2914723_+	COG2378, COG2378, Predicted transcriptional regulator [Transcription]	NA|483aa|up_7|NC_019902.2_2915184_2916633_+	PRK09287, PRK09287, NADP-dependent phosphogluconate dehydrogenase	NA|497aa|up_6|NC_019902.2_2916636_2918127_+	PRK05722, PRK05722, glucose-6-phosphate 1-dehydrogenase; Validated	NA|261aa|up_5|NC_019902.2_2918209_2918992_-	cd01400, 6PGL, 6PGL: 6-Phosphogluconolactonase (6PGL) subfamily; 6PGL catalyzes the second step of the oxidative phase of the pentose phosphate pathway, the hydrolyzation of 6-phosphoglucono-1,5-lactone (delta form) to 6-phosphogluconate	NA|259aa|up_4|NC_019902.2_2919686_2920463_+	NA	NA|1371aa|up_3|NC_019902.2_2920550_2924663_+	COG0553, HepA, Superfamily II DNA/RNA helicases, SNF2 family [Transcription / DNA replication, recombination, and repair]	NA|101aa|up_2|NC_019902.2_2924850_2925153_-	NA	NA|95aa|up_1|NC_019902.2_2925504_2925789_-	COG3668, ParE, Plasmid stabilization system protein [General function prediction only]	NA|96aa|up_0|NC_019902.2_2925788_2926076_-	NA	NA|293aa|down_0|NC_019902.2_2928715_2929594_+	pfam14236, DUF4338, Domain of unknown function (DUF4338)	NA|461aa|down_1|NC_019902.2_2929590_2930973_+	pfam03050, DDE_Tnp_IS66, Transposase IS66 family	NA|76aa|down_2|NC_019902.2_2935485_2935713_+	COG2442, COG2442, Uncharacterized conserved protein [Function unknown]	NA|111aa|down_3|NC_019902.2_2935712_2936045_+	pfam18480, DUF5615, Domain of unknown function (DUF5615)	NA|91aa|down_4|NC_019902.2_2936304_2936577_-	pfam15738, YafQ_toxin, Bacterial toxin of type II toxin-antitoxin system, YafQ	NA|242aa|down_5|NC_019902.2_2937390_2938116_+	NA	NA|324aa|down_6|NC_019902.2_2938115_2939087_+	cd00685, Trans_IPPS_HT, Trans-Isoprenyl Diphosphate Synthases, head-to-tail	NA|247aa|down_7|NC_019902.2_2939078_2939819_-	COG2226, UbiE, Methylase involved in ubiquinone/menaquinone biosynthesis [Coenzyme metabolism]	NA|160aa|down_8|NC_019902.2_2940708_2941188_-	NA	NA|209aa|down_9|NC_019902.2_2942014_2942641_+	COG2119, COG2119, Predicted membrane protein [Function unknown]
GCF_000321415.2_ASM32141v2	NC_019902	Thioalkalivibrio nitratireducens DSM 14787, complete sequence	10	2931055-2932044	11,9,7	PILER-CR,CRISPRCasFinder,CRT	no	WYL	cas3,cas6,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,cas2,cas1,csx1,cmr1gr7,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,csa3,RT,cas6f,cas7f,cas5f,cas8f,cas3-cas2,DEDDh,WYL,DinG,PrimPol	Unclear	TTTCTGAGCTGCCTGCGCGGCAGCGAAC,TTTCTGAGCTGCCTGCGCGGCAGCGAAC,TTTCTGAGCTGCCTGCGCGGCAGCGAAC	28,28,28	0	0	NA	NA	I-F:I-F:I-F	16,16,16	16	Orphan	cas3,cas6,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,cas2,cas1,csx1,cmr1gr7,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,csa3,RT,cas6f,cas7f,cas5f,cas8f,cas3-cas2,DEDDh,WYL,DinG,PrimPol	NA|259aa|up_6|NC_019902.2_2919686_2920463_+,NA|101aa|up_4|NC_019902.2_2924850_2925153_-,NA|96aa|up_2|NC_019902.2_2925788_2926076_-,NA|242aa|down_3|NC_019902.2_2937390_2938116_+,NA|160aa|down_6|NC_019902.2_2940708_2941188_-	NA|483aa|up_9|NC_019902.2_2915184_2916633_+	PRK09287, PRK09287, NADP-dependent phosphogluconate dehydrogenase	NA|497aa|up_8|NC_019902.2_2916636_2918127_+	PRK05722, PRK05722, glucose-6-phosphate 1-dehydrogenase; Validated	NA|261aa|up_7|NC_019902.2_2918209_2918992_-	cd01400, 6PGL, 6PGL: 6-Phosphogluconolactonase (6PGL) subfamily; 6PGL catalyzes the second step of the oxidative phase of the pentose phosphate pathway, the hydrolyzation of 6-phosphoglucono-1,5-lactone (delta form) to 6-phosphogluconate	NA|259aa|up_6|NC_019902.2_2919686_2920463_+	NA	NA|1371aa|up_5|NC_019902.2_2920550_2924663_+	COG0553, HepA, Superfamily II DNA/RNA helicases, SNF2 family [Transcription / DNA replication, recombination, and repair]	NA|101aa|up_4|NC_019902.2_2924850_2925153_-	NA	NA|95aa|up_3|NC_019902.2_2925504_2925789_-	COG3668, ParE, Plasmid stabilization system protein [General function prediction only]	NA|96aa|up_2|NC_019902.2_2925788_2926076_-	NA	NA|293aa|up_1|NC_019902.2_2928715_2929594_+	pfam14236, DUF4338, Domain of unknown function (DUF4338)	NA|461aa|up_0|NC_019902.2_2929590_2930973_+	pfam03050, DDE_Tnp_IS66, Transposase IS66 family	NA|76aa|down_0|NC_019902.2_2935485_2935713_+	COG2442, COG2442, Uncharacterized conserved protein [Function unknown]	NA|111aa|down_1|NC_019902.2_2935712_2936045_+	pfam18480, DUF5615, Domain of unknown function (DUF5615)	NA|91aa|down_2|NC_019902.2_2936304_2936577_-	pfam15738, YafQ_toxin, Bacterial toxin of type II toxin-antitoxin system, YafQ	NA|242aa|down_3|NC_019902.2_2937390_2938116_+	NA	NA|324aa|down_4|NC_019902.2_2938115_2939087_+	cd00685, Trans_IPPS_HT, Trans-Isoprenyl Diphosphate Synthases, head-to-tail	NA|247aa|down_5|NC_019902.2_2939078_2939819_-	COG2226, UbiE, Methylase involved in ubiquinone/menaquinone biosynthesis [Coenzyme metabolism]	NA|160aa|down_6|NC_019902.2_2940708_2941188_-	NA	NA|209aa|down_7|NC_019902.2_2942014_2942641_+	COG2119, COG2119, Predicted membrane protein [Function unknown]	NA|321aa|down_8|NC_019902.2_2942757_2943720_+	TIGR01188, drrA, daunorubicin resistance ABC transporter ATP-binding subunit	NA|255aa|down_9|NC_019902.2_2943719_2944484_+	TIGR00025, Mtu_efflux, ABC transporter efflux protein, DrrB family
