assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000012725.1_ASM1272v1	NC_007406	Nitrobacter winogradskyi Nb-255, complete sequence	1	520959-521044	1	CRISPRCasFinder	no		DEDDh,cas3,csa3,cas2,cas1,cas4,cas7,cas8c,cas5	Orphan	AGCATGATCCCGAAAAGTGGGCACCGGTT	29	0	0	NA	NA	NA	1	1	Orphan	DEDDh,cas3,csa3,cas2,cas1,cas4,cas7,cas8c,cas5	NA|102aa|up_0|NC_007406.1_520652_520958_+,NA	NA|91aa|up_9|NC_007406.1_511398_511671_-	COG2261, COG2261, Predicted membrane protein [Function unknown]	NA|193aa|up_8|NC_007406.1_511865_512444_-	COG3753, COG3753, Uncharacterized protein conserved in bacteria [Function unknown]	NA|256aa|up_7|NC_007406.1_512812_513580_-	COG0600, TauC, ABC-type nitrate/sulfonate/bicarbonate transport system, permease component [Inorganic ion transport and metabolism]	NA|401aa|up_6|NC_007406.1_513666_514869_-	cd13553, PBP2_NrtA_CpmA_like, Substrate binding domain of ABC-type nitrate/bicarbonate transporters, a member of the type 2 periplasmic binding fold superfamily	NA|271aa|up_5|NC_007406.1_514924_515737_-	COG1116, TauB, ABC-type nitrate/sulfonate/bicarbonate transport system, ATPase component [Inorganic ion transport and metabolism]	NA|375aa|up_4|NC_007406.1_515733_516858_-	cd01156, IVD, Isovaleryl-CoA dehydrogenase	NA|191aa|up_3|NC_007406.1_516854_517427_-	COG2128, COG2128, Uncharacterized conserved protein [Function unknown]	NA|349aa|up_2|NC_007406.1_517749_518796_+	TIGR04122, hypothetical_protein, putative exonuclease, DNA ligase-associated	NA|595aa|up_1|NC_007406.1_518792_520577_+	PRK09247, PRK09247, ATP-dependent DNA ligase; Validated	NA|102aa|up_0|NC_007406.1_520652_520958_+	NA	NA|365aa|down_0|NC_007406.1_521078_522173_-	PRK05286, PRK05286, quinone-dependent dihydroorotate dehydrogenase	NA|115aa|down_1|NC_007406.1_522169_522514_-	COG3502, COG3502, Uncharacterized protein conserved in bacteria [Function unknown]	NA|209aa|down_2|NC_007406.1_522739_523366_+	COG2932, COG2932, Predicted transcriptional regulator [Transcription]	NA|653aa|down_3|NC_007406.1_523435_525394_-	PRK00174, PRK00174, acetyl-CoA synthetase; Provisional	NA|138aa|down_4|NC_007406.1_525554_525968_-	pfam01878, EVE, EVE domain	NA|327aa|down_5|NC_007406.1_525970_526951_-	PRK00094, gpsA, NAD(P)H-dependent glycerol-3-phosphate dehydrogenase	NA|358aa|down_6|NC_007406.1_526965_528039_-	PRK09604, PRK09604, tRNA (adenosine(37)-N6)-threonylcarbamoyltransferase complex transferase subunit TsaD	NA|317aa|down_7|NC_007406.1_528129_529080_+	PRK00072, hemC, porphobilinogen deaminase; Reviewed	NA|249aa|down_8|NC_007406.1_529084_529831_+	cd06578, HemD, Uroporphyrinogen-III synthase (HemD) catalyzes the asymmetrical cyclization of tetrapyrrole (linear) to uroporphyrinogen-III, the fourth step in the biosynthesis of heme	NA|506aa|down_9|NC_007406.1_529893_531411_+	COG4223, COG4223, Uncharacterized protein conserved in bacteria [Function unknown]
GCF_000012725.1_ASM1272v1	NC_007406	Nitrobacter winogradskyi Nb-255, complete sequence	2	1546861-1546962	2	CRISPRCasFinder	no		DEDDh,cas3,csa3,cas2,cas1,cas4,cas7,cas8c,cas5	Orphan	ACCGCTTCGCGTGAAGAAAACGCGTCAAAACAATAAT	37	1	1	1546898-1546925	NC_007406.1_453839-453866	NA	1	1	Orphan	DEDDh,cas3,csa3,cas2,cas1,cas4,cas7,cas8c,cas5	NA,NA	NA|394aa|up_9|NC_007406.1_1534193_1535375_+	COG3268, COG3268, Uncharacterized conserved protein [Function unknown]	NA|421aa|up_8|NC_007406.1_1535425_1536688_-	pfam04932, Wzy_C, O-Antigen ligase	NA|511aa|up_7|NC_007406.1_1536721_1538254_-	TIGR03023, Sugar_transferase	NA|378aa|up_6|NC_007406.1_1538361_1539495_-	cd03801, GT4_PimA-like, phosphatidyl-myo-inositol mannosyltransferase	NA|235aa|up_5|NC_007406.1_1539498_1540203_-	COG1596, Wza, Periplasmic protein involved in polysaccharide export, contains    SLBB domain of b-grasp fold [Cell wall/membrane/envelope biogenesis]	NA|716aa|up_4|NC_007406.1_1540450_1542598_+	COG3206, GumC, Uncharacterized protein involved in exopolysaccharide biosynthesis [Cell envelope biogenesis, outer membrane]	NA|398aa|up_3|NC_007406.1_1542620_1543814_-	COG5653, COG5653, Protein involved in cellulose biosynthesis (CelD) [Cell envelope biogenesis, outer membrane]	NA|69aa|up_2|NC_007406.1_1543935_1544142_-	pfam11003, DUF2842, Protein of unknown function (DUF2842)	NA|361aa|up_1|NC_007406.1_1544239_1545322_+	pfam02628, COX15-CtaA, Cytochrome oxidase assembly protein	NA|419aa|up_0|NC_007406.1_1545581_1546838_+	COG2223, NarK, Nitrate/nitrite transporter [Inorganic ion transport and metabolism]	NA|426aa|down_0|NC_007406.1_1547179_1548457_-	PRK05994, PRK05994, O-acetylhomoserine aminocarboxypropyltransferase; Validated	NA|155aa|down_1|NC_007406.1_1548834_1549299_+	PRK09216, rplM, 50S ribosomal protein L13; Reviewed	NA|159aa|down_2|NC_007406.1_1549301_1549778_+	PRK00132, rpsI, 30S ribosomal protein S9; Reviewed	NA|381aa|down_3|NC_007406.1_1549905_1551048_+	cd00254, LT-like, lytic transglycosylase(LT)-like domain	NA|161aa|down_4|NC_007406.1_1551131_1551614_+	cd09281, UPF0066, Escherichia coli YaeB and related proteins	NA|414aa|down_5|NC_007406.1_1551628_1552870_-	pfam04339, FemAB_like, Peptidogalycan biosysnthesis/recognition	NA|251aa|down_6|NC_007406.1_1553004_1553757_-	cd08585, GDPD_like_3, Glycerophosphodiester phosphodiesterase domain of uncharacterized bacterial glycerophosphodiester phosphodiesterases	NA|156aa|down_7|NC_007406.1_1553763_1554231_-	cd02199, YjgF_YER057c_UK114_like_1, This group of proteins belong to a large family of YjgF/YER057c/UK114-like proteins present in bacteria, archaea, and eukaryotes with no definitive function	NA|296aa|down_8|NC_007406.1_1554565_1555453_+	pfam08904, DUF1849, Domain of unknown function (DUF1849)	NA|430aa|down_9|NC_007406.1_1555503_1556793_-	PRK02794, PRK02794, DNA polymerase IV; Provisional
GCF_000012725.1_ASM1272v1	NC_007406	Nitrobacter winogradskyi Nb-255, complete sequence	3	1937700-1937773	3	CRISPRCasFinder	no		DEDDh,cas3,csa3,cas2,cas1,cas4,cas7,cas8c,cas5	Orphan	TTCATGGTACGGCTATCAGCGCCG	24	0	0	NA	NA	NA	1	1	Orphan	DEDDh,cas3,csa3,cas2,cas1,cas4,cas7,cas8c,cas5	NA,NA	NA|226aa|up_9|NC_007406.1_1926935_1927613_+	PRK00301, aat, leucyl/phenylalanyl-tRNA--protein transferase; Reviewed	NA|310aa|up_8|NC_007406.1_1927794_1928724_-	pfam09923, DUF2155, Uncharacterized protein conserved in bacteria (DUF2155)	NA|593aa|up_7|NC_007406.1_1929152_1930931_+	pfam04773, FecR, FecR protein	NA|137aa|up_6|NC_007406.1_1930980_1931391_-	PRK08183, PRK08183, NADH:ubiquinone oxidoreductase subunit NDUFA12	NA|94aa|up_5|NC_007406.1_1931923_1932205_+	pfam14542, Acetyltransf_CG, GCN5-related N-acetyl-transferase	NA|125aa|up_4|NC_007406.1_1932336_1932711_-	pfam12200, DUF3597, Domain of unknown function (DUF3597)	NA|310aa|up_3|NC_007406.1_1933054_1933984_+	COG5473, COG5473, Predicted integral membrane protein [Function unknown]	NA|478aa|up_2|NC_007406.1_1934175_1935609_+	PRK05335, PRK05335, tRNA (uracil-5-)-methyltransferase Gid; Reviewed	NA|367aa|up_1|NC_007406.1_1935636_1936737_+	PRK09188, PRK09188, serine/threonine protein kinase; Provisional	NA|287aa|up_0|NC_007406.1_1936743_1937604_-	pfam00494, SQS_PSY, Squalene/phytoene synthase	NA|353aa|down_0|NC_007406.1_1938133_1939192_-	PRK14726, PRK14726, protein translocase subunit SecDF	NA|533aa|down_1|NC_007406.1_1939208_1940807_-	PRK14726, PRK14726, protein translocase subunit SecDF	NA|134aa|down_2|NC_007406.1_1940840_1941242_-	pfam02699, YajC, Preprotein translocase subunit	NA|325aa|down_3|NC_007406.1_1941429_1942404_+	pfam05673, DUF815, Protein of unknown function (DUF815)	NA|454aa|down_4|NC_007406.1_1942543_1943905_-	COG0739, NlpD, Membrane proteins related to metalloendopeptidases [Cell envelope biogenesis, outer membrane]	NA|218aa|down_5|NC_007406.1_1944053_1944707_-	PRK00312, pcm, protein-L-isoaspartate(D-aspartate) O-methyltransferase	NA|127aa|down_6|NC_007406.1_1944967_1945348_+	cd17586, REC_PFxFATGY, phosphoacceptor receiver (REC) domain of PFxFATGY motif single-domain (stand-alone) response regulators	NA|256aa|down_7|NC_007406.1_1945413_1946181_-	PRK00346, surE, 5'(3')-nucleotidase/polyphosphatase; Provisional	NA|498aa|down_8|NC_007406.1_1946504_1947998_-	COG0172, SerS, Seryl-tRNA synthetase [Translation, ribosomal structure and biogenesis]	NA|270aa|down_9|NC_007406.1_1948105_1948915_-	COG0805, TatC, Sec-independent protein secretion pathway component TatC [Intracellular trafficking and secretion]
GCF_000012725.1_ASM1272v1	NC_007406	Nitrobacter winogradskyi Nb-255, complete sequence	4	2121506-2122660	1,4,1	PILER-CR,CRISPRCasFinder,CRT	no	cas2,cas1,cas4,cas7,cas8c,cas5,cas3	DEDDh,cas3,csa3,cas2,cas1,cas4,cas7,cas8c,cas5	Type I-U,Type I-C, Type I-U?	GTTTCGACCCACGCCCCCGCGAAGGGGGCGAC,GTTTCGACCCACGCCCCCGCGAAGGGGGCGAC,GTTTCGACCCACGCCCCCGCGAAGGGGGCGAC	32,32,32	0	0	NA	NA	NA:NA:NA	17,17,17	17	TypeI-U?,TypeI-C,TypeI-U	DEDDh,cas3,csa3,cas2,cas1,cas4,cas7,cas8c,cas5	NA|90aa|up_6|NC_007406.1_2111613_2111883_-,NA|137aa|down_8|NC_007406.1_2131535_2131946_-	NA|131aa|up_9|NC_007406.1_2109511_2109904_-	PRK00392, rpoZ, DNA-directed RNA polymerase subunit omega; Reviewed	NA|206aa|up_8|NC_007406.1_2110248_2110866_+	cd10911, PIN_LabA, PIN domain of Synechococcus elongatus LabA (low-amplitude and bright) and related proteins	NA|226aa|up_7|NC_007406.1_2110843_2111521_+	cd10031, UDG-F5_TTUDGB_like, Uracil DNA glycosylase family 5, includes Thermotoga maritima TTUDGB and similar proteins	NA|90aa|up_6|NC_007406.1_2111613_2111883_-	NA	NA|158aa|up_5|NC_007406.1_2111973_2112447_-	PRK05422, smpB, SsrA-binding protein SmpB	NA|140aa|up_4|NC_007406.1_2112511_2112931_-	PRK13952, mscL, large conductance mechanosensitive channel protein MscL	NA|297aa|up_3|NC_007406.1_2113110_2114001_-	PRK03170, PRK03170, dihydrodipicolinate synthase; Provisional	NA|736aa|up_2|NC_007406.1_2114361_2116569_+	cd13401, Slt70-like, 70kDa soluble lytic transglycosylase (Slt70) and similar proteins	NA|451aa|up_1|NC_007406.1_2116943_2118295_+	pfam00665, rve, Integrase core domain	NA|523aa|up_0|NC_007406.1_2118831_2120400_-	pfam02530, Porin_2, Porin subfamily	NA|253aa|down_0|NC_007406.1_2122883_2123641_-	pfam13340, DUF4096, Putative transposase of IS4/5 family (DUF4096)	cas2|97aa|down_1|NC_007406.1_2123743_2124034_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|345aa|down_2|NC_007406.1_2124040_2125075_-	TIGR03640, cas1_DVULG, CRISPR-associated endonuclease Cas1, subtype I-C/DVULG	cas4|198aa|down_3|NC_007406.1_2125071_2125665_-	TIGR00372, conserved_hypothetical_protein, CRISPR-associated protein Cas4	cas7|317aa|down_4|NC_007406.1_2125726_2126677_-	pfam05107, Cas_Cas7, CRISPR-associated protein Cas7	cas8c|594aa|down_5|NC_007406.1_2126676_2128458_-	cd09757, Cas8c_I-C, CRISPR/Cas system-associated protein Cas8c	cas5|231aa|down_6|NC_007406.1_2128454_2129147_-	cd09752, Cas5_I-C, CRISPR/Cas system-associated RAMP superfamily protein Cas5	cas3|745aa|down_7|NC_007406.1_2129210_2131445_-	cd17930, DEXHc_cas3, DEXH/Q-box helicase domain of Cas3	NA|137aa|down_8|NC_007406.1_2131535_2131946_-	NA	NA|444aa|down_9|NC_007406.1_2131948_2133280_-	smart00857, Resolvase, Resolvase, N terminal domain
GCF_000012725.1_ASM1272v1	NC_007406	Nitrobacter winogradskyi Nb-255, complete sequence	5	2756399-2756487	5	CRISPRCasFinder	no		DEDDh,cas3,csa3,cas2,cas1,cas4,cas7,cas8c,cas5	Orphan	AACCGGTGCCCACTTCGCTCGAAAAC	26	0	0	NA	NA	NA	1	1	Orphan	DEDDh,cas3,csa3,cas2,cas1,cas4,cas7,cas8c,cas5	NA|152aa|up_6|NC_007406.1_2750356_2750812_-,NA|97aa|up_5|NC_007406.1_2750978_2751269_+,NA|243aa|down_7|NC_007406.1_2765810_2766539_+	NA|207aa|up_9|NC_007406.1_2747652_2748273_+	COG3637, COG3637, Opacity protein and related surface antigens [Cell envelope biogenesis, outer membrane]	NA|285aa|up_8|NC_007406.1_2748373_2749228_-	COG2961, ComJ, Protein involved in catabolism of external DNA [General function prediction only]	NA|230aa|up_7|NC_007406.1_2749325_2750015_-	cd01062, RNase_T2_prok, Ribonuclease T2 (RNase T2) is a widespread family of secreted RNases found in every organism examined thus far	NA|152aa|up_6|NC_007406.1_2750356_2750812_-	NA	NA|97aa|up_5|NC_007406.1_2750978_2751269_+	NA	NA|177aa|up_4|NC_007406.1_2751295_2751826_+	pfam12276, DUF3617, Protein of unknown function (DUF3617)	NA|523aa|up_3|NC_007406.1_2751883_2753452_-	COG0124, HisS, Histidyl-tRNA synthetase [Translation, ribosomal structure and biogenesis]	NA|115aa|up_2|NC_007406.1_2753684_2754029_+	COG2852, COG2852, Very-short-patch-repair endonuclease [Replication, recombination,    and repair]	NA|537aa|up_1|NC_007406.1_2754060_2755671_-	COG4655, COG4655, Predicted membrane protein [Function unknown]	NA|147aa|up_0|NC_007406.1_2755736_2756177_-	COG4961, TadG, Flp pilus assembly protein TadG [Intracellular trafficking and secretion]	NA|446aa|down_0|NC_007406.1_2756654_2757992_+	PRK09221, PRK09221, beta alanine--pyruvate transaminase; Provisional	NA|480aa|down_1|NC_007406.1_2758111_2759551_+	pfam00563, EAL, EAL domain	NA|285aa|down_2|NC_007406.1_2759566_2760421_+	cd07525, HAD_like, uncharacterized family of the haloacid dehalogenase-like (HAD) hydrolase superfamily	NA|132aa|down_3|NC_007406.1_2760478_2760874_-	cd19923, REC_CheY_CheY3, phosphoacceptor receiver (REC) domain of chemotaxis response regulator CheY3 and similar CheY family proteins	NA|328aa|down_4|NC_007406.1_2761141_2762125_+	PRK05627, PRK05627, bifunctional riboflavin kinase/FAD synthetase	NA|1000aa|down_5|NC_007406.1_2762247_2765247_+	PRK13804, ileS, isoleucyl-tRNA synthetase; Provisional	NA|166aa|down_6|NC_007406.1_2765243_2765741_+	PRK14796, PRK14796, lipoprotein signal peptidase; Provisional	NA|243aa|down_7|NC_007406.1_2765810_2766539_+	NA	NA|465aa|down_8|NC_007406.1_2767122_2768517_+	COG0612, PqqL, Predicted Zn-dependent peptidases [General function prediction only]	NA|465aa|down_9|NC_007406.1_2768513_2769908_+	COG0612, PqqL, Predicted Zn-dependent peptidases [General function prediction only]
