assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000619905.2_ASM61990v2	NZ_CP012371	Nitrosospira briensis C-128, complete genome	1	230773-230864	1	CRISPRCasFinder	no	DEDDh	DEDDh,DinG,RT,cas3,cas6f,cas7f,cas5f,cas8f,cas3-cas2,cas1	Unclear	AGGTCGATCAAAACGTCATTATTTTGATTTTTC	33	0	0	NA	NA	NA	1	1	Orphan	DEDDh,DinG,RT,cas3,cas6f,cas7f,cas5f,cas8f,cas3-cas2,cas1	NA|90aa|up_8|NZ_CP012371.1_225180_225450_+,NA|110aa|up_6|NZ_CP012371.1_226033_226363_+,NA|67aa|up_3|NZ_CP012371.1_228236_228437_+,NA|125aa|up_2|NZ_CP012371.1_228411_228786_-,NA|199aa|up_1|NZ_CP012371.1_228797_229394_+,NA|402aa|down_0|NZ_CP012371.1_231534_232740_+	NA|289aa|up_9|NZ_CP012371.1_223992_224859_+	pfam04383, KilA-N, KilA-N domain	NA|90aa|up_8|NZ_CP012371.1_225180_225450_+	NA	NA|161aa|up_7|NZ_CP012371.1_225446_225929_+	COG4226, HicB, Predicted nuclease of the RNAse H fold, HicB family [General    function prediction only]	NA|110aa|up_6|NZ_CP012371.1_226033_226363_+	NA	NA|132aa|up_5|NZ_CP012371.1_226715_227111_-	pfam15919, HicB_lk_antitox, HicB_like antitoxin of bacterial toxin-antitoxin system	NA|163aa|up_4|NZ_CP012371.1_227448_227937_+	pfam10386, DUF2441, Protein of unknown function (DUF2441)	NA|67aa|up_3|NZ_CP012371.1_228236_228437_+	NA	NA|125aa|up_2|NZ_CP012371.1_228411_228786_-	NA	NA|199aa|up_1|NZ_CP012371.1_228797_229394_+	NA	NA|116aa|up_0|NZ_CP012371.1_229468_229816_+	cd00796, INT_Rci_Hp1_C, Shufflon-specific DNA recombinase Rci and Bacteriophage Hp1_like integrase, C-terminal catalytic domain	NA|402aa|down_0|NZ_CP012371.1_231534_232740_+	NA	NA|1052aa|down_1|NZ_CP012371.1_232888_236044_-	PRK10060, PRK10060, cyclic di-GMP phosphodiesterase	NA|106aa|down_2|NZ_CP012371.1_236787_237105_+	PRK02909, PRK02909, flagellar transcriptional regulator FlhD	NA|183aa|down_3|NZ_CP012371.1_237190_237739_+	PRK12722, PRK12722, flagellar transcriptional regulator FlhC	NA|287aa|down_4|NZ_CP012371.1_238121_238982_+	PRK09110, PRK09110, flagellar motor stator protein MotA	NA|310aa|down_5|NZ_CP012371.1_238994_239924_+	PRK09041, motB, motility protein MotB	NA|394aa|down_6|NZ_CP012371.1_240209_241391_+	PRK05702, flhB, flagellar type III secretion system protein FlhB	NA|693aa|down_7|NZ_CP012371.1_241387_243466_+	PRK06012, flhA, flagellar type III secretion system protein FlhA	NA|545aa|down_8|NZ_CP012371.1_243462_245097_+	PRK06995, flhF, flagellar biosynthesis protein FlhF	NA|298aa|down_9|NZ_CP012371.1_245089_245983_+	cd02038, FlhG-like, MinD-like ATPase FlhG
GCF_000619905.2_ASM61990v2	NZ_CP012371	Nitrosospira briensis C-128, complete genome	2	839193-839296	2	CRISPRCasFinder	no		DEDDh,DinG,RT,cas3,cas6f,cas7f,cas5f,cas8f,cas3-cas2,cas1	Orphan	TGTAGATATAGTCGATCACGTCGTGCCGATCG	32	0	0	NA	NA	NA	1	1	Orphan	DEDDh,DinG,RT,cas3,cas6f,cas7f,cas5f,cas8f,cas3-cas2,cas1	NA|76aa|up_9|NZ_CP012371.1_827237_827465_-,NA	NA|76aa|up_9|NZ_CP012371.1_827237_827465_-	NA	NA|410aa|up_8|NZ_CP012371.1_827622_828852_-	pfam03747, ADP_ribosyl_GH, ADP-ribosylglycohydrolase	NA|405aa|up_7|NZ_CP012371.1_829090_830305_+	cd17325, MFS_MdtG_SLC18_like, bacterial MdtG-like and eukaryotic solute carrier 18 (SLC18) family of the Major Facilitator Superfamily of transporters	NA|94aa|up_6|NZ_CP012371.1_830439_830721_+	COG3549, HigB, Plasmid maintenance system killer protein [General function prediction only]	NA|101aa|up_5|NZ_CP012371.1_830759_831062_+	TIGR02607, Virulence-associated_protein_I, addiction module antidote protein, HigA family	NA|235aa|up_4|NZ_CP012371.1_831645_832350_+	COG1136, SalX, ABC-type antimicrobial peptide transport system, ATPase component [Defense mechanisms]	NA|856aa|up_3|NZ_CP012371.1_832337_834905_+	COG4591, LolE, ABC-type transport system, involved in lipoprotein release, permease component [Cell envelope biogenesis, outer membrane]	NA|355aa|up_2|NZ_CP012371.1_835003_836068_+	pfam07143, CrtC, CrtC N-terminal lipocalin domain	NA|487aa|up_1|NZ_CP012371.1_836102_837563_-	PRK05777, PRK05777, NADH-quinone oxidoreductase subunit NuoN	NA|502aa|up_0|NZ_CP012371.1_837559_839065_-	PRK05846, PRK05846, NADH:ubiquinone oxidoreductase subunit M; Reviewed	NA|103aa|down_0|NZ_CP012371.1_841036_841345_-	PRK05715, PRK05715, NADH-quinone oxidoreductase subunit NuoK	NA|175aa|down_1|NZ_CP012371.1_841349_841874_-	PRK06638, PRK06638, NADH-quinone oxidoreductase subunit J	NA|172aa|down_2|NZ_CP012371.1_841906_842422_-	PRK05888, PRK05888, NADH-quinone oxidoreductase subunit NuoI	NA|319aa|down_3|NZ_CP012371.1_842459_843416_-	PRK06076, PRK06076, NADH-quinone oxidoreductase subunit NuoH	NA|919aa|down_4|NZ_CP012371.1_843412_846169_-	PRK08166, PRK08166, NADH-quinone oxidoreductase subunit NuoG	NA|424aa|down_5|NZ_CP012371.1_846171_847443_-	TIGR01959, NADH-quinone_oxidoreductase_subunit_F, NADH-quinone oxidoreductase, F subunit	NA|177aa|down_6|NZ_CP012371.1_847420_847951_-	PRK07539, PRK07539, NADH-quinone oxidoreductase subunit NuoE	NA|584aa|down_7|NZ_CP012371.1_847958_849710_-	PRK11742, PRK11742, bifunctional NADH:ubiquinone oxidoreductase subunit C/D; Provisional	NA|199aa|down_8|NZ_CP012371.1_849740_850337_-	PRK06411, PRK06411, NADH-quinone oxidoreductase subunit NuoB	NA|119aa|down_9|NZ_CP012371.1_850426_850783_-	PRK06602, PRK06602, NADH:ubiquinone oxidoreductase subunit A; Validated
GCF_000619905.2_ASM61990v2	NZ_CP012371	Nitrosospira briensis C-128, complete genome	3	1925661-1926348	3,1,1	CRISPRCasFinder,CRT,PILER-CR	no	cas6f,cas7f,cas5f,cas8f,cas3-cas2,cas1	DEDDh,DinG,RT,cas3,cas6f,cas7f,cas5f,cas8f,cas3-cas2,cas1	Type I-F	TTTCTGAGCTGCCTATGCGGCAGTGAAC,TTTCTGAGCTGCCTATGCGGCAGTGAAC,TTTCTGAGCTGCCTATGCGGCAGTGAAC	28,28,28	0	0	NA	NA	I-F:I-F:I-F	11,11,8	11	TypeI-F	DEDDh,DinG,RT,cas3,cas6f,cas7f,cas5f,cas8f,cas3-cas2,cas1	NA|196aa|up_6|NZ_CP012371.1_1920220_1920808_-,NA|330aa|up_1|NZ_CP012371.1_1924178_1925168_-,NA	NA|305aa|up_9|NZ_CP012371.1_1915279_1916194_-	cd08441, PBP2_MetR, The C-terminal substrate binding domain of LysR-type transcriptional regulator metR, which regulates the expression of methionine biosynthetic genes, contains type 2 periplasmic binding fold	NA|774aa|up_8|NZ_CP012371.1_1916321_1918643_+	PRK05222, PRK05222, 5-methyltetrahydropteroyltriglutamate--homocysteine S-methyltransferase; Provisional	NA|96aa|up_7|NZ_CP012371.1_1919213_1919501_-	cd13831, HU, histone-like DNA-binding protein HU	NA|196aa|up_6|NZ_CP012371.1_1920220_1920808_-	NA	NA|542aa|up_5|NZ_CP012371.1_1920818_1922444_-	pfam07938, Fungal_lectin, Fungal fucose-specific lectin	NA|117aa|up_4|NZ_CP012371.1_1923138_1923489_-	COG2510, COG2510, Predicted membrane protein [Function unknown]	NA|139aa|up_3|NZ_CP012371.1_1923485_1923902_-	PRK02971, PRK02971, 4-amino-4-deoxy-L-arabinose-phosphoundecaprenol flippase subunit ArnF; Provisional	NA|48aa|up_2|NZ_CP012371.1_1923939_1924083_-	COG2826, Tra8, Transposase and inactivated derivatives, IS30 family [DNA replication, recombination, and repair]	NA|330aa|up_1|NZ_CP012371.1_1924178_1925168_-	NA	NA|94aa|up_0|NZ_CP012371.1_1925314_1925596_+	cd10456, GIY-YIG_UPF0213, The GIY-YIG domain of uncharacterized protein family UPF0213 related to structure-specific endonuclease SLX1	cas6f|188aa|down_0|NZ_CP012371.1_1926477_1927041_-	pfam09618, Cas_Csy4, CRISPR-associated protein (Cas_Csy4)	cas7f|351aa|down_1|NZ_CP012371.1_1927044_1928097_-	pfam09615, Cas_Csy3, CRISPR-associated protein (Cas_Csy3)	cas5f|355aa|down_2|NZ_CP012371.1_1928032_1929097_-	pfam09614, Cas_Csy2, CRISPR-associated protein (Cas_Csy2)	cas8f|435aa|down_3|NZ_CP012371.1_1929089_1930394_-	pfam09611, Cas_Csy1, CRISPR-associated protein (Cas_Csy1)	cas3-cas2|1127aa|down_4|NZ_CP012371.1_1930552_1933933_-	TIGR02562, conserved_hypothetical_protein, CRISPR-associated helicase Cas3, subtype I-F/YPEST	cas1|333aa|down_5|NZ_CP012371.1_1933929_1934928_-	TIGR03637, cas1_YPEST, CRISPR-associated endonuclease Cas1, subtype I-F/YPEST	NA|95aa|down_6|NZ_CP012371.1_1935377_1935662_+	PRK11235, PRK11235, type II toxin-antitoxin system RelB/DinJ family antitoxin	NA|103aa|down_7|NZ_CP012371.1_1935646_1935955_+	pfam05016, ParE_toxin, ParE toxin of type II toxin-antitoxin system, parDE	NA|882aa|down_8|NZ_CP012371.1_1935997_1938643_-	PRK03059, PRK03059, PII uridylyl-transferase; Provisional	NA|267aa|down_9|NZ_CP012371.1_1938649_1939450_-	PRK05716, PRK05716, methionine aminopeptidase; Validated
GCF_000619905.2_ASM61990v2	NZ_CP012371	Nitrosospira briensis C-128, complete genome	4	2689566-2689677	4	CRISPRCasFinder	no		DEDDh,DinG,RT,cas3,cas6f,cas7f,cas5f,cas8f,cas3-cas2,cas1	Orphan	CCGCTGCGCAGGCGGACGAACCAGCGGCCGAAGCGCAACCCGA	43	0	0	NA	NA	NA	1	1	Orphan	DEDDh,DinG,RT,cas3,cas6f,cas7f,cas5f,cas8f,cas3-cas2,cas1	NA|81aa|up_7|NZ_CP012371.1_2684914_2685157_+,NA|72aa|up_3|NZ_CP012371.1_2686776_2686992_+,NA|83aa|up_2|NZ_CP012371.1_2687009_2687258_+,NA|114aa|up_1|NZ_CP012371.1_2687275_2687617_+,NA|99aa|up_0|NZ_CP012371.1_2687606_2687903_+,NA|92aa|down_1|NZ_CP012371.1_2690330_2690606_+,NA|157aa|down_2|NZ_CP012371.1_2690624_2691095_+,NA|297aa|down_3|NZ_CP012371.1_2691091_2691982_+,NA|89aa|down_4|NZ_CP012371.1_2691978_2692245_+,NA|145aa|down_9|NZ_CP012371.1_2696603_2697038_+	NA|66aa|up_9|NZ_CP012371.1_2684109_2684307_-	pfam14549, P22_Cro, DNA-binding transcriptional regulator Cro	NA|183aa|up_8|NZ_CP012371.1_2684369_2684918_+	cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|81aa|up_7|NZ_CP012371.1_2684914_2685157_+	NA	NA|138aa|up_6|NZ_CP012371.1_2685205_2685619_+	COG2913, OlmA, Outer membrane lipoprotein OmlA (small protein A) [Cell envelope biogenesis, outer membrane]	NA|134aa|up_5|NZ_CP012371.1_2685707_2686109_-	pfam15919, HicB_lk_antitox, HicB_like antitoxin of bacterial toxin-antitoxin system	NA|59aa|up_4|NZ_CP012371.1_2686118_2686295_-	pfam07927, HicA_toxin, HicA toxin of bacterial toxin-antitoxin,	NA|72aa|up_3|NZ_CP012371.1_2686776_2686992_+	NA	NA|83aa|up_2|NZ_CP012371.1_2687009_2687258_+	NA	NA|114aa|up_1|NZ_CP012371.1_2687275_2687617_+	NA	NA|99aa|up_0|NZ_CP012371.1_2687606_2687903_+	NA	NA|193aa|down_0|NZ_CP012371.1_2689773_2690352_+	PHA02575, 1, deoxynucleoside monophosphate kinase; Provisional	NA|92aa|down_1|NZ_CP012371.1_2690330_2690606_+	NA	NA|157aa|down_2|NZ_CP012371.1_2690624_2691095_+	NA	NA|297aa|down_3|NZ_CP012371.1_2691091_2691982_+	NA	NA|89aa|down_4|NZ_CP012371.1_2691978_2692245_+	NA	NA|78aa|down_5|NZ_CP012371.1_2692244_2692478_+	pfam13986, DUF4224, Domain of unknown function (DUF4224)	NA|350aa|down_6|NZ_CP012371.1_2692487_2693537_+	cd00800, INT_Lambda_C, C-terminal catalytic domain of Lambda integrase, a tyrosine-based site-specific recombinase	NA|140aa|down_7|NZ_CP012371.1_2693898_2694318_-	pfam14328, DUF4385, Domain of unknown function (DUF4385)	NA|57aa|down_8|NZ_CP012371.1_2696223_2696394_+	cd00397, DNA_BRE_C, DNA breaking-rejoining enzymes, C-terminal catalytic domain	NA|145aa|down_9|NZ_CP012371.1_2696603_2697038_+	NA
GCF_000619905.2_ASM61990v2	NZ_CP012371	Nitrosospira briensis C-128, complete genome	5	3153782-3153884	5	CRISPRCasFinder	no		DEDDh,DinG,RT,cas3,cas6f,cas7f,cas5f,cas8f,cas3-cas2,cas1	Orphan	TGTCGGGCGGAGCCCGACCTACC	23	0	0	NA	NA	NA	1	1	Orphan	DEDDh,DinG,RT,cas3,cas6f,cas7f,cas5f,cas8f,cas3-cas2,cas1	NA,NA|48aa|down_6|NZ_CP012371.1_3164174_3164318_+,NA|168aa|down_7|NZ_CP012371.1_3164305_3164809_+,NA|94aa|down_8|NZ_CP012371.1_3165227_3165509_-	NA|309aa|up_9|NZ_CP012371.1_3137661_3138588_+	cd11599, HDAC_classII_2, Histone deacetylases and histone-like deacetylases, classII	NA|422aa|up_8|NZ_CP012371.1_3138824_3140090_+	pfam13433, Peripla_BP_5, Periplasmic binding protein domain	NA|1393aa|up_7|NZ_CP012371.1_3140086_3144265_+	PRK10841, PRK10841, two-component system sensor histidine kinase RcsC	NA|410aa|up_6|NZ_CP012371.1_3144718_3145948_+	cd01948, EAL, EAL domain	NA|199aa|up_5|NZ_CP012371.1_3146193_3146790_+	pfam16925, TetR_C_13, Bacterial transcriptional repressor C-terminal	NA|586aa|up_4|NZ_CP012371.1_3146857_3148615_+	sd00006, TPR, Tetratricopeptide repeat	NA|130aa|up_3|NZ_CP012371.1_3148870_3149260_-	cd05560, Xcc1710_like, Xcc1710_like family, specific to proteobacteria	NA|409aa|up_2|NZ_CP012371.1_3149465_3150692_+	PRK09265, PRK09265, aminotransferase AlaT; Validated	NA|439aa|up_1|NZ_CP012371.1_3150816_3152133_+	PRK06349, PRK06349, homoserine dehydrogenase; Provisional	NA|472aa|up_0|NZ_CP012371.1_3152291_3153707_+	PRK09225, PRK09225, threonine synthase; Validated	NA|258aa|down_0|NZ_CP012371.1_3153918_3154692_-	COG2071, COG2071, Predicted glutamine amidotransferases [General function prediction only]	NA|372aa|down_1|NZ_CP012371.1_3154887_3156003_-	PRK13516, PRK13516, gamma-glutamyl:cysteine ligase; Provisional	NA|403aa|down_2|NZ_CP012371.1_3155999_3157208_-	pfam00999, Na_H_Exchanger, Sodium/hydrogen exchanger family	NA|455aa|down_3|NZ_CP012371.1_3157988_3159353_+	PRK01642, cls, cardiolipin synthetase; Reviewed	NA|485aa|down_4|NZ_CP012371.1_3159703_3161158_+	cd11375, Peptidase_M54, Peptidase family M54, also called archaemetzincins or archaelysins	NA|481aa|down_5|NZ_CP012371.1_3161913_3163356_-	cd01286, deoxycytidylate_deaminase, Deoxycytidylate deaminase domain	NA|48aa|down_6|NZ_CP012371.1_3164174_3164318_+	NA	NA|168aa|down_7|NZ_CP012371.1_3164305_3164809_+	NA	NA|94aa|down_8|NZ_CP012371.1_3165227_3165509_-	NA	NA|204aa|down_9|NZ_CP012371.1_3165512_3166124_-	PRK05327, rpsD, 30S ribosomal protein S4; Validated
