assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000730165.1_ASM73016v2	NZ_CP022110	Nitrospirillum amazonense CBAmc chromosome 1, complete sequence	1	2502012-2502136	1	PILER-CR	no		DEDDh,DinG,cas3,csa3,WYL,RT	Orphan	GCGCCCGAAGCGTTCGAACGCTTCGG	26	0	0	NA	NA	NA	2	2	Orphan	DEDDh,DinG,cas3,csa3,WYL,RT	NA|328aa|up_9|NZ_CP022110.1_2496789_2497773_+,NA|238aa|up_8|NZ_CP022110.1_2497759_2498473_+,NA|160aa|up_7|NZ_CP022110.1_2498479_2498959_+,NA|102aa|up_6|NZ_CP022110.1_2498955_2499261_+,NA|82aa|up_5|NZ_CP022110.1_2499260_2499506_+,NA|109aa|up_4|NZ_CP022110.1_2499553_2499880_-,NA|107aa|up_3|NZ_CP022110.1_2500036_2500357_-,NA|77aa|up_0|NZ_CP022110.1_2501487_2501718_+,NA|243aa|down_0|NZ_CP022110.1_2502226_2502955_+,NA|215aa|down_1|NZ_CP022110.1_2503165_2503810_+,NA|231aa|down_2|NZ_CP022110.1_2503868_2504561_-,NA|222aa|down_3|NZ_CP022110.1_2505114_2505780_+	NA|328aa|up_9|NZ_CP022110.1_2496789_2497773_+	NA	NA|238aa|up_8|NZ_CP022110.1_2497759_2498473_+	NA	NA|160aa|up_7|NZ_CP022110.1_2498479_2498959_+	NA	NA|102aa|up_6|NZ_CP022110.1_2498955_2499261_+	NA	NA|82aa|up_5|NZ_CP022110.1_2499260_2499506_+	NA	NA|109aa|up_4|NZ_CP022110.1_2499553_2499880_-	NA	NA|107aa|up_3|NZ_CP022110.1_2500036_2500357_-	NA	NA|100aa|up_2|NZ_CP022110.1_2500558_2500858_-	TIGR02684, conserved_hypothetical_protein, probable addiction module antidote protein	NA|93aa|up_1|NZ_CP022110.1_2501035_2501314_-	TIGR02683, Uncharacterized_protein_HI_1419, putative addiction module killer protein	NA|77aa|up_0|NZ_CP022110.1_2501487_2501718_+	NA	NA|243aa|down_0|NZ_CP022110.1_2502226_2502955_+	NA	NA|215aa|down_1|NZ_CP022110.1_2503165_2503810_+	NA	NA|231aa|down_2|NZ_CP022110.1_2503868_2504561_-	NA	NA|222aa|down_3|NZ_CP022110.1_2505114_2505780_+	NA	NA|724aa|down_4|NZ_CP022110.1_2505754_2507926_+	pfam05876, Terminase_GpA, Phage terminase large subunit (GpA)	NA|66aa|down_5|NZ_CP022110.1_2507933_2508131_+	PRK14823, PRK14823, putative deoxyribonucleoside-triphosphatase; Provisional	NA|512aa|down_6|NZ_CP022110.1_2508130_2509666_+	pfam05136, Phage_portal_2, Phage portal protein, lambda family	NA|505aa|down_7|NZ_CP022110.1_2509662_2511177_+	cd07022, S49_Sppa_36K_type, Signal peptide peptidase A (SppA) 36K type, a serine protease, has catalytic Ser-Lys dyad	NA|124aa|down_8|NZ_CP022110.1_2511192_2511564_+	pfam02924, HDPD, Bacteriophage lambda head decoration protein D	NA|353aa|down_9|NZ_CP022110.1_2511591_2512650_+	pfam03864, Phage_cap_E, Phage major capsid protein E
GCF_000730165.1_ASM73016v2	NZ_CP022112	Nitrospirillum amazonense CBAmc chromosome 3, complete sequence	1	473792-473888	1	CRISPRCasFinder	no		csa3,DEDDh	Orphan	CTCGGCACGCTGGTCAACAGCGGC	24	0	0	NA	NA	NA	1	1	Orphan	DEDDh,DinG,cas3,csa3,WYL,RT	NA|51aa|up_1|NZ_CP022112.1_471470_471623_+,NA|99aa|down_4|NZ_CP022112.1_496822_497119_+	NA|264aa|up_9|NZ_CP022112.1_458831_459623_+	PRK05653, fabG, 3-oxoacyl-ACP reductase FabG	NA|474aa|up_8|NZ_CP022112.1_459672_461094_+	COG0044, PyrC, Dihydroorotase and related cyclic amidohydrolases [Nucleotide transport and metabolism]	NA|322aa|up_7|NZ_CP022112.1_461155_462121_-	COG0627, COG0627, Predicted esterase [General function prediction only]	NA|384aa|up_6|NZ_CP022112.1_462408_463560_+	pfam12625, Arabinose_bd, Arabinose-binding domain of AraC transcription regulator, N-term	NA|767aa|up_5|NZ_CP022112.1_463717_466018_+	cd01347, ligand_gated_channel, TonB dependent/Ligand-Gated channels are created by a monomeric 22 strand (22,24) anti-parallel beta-barrel	NA|681aa|up_4|NZ_CP022112.1_466171_468214_+	PRK03584, PRK03584, acetoacetate--CoA ligase	NA|458aa|up_3|NZ_CP022112.1_468264_469638_+	cd17369, MFS_ShiA_like, Shikimate transporter and similar proteins of the Major Facilitator Superfamily	NA|561aa|up_2|NZ_CP022112.1_469634_471317_+	pfam07519, Tannase, Tannase and feruloyl esterase	NA|51aa|up_1|NZ_CP022112.1_471470_471623_+	NA	NA|90aa|up_0|NZ_CP022112.1_472052_472322_+	TIGR03793, TOMM_pelo, NHLP leader peptide domain	NA|116aa|down_0|NZ_CP022112.1_494452_494800_-	pfam05717, TnpB_IS66, IS66 Orf2 like protein	NA|138aa|down_1|NZ_CP022112.1_494796_495210_-	pfam01527, HTH_Tnp_1, Transposase	NA|60aa|down_2|NZ_CP022112.1_496058_496238_+	pfam03050, DDE_Tnp_IS66, Transposase IS66 family	NA|132aa|down_3|NZ_CP022112.1_496316_496712_+	cd04056, Peptidases_S53, Peptidase domain in the S53 family	NA|99aa|down_4|NZ_CP022112.1_496822_497119_+	NA	NA|470aa|down_5|NZ_CP022112.1_497131_498541_+	COG1541, PaaK, Coenzyme F390 synthetase [Coenzyme metabolism]	NA|337aa|down_6|NZ_CP022112.1_498537_499548_+	PRK06462, PRK06462, asparagine synthetase A; Reviewed	NA|329aa|down_7|NZ_CP022112.1_499556_500543_+	cd07062, Peptidase_S66_mccF_like, Microcin C7 self-immunity protein determines resistance to exogenous microcin C7	NA|304aa|down_8|NZ_CP022112.1_500557_501469_+	cd00250, CAS_like, Clavaminic acid synthetase (CAS) -like;  CAS is a trifunctional Fe(II)/ 2-oxoglutarate (2OG) oxygenase carrying out three reactions in the biosynthesis of clavulanic acid, an inhibitor of class A serine beta-lactamases	NA|512aa|down_9|NZ_CP022112.1_501471_503007_+	COG0318, CaiC, Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II [Lipid metabolism / Secondary metabolites biosynthesis, transport, and catabolism]
GCF_000730165.1_ASM73016v2	NZ_CP022112	Nitrospirillum amazonense CBAmc chromosome 3, complete sequence	2	473957-474280	2	CRISPRCasFinder	no		csa3,DEDDh	Orphan	CTCGGCACGCTGGTCAACAGCGGC	24	0	0	NA	NA	NA	4	4	Orphan	DEDDh,DinG,cas3,csa3,WYL,RT	NA|51aa|up_1|NZ_CP022112.1_471470_471623_+,NA|99aa|down_4|NZ_CP022112.1_496822_497119_+	NA|264aa|up_9|NZ_CP022112.1_458831_459623_+	PRK05653, fabG, 3-oxoacyl-ACP reductase FabG	NA|474aa|up_8|NZ_CP022112.1_459672_461094_+	COG0044, PyrC, Dihydroorotase and related cyclic amidohydrolases [Nucleotide transport and metabolism]	NA|322aa|up_7|NZ_CP022112.1_461155_462121_-	COG0627, COG0627, Predicted esterase [General function prediction only]	NA|384aa|up_6|NZ_CP022112.1_462408_463560_+	pfam12625, Arabinose_bd, Arabinose-binding domain of AraC transcription regulator, N-term	NA|767aa|up_5|NZ_CP022112.1_463717_466018_+	cd01347, ligand_gated_channel, TonB dependent/Ligand-Gated channels are created by a monomeric 22 strand (22,24) anti-parallel beta-barrel	NA|681aa|up_4|NZ_CP022112.1_466171_468214_+	PRK03584, PRK03584, acetoacetate--CoA ligase	NA|458aa|up_3|NZ_CP022112.1_468264_469638_+	cd17369, MFS_ShiA_like, Shikimate transporter and similar proteins of the Major Facilitator Superfamily	NA|561aa|up_2|NZ_CP022112.1_469634_471317_+	pfam07519, Tannase, Tannase and feruloyl esterase	NA|51aa|up_1|NZ_CP022112.1_471470_471623_+	NA	NA|90aa|up_0|NZ_CP022112.1_472052_472322_+	TIGR03793, TOMM_pelo, NHLP leader peptide domain	NA|116aa|down_0|NZ_CP022112.1_494452_494800_-	pfam05717, TnpB_IS66, IS66 Orf2 like protein	NA|138aa|down_1|NZ_CP022112.1_494796_495210_-	pfam01527, HTH_Tnp_1, Transposase	NA|60aa|down_2|NZ_CP022112.1_496058_496238_+	pfam03050, DDE_Tnp_IS66, Transposase IS66 family	NA|132aa|down_3|NZ_CP022112.1_496316_496712_+	cd04056, Peptidases_S53, Peptidase domain in the S53 family	NA|99aa|down_4|NZ_CP022112.1_496822_497119_+	NA	NA|470aa|down_5|NZ_CP022112.1_497131_498541_+	COG1541, PaaK, Coenzyme F390 synthetase [Coenzyme metabolism]	NA|337aa|down_6|NZ_CP022112.1_498537_499548_+	PRK06462, PRK06462, asparagine synthetase A; Reviewed	NA|329aa|down_7|NZ_CP022112.1_499556_500543_+	cd07062, Peptidase_S66_mccF_like, Microcin C7 self-immunity protein determines resistance to exogenous microcin C7	NA|304aa|down_8|NZ_CP022112.1_500557_501469_+	cd00250, CAS_like, Clavaminic acid synthetase (CAS) -like;  CAS is a trifunctional Fe(II)/ 2-oxoglutarate (2OG) oxygenase carrying out three reactions in the biosynthesis of clavulanic acid, an inhibitor of class A serine beta-lactamases	NA|512aa|down_9|NZ_CP022112.1_501471_503007_+	COG0318, CaiC, Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II [Lipid metabolism / Secondary metabolites biosynthesis, transport, and catabolism]
GCF_000730165.1_ASM73016v2	NZ_CP022113	Nitrospirillum amazonense CBAmc chromosome 4, complete sequence	1	395085-395336	1	CRISPRCasFinder	no		csa3	Orphan	CATCGGCACGCTGATCAACACCGG	24	0	0	NA	NA	NA	3	3	Orphan	DEDDh,DinG,cas3,csa3,WYL,RT	NA|163aa|up_3|NZ_CP022113.1_387672_388161_+,NA|317aa|up_2|NZ_CP022113.1_388201_389152_+,NA|146aa|up_1|NZ_CP022113.1_389148_389586_+,NA|155aa|up_0|NZ_CP022113.1_389596_390061_+,NA|66aa|down_3|NZ_CP022113.1_423669_423867_-	NA|275aa|up_9|NZ_CP022113.1_379818_380643_+	COG4455, ImpE, Protein of avirulence locus involved in temperature-dependent protein secretion [General function prediction only]	NA|164aa|up_8|NZ_CP022113.1_380646_381138_+	TIGR03357, hypothetical_protein_Atu4338, type VI secretion system lysozyme-like protein	NA|612aa|up_7|NZ_CP022113.1_381130_382966_+	pfam05947, T6SS_TssF, Type VI secretion system, TssF	NA|362aa|up_6|NZ_CP022113.1_382962_384048_+	pfam06996, T6SS_TssG, Type VI secretion, TssG	NA|883aa|up_5|NZ_CP022113.1_384257_386906_+	TIGR03345, VI_ClpV1, type VI secretion ATPase, ClpV1 family	NA|159aa|up_4|NZ_CP022113.1_387051_387528_+	pfam05638, T6SS_HCP, Type VI secretion system effector, Hcp	NA|163aa|up_3|NZ_CP022113.1_387672_388161_+	NA	NA|317aa|up_2|NZ_CP022113.1_388201_389152_+	NA	NA|146aa|up_1|NZ_CP022113.1_389148_389586_+	NA	NA|155aa|up_0|NZ_CP022113.1_389596_390061_+	NA	NA|116aa|down_0|NZ_CP022113.1_420662_421010_+	cd06150, YjgF_YER057c_UK114_like_2, This group of proteins belong to a large family of YjgF/YER057c/UK114-like proteins present in bacteria, archaea, and eukaryotes with no definitive function	NA|447aa|down_1|NZ_CP022113.1_421130_422471_+	PRK05318, PRK05318, deoxyguanosinetriphosphate triphosphohydrolase-like protein; Provisional	NA|394aa|down_2|NZ_CP022113.1_422491_423673_-	cd03401, SPFH_prohibitin, Prohibitin family; SPFH (stomatin, prohibitin, flotillin, and HflK/C) superfamily	NA|66aa|down_3|NZ_CP022113.1_423669_423867_-	NA	NA|204aa|down_4|NZ_CP022113.1_423984_424596_-	COG3597, COG3597, Uncharacterized protein/domain associated with GTPases [Function unknown]	NA|256aa|down_5|NZ_CP022113.1_424740_425508_-	COG3208, GrsT, Predicted thioesterase involved in non-ribosomal peptide biosynthesis [Secondary metabolites biosynthesis, transport, and catabolism]	NA|571aa|down_6|NZ_CP022113.1_425504_427217_-	COG4615, PvdE, ABC-type siderophore export system, fused ATPase and permease components [Secondary metabolites biosynthesis, transport, and catabolism / Inorganic ion transport and metabolism]	NA|555aa|down_7|NZ_CP022113.1_427213_428878_-	COG4615, PvdE, ABC-type siderophore export system, fused ATPase and permease components [Secondary metabolites biosynthesis, transport, and catabolism / Inorganic ion transport and metabolism]	NA|334aa|down_8|NZ_CP022113.1_428874_429876_-	cd00250, CAS_like, Clavaminic acid synthetase (CAS) -like;  CAS is a trifunctional Fe(II)/ 2-oxoglutarate (2OG) oxygenase carrying out three reactions in the biosynthesis of clavulanic acid, an inhibitor of class A serine beta-lactamases	NA|5261aa|down_9|NZ_CP022113.1_429925_445708_-	PRK12467, PRK12467, peptide synthase; Provisional
GCF_000730165.1_ASM73016v2	NZ_CP022113	Nitrospirillum amazonense CBAmc chromosome 4, complete sequence	2	626386-626486	2	CRISPRCasFinder	no		csa3	Orphan	TCCTGGGAGAAGCTGCGCTTCTCGTC	26	0	0	NA	NA	NA	1	1	Orphan	DEDDh,DinG,cas3,csa3,WYL,RT	NA|112aa|up_5|NZ_CP022113.1_615955_616291_-,NA	NA|264aa|up_9|NZ_CP022113.1_610873_611665_+	cd05233, SDR_c, classical (c) SDRs	NA|430aa|up_8|NZ_CP022113.1_611726_613016_-	cd01831, Endoglucanase_E_like, Endoglucanase E-like members of the SGNH hydrolase family; Endoglucanase E catalyzes the endohydrolysis of 1,4-beta-glucosidic linkages in cellulose, lichenin and cereal beta-D-glucans	NA|774aa|up_7|NZ_CP022113.1_613033_615355_+	COG4191, COG4191, Signal transduction histidine kinase regulating C4-dicarboxylate transport system [Signal transduction mechanisms]	NA|133aa|up_6|NZ_CP022113.1_615506_615905_+	COG4566, TtrR, Response regulator [Signal transduction mechanisms]	NA|112aa|up_5|NZ_CP022113.1_615955_616291_-	NA	NA|298aa|up_4|NZ_CP022113.1_616561_617455_-	COG3546, COG3546, Mn-containing catalase [Inorganic ion transport and metabolism]	NA|500aa|up_3|NZ_CP022113.1_618601_620101_+	PRK09302, PRK09302, circadian clock protein KaiC; Reviewed	NA|535aa|up_2|NZ_CP022113.1_620964_622569_+	PRK13557, PRK13557, histidine kinase; Provisional	NA|410aa|up_1|NZ_CP022113.1_622674_623904_+	pfam10009, DUF2252, Uncharacterized protein conserved in bacteria (DUF2252)	NA|651aa|up_0|NZ_CP022113.1_623911_625864_-	PRK13557, PRK13557, histidine kinase; Provisional	NA|170aa|down_0|NZ_CP022113.1_626950_627460_+	cd07909, YciF, YciF bacterial stress response protein, ferritin-like iron-binding domain	NA|357aa|down_1|NZ_CP022113.1_627511_628582_-	COG2382, Fes, Enterochelin esterase and related enzymes [Inorganic ion transport and metabolism]	NA|656aa|down_2|NZ_CP022113.1_628620_630588_-	COG3534, AbfA, Alpha-L-arabinofuranosidase [Carbohydrate transport and metabolism]	NA|558aa|down_3|NZ_CP022113.1_631046_632720_+	pfam01229, Glyco_hydro_39, Glycosyl hydrolases family 39	NA|897aa|down_4|NZ_CP022113.1_632778_635469_+	PLN03080, PLN03080, Probable beta-xylosidase; Provisional	NA|246aa|down_5|NZ_CP022113.1_635509_636247_-	COG2186, FadR, Transcriptional regulators [Transcription]	NA|77aa|down_6|NZ_CP022113.1_636354_636585_-	PRK01271, PRK01271, tautomerase PptA	NA|974aa|down_7|NZ_CP022113.1_637281_640203_+	TIGR01782, TonB-dependent_receptor, TonB-dependent receptor	NA|512aa|down_8|NZ_CP022113.1_640377_641913_+	pfam04820, Trp_halogenase, Tryptophan halogenase	NA|236aa|down_9|NZ_CP022113.1_641928_642636_+	pfam07277, SapC, SapC
