assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_010983895.1_ASM1098389v1	NZ_CP048836	Azoarcus sp. M9-3-2 chromosome, complete genome	1	812444-812508	1	CRISPRCasFinder	no		RT,csa3,DinG,DEDDh,Cas9_archaeal,WYL	Orphan	TCCGGCGGCTCGGGTGGCTCCAC	23	0	0	NA	NA	NA	1	1	Orphan	RT,csa3,DinG,DEDDh,Cas9_archaeal,WYL	NA|142aa|up_1|NZ_CP048836.1_808361_808787_-,NA|85aa|down_0|NZ_CP048836.1_813993_814248_+,NA|164aa|down_7|NZ_CP048836.1_821348_821840_-	NA|201aa|up_9|NZ_CP048836.1_799600_800203_+	cd16352, CheD, chemotaxis protein CheD stimulates methylation of methyl-accepting chemotaxis proteins	NA|677aa|up_8|NZ_CP048836.1_800300_802331_+	PRK11360, PRK11360, two-component system sensor histidine kinase AtoS	NA|122aa|up_7|NZ_CP048836.1_802327_802693_+	cd17537, REC_FixJ, phosphoacceptor receiver (REC) domain of FixJ family response regulators	NA|200aa|up_6|NZ_CP048836.1_803256_803856_-	COG3243, PhaC, Poly(3-hydroxyalkanoate) synthetase [Lipid metabolism]	NA|209aa|up_5|NZ_CP048836.1_803962_804589_-	COG4566, TtrR, Response regulator [Signal transduction mechanisms]	NA|473aa|up_4|NZ_CP048836.1_804585_806004_-	PRK11360, PRK11360, two-component system sensor histidine kinase AtoS	NA|281aa|up_3|NZ_CP048836.1_806069_806912_-	COG0664, Crp, cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases [Signal transduction mechanisms]	NA|468aa|up_2|NZ_CP048836.1_806929_808333_-	cd01949, GGDEF, Diguanylate-cyclase (DGC) or GGDEF domain	NA|142aa|up_1|NZ_CP048836.1_808361_808787_-	NA	NA|735aa|up_0|NZ_CP048836.1_809328_811533_+	PRK13875, PRK13875, conjugal transfer protein TrbL; Provisional	NA|85aa|down_0|NZ_CP048836.1_813993_814248_+	NA	NA|227aa|down_1|NZ_CP048836.1_814261_814942_+	cd02423, Peptidase_C39G, A sub-family of peptidase family C39	NA|290aa|down_2|NZ_CP048836.1_815009_815879_+	PRK13875, PRK13875, conjugal transfer protein TrbL; Provisional	NA|417aa|down_3|NZ_CP048836.1_816004_817255_-	pfam04102, SlyX, SlyX	NA|445aa|down_4|NZ_CP048836.1_817354_818689_-	COG2204, AtoC, Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains [Signal transduction mechanisms]	NA|569aa|down_5|NZ_CP048836.1_819046_820753_-	COG2989, COG2989, Uncharacterized protein conserved in bacteria [Function unknown]	NA|188aa|down_6|NZ_CP048836.1_820763_821327_+	pfam05951, Peptidase_M15_2, Bacterial protein of unknown function (DUF882)	NA|164aa|down_7|NZ_CP048836.1_821348_821840_-	NA	NA|256aa|down_8|NZ_CP048836.1_822040_822808_+	TIGR02427, b-ketoadipate_enol-lactone_hydrolase, 3-oxoadipate enol-lactonase	NA|226aa|down_9|NZ_CP048836.1_822876_823554_-	cd10433, YccA_like, YccA-like proteins
GCF_010983895.1_ASM1098389v1	NZ_CP048836	Azoarcus sp. M9-3-2 chromosome, complete genome	2	1230314-1230424	2	CRISPRCasFinder	no		RT,csa3,DinG,DEDDh,Cas9_archaeal,WYL	Orphan	CGGGGGGCAGCGAGCACAGCGAGCGTGGGGGCC	33	1	2	1230347-1230391|1230347-1230391	NZ_CP048836.1_1230034-1230078|NZ_CP048836.1_1230190-1230234	NA	1	1	Orphan	RT,csa3,DinG,DEDDh,Cas9_archaeal,WYL	NA,NA	NA|238aa|up_9|NZ_CP048836.1_1218033_1218747_-	pfam12695, Abhydrolase_5, Alpha/beta hydrolase family	NA|254aa|up_8|NZ_CP048836.1_1218743_1219505_-	cd09086, ExoIII-like_AP-endo, Escherichia coli exonuclease III (ExoIII) and Neisseria meningitides NExo-like subfamily of the ExoIII family purinic/apyrimidinic (AP) endonucleases	NA|604aa|up_7|NZ_CP048836.1_1219664_1221476_+	cd01948, EAL, EAL domain	NA|690aa|up_6|NZ_CP048836.1_1221443_1223513_-	cd06456, M3A_DCP, Peptidase family M3, dipeptidyl carboxypeptidase (DCP)	NA|258aa|up_5|NZ_CP048836.1_1223681_1224455_-	COG5473, COG5473, Predicted integral membrane protein [Function unknown]	NA|312aa|up_4|NZ_CP048836.1_1224583_1225519_-	PRK13961, PRK13961, phosphoribosylaminoimidazole-succinocarboxamide synthase; Provisional	NA|346aa|up_3|NZ_CP048836.1_1225576_1226614_-	cd01825, SGNH_hydrolase_peri1, SGNH_peri1; putative periplasmic member of the SGNH-family of hydrolases, a diverse family of lipases and esterases	NA|493aa|up_2|NZ_CP048836.1_1226622_1228101_-	COG1696, DltB, Predicted membrane protein involved in D-alanine export [Cell envelope biogenesis, outer membrane]	NA|430aa|up_1|NZ_CP048836.1_1228198_1229488_-	pfam11902, DUF3422, Protein of unknown function (DUF3422)	NA|137aa|up_0|NZ_CP048836.1_1229564_1229975_-	cd06661, GGCT_like, GGCT-like domains, also called AIG2-like family	NA|355aa|down_0|NZ_CP048836.1_1230437_1231502_-	PRK09196, PRK09196, fructose-bisphosphate aldolase class II	NA|250aa|down_1|NZ_CP048836.1_1231604_1232354_-	cd02145, BluB, 5,6-dimethylbenzimidazole synthase	NA|482aa|down_2|NZ_CP048836.1_1232614_1234060_-	PRK05826, PRK05826, pyruvate kinase; Provisional	NA|384aa|down_3|NZ_CP048836.1_1234162_1235314_-	cd01949, GGDEF, Diguanylate-cyclase (DGC) or GGDEF domain	NA|395aa|down_4|NZ_CP048836.1_1235464_1236649_-	PRK00073, pgk, phosphoglycerate kinase; Provisional	NA|343aa|down_5|NZ_CP048836.1_1236696_1237725_-	TIGR01532, D-erythrose-4-phosphate_dehydrogenase_E4PDH	NA|343aa|down_6|NZ_CP048836.1_1237785_1238814_-	COG0057, GapA, Glyceraldehyde-3-phosphate dehydrogenase/erythrose-4-phosphate dehydrogenase [Carbohydrate transport and metabolism]	NA|677aa|down_7|NZ_CP048836.1_1238864_1240895_-	PRK12753, PRK12753, transketolase; Reviewed	NA|292aa|down_8|NZ_CP048836.1_1241048_1241924_-	PRK15453, PRK15453, phosphoribulokinase; Provisional	NA|270aa|down_9|NZ_CP048836.1_1241979_1242789_-	cd01637, IMPase_like, Inositol-monophosphatase-like domains
GCF_010983895.1_ASM1098389v1	NZ_CP048836	Azoarcus sp. M9-3-2 chromosome, complete genome	3	1324641-1324732	3	CRISPRCasFinder	no	csa3	RT,csa3,DinG,DEDDh,Cas9_archaeal,WYL	Type I-A	TCCTGCATGGCGGCGGCCCAGTCGTC	26	0	0	NA	NA	NA	1	1	Orphan	RT,csa3,DinG,DEDDh,Cas9_archaeal,WYL	NA|81aa|up_1|NZ_CP048836.1_1322272_1322515_-,NA	NA|263aa|up_9|NZ_CP048836.1_1316080_1316869_-	pfam09836, DUF2063, Putative DNA-binding domain	NA|283aa|up_8|NZ_CP048836.1_1316861_1317710_-	pfam05114, DUF692, Protein of unknown function (DUF692)	NA|88aa|up_7|NZ_CP048836.1_1317753_1318017_-	pfam10048, DUF2282, Predicted integral membrane protein (DUF2282)	NA|186aa|up_6|NZ_CP048836.1_1318170_1318728_-	pfam07209, DUF1415, Protein of unknown function (DUF1415)	NA|446aa|up_5|NZ_CP048836.1_1318724_1320062_-	COG3264, COG3264, Small-conductance mechanosensitive channel [Cell envelope biogenesis, outer membrane]	NA|268aa|up_4|NZ_CP048836.1_1320086_1320890_-	pfam03649, UPF0014, Uncharacterized protein family (UPF0014)	NA|211aa|up_3|NZ_CP048836.1_1320886_1321519_-	cd03225, ABC_cobalt_CbiO_domain1, First domain of the ATP-binding cassette component of cobalt transport system	csa3|228aa|up_2|NZ_CP048836.1_1321588_1322272_+	smart00418, HTH_ARSR, helix_turn_helix, Arsenical Resistance Operon Repressor	NA|81aa|up_1|NZ_CP048836.1_1322272_1322515_-	NA	NA|465aa|up_0|NZ_CP048836.1_1322654_1324049_-	pfam05787, DUF839, Bacterial protein of unknown function (DUF839)	NA|337aa|down_0|NZ_CP048836.1_1324763_1325774_-	PRK06666, fliM, flagellar motor switch protein FliM; Validated	NA|188aa|down_1|NZ_CP048836.1_1325797_1326361_-	PRK07021, fliL, flagellar basal body-associated protein FliL; Reviewed	NA|443aa|down_2|NZ_CP048836.1_1326456_1327785_-	cd17470, T3SS_Flik_C, C-terminal domain of flagellar hook-length control protein FliK and similar domains	NA|151aa|down_3|NZ_CP048836.1_1327843_1328296_-	PRK05689, fliJ, flagella biosynthesis chaperone FliJ	NA|468aa|down_4|NZ_CP048836.1_1328394_1329798_-	TIGR03496, FliI_clade1, flagellar protein export ATPase FliI	NA|228aa|down_5|NZ_CP048836.1_1329799_1330483_-	PRK05687, fliH, flagellar assembly protein FliH	NA|333aa|down_6|NZ_CP048836.1_1330501_1331500_-	PRK05686, fliG, flagellar motor switch protein G; Validated	NA|559aa|down_7|NZ_CP048836.1_1331496_1333173_-	PRK06007, fliF, flagellar basal body M-ring protein FliF	NA|378aa|down_8|NZ_CP048836.1_1333346_1334480_+	PRK11360, PRK11360, two-component system sensor histidine kinase AtoS	NA|460aa|down_9|NZ_CP048836.1_1334472_1335852_+	COG2204, AtoC, Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains [Signal transduction mechanisms]
GCF_010983895.1_ASM1098389v1	NZ_CP048836	Azoarcus sp. M9-3-2 chromosome, complete genome	4	3263157-3263305	4	CRISPRCasFinder	no		RT,csa3,DinG,DEDDh,Cas9_archaeal,WYL	Orphan	CTGCAGTTCCTCGTTGGAGGCCT	23	2	4	3263180-3263198|3263264-3263282|3263264-3263282|3263264-3263282	NZ_CP048836.1_2682391-2682409|NZ_CP048836.1_497546-497528|NZ_CP048836.1_1235578-1235596|NZ_CP048836.1_1713563-1713545	NA	3	3	Orphan	RT,csa3,DinG,DEDDh,Cas9_archaeal,WYL	NA,NA|293aa|down_1|NZ_CP048836.1_3266123_3267002_-,NA|199aa|down_6|NZ_CP048836.1_3269860_3270457_-	NA|786aa|up_9|NZ_CP048836.1_3246874_3249232_-	COG4774, Fiu, Outer membrane receptor for monomeric catechols [Inorganic ion transport and metabolism]	NA|463aa|up_8|NZ_CP048836.1_3249576_3250965_+	PRK12597, PRK12597, F0F1 ATP synthase subunit beta; Provisional	NA|132aa|up_7|NZ_CP048836.1_3250961_3251357_+	cd12152, F1-ATPase_delta, mitochondrial ATP synthase delta subunit	NA|99aa|up_6|NZ_CP048836.1_3251353_3251650_+	pfam09527, ATPase_gene1, Putative F0F1-ATPase subunit Ca2+/Mg2+ transporter	NA|232aa|up_5|NZ_CP048836.1_3251646_3252342_+	PRK13420, PRK13420, F0F1 ATP synthase subunit A; Provisional	NA|89aa|up_4|NZ_CP048836.1_3252338_3252605_+	PRK13468, PRK13468, F0F1 ATP synthase subunit C; Provisional	NA|250aa|up_3|NZ_CP048836.1_3252868_3253618_+	TIGR03321, alt_F1F0_F0_B, alternate F1F0 ATPase, F0 subunit B	NA|482aa|up_2|NZ_CP048836.1_3253607_3255053_+	PRK13343, PRK13343, F0F1 ATP synthase subunit alpha; Provisional	NA|311aa|up_1|NZ_CP048836.1_3255049_3255982_+	cd12151, F1-ATPase_gamma, mitochondrial ATP synthase gamma subunit	NA|1282aa|up_0|NZ_CP048836.1_3256305_3260151_+	cd01406, SIR2-like, Sir2-like: Prokaryotic group of uncharacterized Sir2-like proteins which lack certain key catalytic residues and conserved zinc binding cysteines; and are members of the SIR2 superfamily of proteins, silent information regulator 2 (Sir2) enzymes which catalyze NAD+-dependent protein/histone deacetylation	NA|220aa|down_0|NZ_CP048836.1_3265413_3266073_-	cd03024, DsbA_FrnE, DsbA family, FrnE subfamily; FrnE is a DsbA-like protein containing a CXXC motif	NA|293aa|down_1|NZ_CP048836.1_3266123_3267002_-	NA	NA|163aa|down_2|NZ_CP048836.1_3267121_3267610_+	PRK05208, PRK05208, hypothetical protein; Provisional	NA|76aa|down_3|NZ_CP048836.1_3267609_3267837_+	pfam09791, Oxidored-like, Oxidoreductase-like protein, N-terminal	NA|460aa|down_4|NZ_CP048836.1_3267820_3269200_-	COG0277, GlcD, FAD/FMN-containing dehydrogenases [Energy production and conversion]	NA|163aa|down_5|NZ_CP048836.1_3269359_3269848_+	sd00006, TPR, Tetratricopeptide repeat	NA|199aa|down_6|NZ_CP048836.1_3269860_3270457_-	NA	NA|260aa|down_7|NZ_CP048836.1_3270467_3271247_-	PRK07533, PRK07533, enoyl-[acyl-carrier-protein] reductase FabI	NA|175aa|down_8|NZ_CP048836.1_3271413_3271938_+	pfam18143, HAD_SAK_2, HAD domain in Swiss Army Knife RNA repair proteins	NA|524aa|down_9|NZ_CP048836.1_3271983_3273555_-	PRK15317, PRK15317, alkyl hydroperoxide reductase subunit F; Provisional
GCF_010983895.1_ASM1098389v1	NZ_CP048836	Azoarcus sp. M9-3-2 chromosome, complete genome	5	4036856-4036959	5	CRISPRCasFinder	no	DEDDh	RT,csa3,DinG,DEDDh,Cas9_archaeal,WYL	Unclear	CAGCGAGCATCGCGAGCGTGGGGGCC	26	0	0	NA	NA	NA	1	1	Orphan	RT,csa3,DinG,DEDDh,Cas9_archaeal,WYL	NA|145aa|up_1|NZ_CP048836.1_4035038_4035473_+,NA|137aa|down_5|NZ_CP048836.1_4040546_4040957_-,NA|115aa|down_6|NZ_CP048836.1_4040960_4041305_-	NA|848aa|up_9|NZ_CP048836.1_4026294_4028838_-	PRK13532, PRK13532, nitrate reductase catalytic subunit NapA	NA|87aa|up_8|NZ_CP048836.1_4028839_4029100_-	pfam03927, NapD, NapD protein	NA|168aa|up_7|NZ_CP048836.1_4029096_4029600_-	cd10564, NapF_like, NapF, iron-sulfur subunit of periplasmic nitrate reductase	DEDDh|470aa|up_6|NZ_CP048836.1_4029743_4031153_-	PRK07883, PRK07883, DEDD exonuclease domain-containing protein	NA|63aa|up_5|NZ_CP048836.1_4031213_4031402_-	pfam11943, DUF3460, Protein of unknown function (DUF3460)	NA|737aa|up_4|NZ_CP048836.1_4031471_4033682_-	COG0317, SpoT, Guanosine polyphosphate pyrophosphohydrolases/synthetases [Signal transduction mechanisms / Transcription]	NA|72aa|up_3|NZ_CP048836.1_4033719_4033935_-	PRK00392, rpoZ, DNA-directed RNA polymerase subunit omega; Reviewed	NA|205aa|up_2|NZ_CP048836.1_4033941_4034556_-	PRK00300, gmk, guanylate kinase; Provisional	NA|145aa|up_1|NZ_CP048836.1_4035038_4035473_+	NA	NA|403aa|up_0|NZ_CP048836.1_4035534_4036743_-	TIGR03402, Cysteine_desulfurase_NifS, cysteine desulfurase NifS	NA|298aa|down_0|NZ_CP048836.1_4036972_4037866_-	TIGR02000, Nitrogen_fixation_protein_NifU, Fe-S cluster assembly protein NifU	NA|108aa|down_1|NZ_CP048836.1_4037878_4038202_-	TIGR00049, Uncharacterized_protein_in_nifU_5'region, Iron-sulfur cluster assembly accessory protein	NA|304aa|down_2|NZ_CP048836.1_4038394_4039306_-	cd08471, PBP2_CrgA_like_2, The C-terminal substrate binding domain of an uncharacterized LysR-type transcriptional regulator CrgA-like, contains the type 2 periplasmic binding fold	NA|181aa|down_3|NZ_CP048836.1_4039412_4039955_+	TIGR04025, hypothetical_protein, PPOX class probable FMN-dependent enzyme, DR_2398 family	NA|198aa|down_4|NZ_CP048836.1_4039951_4040545_+	cd03206, GST_C_7, C-terminal, alpha helical domain of an unknown subfamily 7 of Glutathione S-transferases	NA|137aa|down_5|NZ_CP048836.1_4040546_4040957_-	NA	NA|115aa|down_6|NZ_CP048836.1_4040960_4041305_-	NA	NA|90aa|down_7|NZ_CP048836.1_4041304_4041574_-	TIGR02936, fdxN_nitrog, ferredoxin III, nif-specific	NA|75aa|down_8|NZ_CP048836.1_4041570_4041795_-	pfam05082, Rop-like, Rop-like	NA|154aa|down_9|NZ_CP048836.1_4041806_4042268_-	TIGR02935, UPF0460_protein_in_nifX_3'region, probable nitrogen fixation protein
