assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_001548015.1_ASM154801v1	NZ_AP014704	Methylobacterium aquaticum strain MA-22A	1	412719-412963	1	CRISPRCasFinder	no		DEDDh,csa3,cas3,cas5,cas8c,cas7,cas1,cas2,DinG	Orphan	AGCGTGCCGTCGTTGTCCGGGTCGGC	26	0	0	NA	NA	NA	3	3	Orphan	DEDDh,csa3,cas3,cas5,cas8c,cas7,cas1,cas2,DinG,c2c9_V-U4,Cas14u_CAS-V	NA|71aa|up_5|NZ_AP014704.1_407626_407839_+,NA|78aa|up_0|NZ_AP014704.1_412317_412551_+,NA|99aa|down_7|NZ_AP014704.1_423689_423986_-	NA|222aa|up_9|NZ_AP014704.1_404105_404771_+	pfam06776, IalB, Invasion associated locus B (IalB) protein	NA|66aa|up_8|NZ_AP014704.1_404814_405012_+	pfam14070, YjfB_motility, Putative motility protein	NA|421aa|up_7|NZ_AP014704.1_405412_406675_+	COG2223, NarK, Nitrate/nitrite transporter [Inorganic ion transport and metabolism]	NA|254aa|up_6|NZ_AP014704.1_406676_407438_-	COG1040, ComFC, Predicted amidophosphoribosyltransferases [General function prediction only]	NA|71aa|up_5|NZ_AP014704.1_407626_407839_+	NA	NA|249aa|up_4|NZ_AP014704.1_407949_408696_-	pfam04471, Mrr_cat, Restriction endonuclease	NA|265aa|up_3|NZ_AP014704.1_408962_409757_+	pfam06707, DUF1194, Protein of unknown function (DUF1194)	NA|144aa|up_2|NZ_AP014704.1_409814_410246_-	COG4957, COG4957, Predicted transcriptional regulator [Transcription]	NA|349aa|up_1|NZ_AP014704.1_410955_412002_-	cd07302, CHD, cyclase homology domain	NA|78aa|up_0|NZ_AP014704.1_412317_412551_+	NA	NA|441aa|down_0|NZ_AP014704.1_413183_414506_-	COG1253, TlyC, Hemolysins and related proteins containing CBS domains [General function prediction only]	NA|534aa|down_1|NZ_AP014704.1_414794_416396_+	PRK00881, purH, bifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase; Provisional	NA|428aa|down_2|NZ_AP014704.1_416505_417789_+	pfam11902, DUF3422, Protein of unknown function (DUF3422)	NA|328aa|down_3|NZ_AP014704.1_417810_418794_-	pfam10094, DUF2332, Uncharacterized protein conserved in bacteria (DUF2332)	NA|563aa|down_4|NZ_AP014704.1_419197_420886_-	PRK09395, actP, cation/acetate symporter ActP	NA|103aa|down_5|NZ_AP014704.1_420882_421191_-	pfam04341, DUF485, Protein of unknown function, DUF485	NA|650aa|down_6|NZ_AP014704.1_421585_423535_-	PRK00174, PRK00174, acetyl-CoA synthetase; Provisional	NA|99aa|down_7|NZ_AP014704.1_423689_423986_-	NA	NA|160aa|down_8|NZ_AP014704.1_424102_424582_-	cd01285, nucleoside_deaminase, Nucleoside deaminases include adenosine, guanine and cytosine deaminases	NA|297aa|down_9|NZ_AP014704.1_424681_425572_-	cd08422, PBP2_CrgA_like, The C-terminal substrate binding domain of LysR-type transcriptional regulator CrgA and its related homologs, contains the type 2 periplasmic binding domain
GCF_001548015.1_ASM154801v1	NZ_AP014704	Methylobacterium aquaticum strain MA-22A	2	2204377-2204500	2	CRISPRCasFinder	no		DEDDh,csa3,cas3,cas5,cas8c,cas7,cas1,cas2,DinG	Orphan	GTAGCCGTAGGACGGGTAGCCATAGCCGTAG	31	0	0	NA	NA	NA	1	1	Orphan	DEDDh,csa3,cas3,cas5,cas8c,cas7,cas1,cas2,DinG,c2c9_V-U4,Cas14u_CAS-V	NA|75aa|up_8|NZ_AP014704.1_2198779_2199004_-,NA|53aa|up_3|NZ_AP014704.1_2202416_2202575_-,NA|101aa|up_2|NZ_AP014704.1_2202839_2203142_-,NA|215aa|down_2|NZ_AP014704.1_2210576_2211221_-	NA|306aa|up_9|NZ_AP014704.1_2197832_2198750_+	PRK11139, PRK11139, DNA-binding transcriptional activator GcvA; Provisional	NA|75aa|up_8|NZ_AP014704.1_2198779_2199004_-	NA	NA|309aa|up_7|NZ_AP014704.1_2199158_2200085_+	PRK06197, PRK06197, short chain dehydrogenase; Provisional	NA|137aa|up_6|NZ_AP014704.1_2200108_2200519_-	cd04706, PLA2_plant, PLA2_plant: Plant-specific sub-family of  Phospholipase A2, a super-family of secretory and cytosolic enzymes; the latter are either Ca dependent or Ca independent	NA|74aa|up_5|NZ_AP014704.1_2200804_2201026_+	pfam18557, NepR, Anti-sigma factor NepR	NA|156aa|up_4|NZ_AP014704.1_2201182_2201650_+	pfam10011, DUF2254, Predicted membrane protein (DUF2254)	NA|53aa|up_3|NZ_AP014704.1_2202416_2202575_-	NA	NA|101aa|up_2|NZ_AP014704.1_2202839_2203142_-	NA	NA|176aa|up_1|NZ_AP014704.1_2203292_2203820_-	cd03424, ADPRase_NUDT5, ADP-ribose pyrophosphatase (ADPRase) catalyzes the hydrolysis of ADP-ribose and a variety of additional ADP-sugar conjugates to AMP and ribose-5-phosphate	NA|89aa|up_0|NZ_AP014704.1_2203909_2204176_-	cd00291, SirA_YedF_YeeD, SirA, YedF, and YeeD	NA|456aa|down_0|NZ_AP014704.1_2205024_2206392_-	PLN02611, PLN02611, glutamate--cysteine ligase	NA|1179aa|down_1|NZ_AP014704.1_2206918_2210455_+	PRK05673, dnaE, DNA polymerase III subunit alpha; Validated	NA|215aa|down_2|NZ_AP014704.1_2210576_2211221_-	NA	NA|355aa|down_3|NZ_AP014704.1_2211223_2212288_-	COG1192, Soj, ATPases involved in chromosome partitioning [Cell division and chromosome partitioning]	NA|389aa|down_4|NZ_AP014704.1_2212570_2213737_-	COG3214, COG3214, Uncharacterized protein conserved in bacteria [Function unknown]	NA|610aa|down_5|NZ_AP014704.1_2213741_2215571_-	PRK12448, PRK12448, dihydroxy-acid dehydratase; Provisional	NA|89aa|down_6|NZ_AP014704.1_2215664_2215931_-	PRK14959, PRK14959, DNA polymerase III subunits gamma and tau; Provisional	NA|618aa|down_7|NZ_AP014704.1_2216193_2218047_+	COG1132, MdlB, ABC-type multidrug transport system, ATPase and permease components [Defense mechanisms]	NA|255aa|down_8|NZ_AP014704.1_2218168_2218933_-	pfam00561, Abhydrolase_1, alpha/beta hydrolase fold	NA|302aa|down_9|NZ_AP014704.1_2219205_2220111_-	COG2084, MmsB, 3-hydroxyisobutyrate dehydrogenase and related beta-hydroxyacid dehydrogenases [Lipid metabolism]
GCF_001548015.1_ASM154801v1	NZ_AP014704	Methylobacterium aquaticum strain MA-22A	3	2896846-2897330	1,3,1	CRT,CRISPRCasFinder,PILER-CR	no	cas5,cas8c,cas7,cas1,cas2	DEDDh,csa3,cas3,cas5,cas8c,cas7,cas1,cas2,DinG	Type I-U,Type I-C, Type I-U?	GTCGCTCCCTCACGGGNGGCGCGGATCGAAAC,GTCGCTCCCTCACGGGGGCGCGGATCGAAAC,GTCGCTCCCTCACGGGGGCGCGGATCGAAAC	32,31,31	0	0	NA	NA	NA:NA:NA	7,5,3	7	TypeI-U,TypeI-C,TypeI-U?	DEDDh,csa3,cas3,cas5,cas8c,cas7,cas1,cas2,DinG,c2c9_V-U4,Cas14u_CAS-V	NA|110aa|up_5|NZ_AP014704.1_2889946_2890276_-,NA|133aa|down_0|NZ_AP014704.1_2899946_2900345_-	NA|789aa|up_9|NZ_AP014704.1_2882226_2884593_+	PRK08255, PRK08255, bifunctional salicylyl-CoA 5-hydroxylase/oxidoreductase	NA|562aa|up_8|NZ_AP014704.1_2884766_2886452_+	smart00283, MA, Methyl-accepting chemotaxis-like domains (chemotaxis sensory transducer)	NA|524aa|up_7|NZ_AP014704.1_2887100_2888672_+	PRK00915, PRK00915, 2-isopropylmalate synthase; Validated	NA|237aa|up_6|NZ_AP014704.1_2889249_2889960_+	pfam00561, Abhydrolase_1, alpha/beta hydrolase fold	NA|110aa|up_5|NZ_AP014704.1_2889946_2890276_-	NA	cas5|266aa|up_4|NZ_AP014704.1_2890828_2891626_+	cd09752, Cas5_I-C, CRISPR/Cas system-associated RAMP superfamily protein Cas5	cas8c|609aa|up_3|NZ_AP014704.1_2891622_2893449_+	pfam09709, Cas_Csd1, CRISPR-associated protein (Cas_Csd1)	cas7|312aa|up_2|NZ_AP014704.1_2893438_2894374_+	pfam05107, Cas_Cas7, CRISPR-associated protein Cas7	cas1|180aa|up_1|NZ_AP014704.1_2894286_2894826_+	TIGR03640, cas1_DVULG, CRISPR-associated endonuclease Cas1, subtype I-C/DVULG	cas2|97aa|up_0|NZ_AP014704.1_2894835_2895126_+	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	NA|133aa|down_0|NZ_AP014704.1_2899946_2900345_-	NA	NA|74aa|down_1|NZ_AP014704.1_2900277_2900499_+	pfam00872, Transposase_mut, Transposase, Mutator family	NA|161aa|down_2|NZ_AP014704.1_2901170_2901653_+	cd07313, terB_like_2, tellurium resistance terB-like protein, subgroup 2	NA|250aa|down_3|NZ_AP014704.1_2901652_2902402_+	PRK06490, PRK06490, glutamine amidotransferase; Provisional	NA|124aa|down_4|NZ_AP014704.1_2902398_2902770_+	pfam03788, LrgA, LrgA family	NA|239aa|down_5|NZ_AP014704.1_2902766_2903483_+	pfam04172, LrgB, LrgB-like family	NA|411aa|down_6|NZ_AP014704.1_2903633_2904866_-	COG0285, FolC, Folylpolyglutamate synthase [Coenzyme metabolism]	NA|160aa|down_7|NZ_AP014704.1_2905273_2905753_+	COG1522, Lrp, Transcriptional regulators [Transcription]	NA|106aa|down_8|NZ_AP014704.1_2906024_2906342_+	pfam14417, MEDS, MEDS: MEthanogen/methylotroph, DcmR Sensory domain	NA|368aa|down_9|NZ_AP014704.1_2906352_2907456_-	cd03801, GT4_PimA-like, phosphatidyl-myo-inositol mannosyltransferase
GCF_001548015.1_ASM154801v1	NZ_AP014704	Methylobacterium aquaticum strain MA-22A	4	4910997-4911147	2	PILER-CR	no		DEDDh,csa3,cas3,cas5,cas8c,cas7,cas1,cas2,DinG	Orphan	GCGATCCCGATAGTCGCGGTCGCGATAGCCGTAGTCG	37	0	0	NA	NA	NA	2	2	Orphan	DEDDh,csa3,cas3,cas5,cas8c,cas7,cas1,cas2,DinG,c2c9_V-U4,Cas14u_CAS-V	NA|95aa|up_8|NZ_AP014704.1_4902500_4902785_-,NA|101aa|up_7|NZ_AP014704.1_4902781_4903084_-,NA|85aa|up_5|NZ_AP014704.1_4903459_4903714_-,NA|148aa|up_3|NZ_AP014704.1_4904687_4905131_-,NA	NA|140aa|up_9|NZ_AP014704.1_4902081_4902501_-	pfam11367, DUF3168, Protein of unknown function (DUF3168)	NA|95aa|up_8|NZ_AP014704.1_4902500_4902785_-	NA	NA|101aa|up_7|NZ_AP014704.1_4902781_4903084_-	NA	NA|125aa|up_6|NZ_AP014704.1_4903083_4903458_-	pfam13262, DUF4054, Protein of unknown function (DUF4054)	NA|85aa|up_5|NZ_AP014704.1_4903459_4903714_-	NA	NA|316aa|up_4|NZ_AP014704.1_4903726_4904674_-	COG4834, COG4834, Uncharacterized protein conserved in bacteria [Function unknown]	NA|148aa|up_3|NZ_AP014704.1_4904687_4905131_-	NA	NA|339aa|up_2|NZ_AP014704.1_4905130_4906147_-	pfam09979, DUF2213, Uncharacterized protein conserved in bacteria (DUF2213)	NA|422aa|up_1|NZ_AP014704.1_4906488_4907754_-	pfam06381, DUF1073, Protein of unknown function (DUF1073)	NA|456aa|up_0|NZ_AP014704.1_4907753_4909121_-	COG5362, COG5362, Phage-related terminase [General function prediction only]	NA|707aa|down_0|NZ_AP014704.1_4911669_4913790_-	TIGR01783, Ferrienterobactin_receptor, TonB-dependent siderophore receptor	NA|333aa|down_1|NZ_AP014704.1_4914434_4915433_+	TIGR03945, cysteine_synthase, 2,3-diaminopropionate biosynthesis protein SbnA	NA|337aa|down_2|NZ_AP014704.1_4915477_4916488_+	TIGR03944, ornithine_cyclodeaminase, 2,3-diaminopropionate biosynthesis protein SbnB	NA|607aa|down_3|NZ_AP014704.1_4916500_4918321_+	COG4264, RhbC, Siderophore synthetase component [Secondary metabolites biosynthesis, transport, and catabolism]	NA|399aa|down_4|NZ_AP014704.1_4918317_4919514_+	cd17325, MFS_MdtG_SLC18_like, bacterial MdtG-like and eukaryotic solute carrier 18 (SLC18) family of the Major Facilitator Superfamily of transporters	NA|1151aa|down_5|NZ_AP014704.1_4919510_4922963_+	COG4264, RhbC, Siderophore synthetase component [Secondary metabolites biosynthesis, transport, and catabolism]	NA|261aa|down_6|NZ_AP014704.1_4922959_4923742_+	COG3836, HpcH, 2,4-dihydroxyhept-2-ene-1,7-dioic acid aldolase [Carbohydrate transport and metabolism]	NA|403aa|down_7|NZ_AP014704.1_4923738_4924947_+	cd06843, PLPDE_III_PvsE_like, Type III Pyridoxal 5-phosphate (PLP)-Dependent Enzyme PvsE	NA|165aa|down_8|NZ_AP014704.1_4924949_4925444_+	cd16400, ParB_Srx_like_nuclease, ParB/Srx_like nuclease and putative transcriptional regulators related to SbnI	NA|674aa|down_9|NZ_AP014704.1_4931239_4933261_-	PRK11107, PRK11107, hybrid sensory histidine kinase BarA; Provisional
GCF_001548015.1_ASM154801v1	NZ_AP014704	Methylobacterium aquaticum strain MA-22A	5	4981495-4981950	3,4,2	PILER-CR,CRISPRCasFinder,CRT	no		DEDDh,csa3,cas3,cas5,cas8c,cas7,cas1,cas2,DinG	Orphan	GGCTCCCCCGCACTCGCGGGGATCGACCC,GGCTCCCCCGCACTCGCGGGGATCGACCC,GGCTCCCCCGCACTCGCGGGGATCGACCC	29,29,29	0	0	NA	NA	NA:NA:NA	5,7,7	7	Orphan	DEDDh,csa3,cas3,cas5,cas8c,cas7,cas1,cas2,DinG,c2c9_V-U4,Cas14u_CAS-V	NA|86aa|up_8|NZ_AP014704.1_4968189_4968447_+,NA	NA|244aa|up_9|NZ_AP014704.1_4967257_4967989_+	pfam01695, IstB_IS21, IstB-like ATP binding protein	NA|86aa|up_8|NZ_AP014704.1_4968189_4968447_+	NA	NA|512aa|up_7|NZ_AP014704.1_4969824_4971360_+	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|466aa|up_6|NZ_AP014704.1_4971356_4972754_+	COG2204, AtoC, Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains [Signal transduction mechanisms]	NA|443aa|up_5|NZ_AP014704.1_4973838_4975167_+	cd17325, MFS_MdtG_SLC18_like, bacterial MdtG-like and eukaryotic solute carrier 18 (SLC18) family of the Major Facilitator Superfamily of transporters	NA|364aa|up_4|NZ_AP014704.1_4975175_4976267_+	cd07402, MPP_GpdQ, Enterobacter aerogenes GpdQ and related proteins, metallophosphatase domain	NA|424aa|up_3|NZ_AP014704.1_4976523_4977795_+	cd01118, ArsB_permease, Anion permease ArsB	NA|286aa|up_2|NZ_AP014704.1_4977908_4978766_+	pfam04657, DMT_YdcZ, Putative inner membrane exporter, YdcZ	NA|448aa|up_1|NZ_AP014704.1_4978811_4980155_-	PRK11274, glcF, glycolate oxidase subunit GlcF	NA|323aa|up_0|NZ_AP014704.1_4980311_4981280_-	cd01045, Ferritin_like_AB, Uncharacterized family of ferritin-like proteins found in archaea and bacteria	NA|249aa|down_0|NZ_AP014704.1_4982145_4982892_+	cd07983, LPLAT_DUF374-like, Lysophospholipid Acyltransferases (LPLATs) of Glycerophospholipid Biosynthesis: DUF374	NA|445aa|down_1|NZ_AP014704.1_4982888_4984223_+	PRK05749, PRK05749, 3-deoxy-D-manno-octulosonic-acid transferase; Reviewed	NA|331aa|down_2|NZ_AP014704.1_4984219_4985212_+	PRK00652, lpxK, tetraacyldisaccharide 4'-kinase; Reviewed	NA|74aa|down_3|NZ_AP014704.1_4985197_4985419_-	COG3908, COG3908, Uncharacterized protein conserved in bacteria [Function unknown]	NA|133aa|down_4|NZ_AP014704.1_4985547_4985946_+	TIGR01354, Cytidine_deaminase, cytidine deaminase, homotetrameric	NA|277aa|down_5|NZ_AP014704.1_4985942_4986773_+	PRK08202, PRK08202, purine nucleoside phosphorylase; Provisional	NA|438aa|down_6|NZ_AP014704.1_4986796_4988110_+	PRK05820, deoA, thymidine phosphorylase; Reviewed	NA|136aa|down_7|NZ_AP014704.1_4988267_4988675_-	TIGR02473, conserved_hypothetical_protein, flagellar export protein FliJ	NA|468aa|down_8|NZ_AP014704.1_4988764_4990168_-	PRK08927, fliI, flagellar protein export ATPase FliI	NA|234aa|down_9|NZ_AP014704.1_4990537_4991239_+	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]
GCF_001548015.1_ASM154801v1	NZ_AP014705	Methylobacterium aquaticum strain MA-22A plasmid pMaq22A_1p, complete sequence	1	251279-251436	1	CRISPRCasFinder	no	csa3	csa3,c2c9_V-U4,Cas14u_CAS-V	Type I-A	CACGCCCACCACGATCACGGCCA	23	0	0	NA	NA	NA	3	3	Orphan	DEDDh,csa3,cas3,cas5,cas8c,cas7,cas1,cas2,DinG,c2c9_V-U4,Cas14u_CAS-V	NA|142aa|up_8|NZ_AP014705.1_243674_244100_+,NA|169aa|down_3|NZ_AP014705.1_259310_259817_-	NA|1251aa|up_9|NZ_AP014705.1_239487_243240_+	PRK09490, metH, B12-dependent methionine synthase; Provisional	NA|142aa|up_8|NZ_AP014705.1_243674_244100_+	NA	NA|71aa|up_7|NZ_AP014705.1_244119_244332_-	pfam03299, TF_AP-2, Transcription factor AP-2	NA|182aa|up_6|NZ_AP014705.1_244477_245023_+	pfam08714, Fae, Formaldehyde-activating enzyme (Fae)	NA|285aa|up_5|NZ_AP014705.1_245146_246001_+	pfam01391, Collagen, Collagen triple helix repeat (20 copies)	NA|415aa|up_4|NZ_AP014705.1_246085_247330_-	TIGR03862, flavo_PP4765, uncharacterized flavoprotein, PP_4765 family	NA|351aa|up_3|NZ_AP014705.1_247549_248602_+	TIGR03718, R_switched_Alx, integral membrane protein, TerC family	NA|252aa|up_2|NZ_AP014705.1_248639_249395_-	COG3023, ampD, N-acetyl-anhydromuramyl-L-alanine amidase [Cell envelope biogenesis, outer membrane]	NA|247aa|up_1|NZ_AP014705.1_249399_250140_-	PRK09430, djlA, co-chaperone DjlA	NA|90aa|up_0|NZ_AP014705.1_250304_250574_-	pfam05818, TraT, Enterobacterial TraT complement resistance protein	NA|784aa|down_0|NZ_AP014705.1_251849_254201_+	smart00283, MA, Methyl-accepting chemotaxis-like domains (chemotaxis sensory transducer)	NA|344aa|down_1|NZ_AP014705.1_254340_255372_+	PRK11815, PRK11815, tRNA dihydrouridine(20/20a) synthase DusA	NA|539aa|down_2|NZ_AP014705.1_257599_259216_-	cd05936, FC-FACS_FadD_like, Prokaryotic long-chain fatty acid CoA synthetases similar to Escherichia coli FadD	NA|169aa|down_3|NZ_AP014705.1_259310_259817_-	NA	NA|159aa|down_4|NZ_AP014705.1_259956_260433_-	PRK00226, greA, transcription elongation factor GreA; Reviewed	NA|1158aa|down_5|NZ_AP014705.1_260557_264031_-	PRK05294, carB, carbamoyl-phosphate synthase large subunit	NA|261aa|down_6|NZ_AP014705.1_264196_264979_+	COG0300, DltE, Short-chain dehydrogenases of various substrate specificities [General function prediction only]	NA|138aa|down_7|NZ_AP014705.1_265173_265587_+	pfam04138, GtrA, GtrA-like protein	NA|1184aa|down_8|NZ_AP014705.1_265945_269497_-	COG1529, CoxL, Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs [Energy production and conversion]	NA|184aa|down_9|NZ_AP014705.1_269493_270045_-	COG2080, CoxS, Aerobic-type carbon monoxide dehydrogenase, small subunit CoxS/CutS homologs [Energy production and conversion]
