assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_012769535.1_ASM1276953v1	NZ_CP046565	Methylococcus sp. IM1 chromosome, complete genome	1	1035970-1043440	1,1,1,2,3	PILER-CR,CRISPRCasFinder,CRT,PILER-CR,PILER-CR	no	cas6f,cas7f,cas5f,cas8f,cas3-cas2,cas1	Cas9_archaeal,DEDDh,WYL,cas6f,cas7f,cas5f,cas8f,cas3-cas2,cas1,cas2,cas4,cas7b,cas8c,cas5,cas3,DinG,cas6,csx1,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,csx16,csa3	Type I-F	TTTCTGAGCTGCCTATGCGGCAGTGAAG,TTTCTGAGCTGCCTATGCGGCAGTGAAG,TTTCTGAGCTGCCTATGCGGCAGTGAAG,TTTCTGAGCTGCCTATGCGGCAGTGAAG,TTTCTGAGCTGCCTATGCGGCAGTGAAG	28,28,28,28,28	0	0	NA	NA	I-F:I-F:I-F:I-F:I-F	119,124,124,119,119	124	TypeI-F	Cas9_archaeal,DEDDh,WYL,cas6f,cas7f,cas5f,cas8f,cas3-cas2,cas1,cas2,cas4,cas7b,cas8c,cas5,cas3,DinG,cas6,csx1,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,csx16,csa3	NA|109aa|up_3|NZ_CP046565.1_1031386_1031713_-,NA	NA|266aa|up_9|NZ_CP046565.1_1025384_1026182_-	PRK05406, PRK05406, 5-oxoprolinase subunit PxpA	NA|330aa|up_8|NZ_CP046565.1_1026190_1027180_-	smart00797, AHS2, Allophanate hydrolase subunit 2	NA|233aa|up_7|NZ_CP046565.1_1027176_1027875_-	pfam02682, CT_C_D, Carboxyltransferase domain, subdomain C and D	NA|380aa|up_6|NZ_CP046565.1_1028430_1029570_+	cd01298, ATZ_TRZ_like, TRZ/ATZ family contains enzymes from the atrazine degradation pathway and related hydrolases	NA|284aa|up_5|NZ_CP046565.1_1029790_1030642_-	cd19071, AKR_AKR1-5-like, AKR1/2/3/4/5 family of aldo-keto reductase (AKR) and similar proteins	NA|205aa|up_4|NZ_CP046565.1_1030652_1031267_-	pfam13649, Methyltransf_25, Methyltransferase domain	NA|109aa|up_3|NZ_CP046565.1_1031386_1031713_-	NA	NA|432aa|up_2|NZ_CP046565.1_1031732_1033028_-	COG0004, AmtB, Ammonia permease [Inorganic ion transport and metabolism]	NA|417aa|up_1|NZ_CP046565.1_1033296_1034547_+	pfam07642, BBP2, Putative beta-barrel porin-2, OmpL-like	NA|413aa|up_0|NZ_CP046565.1_1034655_1035893_-	PHA02517, PHA02517, putative transposase OrfB; Reviewed	cas6f|188aa|down_0|NZ_CP046565.1_1043570_1044134_-	pfam09618, Cas_Csy4, CRISPR-associated protein (Cas_Csy4)	cas7f|350aa|down_1|NZ_CP046565.1_1044137_1045187_-	pfam09615, Cas_Csy3, CRISPR-associated protein (Cas_Csy3)	cas5f|328aa|down_2|NZ_CP046565.1_1045190_1046174_-	pfam09614, Cas_Csy2, CRISPR-associated protein (Cas_Csy2)	cas8f|450aa|down_3|NZ_CP046565.1_1046166_1047516_-	cd09735, Csy1_I-F, CRISPR/Cas system-associated protein Csy1	NA|107aa|down_4|NZ_CP046565.1_1047588_1047909_-	COG2944, COG2944, Predicted transcriptional regulator [Transcription]	cas3-cas2|1105aa|down_5|NZ_CP046565.1_1048338_1051653_-	TIGR02562, conserved_hypothetical_protein, CRISPR-associated helicase Cas3, subtype I-F/YPEST	cas1|326aa|down_6|NZ_CP046565.1_1051649_1052627_-	TIGR03637, cas1_YPEST, CRISPR-associated endonuclease Cas1, subtype I-F/YPEST	NA|355aa|down_7|NZ_CP046565.1_1052789_1053854_-	PRK00892, lpxD, UDP-3-O-[3-hydroxymyristoyl] glucosamine N-acyltransferase; Provisional	NA|267aa|down_8|NZ_CP046565.1_1053899_1054700_-	PRK00278, trpC, indole-3-glycerol phosphate synthase TrpC	NA|342aa|down_9|NZ_CP046565.1_1054705_1055731_-	PRK00188, trpD, anthranilate phosphoribosyltransferase; Provisional
GCF_012769535.1_ASM1276953v1	NZ_CP046565	Methylococcus sp. IM1 chromosome, complete genome	2	1217202-1220898	4,2,2	PILER-CR,CRISPRCasFinder,CRT	no	cas2,cas1,cas4,cas7b,cas8c,cas5,cas3,WYL	Cas9_archaeal,DEDDh,WYL,cas6f,cas7f,cas5f,cas8f,cas3-cas2,cas1,cas2,cas4,cas7b,cas8c,cas5,cas3,DinG,cas6,csx1,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,csx16,csa3	Type I-U, Type I-U?,Type I-C	GTTTCAATCCACTCCCGGCTATTGAGCCGGGAGATAC,GTTTCAATCCACTCCCGGCTATTGAGCCGGGAGATAC,GTTTCAATCCACTCCCGGCTATTGAGCCGGGAGATAC	37,37,37	1	1	1217882-1217917	NZ_CP046565.1_929219-929184	I-C:I-C:I-C	51,51,51	51	TypeI-U,TypeI-U?,TypeI-C	Cas9_archaeal,DEDDh,WYL,cas6f,cas7f,cas5f,cas8f,cas3-cas2,cas1,cas2,cas4,cas7b,cas8c,cas5,cas3,DinG,cas6,csx1,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,csx16,csa3	NA|199aa|up_0|NZ_CP046565.1_1216532_1217129_+,NA	NA|375aa|up_9|NZ_CP046565.1_1208798_1209923_+	cd03811, GT4_GT28_WabH-like, family 4 and family 28 glycosyltransferases similar to Klebsiella WabH	NA|361aa|up_8|NZ_CP046565.1_1209930_1211013_-	cd03794, GT4_WbuB-like, Escherichia coli WbuB and similar proteins	NA|280aa|up_7|NZ_CP046565.1_1211009_1211849_-	pfam13469, Sulfotransfer_3, Sulfotransferase family	NA|262aa|up_6|NZ_CP046565.1_1211845_1212631_-	pfam13230, GATase_4, Glutamine amidotransferases class-II	NA|309aa|up_5|NZ_CP046565.1_1212837_1213764_-	pfam02424, ApbE, ApbE family	NA|484aa|up_4|NZ_CP046565.1_1213771_1215223_-	pfam12094, DUF3570, Protein of unknown function (DUF3570)	NA|71aa|up_3|NZ_CP046565.1_1215209_1215422_-	pfam14086, DUF4266, Domain of unknown function (DUF4266)	NA|161aa|up_2|NZ_CP046565.1_1215487_1215970_-	cd02966, TlpA_like_family, TlpA-like family; composed of  TlpA, ResA, DsbE and similar proteins	NA|129aa|up_1|NZ_CP046565.1_1216084_1216471_+	pfam04241, DUF423, Protein of unknown function (DUF423)	NA|199aa|up_0|NZ_CP046565.1_1216532_1217129_+	NA	cas2|97aa|down_0|NZ_CP046565.1_1221085_1221376_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|345aa|down_1|NZ_CP046565.1_1221385_1222420_-	TIGR03640, cas1_DVULG, CRISPR-associated endonuclease Cas1, subtype I-C/DVULG	cas4|217aa|down_2|NZ_CP046565.1_1222416_1223067_-	TIGR00372, conserved_hypothetical_protein, CRISPR-associated protein Cas4	NA|63aa|down_3|NZ_CP046565.1_1223095_1223284_-	pfam02452, PemK_toxin, PemK-like, MazF-like toxin of type II toxin-antitoxin system	cas7b|319aa|down_4|NZ_CP046565.1_1223341_1224298_-	pfam05107, Cas_Cas7, CRISPR-associated protein Cas7	cas8c|658aa|down_5|NZ_CP046565.1_1224314_1226288_-	pfam09709, Cas_Csd1, CRISPR-associated protein (Cas_Csd1)	cas5|280aa|down_6|NZ_CP046565.1_1226290_1227130_-	cd09752, Cas5_I-C, CRISPR/Cas system-associated RAMP superfamily protein Cas5	cas3|766aa|down_7|NZ_CP046565.1_1227126_1229424_-	cd17930, DEXHc_cas3, DEXH/Q-box helicase domain of Cas3	WYL|324aa|down_8|NZ_CP046565.1_1229822_1230794_-	pfam13280, WYL, WYL domain	NA|92aa|down_9|NZ_CP046565.1_1230833_1231109_-	cd02227, cupin_TM1112-like, Thermotoga maritima TM1112 and related proteins, cupin domain
GCF_012769535.1_ASM1276953v1	NZ_CP046565	Methylococcus sp. IM1 chromosome, complete genome	3	2377157-2378721	5,3,3	PILER-CR,CRISPRCasFinder,CRT	no	cas6,csx1,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,csx16,cas2,cas1	Cas9_archaeal,DEDDh,WYL,cas6f,cas7f,cas5f,cas8f,cas3-cas2,cas1,cas2,cas4,cas7b,cas8c,cas5,cas3,DinG,cas6,csx1,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,csx16,csa3	Type III-C,Type III-A,Type III-B,Type III-D	GTAGTAATCGAGACCTGATGAAGAAGGGATTAAGAC,GTAGTAATCGAGACCTGATGAAGAAGGGATTAAGAC,GTAGTAATCGAGACCTGATGAAGAAGGGATTAAGAC	36,36,36	0	0	NA	NA	NA:NA:NA	21,22,22	22	TypeIII-C,TypeIII-A,TypeIII-B,TypeIII-D	Cas9_archaeal,DEDDh,WYL,cas6f,cas7f,cas5f,cas8f,cas3-cas2,cas1,cas2,cas4,cas7b,cas8c,cas5,cas3,DinG,cas6,csx1,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,csx16,csa3	NA,NA	csm2gr11|166aa|up_9|NZ_CP046565.1_2368885_2369383_+	pfam03750, Csm2_III-A, Csm2 Type III-A	csm3gr7|242aa|up_8|NZ_CP046565.1_2369399_2370125_+	cd09684, Csm3_III-A, CRISPR/Cas system-associated RAMP superfamily protein Csm3	csm4gr5|325aa|up_7|NZ_CP046565.1_2370245_2371220_+	TIGR01903, Hypothetical_protein	csm5gr7|569aa|up_6|NZ_CP046565.1_2371216_2372923_+	COG0330, HflC, Membrane protease subunits, stomatin/prohibitin homologs [Posttranslational modification, protein turnover, chaperones]	csx16|100aa|up_5|NZ_CP046565.1_2372983_2373283_+	pfam09652, Cas_VVA1548, Putative CRISPR-associated protein (Cas_VVA1548)	csx1|391aa|up_4|NZ_CP046565.1_2373355_2374528_+	TIGR02221, CRISPR-associated_protein_Csx1_2, CRISPR-associated protein, TM1812 family	cas2|95aa|up_3|NZ_CP046565.1_2374627_2374912_+	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|325aa|up_2|NZ_CP046565.1_2374919_2375894_+	pfam01867, Cas_Cas1, CRISPR associated protein Cas1	NA|258aa|up_1|NZ_CP046565.1_2375890_2376664_+	cd09634, Cas1_I-II-III, CRISPR/Cas system-associated protein Cas1	cas2|106aa|up_0|NZ_CP046565.1_2376667_2376985_+	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	NA|475aa|down_0|NZ_CP046565.1_2378891_2380316_+	cd01949, GGDEF, Diguanylate-cyclase (DGC) or GGDEF domain	NA|112aa|down_1|NZ_CP046565.1_2380317_2380653_+	TIGR03030, Cellulose_synthase_UDP-forming, cellulose synthase catalytic subunit (UDP-forming)	NA|784aa|down_2|NZ_CP046565.1_2380654_2383006_-	pfam06934, CTI, Fatty acid cis/trans isomerase (CTI)	NA|1130aa|down_3|NZ_CP046565.1_2383321_2386711_+	COG3459, COG3459, Cellobiose phosphorylase [Carbohydrate transport and metabolism]	NA|812aa|down_4|NZ_CP046565.1_2386866_2389302_+	PRK05261, PRK05261, phosphoketolase	NA|394aa|down_5|NZ_CP046565.1_2389308_2390490_+	PRK00180, PRK00180, acetate kinase A/propionate kinase 2; Reviewed	NA|179aa|down_6|NZ_CP046565.1_2390540_2391077_+	pfam09831, DUF2058, Uncharacterized protein conserved in bacteria (DUF2058)	NA|550aa|down_7|NZ_CP046565.1_2391306_2392956_+	PRK07994, PRK07994, DNA polymerase III subunits gamma and tau; Validated	NA|108aa|down_8|NZ_CP046565.1_2392960_2393284_+	PRK00153, PRK00153, YbaB/EbfC family nucleoid-associated protein	NA|199aa|down_9|NZ_CP046565.1_2393290_2393887_+	PRK00076, recR, recombination protein RecR; Reviewed
GCF_012769535.1_ASM1276953v1	NZ_CP046565	Methylococcus sp. IM1 chromosome, complete genome	4	3293382-3293484	4	CRISPRCasFinder	no		Cas9_archaeal,DEDDh,WYL,cas6f,cas7f,cas5f,cas8f,cas3-cas2,cas1,cas2,cas4,cas7b,cas8c,cas5,cas3,DinG,cas6,csx1,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,csx16,csa3	Orphan	CTCGCAGTACTCGCCCCGCGAGTGGCG	27	0	0	NA	NA	NA	1	1	Orphan	Cas9_archaeal,DEDDh,WYL,cas6f,cas7f,cas5f,cas8f,cas3-cas2,cas1,cas2,cas4,cas7b,cas8c,cas5,cas3,DinG,cas6,csx1,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,csx16,csa3	NA|85aa|up_8|NZ_CP046565.1_3286022_3286277_+,NA|67aa|up_5|NZ_CP046565.1_3288110_3288311_+,NA|106aa|up_2|NZ_CP046565.1_3291226_3291544_+,NA|159aa|up_1|NZ_CP046565.1_3291932_3292409_+,NA|218aa|up_0|NZ_CP046565.1_3292726_3293380_+,NA|100aa|down_0|NZ_CP046565.1_3293862_3294162_+,NA|86aa|down_1|NZ_CP046565.1_3294255_3294513_+	NA|423aa|up_9|NZ_CP046565.1_3284761_3286030_+	pfam13795, HupE_UreJ_2, HupE / UreJ protein	NA|85aa|up_8|NZ_CP046565.1_3286022_3286277_+	NA	NA|279aa|up_7|NZ_CP046565.1_3286284_3287121_-	cd02237, cupin_DAD_ChrR, 2,4'-Dihydroxyacetophenone dioxygenase (DAD) and anti-sigma factor ChrR, and similar proteins; cupin domain	NA|215aa|up_6|NZ_CP046565.1_3287125_3287770_-	PRK09646, PRK09646, ECF RNA polymerase sigma factor SigK	NA|67aa|up_5|NZ_CP046565.1_3288110_3288311_+	NA	NA|77aa|up_4|NZ_CP046565.1_3288508_3288739_+	COG3311, AlpA, Predicted transcriptional regulator [Transcription]	NA|834aa|up_3|NZ_CP046565.1_3288735_3291237_+	pfam13148, DUF3987, Protein of unknown function (DUF3987)	NA|106aa|up_2|NZ_CP046565.1_3291226_3291544_+	NA	NA|159aa|up_1|NZ_CP046565.1_3291932_3292409_+	NA	NA|218aa|up_0|NZ_CP046565.1_3292726_3293380_+	NA	NA|100aa|down_0|NZ_CP046565.1_3293862_3294162_+	NA	NA|86aa|down_1|NZ_CP046565.1_3294255_3294513_+	NA	NA|401aa|down_2|NZ_CP046565.1_3294493_3295696_+	pfam13481, AAA_25, AAA domain	NA|139aa|down_3|NZ_CP046565.1_3296128_3296545_+	PRK07764, PRK07764, DNA polymerase III subunits gamma and tau; Validated	NA|192aa|down_4|NZ_CP046565.1_3296624_3297200_+	cd03768, SR_ResInv, Serine Recombinase (SR) family, Resolvase and Invertase subfamily, catalytic domain; members contain a C-terminal DNA binding domain	NA|409aa|down_5|NZ_CP046565.1_3297207_3298434_-	PRK09692, PRK09692, integrase; Provisional	NA|43aa|down_6|NZ_CP046565.1_3300446_3300575_+	pfam13333, rve_2, Integrase core domain	NA|147aa|down_7|NZ_CP046565.1_3300620_3301061_-	pfam13701, DDE_Tnp_1_4, Transposase DDE domain group 1	NA|256aa|down_8|NZ_CP046565.1_3302233_3303001_-	pfam08241, Methyltransf_11, Methyltransferase domain	NA|419aa|down_9|NZ_CP046565.1_3302993_3304250_-	cd03811, GT4_GT28_WabH-like, family 4 and family 28 glycosyltransferases similar to Klebsiella WabH
