assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000022685.1_ASM2268v1	NC_012808	Methylorubrum extorquens AM1, complete sequence	1	703389-703475	1	CRISPRCasFinder	no		RT,DEDDh,csa3,cas3,WYL	Orphan	TATGGCGGTTATTACGGCTACGGC	24	0	0	NA	NA	NA	1	1	Orphan	RT,DEDDh,csa3,cas3,WYL,c2c9_V-U4,csf3gr5,DinG,csf2gr7,cas14j	NA|167aa|up_6|NC_012808.1_696901_697402_+,NA|63aa|up_5|NC_012808.1_697522_697711_-,NA|226aa|down_6|NC_012808.1_707087_707765_-	NA|361aa|up_9|NC_012808.1_693839_694922_+	COG1463, Ttg2C, ABC-type transport system involved in resistance to organic solvents, periplasmic component [Secondary metabolites biosynthesis, transport, and catabolism]	NA|208aa|up_8|NC_012808.1_695042_695666_+	COG3218, COG3218, ABC-type uncharacterized transport system, auxiliary component [General function prediction only]	NA|273aa|up_7|NC_012808.1_695717_696536_-	COG0483, SuhB, Archaeal fructose-1,6-bisphosphatase and related enzymes of inositol monophosphatase family [Carbohydrate transport and metabolism]	NA|167aa|up_6|NC_012808.1_696901_697402_+	NA	NA|63aa|up_5|NC_012808.1_697522_697711_-	NA	NA|317aa|up_4|NC_012808.1_697754_698705_-	PRK12878, ubiA, 4-hydroxybenzoate octaprenyltransferase	NA|201aa|up_3|NC_012808.1_698741_699344_-	COG4982, COG4982, 3-oxoacyl-[acyl-carrier protein]	NA|251aa|up_2|NC_012808.1_699651_700404_+	PRK11713, PRK11713, 16S ribosomal RNA methyltransferase RsmE; Provisional	NA|328aa|up_1|NC_012808.1_700433_701417_+	cd08964, L-asparaginase_II, Type II (periplasmic) bacterial L-asparaginase	NA|457aa|up_0|NC_012808.1_701603_702974_+	PLN02611, PLN02611, glutamate--cysteine ligase	NA|89aa|down_0|NC_012808.1_703569_703836_+	cd00291, SirA_YedF_YeeD, SirA, YedF, and YeeD	NA|189aa|down_1|NC_012808.1_703875_704442_+	cd03424, ADPRase_NUDT5, ADP-ribose pyrophosphatase (ADPRase) catalyzes the hydrolysis of ADP-ribose and a variety of additional ADP-sugar conjugates to AMP and ribose-5-phosphate	NA|133aa|down_2|NC_012808.1_704898_705297_+	pfam12616, DUF3775, Protein of unknown function (DUF3775)	NA|147aa|down_3|NC_012808.1_705445_705886_-	cd04706, PLA2_plant, PLA2_plant: Plant-specific sub-family of  Phospholipase A2, a super-family of secretory and cytosolic enzymes; the latter are either Ca dependent or Ca independent	NA|231aa|down_4|NC_012808.1_706131_706824_-	COG3577, COG3577, Predicted aspartyl protease [General function prediction only]	NA|71aa|down_5|NC_012808.1_706820_707033_-	pfam06945, DUF1289, Protein of unknown function (DUF1289)	NA|226aa|down_6|NC_012808.1_707087_707765_-	NA	NA|276aa|down_7|NC_012808.1_707779_708607_-	PRK00235, cobS, cobalamin synthase; Reviewed	NA|346aa|down_8|NC_012808.1_708774_709812_+	PRK00105, cobT, nicotinate-nucleotide--dimethylbenzimidazole phosphoribosyltransferase; Reviewed	NA|514aa|down_9|NC_012808.1_709818_711360_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]
GCF_000022685.1_ASM2268v1	NC_012808	Methylorubrum extorquens AM1, complete sequence	2	1858141-1858385	2	CRISPRCasFinder	no		RT,DEDDh,csa3,cas3,WYL	Orphan	ACCCGGACAACGACGGCACCCTCGAC	26	0	0	NA	NA	NA	3	3	Orphan	RT,DEDDh,csa3,cas3,WYL,c2c9_V-U4,csf3gr5,DinG,csf2gr7,cas14j	NA|93aa|up_4|NC_012808.1_1852098_1852377_+,NA|210aa|up_1|NC_012808.1_1854023_1854653_+,NA|163aa|down_7|NC_012808.1_1868542_1869031_+	NA|237aa|up_9|NC_012808.1_1847616_1848327_-	PRK02227, PRK02227, (5-formylfuran-3-yl)methyl phosphate synthase	NA|178aa|up_8|NC_012808.1_1848372_1848906_-	pfam11684, DUF3280, Protein of unknown function (DUF2380)	NA|423aa|up_7|NC_012808.1_1849137_1850406_+	TIGR03863, PQQ_ABC_bind, ABC transporter, substrate binding protein, PQQ-dependent alcohol dehydrogenase system	NA|329aa|up_6|NC_012808.1_1850572_1851559_+	TIGR03866, PQQ_ABC_repeats, PQQ-dependent catabolism-associated beta-propeller protein	NA|145aa|up_5|NC_012808.1_1851650_1852085_+	cd04210, Cupredoxin_like_1, Uncharacterized Cupredoxin-like subfamily	NA|93aa|up_4|NC_012808.1_1852098_1852377_+	NA	NA|250aa|up_3|NC_012808.1_1852373_1853123_+	TIGR03864, PQQ_ABC_ATP, ABC transporter, ATP-binding subunit, PQQ-dependent alcohol dehydrogenase system	NA|298aa|up_2|NC_012808.1_1853119_1854013_+	TIGR03861, ABC_efflux_transporter_permease_protein, alcohol ABC transporter, permease protein	NA|210aa|up_1|NC_012808.1_1854023_1854653_+	NA	NA|840aa|up_0|NC_012808.1_1855200_1857720_+	pfam00593, TonB_dep_Rec, TonB dependent receptor	NA|120aa|down_0|NC_012808.1_1858641_1859001_+	PRK00068, PRK00068, hypothetical protein; Validated	NA|519aa|down_1|NC_012808.1_1859104_1860661_-	TIGR03023, Sugar_transferase	NA|420aa|down_2|NC_012808.1_1860792_1862052_-	cd03801, GT4_PimA-like, phosphatidyl-myo-inositol mannosyltransferase	NA|193aa|down_3|NC_012808.1_1862224_1862803_-	COG1596, Wza, Periplasmic protein involved in polysaccharide export, contains    SLBB domain of b-grasp fold [Cell wall/membrane/envelope biogenesis]	NA|805aa|down_4|NC_012808.1_1863024_1865439_+	COG3206, GumC, Uncharacterized protein involved in exopolysaccharide biosynthesis [Cell envelope biogenesis, outer membrane]	NA|108aa|down_5|NC_012808.1_1865992_1866316_-	PRK00153, PRK00153, YbaB/EbfC family nucleoid-associated protein	NA|627aa|down_6|NC_012808.1_1866405_1868286_-	PRK09111, PRK09111, DNA polymerase III subunits gamma and tau; Validated	NA|163aa|down_7|NC_012808.1_1868542_1869031_+	NA	NA|87aa|down_8|NC_012808.1_1869065_1869326_-	pfam13670, PepSY_2, Peptidase propeptide and YPEB domain	NA|513aa|down_9|NC_012808.1_1869475_1871014_+	COG0606, COG0606, Predicted ATPase with chaperone activity [Posttranslational modification, protein turnover, chaperones]
GCF_000022685.1_ASM2268v1	NC_012808	Methylorubrum extorquens AM1, complete sequence	3	2521865-2522136	3	CRISPRCasFinder	no		RT,DEDDh,csa3,cas3,WYL	Orphan	GGCAACAACGTCGTGCTCGGCGG	23	0	0	NA	NA	NA	4	4	Orphan	RT,DEDDh,csa3,cas3,WYL,c2c9_V-U4,csf3gr5,DinG,csf2gr7,cas14j	NA|292aa|up_4|NC_012808.1_2482429_2483305_+,NA|101aa|up_1|NC_012808.1_2485952_2486255_+,NA|79aa|down_9|NC_012808.1_2548152_2548389_-	NA|629aa|up_9|NC_012808.1_2475648_2477535_-	TIGR01730, COG0845:_Membrane-fusion_protein, RND family efflux transporter, MFP subunit	NA|99aa|up_8|NC_012808.1_2477624_2477921_-	pfam11950, DUF3467, Protein of unknown function (DUF3467)	NA|351aa|up_7|NC_012808.1_2478307_2479360_+	pfam00891, Methyltransf_2, O-methyltransferase	NA|220aa|up_6|NC_012808.1_2479356_2480016_+	PRK08233, PRK08233, hypothetical protein; Provisional	NA|796aa|up_5|NC_012808.1_2480045_2482433_+	pfam04820, Trp_halogenase, Tryptophan halogenase	NA|292aa|up_4|NC_012808.1_2482429_2483305_+	NA	NA|311aa|up_3|NC_012808.1_2483322_2484255_+	COG0535, COG0535, Predicted Fe-S oxidoreductases [General function prediction only]	NA|310aa|up_2|NC_012808.1_2484386_2485316_-	PRK09685, PRK09685, DNA-binding transcriptional activator FeaR; Provisional	NA|101aa|up_1|NC_012808.1_2485952_2486255_+	NA	NA|267aa|up_0|NC_012808.1_2486251_2487052_+	pfam07277, SapC, SapC	NA|272aa|down_0|NC_012808.1_2534897_2535713_+	pfam13640, 2OG-FeII_Oxy_3, 2OG-Fe(II) oxygenase superfamily	NA|264aa|down_1|NC_012808.1_2535717_2536509_+	COG2226, UbiE, Methylase involved in ubiquinone/menaquinone biosynthesis [Coenzyme metabolism]	NA|460aa|down_2|NC_012808.1_2536647_2538027_+	COG1032, COG1032, Fe-S oxidoreductase [Energy production and conversion]	NA|438aa|down_3|NC_012808.1_2538023_2539337_+	COG1032, COG1032, Fe-S oxidoreductase [Energy production and conversion]	NA|480aa|down_4|NC_012808.1_2539343_2540783_-	COG2132, SufI, Putative multicopper oxidases [Secondary metabolites biosynthesis, transport, and catabolism]	NA|479aa|down_5|NC_012808.1_2541151_2542588_+	PRK01490, tig, trigger factor; Provisional	NA|209aa|down_6|NC_012808.1_2542719_2543346_+	PRK00277, clpP, ATP-dependent Clp protease proteolytic subunit; Reviewed	NA|424aa|down_7|NC_012808.1_2543597_2544869_+	PRK05342, clpX, ATP-dependent Clp protease ATP-binding subunit ClpX	NA|807aa|down_8|NC_012808.1_2545136_2547557_+	COG0466, Lon, ATP-dependent Lon protease, bacterial type [Posttranslational modification, protein turnover, chaperones]	NA|79aa|down_9|NC_012808.1_2548152_2548389_-	NA
GCF_000022685.1_ASM2268v1	NC_012808	Methylorubrum extorquens AM1, complete sequence	4	4675268-4675354	4	CRISPRCasFinder	no		RT,DEDDh,csa3,cas3,WYL	Orphan	CGGCTTTCGGGAAAAAGATGATGC	24	0	0	NA	NA	NA	1	1	Orphan	RT,DEDDh,csa3,cas3,WYL,c2c9_V-U4,csf3gr5,DinG,csf2gr7,cas14j	NA|75aa|up_9|NC_012808.1_4663416_4663641_-,NA|123aa|up_4|NC_012808.1_4670880_4671249_+,NA|66aa|up_3|NC_012808.1_4671427_4671625_+,NA|114aa|up_2|NC_012808.1_4671691_4672033_-,NA|76aa|down_2|NC_012808.1_4676094_4676322_-,NA|65aa|down_4|NC_012808.1_4678141_4678336_-	NA|75aa|up_9|NC_012808.1_4663416_4663641_-	NA	NA|730aa|up_8|NC_012808.1_4663710_4665900_-	PRK10917, PRK10917, ATP-dependent DNA helicase RecG; Provisional	NA|99aa|up_7|NC_012808.1_4666091_4666388_+	COG2938, COG2938, Uncharacterized conserved protein [Function unknown]	NA|1197aa|up_6|NC_012808.1_4666601_4670192_+	COG1197, Mfd, Transcription-repair coupling factor (superfamily II helicase) [DNA replication, recombination, and repair / Transcription]	NA|70aa|up_5|NC_012808.1_4670568_4670778_+	COG1278, CspC, Cold shock proteins [Transcription]	NA|123aa|up_4|NC_012808.1_4670880_4671249_+	NA	NA|66aa|up_3|NC_012808.1_4671427_4671625_+	NA	NA|114aa|up_2|NC_012808.1_4671691_4672033_-	NA	NA|467aa|up_1|NC_012808.1_4672291_4673692_-	COG0318, CaiC, Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II [Lipid metabolism / Secondary metabolites biosynthesis, transport, and catabolism]	NA|418aa|up_0|NC_012808.1_4673989_4675243_+	PRK13557, PRK13557, histidine kinase; Provisional	NA|121aa|down_0|NC_012808.1_4675443_4675806_+	cd18161, REC_hyHK_blue-like, phosphoacceptor receiver (REC) domain of hybrid sensor histidine kinase/response regulators similar to Pseudomonas savastanoi blue-light-activated histidine kinase	NA|99aa|down_1|NC_012808.1_4675819_4676116_-	pfam05016, ParE_toxin, ParE toxin of type II toxin-antitoxin system, parDE	NA|76aa|down_2|NC_012808.1_4676094_4676322_-	NA	NA|546aa|down_3|NC_012808.1_4676365_4678003_-	PRK08162, PRK08162, acyl-CoA synthetase; Validated	NA|65aa|down_4|NC_012808.1_4678141_4678336_-	NA	NA|599aa|down_5|NC_012808.1_4678585_4680382_+	COG1807, ArnT, 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family [Cell envelope biogenesis, outer membrane]	NA|253aa|down_6|NC_012808.1_4680378_4681137_+	cd04179, DPM_DPG-synthase_like, DPM_DPG-synthase_like is a member of the Glycosyltransferase 2 superfamily	NA|110aa|down_7|NC_012808.1_4681141_4681471_+	COG3952, COG3952, Predicted membrane protein [Function unknown]	NA|155aa|down_8|NC_012808.1_4681486_4681951_+	pfam11026, DUF2721, Protein of unknown function (DUF2721)	NA|504aa|down_9|NC_012808.1_4681954_4683466_-	COG2072, TrkA, Predicted flavoprotein involved in K+ transport [Inorganic ion transport and metabolism]
GCF_000022685.1_ASM2268v1	NC_012811	Methylorubrum extorquens AM1 megaplasmid, complete sequence	1	577825-578338	1,1,2	CRT,CRISPRCasFinder,CRISPRCasFinder	no	csf3gr5,DinG,csf2gr7,c2c9_V-U4	csa3,c2c9_V-U4,DEDDh,csf3gr5,DinG,csf2gr7,cas14j	Type IV-A	NCGGTTNACCCGCGCGNG,CGGTTCACCCGCGCGTGCGCGGAGGAGACAG,CGGTTCACCCGCGCGTGCGCGGAGGAGACAG	18,31,31	0	0	NA	NA	NA:NA:NA	8,2,2	8	TypeIV-A	RT,DEDDh,csa3,cas3,WYL,c2c9_V-U4,csf3gr5,DinG,csf2gr7,cas14j	NA|222aa|up_9|NC_012811.1_567554_568220_-,NA|141aa|up_8|NC_012811.1_568230_568653_-,NA|206aa|up_6|NC_012811.1_569683_570301_+,NA|433aa|up_5|NC_012811.1_570316_571615_-,NA|277aa|up_4|NC_012811.1_571635_572466_-,NA|619aa|up_3|NC_012811.1_572724_574581_+,NA|163aa|up_2|NC_012811.1_574774_575263_+,NA|268aa|up_1|NC_012811.1_575272_576076_-,csf3gr5|249aa|down_0|NC_012811.1_578458_579205_-,NA|265aa|down_4|NC_012811.1_582585_583380_-,NA|223aa|down_5|NC_012811.1_583366_584035_-,NA|244aa|down_6|NC_012811.1_584309_585041_+,NA|246aa|down_7|NC_012811.1_585052_585790_+,NA|644aa|down_9|NC_012811.1_587390_589322_-	NA|222aa|up_9|NC_012811.1_567554_568220_-	NA	NA|141aa|up_8|NC_012811.1_568230_568653_-	NA	NA|226aa|up_7|NC_012811.1_568674_569352_-	PRK13973, PRK13973, thymidylate kinase; Provisional	NA|206aa|up_6|NC_012811.1_569683_570301_+	NA	NA|433aa|up_5|NC_012811.1_570316_571615_-	NA	NA|277aa|up_4|NC_012811.1_571635_572466_-	NA	NA|619aa|up_3|NC_012811.1_572724_574581_+	NA	NA|163aa|up_2|NC_012811.1_574774_575263_+	NA	NA|268aa|up_1|NC_012811.1_575272_576076_-	NA	NA|257aa|up_0|NC_012811.1_576228_576999_+	PRK09039, PRK09039, peptidoglycan -binding protein	csf3gr5|249aa|down_0|NC_012811.1_578458_579205_-	NA	DinG|581aa|down_1|NC_012811.1_579201_580944_-	COG1199, DinG, Rad3-related DNA helicases [Transcription / DNA replication, recombination, and repair]	DinG|171aa|down_2|NC_012811.1_580940_581453_-	COG1199, DinG, Rad3-related DNA helicases [Transcription / DNA replication, recombination, and repair]	csf2gr7|373aa|down_3|NC_012811.1_581452_582571_-	cd09706, Csf2_U, CRISPR/Cas system-associated RAMP superfamily protein Csf2	NA|265aa|down_4|NC_012811.1_582585_583380_-	NA	NA|223aa|down_5|NC_012811.1_583366_584035_-	NA	NA|244aa|down_6|NC_012811.1_584309_585041_+	NA	NA|246aa|down_7|NC_012811.1_585052_585790_+	NA	c2c9_V-U4|403aa|down_8|NC_012811.1_586010_587219_-	COG0675, COG0675, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|644aa|down_9|NC_012811.1_587390_589322_-	NA
GCF_000022685.1_ASM2268v1	NC_012811	Methylorubrum extorquens AM1 megaplasmid, complete sequence	2	620384-620470	3	CRISPRCasFinder	no	c2c9_V-U4	csa3,c2c9_V-U4,DEDDh,csf3gr5,DinG,csf2gr7,cas14j	Type V-U4	CCGTTCCCCGCGCGAGCGGGGATGCT	26	0	0	NA	NA	I-E	1	1	TypeV-U4	RT,DEDDh,csa3,cas3,WYL,c2c9_V-U4,csf3gr5,DinG,csf2gr7,cas14j	NA|75aa|up_9|NC_012811.1_605963_606188_+,NA|82aa|up_7|NC_012811.1_612633_612879_-,NA|62aa|up_4|NC_012811.1_614843_615029_-,NA|277aa|up_2|NC_012811.1_617056_617887_+,NA|304aa|up_1|NC_012811.1_618072_618984_+,NA|287aa|up_0|NC_012811.1_619384_620245_+,NA|139aa|down_1|NC_012811.1_624144_624561_-,NA|156aa|down_5|NC_012811.1_628617_629085_+,NA|323aa|down_7|NC_012811.1_630380_631349_-	NA|75aa|up_9|NC_012811.1_605963_606188_+	NA	NA|2032aa|up_8|NC_012811.1_606456_612552_+	COG5184, ATS1, Alpha-tubulin suppressor and related RCC1 domain-containing proteins [Cell division and chromosome partitioning / Cytoskeleton]	NA|82aa|up_7|NC_012811.1_612633_612879_-	NA	NA|418aa|up_6|NC_012811.1_612890_614144_-	pfam07804, HipA_C, HipA-like C-terminal domain	NA|107aa|up_5|NC_012811.1_614127_614448_-	cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|62aa|up_4|NC_012811.1_614843_615029_-	NA	NA|494aa|up_3|NC_012811.1_615557_617039_+	TIGR03743, SXT_TraD, conjugative coupling factor TraD, SXT/TOL subfamily	NA|277aa|up_2|NC_012811.1_617056_617887_+	NA	NA|304aa|up_1|NC_012811.1_618072_618984_+	NA	NA|287aa|up_0|NC_012811.1_619384_620245_+	NA	NA|1193aa|down_0|NC_012811.1_620562_624141_+	TIGR04346, DotA_TraY, conjugal transfer/type IV secretion protein DotA/TraY	NA|139aa|down_1|NC_012811.1_624144_624561_-	NA	NA|115aa|down_2|NC_012811.1_624922_625267_-	COG0727, COG0727, Predicted Fe-S-cluster oxidoreductase [General function prediction only]	NA|281aa|down_3|NC_012811.1_625565_626408_-	cd10440, GIY-YIG_COG3680, GIY-YIG domain of uncharacterized proteins from bacteria and their eukaryotic homologs	NA|602aa|down_4|NC_012811.1_626539_628345_-	COG4469, CoiA, Competence protein CoiA-like family, contains a predicted nuclease    domain [General function prediction only]	NA|156aa|down_5|NC_012811.1_628617_629085_+	NA	c2c9_V-U4|376aa|down_6|NC_012811.1_629202_630330_-	COG0675, COG0675, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|323aa|down_7|NC_012811.1_630380_631349_-	NA	NA|397aa|down_8|NC_012811.1_631417_632608_-	cd09256, AP_MuD_MHD, Mu-homology domain (MHD) of a adaptor protein (AP) encoded by mu-2 related death-inducing gene, MuD (also known as MUDENG)	NA|294aa|down_9|NC_012811.1_633318_634200_+	COG3637, COG3637, Opacity protein and related surface antigens [Cell envelope biogenesis, outer membrane]
