assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_900234795.1_TK0001_PRJEB23178_v1	NZ_LT962688	Methylorubrum extorquens strain TK 0001 chromosome TK0001	1	461520-461611	1	CRISPRCasFinder	no		DEDDh,csa3,cas3,WYL	Orphan	GCCTCATCGAACTGGTCGAAGAC	23	0	0	NA	NA	NA	1	1	Orphan	DEDDh,csa3,cas3,WYL	NA|119aa|up_9|NZ_LT962688.1_450710_451067_-,NA|139aa|up_8|NZ_LT962688.1_451071_451488_-,NA|89aa|up_7|NZ_LT962688.1_451484_451751_-,NA|512aa|up_3|NZ_LT962688.1_454469_456005_-,NA|120aa|up_2|NZ_LT962688.1_456016_456376_-,NA|129aa|up_1|NZ_LT962688.1_456388_456775_-,NA|409aa|up_0|NZ_LT962688.1_456780_458007_-,NA|240aa|down_1|NZ_LT962688.1_463723_464443_-,NA|474aa|down_2|NZ_LT962688.1_464570_465992_-,NA|258aa|down_3|NZ_LT962688.1_465996_466770_-,NA|173aa|down_4|NZ_LT962688.1_467007_467526_-,NA|183aa|down_5|NZ_LT962688.1_467529_468078_-,NA|151aa|down_6|NZ_LT962688.1_468081_468534_-	NA|119aa|up_9|NZ_LT962688.1_450710_451067_-	NA	NA|139aa|up_8|NZ_LT962688.1_451071_451488_-	NA	NA|89aa|up_7|NZ_LT962688.1_451484_451751_-	NA	NA|278aa|up_6|NZ_LT962688.1_451753_452587_-	pfam11860, Muraidase, N-acetylmuramidase	NA|119aa|up_5|NZ_LT962688.1_452601_452958_-	COG0797, RlpA, Lipoproteins [Cell envelope biogenesis, outer membrane]	NA|427aa|up_4|NZ_LT962688.1_453180_454461_+	pfam00754, F5_F8_type_C, F5/8 type C domain	NA|512aa|up_3|NZ_LT962688.1_454469_456005_-	NA	NA|120aa|up_2|NZ_LT962688.1_456016_456376_-	NA	NA|129aa|up_1|NZ_LT962688.1_456388_456775_-	NA	NA|409aa|up_0|NZ_LT962688.1_456780_458007_-	NA	NA|698aa|down_0|NZ_LT962688.1_461625_463719_-	cd00736, lambda_lys-like, Bacteriophage lambda lysozyme and similar proteins	NA|240aa|down_1|NZ_LT962688.1_463723_464443_-	NA	NA|474aa|down_2|NZ_LT962688.1_464570_465992_-	NA	NA|258aa|down_3|NZ_LT962688.1_465996_466770_-	NA	NA|173aa|down_4|NZ_LT962688.1_467007_467526_-	NA	NA|183aa|down_5|NZ_LT962688.1_467529_468078_-	NA	NA|151aa|down_6|NZ_LT962688.1_468081_468534_-	NA	NA|199aa|down_7|NZ_LT962688.1_468535_469132_-	COG1250, FadB, 3-hydroxyacyl-CoA dehydrogenase [Lipid metabolism]	NA|316aa|down_8|NZ_LT962688.1_471569_472516_-	pfam13358, DDE_3, DDE superfamily endonuclease	NA|714aa|down_9|NZ_LT962688.1_472637_474779_-	pfam13517, VCBS, Repeat domain in Vibrio, Colwellia, Bradyrhizobium and Shewanella
GCF_900234795.1_TK0001_PRJEB23178_v1	NZ_LT962688	Methylorubrum extorquens strain TK 0001 chromosome TK0001	2	666539-666671	2	CRISPRCasFinder	no		DEDDh,csa3,cas3,WYL	Orphan	CGCCGATGAGCGTGTCGTCGCCCGC	25	0	0	NA	NA	NA	2	2	Orphan	DEDDh,csa3,cas3,WYL	NA|91aa|up_5|NZ_LT962688.1_658547_658820_-,NA|163aa|down_6|NZ_LT962688.1_673709_674198_+	NA|275aa|up_9|NZ_LT962688.1_652582_653407_-	COG3637, COG3637, Opacity protein and related surface antigens [Cell envelope biogenesis, outer membrane]	NA|912aa|up_8|NZ_LT962688.1_653627_656363_-	PRK05560, PRK05560, DNA gyrase subunit A; Validated	NA|200aa|up_7|NZ_LT962688.1_656687_657287_-	PRK00317, mobA, molybdopterin-guanine dinucleotide biosynthesis protein MobA; Reviewed	NA|376aa|up_6|NZ_LT962688.1_657328_658456_-	pfam10609, ParA, NUBPL iron-transfer P-loop NTPase	NA|91aa|up_5|NZ_LT962688.1_658547_658820_-	NA	NA|168aa|up_4|NZ_LT962688.1_658969_659473_+	pfam16242, Pyrid_ox_like, Pyridoxamine 5'-phosphate oxidase like	NA|433aa|up_3|NZ_LT962688.1_659632_660931_+	PRK05321, PRK05321, nicotinate phosphoribosyltransferase; Provisional	NA|261aa|up_2|NZ_LT962688.1_660941_661724_+	cd10787, LamB_YcsF_like, LamB/YcsF family of  lactam utilization protein	NA|680aa|up_1|NZ_LT962688.1_661953_663993_+	COG0628, yhhT, Predicted permease, member of the PurR regulon [General function prediction only]	NA|264aa|up_0|NZ_LT962688.1_664048_664840_+	cd01641, Bacterial_IMPase_like_1, Predominantly bacterial family of Mg++ dependend phosphatases, related to inositol monophosphatases	NA|331aa|down_0|NZ_LT962688.1_668059_669052_-	COG2267, PldB, Lysophospholipase [Lipid metabolism]	NA|157aa|down_1|NZ_LT962688.1_669234_669705_+	PRK10743, PRK10743, heat shock chaperone IbpA	NA|138aa|down_2|NZ_LT962688.1_670122_670536_-	COG4101, COG4101, Predicted mannose-6-phosphate isomerase [Carbohydrate transport and metabolism]	NA|290aa|down_3|NZ_LT962688.1_671205_672075_+	pfam11004, Kdo_hydroxy, 3-deoxy-D-manno-oct-2-ulosonic acid (Kdo) hydroxylase	NA|360aa|down_4|NZ_LT962688.1_672079_673159_-	COG0628, yhhT, Predicted permease, member of the PurR regulon [General function prediction only]	NA|99aa|down_5|NZ_LT962688.1_673201_673498_-	cd01949, GGDEF, Diguanylate-cyclase (DGC) or GGDEF domain	NA|163aa|down_6|NZ_LT962688.1_673709_674198_+	NA	NA|99aa|down_7|NZ_LT962688.1_674403_674700_+	PRK06508, PRK06508, acyl carrier protein; Provisional	NA|155aa|down_8|NZ_LT962688.1_674728_675193_+	COG0764, FabA, 3-hydroxymyristoyl/3-hydroxydecanoyl-(acyl carrier protein) dehydratases [Lipid metabolism]	NA|382aa|down_9|NZ_LT962688.1_675189_676335_+	PRK06519, PRK06519, beta-ketoacyl-ACP synthase
GCF_900234795.1_TK0001_PRJEB23178_v1	NZ_LT962688	Methylorubrum extorquens strain TK 0001 chromosome TK0001	3	2566422-2566495	3	CRISPRCasFinder	no		DEDDh,csa3,cas3,WYL	Orphan	TGGCCGTGATGGTGATGGCGGTG	23	0	0	NA	NA	NA	1	1	Orphan	DEDDh,csa3,cas3,WYL	NA,NA|130aa|down_2|NZ_LT962688.1_2568940_2569330_-,NA|359aa|down_4|NZ_LT962688.1_2571779_2572856_+	NA|159aa|up_9|NZ_LT962688.1_2557873_2558350_+	cd19923, REC_CheY_CheY3, phosphoacceptor receiver (REC) domain of chemotaxis response regulator CheY3 and similar CheY family proteins	NA|316aa|up_8|NZ_LT962688.1_2558365_2559313_-	COG1409, Icc, Predicted phosphohydrolases [General function prediction only]	NA|250aa|up_7|NZ_LT962688.1_2559550_2560300_+	cd07983, LPLAT_DUF374-like, Lysophospholipid Acyltransferases (LPLATs) of Glycerophospholipid Biosynthesis: DUF374	NA|435aa|up_6|NZ_LT962688.1_2560296_2561601_+	PRK05749, PRK05749, 3-deoxy-D-manno-octulosonic-acid transferase; Reviewed	NA|328aa|up_5|NZ_LT962688.1_2561608_2562592_+	PRK00652, lpxK, tetraacyldisaccharide 4'-kinase; Reviewed	NA|75aa|up_4|NZ_LT962688.1_2562674_2562899_-	pfam09866, DUF2093, Uncharacterized protein conserved in bacteria (DUF2093)	NA|138aa|up_3|NZ_LT962688.1_2563043_2563457_+	TIGR01354, Cytidine_deaminase, cytidine deaminase, homotetrameric	NA|271aa|up_2|NZ_LT962688.1_2563446_2564259_+	PRK08202, PRK08202, purine nucleoside phosphorylase; Provisional	NA|258aa|up_1|NZ_LT962688.1_2564268_2565042_+	PRK05283, PRK05283, deoxyribose-phosphate aldolase; Provisional	NA|437aa|up_0|NZ_LT962688.1_2565038_2566349_+	PRK05820, deoA, thymidine phosphorylase; Reviewed	NA|219aa|down_0|NZ_LT962688.1_2566966_2567623_-	COG3963, COG3963, Phospholipid N-methyltransferase [Lipid metabolism]	NA|386aa|down_1|NZ_LT962688.1_2567781_2568939_-	PRK10767, PRK10767, chaperone protein DnaJ; Provisional	NA|130aa|down_2|NZ_LT962688.1_2568940_2569330_-	NA	NA|640aa|down_3|NZ_LT962688.1_2569514_2571434_-	PRK00290, dnaK, molecular chaperone DnaK; Provisional	NA|359aa|down_4|NZ_LT962688.1_2571779_2572856_+	NA	NA|91aa|down_5|NZ_LT962688.1_2576141_2576414_+	COG4298, COG4298, Uncharacterized protein conserved in bacteria [Function unknown]	NA|98aa|down_6|NZ_LT962688.1_2576497_2576791_+	PRK05658, PRK05658, RNA polymerase sigma factor RpoD; Validated	NA|567aa|down_7|NZ_LT962688.1_2576787_2578488_-	COG0497, RecN, ATPase involved in DNA repair [DNA replication, recombination, and repair]	NA|329aa|down_8|NZ_LT962688.1_2578643_2579630_+	pfam13704, Glyco_tranf_2_4, Glycosyl transferase family 2	NA|292aa|down_9|NZ_LT962688.1_2579909_2580785_-	TIGR03302, OM_YfiO, outer membrane assembly lipoprotein YfiO
GCF_900234795.1_TK0001_PRJEB23178_v1	NZ_LT962688	Methylorubrum extorquens strain TK 0001 chromosome TK0001	4	3041823-3041975	1	PILER-CR	no		DEDDh,csa3,cas3,WYL	Orphan	GGCAACGACACGATCTACGGTCAGGACGG	29	0	0	NA	NA	NA	2	2	Orphan	DEDDh,csa3,cas3,WYL	NA|97aa|up_9|NZ_LT962688.1_3032456_3032747_+,NA|48aa|up_3|NZ_LT962688.1_3038390_3038534_-,NA|95aa|up_2|NZ_LT962688.1_3038743_3039028_-,NA|79aa|down_0|NZ_LT962688.1_3042922_3043159_-	NA|97aa|up_9|NZ_LT962688.1_3032456_3032747_+	NA	NA|142aa|up_8|NZ_LT962688.1_3032753_3033179_-	PRK13952, mscL, large conductance mechanosensitive channel protein MscL	NA|236aa|up_7|NZ_LT962688.1_3033619_3034327_+	PRK12547, PRK12547, RNA polymerase sigma factor; Provisional	NA|816aa|up_6|NZ_LT962688.1_3034390_3036838_-	PRK14939, gyrB, DNA gyrase subunit B; Provisional	NA|215aa|up_5|NZ_LT962688.1_3037076_3037721_-	cd07737, YcbL-like_MBL-fold, Salmonella enterica serovar typhimurium YcbL and related proteins; MBL-fold metallo hydrolase domain	NA|140aa|up_4|NZ_LT962688.1_3037885_3038305_-	COG1607, COG1607, Acyl-CoA hydrolase [Lipid metabolism]	NA|48aa|up_3|NZ_LT962688.1_3038390_3038534_-	NA	NA|95aa|up_2|NZ_LT962688.1_3038743_3039028_-	NA	NA|62aa|up_1|NZ_LT962688.1_3039154_3039340_-	COG3422, COG3422, Uncharacterized conserved protein [Function unknown]	NA|511aa|up_0|NZ_LT962688.1_3039514_3041047_-	COG0513, SrmB, Superfamily II DNA and RNA helicases [DNA replication, recombination, and repair / Transcription / Translation, ribosomal structure and biogenesis]	NA|79aa|down_0|NZ_LT962688.1_3042922_3043159_-	NA	NA|283aa|down_1|NZ_LT962688.1_3043431_3044280_+	COG2897, SseA, Rhodanese-related sulfurtransferase [Inorganic ion transport and metabolism]	NA|571aa|down_2|NZ_LT962688.1_3044467_3046180_+	COG5360, COG5360, Uncharacterized protein conserved in bacteria [Function unknown]	NA|533aa|down_3|NZ_LT962688.1_3046387_3047986_+	PRK00881, purH, bifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase; Provisional	NA|579aa|down_4|NZ_LT962688.1_3048198_3049935_-	cd17369, MFS_ShiA_like, Shikimate transporter and similar proteins of the Major Facilitator Superfamily	NA|562aa|down_5|NZ_LT962688.1_3050193_3051879_-	PRK09395, actP, cation/acetate symporter ActP	NA|129aa|down_6|NZ_LT962688.1_3051875_3052262_-	pfam04341, DUF485, Protein of unknown function, DUF485	NA|650aa|down_7|NZ_LT962688.1_3052473_3054423_-	PRK00174, PRK00174, acetyl-CoA synthetase; Provisional	NA|847aa|down_8|NZ_LT962688.1_3054682_3057223_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|159aa|down_9|NZ_LT962688.1_3057464_3057941_+	TIGR01985, kDa_protein, phasin
