assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_003072605.1_ASM307260v1	NZ_CP026736	Bacillus megaterium strain YC4-R4 chromosome, complete genome	1	2778610-2778747	1	CRT	no		DEDDh,cas3,DinG,csa3,WYL	Orphan	AGTAGAACGACGAGTAGT	18	2	13	2778628-2778645|2778628-2778645|2778628-2778645|2778628-2778645|2778628-2778645|2778628-2778645|2778712-2778729|2778712-2778729|2778712-2778729|2778712-2778729|2778712-2778729|2778712-2778729|2778712-2778729	NZ_CP026736.1_2981641-2981658|NZ_CP026736.1_2981473-2981490|NZ_CP026736.1_2981494-2981511|NZ_CP026736.1_2981515-2981532|NZ_CP026736.1_2981536-2981553|NZ_CP026736.1_2981662-2981679|NZ_CP026736.1_2981557-2981574|NZ_CP026736.1_2981599-2981616|NZ_CP026736.1_2981473-2981490|NZ_CP026736.1_2981494-2981511|NZ_CP026736.1_2981515-2981532|NZ_CP026736.1_2981536-2981553|NZ_CP026736.1_2981662-2981679	NA	3	3	Orphan	DEDDh,cas3,DinG,csa3,WYL	NA|81aa|up_2|NZ_CP026736.1_2775203_2775446_+,NA|65aa|down_2|NZ_CP026736.1_2780985_2781180_+	NA|149aa|up_9|NZ_CP026736.1_2768705_2769152_+	pfam10710, DUF2512, Protein of unknown function (DUF2512)	NA|184aa|up_8|NZ_CP026736.1_2769428_2769980_-	TIGR02227, Inactive_signal_peptidase_IA	NA|112aa|up_7|NZ_CP026736.1_2770170_2770506_-	COG5416, COG5416, Uncharacterized integral membrane protein [Function unknown]	NA|251aa|up_6|NZ_CP026736.1_2770796_2771549_-	pfam01863, DUF45, Protein of unknown function DUF45	NA|415aa|up_5|NZ_CP026736.1_2771810_2773055_+	smart00283, MA, Methyl-accepting chemotaxis-like domains (chemotaxis sensory transducer)	NA|169aa|up_4|NZ_CP026736.1_2773261_2773768_+	pfam00583, Acetyltransf_1, Acetyltransferase (GNAT) family	NA|385aa|up_3|NZ_CP026736.1_2773815_2774970_+	cd05669, M20_Acy1_YxeP-like, M20 Peptidase aminoacyclase-1 YxeP-like proteins, including YxeP, YtnL, YjiB and HipO2	NA|81aa|up_2|NZ_CP026736.1_2775203_2775446_+	NA	NA|538aa|up_1|NZ_CP026736.1_2775912_2777526_+	cd05654, M20_ArgE_RocB, M20 Peptidase arginine utilization protein, RocB	NA|266aa|up_0|NZ_CP026736.1_2777544_2778342_-	pfam13468, Glyoxalase_3, Glyoxalase-like domain	NA|149aa|down_0|NZ_CP026736.1_2778962_2779409_+	pfam00583, Acetyltransf_1, Acetyltransferase (GNAT) family	NA|390aa|down_1|NZ_CP026736.1_2779641_2780811_-	cd17325, MFS_MdtG_SLC18_like, bacterial MdtG-like and eukaryotic solute carrier 18 (SLC18) family of the Major Facilitator Superfamily of transporters	NA|65aa|down_2|NZ_CP026736.1_2780985_2781180_+	NA	NA|434aa|down_3|NZ_CP026736.1_2781211_2782513_-	COG2252, COG2252, Xanthine/uracil/vitamin C permease [Nucleotide transport and    metabolism]	NA|317aa|down_4|NZ_CP026736.1_2782732_2783683_-	COG3290, CitA, Signal transduction histidine kinase regulating citrate/malate metabolism [Signal transduction mechanisms]	NA|233aa|down_5|NZ_CP026736.1_2783700_2784399_-	COG3279, LytT, Response regulator of the LytR/AlgR family [Transcription / Signal transduction mechanisms]	NA|246aa|down_6|NZ_CP026736.1_2784574_2785312_+	cd03266, ABC_NatA_sodium_exporter, ATP-binding cassette domain of the Na+ transporter	NA|389aa|down_7|NZ_CP026736.1_2785313_2786480_+	COG1668, NatB, ABC-type Na+ efflux pump, permease component [Energy production and conversion / Inorganic ion transport and metabolism]	NA|261aa|down_8|NZ_CP026736.1_2786794_2787577_+	pfam01636, APH, Phosphotransferase enzyme family	NA|167aa|down_9|NZ_CP026736.1_2787713_2788214_+	PRK13182, racA, chromosome-anchoring protein RacA
GCF_003072605.1_ASM307260v1	NZ_CP026736	Bacillus megaterium strain YC4-R4 chromosome, complete genome	2	4442242-4442391	1	CRISPRCasFinder	no		DEDDh,cas3,DinG,csa3,WYL	Orphan	CTTTTGCCGTTTTTGACGTACTACT	25	0	0	NA	NA	NA	2	2	Orphan	DEDDh,cas3,DinG,csa3,WYL	NA|48aa|up_6|NZ_CP026736.1_4434197_4434341_+,NA|488aa|up_0|NZ_CP026736.1_4440689_4442153_+,NA|341aa|down_0|NZ_CP026736.1_4443122_4444145_+,NA|140aa|down_4|NZ_CP026736.1_4447379_4447799_-,NA|164aa|down_5|NZ_CP026736.1_4448206_4448698_+,NA|279aa|down_6|NZ_CP026736.1_4448943_4449780_-	NA|277aa|up_9|NZ_CP026736.1_4431695_4432526_-	COG1284, COG1284, Uncharacterized conserved protein [Function unknown]	NA|187aa|up_8|NZ_CP026736.1_4432725_4433286_-	sd00006, TPR, Tetratricopeptide repeat	NA|185aa|up_7|NZ_CP026736.1_4433483_4434038_-	pfam00583, Acetyltransf_1, Acetyltransferase (GNAT) family	NA|48aa|up_6|NZ_CP026736.1_4434197_4434341_+	NA	NA|275aa|up_5|NZ_CP026736.1_4434407_4435232_-	smart00318, SNc, Staphylococcal nuclease homologues	NA|220aa|up_4|NZ_CP026736.1_4435632_4436292_-	PLN00052, PLN00052, prolyl 4-hydroxylase; Provisional	NA|501aa|up_3|NZ_CP026736.1_4436543_4438046_-	pfam06039, Mqo, Malate:quinone oxidoreductase (Mqo)	NA|275aa|up_2|NZ_CP026736.1_4438166_4438991_-	pfam03618, Kinase-PPPase, Kinase/pyrophosphorylase	NA|294aa|up_1|NZ_CP026736.1_4439494_4440376_-	COG0561, Cof, Predicted hydrolases of the HAD superfamily [General function prediction only]	NA|488aa|up_0|NZ_CP026736.1_4440689_4442153_+	NA	NA|341aa|down_0|NZ_CP026736.1_4443122_4444145_+	NA	NA|199aa|down_1|NZ_CP026736.1_4444988_4445585_+	pfam00924, MS_channel, Mechanosensitive ion channel	NA|193aa|down_2|NZ_CP026736.1_4445587_4446166_-	cd07523, HAD_YsbA-like, uncharacterized family of the haloacid dehalogenase-like superfamily, similar to the uncharacterized Lactococcus lactis YsbA	NA|338aa|down_3|NZ_CP026736.1_4446293_4447307_-	COG4278, COG4278, Uncharacterized conserved protein [Function unknown]	NA|140aa|down_4|NZ_CP026736.1_4447379_4447799_-	NA	NA|164aa|down_5|NZ_CP026736.1_4448206_4448698_+	NA	NA|279aa|down_6|NZ_CP026736.1_4448943_4449780_-	NA	NA|411aa|down_7|NZ_CP026736.1_4449899_4451132_-	pfam02073, Peptidase_M29, Thermophilic metalloprotease (M29)	NA|265aa|down_8|NZ_CP026736.1_4451275_4452070_-	COG1414, IclR, Transcriptional regulator [Transcription]	NA|259aa|down_9|NZ_CP026736.1_4452122_4452899_-	COG0179, MhpD, 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway) [Secondary metabolites biosynthesis, transport, and catabolism]
GCF_003072605.1_ASM307260v1	NZ_CP026736	Bacillus megaterium strain YC4-R4 chromosome, complete genome	3	4442584-4442670	2	CRISPRCasFinder	no		DEDDh,cas3,DinG,csa3,WYL	Orphan	CTTTTGCCGTTTTTGACGTACTACT	25	0	0	NA	NA	NA	1	1	Orphan	DEDDh,cas3,DinG,csa3,WYL	NA|48aa|up_6|NZ_CP026736.1_4434197_4434341_+,NA|488aa|up_0|NZ_CP026736.1_4440689_4442153_+,NA|341aa|down_0|NZ_CP026736.1_4443122_4444145_+,NA|140aa|down_4|NZ_CP026736.1_4447379_4447799_-,NA|164aa|down_5|NZ_CP026736.1_4448206_4448698_+,NA|279aa|down_6|NZ_CP026736.1_4448943_4449780_-	NA|277aa|up_9|NZ_CP026736.1_4431695_4432526_-	COG1284, COG1284, Uncharacterized conserved protein [Function unknown]	NA|187aa|up_8|NZ_CP026736.1_4432725_4433286_-	sd00006, TPR, Tetratricopeptide repeat	NA|185aa|up_7|NZ_CP026736.1_4433483_4434038_-	pfam00583, Acetyltransf_1, Acetyltransferase (GNAT) family	NA|48aa|up_6|NZ_CP026736.1_4434197_4434341_+	NA	NA|275aa|up_5|NZ_CP026736.1_4434407_4435232_-	smart00318, SNc, Staphylococcal nuclease homologues	NA|220aa|up_4|NZ_CP026736.1_4435632_4436292_-	PLN00052, PLN00052, prolyl 4-hydroxylase; Provisional	NA|501aa|up_3|NZ_CP026736.1_4436543_4438046_-	pfam06039, Mqo, Malate:quinone oxidoreductase (Mqo)	NA|275aa|up_2|NZ_CP026736.1_4438166_4438991_-	pfam03618, Kinase-PPPase, Kinase/pyrophosphorylase	NA|294aa|up_1|NZ_CP026736.1_4439494_4440376_-	COG0561, Cof, Predicted hydrolases of the HAD superfamily [General function prediction only]	NA|488aa|up_0|NZ_CP026736.1_4440689_4442153_+	NA	NA|341aa|down_0|NZ_CP026736.1_4443122_4444145_+	NA	NA|199aa|down_1|NZ_CP026736.1_4444988_4445585_+	pfam00924, MS_channel, Mechanosensitive ion channel	NA|193aa|down_2|NZ_CP026736.1_4445587_4446166_-	cd07523, HAD_YsbA-like, uncharacterized family of the haloacid dehalogenase-like superfamily, similar to the uncharacterized Lactococcus lactis YsbA	NA|338aa|down_3|NZ_CP026736.1_4446293_4447307_-	COG4278, COG4278, Uncharacterized conserved protein [Function unknown]	NA|140aa|down_4|NZ_CP026736.1_4447379_4447799_-	NA	NA|164aa|down_5|NZ_CP026736.1_4448206_4448698_+	NA	NA|279aa|down_6|NZ_CP026736.1_4448943_4449780_-	NA	NA|411aa|down_7|NZ_CP026736.1_4449899_4451132_-	pfam02073, Peptidase_M29, Thermophilic metalloprotease (M29)	NA|265aa|down_8|NZ_CP026736.1_4451275_4452070_-	COG1414, IclR, Transcriptional regulator [Transcription]	NA|259aa|down_9|NZ_CP026736.1_4452122_4452899_-	COG0179, MhpD, 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway) [Secondary metabolites biosynthesis, transport, and catabolism]
GCF_003072605.1_ASM307260v1	NZ_CP026740	Bacillus megaterium strain YC4-R4 plasmid unnamed4	1	29210-29325	1	CRISPRCasFinder	no			Orphan	TGAGGCCCCTGAGGTCCCTGAGGCCCTTGAGATCCTTGAGGCCC	44	0	0	NA	NA	NA	1	1	Orphan	DEDDh,cas3,DinG,csa3,WYL	NA,NA|159aa|down_0|NZ_CP026740.1_30106_30583_+,NA|171aa|down_1|NZ_CP026740.1_31011_31524_+,NA|187aa|down_2|NZ_CP026740.1_31724_32285_-,NA|50aa|down_7|NZ_CP026740.1_36863_37013_-	NA|192aa|up_9|NZ_CP026740.1_5260_5836_-	cd04764, HTH_MlrA-like_sg1, Helix-Turn-Helix DNA binding domain of putative MlrA-like transcription regulators	NA|311aa|up_8|NZ_CP026740.1_6871_7804_-	PRK00236, xerC, site-specific tyrosine recombinase XerC; Reviewed	NA|463aa|up_7|NZ_CP026740.1_7982_9371_-	pfam01051, Rep_3, Initiator Replication protein	NA|298aa|up_6|NZ_CP026740.1_10363_11257_+	pfam13730, HTH_36, Helix-turn-helix domain	NA|4604aa|up_5|NZ_CP026740.1_11433_25245_+	NF012211, tand_rpt_95, tandem-95 repeat protein	NA|176aa|up_4|NZ_CP026740.1_25397_25925_+	PRK08118, PRK08118, DNA topology modulation protein	NA|163aa|up_3|NZ_CP026740.1_26345_26834_-	pfam07552, Coat_X, Spore Coat Protein X and V domain	NA|234aa|up_2|NZ_CP026740.1_26942_27644_-	pfam08795, DUF1796, Putative papain-like cysteine peptidase (DUF1796)	NA|135aa|up_1|NZ_CP026740.1_27845_28250_-	pfam13799, DUF4183, Domain of unknown function (DUF4183)	NA|95aa|up_0|NZ_CP026740.1_28568_28853_+	pfam13799, DUF4183, Domain of unknown function (DUF4183)	NA|159aa|down_0|NZ_CP026740.1_30106_30583_+	NA	NA|171aa|down_1|NZ_CP026740.1_31011_31524_+	NA	NA|187aa|down_2|NZ_CP026740.1_31724_32285_-	NA	NA|352aa|down_3|NZ_CP026740.1_32425_33481_-	pfam02958, EcKinase, Ecdysteroid kinase	NA|392aa|down_4|NZ_CP026740.1_33486_34662_-	COG3919, COG3919, Predicted ATP-grasp enzyme [General function prediction only]	NA|435aa|down_5|NZ_CP026740.1_34665_35970_-	cd03801, GT4_PimA-like, phosphatidyl-myo-inositol mannosyltransferase	NA|219aa|down_6|NZ_CP026740.1_35966_36623_-	COG2120, COG2120, Uncharacterized proteins, LmbE homologs [Function unknown]	NA|50aa|down_7|NZ_CP026740.1_36863_37013_-	NA	NA|174aa|down_8|NZ_CP026740.1_37404_37926_+	pfam00069, Pkinase, Protein kinase domain	NA|274aa|down_9|NZ_CP026740.1_38068_38890_-	COG3391, COG3391, Uncharacterized conserved protein [Function unknown]
