assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_006385935.1_ASM638593v1	NZ_CP041066	Bacillus megaterium strain KNU-01 chromosome, complete genome	1	885939-886098	1	CRISPRCasFinder	no		DEDDh,cas3,DinG,csa3,WYL,RT	Orphan	CAAAATGCTCAAGCATCTAAAAACAACTTCGGTACTGAATTTGGTAGCGAAACAA	55	0	0	NA	NA	NA	1	1	Orphan	DEDDh,cas3,DinG,csa3,WYL,RT	NA|75aa|up_1|NZ_CP041066.1_884693_884918_-,NA|131aa|down_8|NZ_CP041066.1_895221_895614_+	NA|301aa|up_9|NZ_CP041066.1_878492_879395_-	TIGR01777, Hypothetical_UPF0105_protein_Rv2216/MT2273/Mb2239	NA|270aa|up_8|NZ_CP041066.1_879491_880301_+	PRK14135, recX, recombination regulator RecX; Provisional	NA|218aa|up_7|NZ_CP041066.1_880591_881245_-	cd08939, KDSR-like_SDR_c, 3-ketodihydrosphingosine reductase (KDSR) and related proteins, classical (c) SDR	NA|105aa|up_6|NZ_CP041066.1_881327_881642_+	pfam08838, DUF1811, Protein of unknown function (DUF1811)	NA|52aa|up_5|NZ_CP041066.1_881720_881876_-	pfam08176, SspK, Small acid-soluble spore protein K family	NA|90aa|up_4|NZ_CP041066.1_882011_882281_+	pfam14043, WVELL, WVELL protein	NA|327aa|up_3|NZ_CP041066.1_882344_883325_-	pfam04307, YdjM, LexA-binding, inner membrane-associated putative hydrolase	NA|365aa|up_2|NZ_CP041066.1_883564_884659_+	COG1194, MutY, A/G-specific DNA glycosylase [DNA replication, recombination, and repair]	NA|75aa|up_1|NZ_CP041066.1_884693_884918_-	NA	NA|250aa|up_0|NZ_CP041066.1_885034_885784_+	PRK08063, PRK08063, enoyl-[acyl-carrier-protein] reductase FabL	NA|87aa|down_0|NZ_CP041066.1_886364_886625_+	pfam14182, YgaB, YgaB-like protein	NA|179aa|down_1|NZ_CP041066.1_886820_887357_+	PRK13662, PRK13662, hypothetical protein; Provisional	NA|584aa|down_2|NZ_CP041066.1_887526_889278_+	COG1132, MdlB, ABC-type multidrug transport system, ATPase and permease components [Defense mechanisms]	NA|363aa|down_3|NZ_CP041066.1_889313_890402_-	COG4129, COG4129, Predicted membrane protein [Function unknown]	NA|433aa|down_4|NZ_CP041066.1_891035_892334_-	PRK12389, PRK12389, glutamate-1-semialdehyde aminotransferase; Provisional	NA|338aa|down_5|NZ_CP041066.1_892536_893550_+	cd03267, ABC_NatA_like, ATP-binding cassette domain of an uncharacterized transporter similar in sequence to NatA	NA|264aa|down_6|NZ_CP041066.1_893542_894334_+	COG4587, COG4587, ABC-type uncharacterized transport system, permease component [General function prediction only]	NA|262aa|down_7|NZ_CP041066.1_894343_895129_+	COG3694, COG3694, ABC-type uncharacterized transport system, permease component [General function prediction only]	NA|131aa|down_8|NZ_CP041066.1_895221_895614_+	NA	NA|157aa|down_9|NZ_CP041066.1_895683_896154_+	COG1225, Bcp, Peroxiredoxin [Posttranslational modification, protein turnover, chaperones]
GCF_006385935.1_ASM638593v1	NZ_CP041066	Bacillus megaterium strain KNU-01 chromosome, complete genome	2	910839-911161	1,1	CRT,PILER-CR	no		DEDDh,cas3,DinG,csa3,WYL,RT	Orphan	NNCNNTATATTATGAAGN,TATATTATGAAGAAACTTT	18,19	0	0	NA	NA	NA:NA	5,3	5	Orphan	DEDDh,cas3,DinG,csa3,WYL,RT	NA|131aa|up_9|NZ_CP041066.1_895221_895614_+,NA|163aa|up_2|NZ_CP041066.1_907220_907709_+,NA|315aa|down_0|NZ_CP041066.1_912140_913085_+,NA|111aa|down_1|NZ_CP041066.1_913086_913419_+,NA|91aa|down_3|NZ_CP041066.1_913739_914012_+,NA|246aa|down_4|NZ_CP041066.1_914045_914783_+,NA|164aa|down_5|NZ_CP041066.1_914782_915274_+,NA|101aa|down_8|NZ_CP041066.1_916550_916853_+,NA|194aa|down_9|NZ_CP041066.1_917012_917594_+	NA|131aa|up_9|NZ_CP041066.1_895221_895614_+	NA	NA|157aa|up_8|NZ_CP041066.1_895683_896154_+	COG1225, Bcp, Peroxiredoxin [Posttranslational modification, protein turnover, chaperones]	NA|145aa|up_7|NZ_CP041066.1_896408_896843_+	COG0735, Fur, Fe2+/Zn2+ uptake regulation proteins [Inorganic ion transport and metabolism]	NA|120aa|up_6|NZ_CP041066.1_896865_897225_-	pfam11023, DUF2614, Zinc-ribbon containing domain	NA|289aa|up_5|NZ_CP041066.1_897446_898313_+	pfam14540, NTF-like, Nucleotidyltransferase-like	NA|326aa|up_4|NZ_CP041066.1_905237_906215_+	COG4974, XerD, Site-specific recombinase XerD [DNA replication, recombination, and repair]	NA|266aa|up_3|NZ_CP041066.1_906392_907190_+	pfam13730, HTH_36, Helix-turn-helix domain	NA|163aa|up_2|NZ_CP041066.1_907220_907709_+	NA	NA|281aa|up_1|NZ_CP041066.1_908865_909708_-	cd10443, GIY-YIG_HE_Tlr8p_PBC-V_like, GIY-YIG domain of uncharacterized hypothetical protein found in phycodnavirus PBCV-1 DNA virus, T	NA|88aa|up_0|NZ_CP041066.1_909731_909995_-	COG2944, COG2944, Predicted transcriptional regulator [Transcription]	NA|315aa|down_0|NZ_CP041066.1_912140_913085_+	NA	NA|111aa|down_1|NZ_CP041066.1_913086_913419_+	NA	NA|82aa|down_2|NZ_CP041066.1_913504_913750_+	TIGR03830, transcriptional_regulator_XRE_family, putative zinc finger/helix-turn-helix protein, YgiT family	NA|91aa|down_3|NZ_CP041066.1_913739_914012_+	NA	NA|246aa|down_4|NZ_CP041066.1_914045_914783_+	NA	NA|164aa|down_5|NZ_CP041066.1_914782_915274_+	NA	NA|147aa|down_6|NZ_CP041066.1_915600_916041_+	pfam13022, HTH_Tnp_1_2, Helix-turn-helix of insertion element transposase	NA|127aa|down_7|NZ_CP041066.1_916109_916490_+	pfam06199, Phage_tail_2, Phage tail tube protein	NA|101aa|down_8|NZ_CP041066.1_916550_916853_+	NA	NA|194aa|down_9|NZ_CP041066.1_917012_917594_+	NA
GCF_006385935.1_ASM638593v1	NZ_CP041066	Bacillus megaterium strain KNU-01 chromosome, complete genome	3	1492011-1492106	2	CRISPRCasFinder	no		DEDDh,cas3,DinG,csa3,WYL,RT	Orphan	TGTAACTATAATGGTTACAACAA	23	0	0	NA	NA	NA	1	1	Orphan	DEDDh,cas3,DinG,csa3,WYL,RT	NA,NA|60aa|down_4|NZ_CP041066.1_1495783_1495963_+	NA|319aa|up_9|NZ_CP041066.1_1476940_1477897_+	cd07938, DRE_TIM_HMGL, 3-hydroxy-3-methylglutaryl-CoA lyase, catalytic TIM barrel domain	NA|483aa|up_8|NZ_CP041066.1_1478047_1479496_+	cd07149, ALDH_y4uC, Uncharacterized ALDH (y4uC) with similarity to Tortula ruralis aldehyde dehydrogenase ALDH21A1	NA|292aa|up_7|NZ_CP041066.1_1479587_1480463_+	COG1082, IolE, Sugar phosphate isomerases/epimerases [Carbohydrate transport and metabolism]	NA|584aa|up_6|NZ_CP041066.1_1480715_1482467_-	PRK08186, PRK08186, allophanate hydrolase; Provisional	NA|1204aa|up_5|NZ_CP041066.1_1482481_1486093_-	TIGR02712, Includes:_Allophanate_hydrolase, urea carboxylase	NA|218aa|up_4|NZ_CP041066.1_1486109_1486763_-	TIGR03424, urea_degr_1, urea carboxylase-associated protein 1	NA|246aa|up_3|NZ_CP041066.1_1486783_1487521_-	TIGR03425, urea_degr_2, urea carboxylase-associated protein 2	NA|450aa|up_2|NZ_CP041066.1_1487797_1489147_-	TIGR00909, putative_amino_acid_transporter, amino acid transporter	NA|443aa|up_1|NZ_CP041066.1_1490130_1491459_+	cd01293, Bact_CD, Bacterial cytosine deaminase and related metal-dependent hydrolases	NA|66aa|up_0|NZ_CP041066.1_1491506_1491704_-	COG1278, CspC, Cold shock proteins [Transcription]	NA|211aa|down_0|NZ_CP041066.1_1492127_1492760_+	COG2910, COG2910, Putative NADH-flavin reductase [General function prediction only]	NA|329aa|down_1|NZ_CP041066.1_1492932_1493919_+	COG0673, MviM, Predicted dehydrogenases and related proteins [General function prediction only]	NA|181aa|down_2|NZ_CP041066.1_1494531_1495074_+	cd03015, PRX_Typ2cys, Peroxiredoxin (PRX) family, Typical 2-Cys PRX subfamily; PRXs are thiol-specific antioxidant (TSA) proteins, which confer a protective role in cells through its peroxidase activity by reducing hydrogen peroxide, peroxynitrite, and organic hydroperoxides	NA|148aa|down_3|NZ_CP041066.1_1495191_1495635_+	cd02966, TlpA_like_family, TlpA-like family; composed of  TlpA, ResA, DsbE and similar proteins	NA|60aa|down_4|NZ_CP041066.1_1495783_1495963_+	NA	NA|204aa|down_5|NZ_CP041066.1_1496160_1496772_+	TIGR02937, RNA_polymerase_sigma_factor, RNA polymerase sigma factor, sigma-70 family	NA|49aa|down_6|NZ_CP041066.1_1496764_1496911_+	pfam12841, YvrJ, YvrJ protein family	NA|76aa|down_7|NZ_CP041066.1_1496915_1497143_+	PRK14082, PRK14082, hypothetical protein; Provisional	NA|475aa|down_8|NZ_CP041066.1_1497624_1499049_+	COG3706, PleD, Response regulator containing a CheY-like receiver domain and a GGDEF domain [Signal transduction mechanisms]	NA|58aa|down_9|NZ_CP041066.1_1499077_1499251_-	PRK14861, tatA, twin arginine translocase protein A; Provisional
GCF_006385935.1_ASM638593v1	NZ_CP041066	Bacillus megaterium strain KNU-01 chromosome, complete genome	4	1592713-1592793	3	CRISPRCasFinder	no	csa3	DEDDh,cas3,DinG,csa3,WYL,RT	Type I-A	ATAATGAATGATAGTCATTCATT	23	0	0	NA	NA	NA	1	1	Orphan	DEDDh,cas3,DinG,csa3,WYL,RT	NA|73aa|up_6|NZ_CP041066.1_1584420_1584639_+,NA|79aa|up_5|NZ_CP041066.1_1584635_1584872_+,NA|93aa|up_1|NZ_CP041066.1_1589425_1589704_-,NA	csa3|88aa|up_9|NZ_CP041066.1_1582054_1582318_+	cd00090, HTH_ARSR, Arsenical Resistance Operon Repressor and similar prokaryotic, metal regulated homodimeric repressors	NA|213aa|up_8|NZ_CP041066.1_1582318_1582957_+	COG5658, COG5658, Predicted integral membrane protein [Function unknown]	NA|331aa|up_7|NZ_CP041066.1_1583105_1584098_-	pfam11553, DUF3231, Protein of unknown function (DUF3231)	NA|73aa|up_6|NZ_CP041066.1_1584420_1584639_+	NA	NA|79aa|up_5|NZ_CP041066.1_1584635_1584872_+	NA	NA|844aa|up_4|NZ_CP041066.1_1585177_1587709_+	PRK10060, PRK10060, cyclic di-GMP phosphodiesterase	NA|215aa|up_3|NZ_CP041066.1_1587766_1588411_+	PRK09347, folE, GTP cyclohydrolase I; Provisional	NA|310aa|up_2|NZ_CP041066.1_1588459_1589389_+	COG0492, TrxB, Thioredoxin reductase [Posttranslational modification, protein turnover, chaperones]	NA|93aa|up_1|NZ_CP041066.1_1589425_1589704_-	NA	NA|815aa|up_0|NZ_CP041066.1_1589910_1592355_+	cd01948, EAL, EAL domain	NA|460aa|down_0|NZ_CP041066.1_1592860_1594240_+	COG0531, PotE, Amino acid transporters [Amino acid transport and metabolism]	NA|291aa|down_1|NZ_CP041066.1_1594292_1595165_+	cd07573, CPA, N-carbamoylputrescine amidohydrolase (CPA) (class 11 nitrilases)	NA|388aa|down_2|NZ_CP041066.1_1595161_1596325_+	cd09996, HDAC_classII_1, Histone deacetylases and histone-like deacetylases, classII	NA|243aa|down_3|NZ_CP041066.1_1596891_1597620_-	PRK12804, PRK12804, flagellin; Provisional	NA|214aa|down_4|NZ_CP041066.1_1597849_1598491_-	PRK13197, PRK13197, pyrrolidone-carboxylate peptidase; Provisional	NA|322aa|down_5|NZ_CP041066.1_1598505_1599471_-	pfam06166, DUF979, Protein of unknown function (DUF979)	NA|235aa|down_6|NZ_CP041066.1_1599471_1600176_-	pfam06149, DUF969, Protein of unknown function (DUF969)	NA|304aa|down_7|NZ_CP041066.1_1600552_1601464_+	pfam12773, DZR, Double zinc ribbon	NA|490aa|down_8|NZ_CP041066.1_1601528_1602998_+	COG4640, COG4640, Predicted membrane protein [Function unknown]	NA|186aa|down_9|NZ_CP041066.1_1603105_1603663_-	pfam04982, HPP, HPP family
GCF_006385935.1_ASM638593v1	NZ_CP041066	Bacillus megaterium strain KNU-01 chromosome, complete genome	5	1922750-1923160	2	CRT	no	DinG,cas3	DEDDh,cas3,DinG,csa3,WYL,RT	Unclear	CNNCGAGNAGAACGGCGA	18	1	1	1922957-1922986	NZ_CP041066.1_1719876-1719905	NA	9	9	Unclear	DEDDh,cas3,DinG,csa3,WYL,RT	NA|167aa|up_7|NZ_CP041066.1_1914805_1915306_+,NA|85aa|up_3|NZ_CP041066.1_1920260_1920515_+,NA|384aa|down_1|NZ_CP041066.1_1924653_1925805_+,NA|221aa|down_2|NZ_CP041066.1_1925869_1926532_+,NA|89aa|down_5|NZ_CP041066.1_1929234_1929501_-	NA|236aa|up_9|NZ_CP041066.1_1913299_1914007_+	COG3935, DnaD, Putative primosome component and related proteins [DNA replication, recombination, and repair]	NA|224aa|up_8|NZ_CP041066.1_1914147_1914819_+	COG0177, Nth, Predicted EndoIII-related endonuclease [DNA replication, recombination, and repair]	NA|167aa|up_7|NZ_CP041066.1_1914805_1915306_+	NA	NA|949aa|up_6|NZ_CP041066.1_1915341_1918188_-	TIGR02074, Includes:_Penicillin-insensitive_transglycosylase, penicillin-binding protein, 1A family	NA|200aa|up_5|NZ_CP041066.1_1918251_1918851_-	PRK02234, recU, Holliday junction-specific endonuclease; Reviewed	NA|332aa|up_4|NZ_CP041066.1_1918924_1919920_+	pfam10720, DUF2515, Protein of unknown function (DUF2515)	NA|85aa|up_3|NZ_CP041066.1_1920260_1920515_+	NA	NA|117aa|up_2|NZ_CP041066.1_1920560_1920911_+	pfam08807, DUF1798, Bacterial domain of unknown function (DUF1798)	NA|65aa|up_1|NZ_CP041066.1_1921563_1921758_-	pfam14178, YppF, YppF-like protein	NA|138aa|up_0|NZ_CP041066.1_1921987_1922401_+	pfam14179, YppG, YppG-like protein	NA|362aa|down_0|NZ_CP041066.1_1923437_1924523_+	COG5337, CotH, Spore coat assembly protein [Cell envelope biogenesis, outer membrane]	NA|384aa|down_1|NZ_CP041066.1_1924653_1925805_+	NA	NA|221aa|down_2|NZ_CP041066.1_1925869_1926532_+	NA	NA|382aa|down_3|NZ_CP041066.1_1926888_1928034_+	pfam01053, Cys_Met_Meta_PP, Cys/Met metabolism PLP-dependent enzyme	NA|394aa|down_4|NZ_CP041066.1_1928017_1929199_+	pfam01053, Cys_Met_Meta_PP, Cys/Met metabolism PLP-dependent enzyme	NA|89aa|down_5|NZ_CP041066.1_1929234_1929501_-	NA	NA|220aa|down_6|NZ_CP041066.1_1929563_1930223_-	COG2968, COG2968, Uncharacterized conserved protein [Function unknown]	NA|493aa|down_7|NZ_CP041066.1_1930455_1931934_-	TIGR02121, Osmoregulated_proline_transporter, sodium/proline symporter	NA|182aa|down_8|NZ_CP041066.1_1932226_1932772_-	TIGR02898, Uncharacterized_lipoprotein_YlaJ, sporulation lipoprotein, YhcN/YlaJ family	NA|68aa|down_9|NZ_CP041066.1_1933337_1933541_+	COG1278, CspC, Cold shock proteins [Transcription]
GCF_006385935.1_ASM638593v1	NZ_CP041066	Bacillus megaterium strain KNU-01 chromosome, complete genome	6	2634324-2634413	4	CRISPRCasFinder	no		DEDDh,cas3,DinG,csa3,WYL,RT	Orphan	TCAAATGCAGCCGCCAAGCGGTTC	24	0	0	NA	NA	NA	1	1	Orphan	DEDDh,cas3,DinG,csa3,WYL,RT	NA|89aa|up_9|NZ_CP041066.1_2625362_2625629_-,NA|159aa|up_1|NZ_CP041066.1_2631266_2631743_-,NA	NA|89aa|up_9|NZ_CP041066.1_2625362_2625629_-	NA	NA|475aa|up_8|NZ_CP041066.1_2625801_2627226_+	cd09111, PLDc_ymdC_like_1, Putative catalytic domain, repeat 1, of Escherichia coli uncharacterized protein ymdC and similar proteins	NA|177aa|up_7|NZ_CP041066.1_2627322_2627853_+	COG3981, COG3981, Predicted acetyltransferase [General function prediction only]	NA|94aa|up_6|NZ_CP041066.1_2627889_2628171_-	cd04706, PLA2_plant, PLA2_plant: Plant-specific sub-family of  Phospholipase A2, a super-family of secretory and cytosolic enzymes; the latter are either Ca dependent or Ca independent	NA|144aa|up_5|NZ_CP041066.1_2628277_2628709_-	smart00347, HTH_MARR, helix_turn_helix multiple antibiotic resistance protein	NA|135aa|up_4|NZ_CP041066.1_2628902_2629307_+	PRK01658, PRK01658, CidA/LrgA family holin-like protein	NA|231aa|up_3|NZ_CP041066.1_2629276_2629969_+	pfam04172, LrgB, LrgB-like family	NA|291aa|up_2|NZ_CP041066.1_2630240_2631113_+	cd07041, STAS_RsbR_RsbS_like, Sulphate Transporter and Anti-Sigma factor antagonist domain of the "stressosome" complex proteins RsbS and RsbR, regulators of the bacterial stress activated alternative sigma factor sigma-B by phosphorylation	NA|159aa|up_1|NZ_CP041066.1_2631266_2631743_-	NA	NA|191aa|up_0|NZ_CP041066.1_2631811_2632384_-	COG1670, RimL, Acetyltransferases, including N-acetylases of ribosomal proteins [Translation, ribosomal structure and biogenesis]	NA|322aa|down_0|NZ_CP041066.1_2634910_2635876_+	cd04187, DPM1_like_bac, Bacterial DPM1_like enzymes are related to eukaryotic DPM1	NA|296aa|down_1|NZ_CP041066.1_2636268_2637156_+	COG1210, GalU, UDP-glucose pyrophosphorylase [Cell envelope biogenesis, outer membrane]	NA|225aa|down_2|NZ_CP041066.1_2637300_2637975_+	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|460aa|down_3|NZ_CP041066.1_2637976_2639356_+	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|55aa|down_4|NZ_CP041066.1_2639728_2639893_+	pfam10055, DUF2292, Uncharacterized small protein (DUF2292)	NA|51aa|down_5|NZ_CP041066.1_2640235_2640388_+	pfam13076, Fur_reg_FbpA, Fur-regulated basic protein A	NA|355aa|down_6|NZ_CP041066.1_2640640_2641705_+	COG1613, Sbp, ABC-type sulfate transport system, periplasmic component [Inorganic ion transport and metabolism]	NA|279aa|down_7|NZ_CP041066.1_2641799_2642636_+	TIGR02139, permease_CysT, sulfate ABC transporter, permease protein CysT	NA|289aa|down_8|NZ_CP041066.1_2642648_2643515_+	COG4208, CysW, ABC-type sulfate transport system, permease component [Inorganic ion transport and metabolism]	NA|358aa|down_9|NZ_CP041066.1_2643534_2644608_+	COG1118, CysA, ABC-type sulfate/molybdate transport systems, ATPase component [Inorganic ion transport and metabolism]
