assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_002003345.1_ASM200334v1	NZ_CP016090	Clostridium beijerinckii strain BAS/B3/I/124, complete genome	1	1266437-1266519	1	CRISPRCasFinder	no		cas3,DEDDh,csa3,DinG,WYL,c2c10_CAS-V-U3	Orphan	AATGACGGAGGTACATGGTACTA	23	0	0	NA	NA	NA	1	1	Orphan	cas3,DEDDh,csa3,DinG,WYL,c2c10_CAS-V-U3	NA|230aa|up_1|NZ_CP016090.1_1264068_1264758_+,NA|436aa|up_0|NZ_CP016090.1_1264813_1266121_+,NA	NA|533aa|up_9|NZ_CP016090.1_1253530_1255129_+	COG1696, DltB, Predicted membrane protein involved in D-alanine export [Cell envelope biogenesis, outer membrane]	NA|455aa|up_8|NZ_CP016090.1_1255115_1256480_+	pfam00657, Lipase_GDSL, GDSL-like Lipase/Acylhydrolase	NA|504aa|up_7|NZ_CP016090.1_1256522_1258034_+	cd05930, A_NRPS, The adenylation domain of nonribosomal peptide synthetases (NRPS)	NA|76aa|up_6|NZ_CP016090.1_1258379_1258607_+	COG0236, AcpP, Acyl carrier protein [Lipid metabolism / Secondary metabolites biosynthesis, transport, and catabolism]	NA|174aa|up_5|NZ_CP016090.1_1258785_1259307_+	COG5652, COG5652, Predicted integral membrane protein [Function unknown]	NA|293aa|up_4|NZ_CP016090.1_1259618_1260497_+	COG1210, GalU, UDP-glucose pyrophosphorylase [Cell envelope biogenesis, outer membrane]	NA|660aa|up_3|NZ_CP016090.1_1260535_1262515_+	cd03255, ABC_MJ0796_LolCDE_FtsE, ATP-binding cassette domain of the transporters involved in export of lipoprotein and macrolide, and cell division protein	NA|323aa|up_2|NZ_CP016090.1_1262801_1263770_-	COG5263, COG5263, FOG: Glucan-binding domain (YG repeat) [General function prediction only]	NA|230aa|up_1|NZ_CP016090.1_1264068_1264758_+	NA	NA|436aa|up_0|NZ_CP016090.1_1264813_1266121_+	NA	NA|415aa|down_0|NZ_CP016090.1_1268047_1269292_+	PRK13342, PRK13342, recombination factor protein RarA; Reviewed	NA|151aa|down_1|NZ_CP016090.1_1269414_1269867_+	TIGR00738, Putative_HTH-type_transcriptional_regulator, Rrf2 family protein	NA|394aa|down_2|NZ_CP016090.1_1269869_1271051_+	TIGR03402, Cysteine_desulfurase_NifS, cysteine desulfurase NifS	NA|146aa|down_3|NZ_CP016090.1_1271052_1271490_+	TIGR03419, NifU_clost, FeS cluster assembly scaffold protein NifU, Clostridium type	NA|359aa|down_4|NZ_CP016090.1_1271501_1272578_+	PRK00143, mnmA, tRNA-specific 2-thiouridylase MnmA; Reviewed	NA|99aa|down_5|NZ_CP016090.1_1272982_1273279_+	PTZ00395, PTZ00395, Sec24-related protein; Provisional	NA|167aa|down_6|NZ_CP016090.1_1274076_1274577_+	COG3881, COG3881, PRC-barrel domain containing protein [General function prediction only]	NA|343aa|down_7|NZ_CP016090.1_1274557_1275586_+	COG0628, yhhT, Predicted permease, member of the PurR regulon [General function prediction only]	NA|880aa|down_8|NZ_CP016090.1_1276147_1278787_+	PRK00252, alaS, alanyl-tRNA synthetase; Reviewed	NA|86aa|down_9|NZ_CP016090.1_1278923_1279181_+	PRK05473, IreB-like, IreB family regulatory phosphoprotein
GCF_002003345.1_ASM200334v1	NZ_CP016090	Clostridium beijerinckii strain BAS/B3/I/124, complete genome	2	4796880-4796976	2	CRISPRCasFinder	no		cas3,DEDDh,csa3,DinG,WYL,c2c10_CAS-V-U3	Orphan	CCATACTTGTATGGTAGTTATATTT	25	0	0	NA	NA	NA	1	1	Orphan	cas3,DEDDh,csa3,DinG,WYL,c2c10_CAS-V-U3	NA,NA|99aa|down_3|NZ_CP016090.1_4802644_4802941_+,NA|104aa|down_6|NZ_CP016090.1_4805274_4805586_-	NA|306aa|up_9|NZ_CP016090.1_4782792_4783710_-	cd01174, ribokinase, Ribokinase catalyses the phosphorylation of ribose to ribose-5-phosphate using ATP	NA|566aa|up_8|NZ_CP016090.1_4783875_4785573_-	COG0840, Tar, Methyl-accepting chemotaxis protein [Cell motility and secretion / Signal transduction mechanisms]	NA|488aa|up_7|NZ_CP016090.1_4785655_4787119_-	cd01536, PBP1_ABC_sugar_binding-like, periplasmic sugar-binding domain of active transport systems that are members of the type 1 periplasmic binding protein (PBP1) superfamily	NA|186aa|up_6|NZ_CP016090.1_4787696_4788254_-	COG3963, COG3963, Phospholipid N-methyltransferase [Lipid metabolism]	NA|141aa|up_5|NZ_CP016090.1_4789142_4789565_-	COG0537, Hit, Diadenosine tetraphosphate (Ap4A) hydrolase and other HIT family hydrolases [Nucleotide transport and metabolism / Carbohydrate transport and metabolism / General function prediction only]	NA|502aa|up_4|NZ_CP016090.1_4789848_4791354_-	COG2211, MelB, Na+/melibiose symporter and related transporters [Carbohydrate transport and metabolism]	NA|471aa|up_3|NZ_CP016090.1_4791392_4792805_-	pfam02614, UxaC, Glucuronate isomerase	NA|538aa|up_2|NZ_CP016090.1_4793231_4794845_-	COG0246, MtlD, Mannitol-1-phosphate/altronate dehydrogenases [Carbohydrate transport and metabolism]	NA|354aa|up_1|NZ_CP016090.1_4794943_4796005_-	PRK03906, PRK03906, mannonate dehydratase; Provisional	NA|225aa|up_0|NZ_CP016090.1_4796174_4796849_-	COG1802, GntR, Transcriptional regulators [Transcription]	NA|560aa|down_0|NZ_CP016090.1_4797195_4798875_+	smart00283, MA, Methyl-accepting chemotaxis-like domains (chemotaxis sensory transducer)	NA|392aa|down_1|NZ_CP016090.1_4798956_4800132_+	COG4134, COG4134, ABC-type uncharacterized transport system, periplasmic component [General function prediction only]	NA|621aa|down_2|NZ_CP016090.1_4800211_4802074_-	COG0531, PotE, Amino acid transporters [Amino acid transport and metabolism]	NA|99aa|down_3|NZ_CP016090.1_4802644_4802941_+	NA	NA|148aa|down_4|NZ_CP016090.1_4803077_4803521_-	pfam13508, Acetyltransf_7, Acetyltransferase (GNAT) domain	NA|354aa|down_5|NZ_CP016090.1_4804117_4805179_-	cd08174, G1PDH-like, Glycerol-1-phosphate dehydrogenase-like	NA|104aa|down_6|NZ_CP016090.1_4805274_4805586_-	NA	NA|64aa|down_7|NZ_CP016090.1_4805669_4805861_-	pfam04024, PspC, PspC domain	NA|572aa|down_8|NZ_CP016090.1_4806267_4807983_-	pfam00920, ILVD_EDD, Dehydratase family	NA|243aa|down_9|NZ_CP016090.1_4808213_4808942_-	COG2186, FadR, Transcriptional regulators [Transcription]
GCF_002003345.1_ASM200334v1	NZ_CP016090	Clostridium beijerinckii strain BAS/B3/I/124, complete genome	3	5333614-5333697	3	CRISPRCasFinder	no		cas3,DEDDh,csa3,DinG,WYL,c2c10_CAS-V-U3	Orphan	ATAATGGTACTTGGTACTTCTTAA	24	0	0	NA	NA	NA	1	1	Orphan	cas3,DEDDh,csa3,DinG,WYL,c2c10_CAS-V-U3	NA,NA	NA|519aa|up_9|NZ_CP016090.1_5321751_5323308_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|230aa|up_8|NZ_CP016090.1_5323304_5323994_-	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|95aa|up_7|NZ_CP016090.1_5324703_5324988_+	COG1359, COG1359, Uncharacterized conserved protein [Function unknown]	NA|293aa|up_6|NZ_CP016090.1_5325442_5326321_-	TIGR00718, Probable_L-serine_dehydratase_alpha_chain, L-serine dehydratase, iron-sulfur-dependent, alpha subunit	NA|227aa|up_5|NZ_CP016090.1_5326322_5327003_-	TIGR00719, Probable_L-serine_dehydratase_beta_chain, L-serine dehydratase, iron-sulfur-dependent, beta subunit	NA|246aa|up_4|NZ_CP016090.1_5327277_5328015_-	COG2819, COG2819, Predicted hydrolase of the alpha/beta superfamily [General function prediction only]	NA|268aa|up_3|NZ_CP016090.1_5328021_5328825_-	COG2207, AraC, AraC-type DNA-binding domain-containing proteins [Transcription]	NA|365aa|up_2|NZ_CP016090.1_5329008_5330103_+	pfam01032, FecCD, FecCD transport family	NA|264aa|up_1|NZ_CP016090.1_5330112_5330904_+	COG1120, FepC, ABC-type cobalamin/Fe3+-siderophores transport systems, ATPase components [Inorganic ion transport and metabolism / Coenzyme metabolism]	NA|349aa|up_0|NZ_CP016090.1_5330963_5332010_+	cd01148, TroA_a, Metal binding protein TroA_a	NA|118aa|down_0|NZ_CP016090.1_5333984_5334338_-	COG3862, COG3862, Uncharacterized protein with conserved CXXC pairs [Function unknown]	NA|418aa|down_1|NZ_CP016090.1_5334339_5335593_-	COG0492, TrxB, Thioredoxin reductase [Posttranslational modification, protein turnover, chaperones]	NA|476aa|down_2|NZ_CP016090.1_5335610_5337038_-	COG0579, COG0579, Predicted dehydrogenase [General function prediction only]	NA|500aa|down_3|NZ_CP016090.1_5337491_5338991_-	PRK00047, glpK, glycerol kinase GlpK	NA|186aa|down_4|NZ_CP016090.1_5339283_5339841_-	pfam04309, G3P_antiterm, Glycerol-3-phosphate responsive antiterminator	NA|238aa|down_5|NZ_CP016090.1_5339926_5340640_-	COG0580, GlpF, Glycerol uptake facilitator and related permeases (Major Intrinsic Protein Family) [Carbohydrate transport and metabolism]	NA|552aa|down_6|NZ_CP016090.1_5341292_5342948_-	cd02028, UMPK_like, Uridine monophosphate kinase_like (UMPK_like) is a family of proteins highly similar to the uridine monophosphate kinase (UMPK, EC 2	NA|233aa|down_7|NZ_CP016090.1_5343259_5343958_+	COG2323, COG2323, Predicted membrane protein [Function unknown]	NA|127aa|down_8|NZ_CP016090.1_5343954_5344335_+	pfam14276, DUF4363, Domain of unknown function (DUF4363)	NA|408aa|down_9|NZ_CP016090.1_5344676_5345900_+	PRK00509, PRK00509, argininosuccinate synthase; Provisional
GCF_002003345.1_ASM200334v1	NZ_CP016090	Clostridium beijerinckii strain BAS/B3/I/124, complete genome	4	5681531-5681617	4	CRISPRCasFinder	no		cas3,DEDDh,csa3,DinG,WYL,c2c10_CAS-V-U3	Orphan	CATAGCACCTGATCCATTTAAGTAGTA	27	0	0	NA	NA	NA	1	1	Orphan	cas3,DEDDh,csa3,DinG,WYL,c2c10_CAS-V-U3	NA,NA|89aa|down_6|NZ_CP016090.1_5693804_5694071_+,NA|74aa|down_7|NZ_CP016090.1_5694083_5694305_+,NA|84aa|down_8|NZ_CP016090.1_5694403_5694655_+,NA|48aa|down_9|NZ_CP016090.1_5694700_5694844_+	NA|514aa|up_9|NZ_CP016090.1_5664601_5666143_-	pfam14264, Glucos_trans_II, Glucosyl transferase GtrII	NA|1033aa|up_8|NZ_CP016090.1_5666184_5669283_-	pfam00535, Glycos_transf_2, Glycosyl transferase family 2	NA|388aa|up_7|NZ_CP016090.1_5669306_5670470_-	pfam13578, Methyltransf_24, Methyltransferase domain	NA|420aa|up_6|NZ_CP016090.1_5670498_5671758_-	cd03220, ABC_KpsT_Wzt, ATP-binding cassette component of polysaccharide transport system	NA|268aa|up_5|NZ_CP016090.1_5671770_5672574_-	COG1682, TagG, ABC-type polysaccharide/polyol phosphate export systems, permease component [Carbohydrate transport and metabolism / Cell envelope biogenesis, outer membrane]	NA|458aa|up_4|NZ_CP016090.1_5672703_5674077_-	COG5610, COG5610, Predicted hydrolase (HAD superfamily) [General function prediction only]	NA|346aa|up_3|NZ_CP016090.1_5674028_5675066_-	PRK10073, PRK10073, putative glycosyl transferase; Provisional	NA|475aa|up_2|NZ_CP016090.1_5675455_5676880_+	TIGR03023, Sugar_transferase	NA|379aa|up_1|NZ_CP016090.1_5676876_5678013_+	cd03808, GT4_CapM-like, capsular polysaccharide biosynthesis glycosyltransferase CapM and similar proteins	NA|611aa|up_0|NZ_CP016090.1_5678309_5680142_+	pfam04932, Wzy_C, O-Antigen ligase	NA|406aa|down_0|NZ_CP016090.1_5684120_5685338_-	COG5263, COG5263, FOG: Glucan-binding domain (YG repeat) [General function prediction only]	NA|415aa|down_1|NZ_CP016090.1_5685482_5686727_-	COG5263, COG5263, FOG: Glucan-binding domain (YG repeat) [General function prediction only]	NA|516aa|down_2|NZ_CP016090.1_5686777_5688325_-	COG5263, COG5263, FOG: Glucan-binding domain (YG repeat) [General function prediction only]	NA|426aa|down_3|NZ_CP016090.1_5688554_5689832_-	COG5263, COG5263, FOG: Glucan-binding domain (YG repeat) [General function prediction only]	NA|434aa|down_4|NZ_CP016090.1_5689997_5691299_-	COG5263, COG5263, FOG: Glucan-binding domain (YG repeat) [General function prediction only]	NA|576aa|down_5|NZ_CP016090.1_5691843_5693571_-	COG5263, COG5263, FOG: Glucan-binding domain (YG repeat) [General function prediction only]	NA|89aa|down_6|NZ_CP016090.1_5693804_5694071_+	NA	NA|74aa|down_7|NZ_CP016090.1_5694083_5694305_+	NA	NA|84aa|down_8|NZ_CP016090.1_5694403_5694655_+	NA	NA|48aa|down_9|NZ_CP016090.1_5694700_5694844_+	NA
