assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000145275.1_ASM14527v1	NC_014393	Clostridium cellulovorans 743B, complete genome	1	1128578-1128666	1	CRISPRCasFinder	no	DEDDh	csa3,DinG,DEDDh,cas3,cas5,cas7,cas8a2,RT,WYL,PD-DExK	Unclear	TTTAAATTAATATAAATATATAT	23	0	0	NA	NA	NA	1	1	Orphan	csa3,DinG,DEDDh,cas3,cas5,cas7,cas8a2,RT,WYL,PD-DExK	NA|271aa|up_4|NC_014393.1_1123698_1124511_+,NA	NA|277aa|up_9|NC_014393.1_1116762_1117593_-	pfam07705, CARDB, CARDB	NA|293aa|up_8|NC_014393.1_1117847_1118726_-	COG2207, AraC, AraC-type DNA-binding domain-containing proteins [Transcription]	NA|690aa|up_7|NC_014393.1_1119189_1121259_+	COG1501, COG1501, Alpha-glucosidases, family 31 of glycosyl hydrolases [Carbohydrate transport and metabolism]	NA|219aa|up_6|NC_014393.1_1121497_1122154_+	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|476aa|up_5|NC_014393.1_1122159_1123587_+	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|271aa|up_4|NC_014393.1_1123698_1124511_+	NA	NA|111aa|up_3|NC_014393.1_1124877_1125210_+	TIGR03308, phn_thr-fam, phosphonate metabolism protein, transferase hexapeptide repeat family	NA|116aa|up_2|NC_014393.1_1125175_1125523_+	cd03349, LbH_XAT, Xenobiotic acyltransferase (XAT): The XAT class of hexapeptide acyltransferases is composed of a large number of microbial enzymes that catalyze the CoA-dependent acetylation of a variety of hydroxyl-bearing acceptors such as chloramphenicol and streptogramin, among others	NA|194aa|up_1|NC_014393.1_1125570_1126152_+	cd03135, GATase1_DJ-1, Type 1 glutamine amidotransferase (GATase1)-like domain found in Human DJ-1	NA|358aa|up_0|NC_014393.1_1127235_1128309_-	cd01539, PBP1_GGBP, periplasmic glucose/galactose-binding protein (GGBP) involved in chemotaxis towards, and active transport of, glucose and galactose in various bacterial species	NA|405aa|down_0|NC_014393.1_1128695_1129910_+	COG4585, COG4585, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|221aa|down_1|NC_014393.1_1129893_1130556_+	COG2197, CitB, Response regulator containing a CheY-like receiver domain and an HTH DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|277aa|down_2|NC_014393.1_1130637_1131468_+	PRK06940, PRK06940, short chain dehydrogenase; Provisional	NA|267aa|down_3|NC_014393.1_1131775_1132576_+	cd05233, SDR_c, classical (c) SDRs	NA|255aa|down_4|NC_014393.1_1132731_1133496_-	pfam06161, DUF975, Protein of unknown function (DUF975)	NA|765aa|down_5|NC_014393.1_1133974_1136269_+	pfam00759, Glyco_hydro_9, Glycosyl hydrolase family 9	NA|145aa|down_6|NC_014393.1_1136580_1137015_+	cd16387, ParB_N_Srx, ParB N-terminal domain and sulfiredoxin protein-related families	NA|664aa|down_7|NC_014393.1_1137608_1139600_+	PRK12268, PRK12268, methionyl-tRNA synthetase; Reviewed	NA|158aa|down_8|NC_014393.1_1139995_1140469_+	COG1943, COG1943, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|116aa|down_9|NC_014393.1_1140718_1141066_+	COG1695, COG1695, Predicted transcriptional regulators [Transcription]
GCF_000145275.1_ASM14527v1	NC_014393	Clostridium cellulovorans 743B, complete genome	2	1581437-1581528	2	CRISPRCasFinder	no		csa3,DinG,DEDDh,cas3,cas5,cas7,cas8a2,RT,WYL,PD-DExK	Orphan	TTAAATAATTATCTCTAATATAAAAGT	27	0	0	NA	NA	NA	1	1	Orphan	csa3,DinG,DEDDh,cas3,cas5,cas7,cas8a2,RT,WYL,PD-DExK	NA|86aa|up_7|NC_014393.1_1572633_1572891_+,NA|362aa|up_5|NC_014393.1_1574077_1575163_-,NA|137aa|up_3|NC_014393.1_1576450_1576861_-,NA|208aa|down_6|NC_014393.1_1589640_1590264_+	NA|448aa|up_9|NC_014393.1_1568421_1569765_+	PRK09414, PRK09414, NADP-specific glutamate dehydrogenase	NA|809aa|up_8|NC_014393.1_1569947_1572374_-	cd04300, GT35_Glycogen_Phosphorylase, glycogen phosphorylase and similar proteins	NA|86aa|up_7|NC_014393.1_1572633_1572891_+	NA	NA|281aa|up_6|NC_014393.1_1573074_1573917_+	TIGR00762, DegV, EDD domain protein, DegV family	NA|362aa|up_5|NC_014393.1_1574077_1575163_-	NA	NA|348aa|up_4|NC_014393.1_1575172_1576216_-	COG4632, EpsL, Exopolysaccharide biosynthesis protein related to N-acetylglucosamine-1-phosphodiester alpha-N-acetylglucosaminidase [Carbohydrate transport and metabolism]	NA|137aa|up_3|NC_014393.1_1576450_1576861_-	NA	NA|151aa|up_2|NC_014393.1_1577130_1577583_+	PRK14900, valS, valyl-tRNA synthetase; Provisional	NA|152aa|up_1|NC_014393.1_1577750_1578206_-	pfam13673, Acetyltransf_10, Acetyltransferase (GNAT) domain	NA|578aa|up_0|NC_014393.1_1578572_1580306_+	pfam07693, KAP_NTPase, KAP family P-loop domain	NA|455aa|down_0|NC_014393.1_1581611_1582976_-	cd16913, YkuD_like, L,D-transpeptidases/carboxypeptidases similar to Bacillus YkuD	NA|457aa|down_1|NC_014393.1_1583086_1584457_-	cd16913, YkuD_like, L,D-transpeptidases/carboxypeptidases similar to Bacillus YkuD	NA|229aa|down_2|NC_014393.1_1585002_1585689_-	cd05387, BY-kinase, bacterial tyrosine-kinase	NA|227aa|down_3|NC_014393.1_1585698_1586379_-	COG3944, COG3944, Capsular polysaccharide biosynthesis protein [Cell envelope biogenesis, outer membrane]	NA|246aa|down_4|NC_014393.1_1586980_1587718_+	PRK12434, PRK12434, tRNA pseudouridine(38-40) synthase TruA	NA|343aa|down_5|NC_014393.1_1588090_1589119_+	PRK09261, PRK09261, phospho-2-dehydro-3-deoxyheptonate aldolase; Validated	NA|208aa|down_6|NC_014393.1_1589640_1590264_+	NA	NA|185aa|down_7|NC_014393.1_1590275_1590830_+	COG3945, COG3945, Uncharacterized conserved protein [Function unknown]	NA|341aa|down_8|NC_014393.1_1590892_1591915_-	COG1316, LytR, Transcriptional regulator [Transcription]	NA|402aa|down_9|NC_014393.1_1591928_1593134_-	PRK13902, alaS, alanyl-tRNA synthetase; Provisional
GCF_000145275.1_ASM14527v1	NC_014393	Clostridium cellulovorans 743B, complete genome	3	2598493-2598666	1	PILER-CR	no		csa3,DinG,DEDDh,cas3,cas5,cas7,cas8a2,RT,WYL,PD-DExK	Orphan	AATAATAATAAAGTTTGGTCAT	22	0	0	NA	NA	NA	2	2	Orphan	csa3,DinG,DEDDh,cas3,cas5,cas7,cas8a2,RT,WYL,PD-DExK	NA|312aa|up_6|NC_014393.1_2593340_2594276_-,NA|248aa|up_1|NC_014393.1_2597321_2598065_-,NA|53aa|up_0|NC_014393.1_2598135_2598294_-,NA|142aa|down_3|NC_014393.1_2602890_2603316_+	NA|139aa|up_9|NC_014393.1_2591486_2591903_-	pfam10990, DUF2809, Protein of unknown function (DUF2809)	NA|101aa|up_8|NC_014393.1_2591940_2592243_-	smart00886, Dabb, Stress responsive A/B Barrel Domain	NA|267aa|up_7|NC_014393.1_2592517_2593318_-	pfam01987, AIM24, Mitochondrial biogenesis AIM24	NA|312aa|up_6|NC_014393.1_2593340_2594276_-	NA	NA|165aa|up_5|NC_014393.1_2594417_2594912_-	pfam00583, Acetyltransf_1, Acetyltransferase (GNAT) family	NA|286aa|up_4|NC_014393.1_2595021_2595879_-	pfam04439, Adenyl_transf, Streptomycin adenylyltransferase	NA|113aa|up_3|NC_014393.1_2595886_2596225_-	pfam10694, DUF2500, Protein of unknown function (DUF2500)	NA|113aa|up_2|NC_014393.1_2596292_2596631_-	COG4551, COG4551, Predicted protein tyrosine phosphatase [General function prediction only]	NA|248aa|up_1|NC_014393.1_2597321_2598065_-	NA	NA|53aa|up_0|NC_014393.1_2598135_2598294_-	NA	NA|333aa|down_0|NC_014393.1_2599031_2600030_-	pfam14057, GGGtGRT, GGGtGRT protein	NA|231aa|down_1|NC_014393.1_2600050_2600743_-	COG0822, IscU, NifU homolog involved in Fe-S cluster formation [Energy production and conversion]	NA|136aa|down_2|NC_014393.1_2602080_2602488_-	pfam06355, Aegerolysin, Aegerolysin	NA|142aa|down_3|NC_014393.1_2602890_2603316_+	NA	NA|320aa|down_4|NC_014393.1_2603378_2604338_-	cd08547, Type_II_cohesin, Type II cohesin domain, interaction partner of dockerin	NA|478aa|down_5|NC_014393.1_2604415_2605849_-	COG5279, CYK3, Uncharacterized protein involved in cytokinesis, contains TGc (transglutaminase/protease-like) domain [Cell division and chromosome partitioning]	NA|806aa|down_6|NC_014393.1_2606304_2608722_+	cd08547, Type_II_cohesin, Type II cohesin domain, interaction partner of dockerin	NA|256aa|down_7|NC_014393.1_2609160_2609928_-	cd02513, CMP-NeuAc_Synthase, CMP-NeuAc_Synthase activates N-acetylneuraminic acid by adding CMP moiety	NA|207aa|down_8|NC_014393.1_2609924_2610545_-	COG0673, MviM, Predicted dehydrogenases and related proteins [General function prediction only]	NA|90aa|down_9|NC_014393.1_2610573_2610843_-	COG0673, MviM, Predicted dehydrogenases and related proteins [General function prediction only]
GCF_000145275.1_ASM14527v1	NC_014393	Clostridium cellulovorans 743B, complete genome	4	2748102-2750748	2,3,1,3	PILER-CR,CRISPRCasFinder,CRT,PILER-CR	no	cas3,cas5,cas7,cas8a2	csa3,DinG,DEDDh,cas3,cas5,cas7,cas8a2,RT,WYL,PD-DExK	Type I-A	CTTTAAATAATACAGTATTCAATATTAAC,CTTTAAATAATACAGTATTCAATATTAAC,CTTTAAATAATACAGTATTCAATATTAAC,CTTTAAATAATACAGTATTCAATATTAAC	29,29,29,29	1	1	2748326-2748361	NC_014393.1_2748066-2748101	NA:NA:NA:NA	35,40,40,35	40	TypeI-A	csa3,DinG,DEDDh,cas3,cas5,cas7,cas8a2,RT,WYL,PD-DExK	NA|168aa|up_9|NC_014393.1_2741074_2741578_-,NA|140aa|up_8|NC_014393.1_2742698_2743118_-,NA|98aa|up_7|NC_014393.1_2743184_2743478_-,NA|88aa|up_6|NC_014393.1_2743556_2743820_-,NA|69aa|up_5|NC_014393.1_2744365_2744572_-,NA|62aa|up_3|NC_014393.1_2745108_2745294_-,NA|387aa|up_0|NC_014393.1_2746583_2747744_-,cas5|213aa|down_1|NC_014393.1_2753065_2753704_-,cas8a2|446aa|down_3|NC_014393.1_2754637_2755975_-,NA|108aa|down_5|NC_014393.1_2756539_2756863_-,NA|166aa|down_6|NC_014393.1_2756962_2757460_-,NA|172aa|down_8|NC_014393.1_2758296_2758812_-,NA|140aa|down_9|NC_014393.1_2758837_2759257_-	NA|168aa|up_9|NC_014393.1_2741074_2741578_-	NA	NA|140aa|up_8|NC_014393.1_2742698_2743118_-	NA	NA|98aa|up_7|NC_014393.1_2743184_2743478_-	NA	NA|88aa|up_6|NC_014393.1_2743556_2743820_-	NA	NA|69aa|up_5|NC_014393.1_2744365_2744572_-	NA	NA|182aa|up_4|NC_014393.1_2744541_2745087_-	pfam14094, DUF4272, Domain of unknown function (DUF4272)	NA|62aa|up_3|NC_014393.1_2745108_2745294_-	NA	NA|162aa|up_2|NC_014393.1_2745779_2746265_-	pfam01134, GIDA, Glucose inhibited division protein A	NA|89aa|up_1|NC_014393.1_2746283_2746550_+	cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|387aa|up_0|NC_014393.1_2746583_2747744_-	NA	cas3|706aa|down_0|NC_014393.1_2750939_2753057_-	COG1203, COG1203, CRISPR-associated helicase Cas3 [Defense mechanisms]	cas5|213aa|down_1|NC_014393.1_2753065_2753704_-	NA	cas7|307aa|down_2|NC_014393.1_2753717_2754638_-	pfam01905, DevR, CRISPR-associated negative auto-regulator DevR/Csa2	cas8a2|446aa|down_3|NC_014393.1_2754637_2755975_-	NA	NA|160aa|down_4|NC_014393.1_2756073_2756553_-	pfam01478, Peptidase_A24, Type IV leader peptidase family	NA|108aa|down_5|NC_014393.1_2756539_2756863_-	NA	NA|166aa|down_6|NC_014393.1_2756962_2757460_-	NA	NA|175aa|down_7|NC_014393.1_2757566_2758091_-	PLN02501, PLN02501, digalactosyldiacylglycerol synthase	NA|172aa|down_8|NC_014393.1_2758296_2758812_-	NA	NA|140aa|down_9|NC_014393.1_2758837_2759257_-	NA
GCF_000145275.1_ASM14527v1	NC_014393	Clostridium cellulovorans 743B, complete genome	5	4819249-4819389	4	CRISPRCasFinder	no		csa3,DinG,DEDDh,cas3,cas5,cas7,cas8a2,RT,WYL,PD-DExK	Orphan	CTGACCCAAGGATAGTGAGAGGCAATGAAGCCACAAAAAACTTCCTGAAAAAAA	54	0	0	NA	NA	NA	1	1	Orphan	csa3,DinG,DEDDh,cas3,cas5,cas7,cas8a2,RT,WYL,PD-DExK	NA|77aa|up_4|NC_014393.1_4814235_4814466_-,NA|83aa|down_0|NC_014393.1_4819615_4819864_+,NA|135aa|down_1|NC_014393.1_4819866_4820271_+	NA|419aa|up_9|NC_014393.1_4810610_4811867_-	cd16403, ParB_N_like_MT, ParB N-terminal-like domain, some attached to C-terminal S-adenosylmethionine-dependent methyltransferase	NA|261aa|up_8|NC_014393.1_4811876_4812659_-	COG0192, MetK, S-adenosylmethionine synthetase [Coenzyme metabolism]	NA|180aa|up_7|NC_014393.1_4812660_4813200_-	pfam05119, Terminase_4, Phage terminase, small subunit	NA|123aa|up_6|NC_014393.1_4813315_4813684_-	pfam01844, HNH, HNH endonuclease	NA|136aa|up_5|NC_014393.1_4813825_4814233_-	cd06171, Sigma70_r4, Sigma70, region (SR) 4 refers to the most C-terminal of four conserved domains found in Escherichia coli (Ec) sigma70, the main housekeeping sigma, and related sigma-factors (SFs)	NA|77aa|up_4|NC_014393.1_4814235_4814466_-	NA	NA|457aa|up_3|NC_014393.1_4814545_4815916_-	cd18013, DEXQc_bact_SNF2, DEXQ-box helicase domain of bacterial SNF2 family proteins	NA|93aa|up_2|NC_014393.1_4815899_4816178_-	smart00990, VRR_NUC, This model contains proteins with the VRR-NUC domain	NA|756aa|up_1|NC_014393.1_4816394_4818662_-	TIGR01613, putative_primase, phage/plasmid primase, P4 family, C-terminal domain	NA|142aa|up_0|NC_014393.1_4818652_4819078_-	pfam14359, DUF4406, Domain of unknown function (DUF4406)	NA|83aa|down_0|NC_014393.1_4819615_4819864_+	NA	NA|135aa|down_1|NC_014393.1_4819866_4820271_+	NA	NA|381aa|down_2|NC_014393.1_4820263_4821406_+	pfam10926, DUF2800, Protein of unknown function (DUF2800)	NA|188aa|down_3|NC_014393.1_4821398_4821962_+	pfam10991, DUF2815, Protein of unknown function (DUF2815)	NA|664aa|down_4|NC_014393.1_4822058_4824050_+	cd08642, DNA_pol_A_pol_I_A, Polymerase I functions primarily to fill DNA gaps that arise during DNA repair, recombination and replication	NA|172aa|down_5|NC_014393.1_4824083_4824599_-	TIGR02937, RNA_polymerase_sigma_factor, RNA polymerase sigma factor, sigma-70 family	NA|209aa|down_6|NC_014393.1_4824922_4825549_+	smart00530, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|636aa|down_7|NC_014393.1_4825717_4827625_-	cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|491aa|down_8|NC_014393.1_4827662_4829135_-	pfam09563, RE_LlaJI, LlaJI restriction endonuclease	NA|858aa|down_9|NC_014393.1_4829137_4831711_-	COG4127, COG4127, Uncharacterized conserved protein [Function unknown]
