assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000723465.1_Rb803	NZ_HF545617	Ruminococcus bicirculans strain 80/3 chromosome II	1	253727-262985	1,1,1,2,3	PILER-CR,CRISPRCasFinder,CRT,PILER-CR,PILER-CR	no	cas3,cas5,cas8c,cas7,cas4,cas1,cas2	DEDDh,cas3,cas5,cas8c,cas7,cas4,cas1,cas2,cas13d,WYL	 Type I-U?,Type I-C,Type I-U	GTCACGCTCCACGTGAGCGTGTGAGTTGAAAT,GTCACGCTCCACGTGAGCGTGTGAGTTGAAAT,GTCACGCTCCACGTGAGCGTGTGAGTTGAAAT,GTCACGCTCCACGTGAGCGTGTGAGTTGAAAT,GTCACGCTCCACGTGAGCGTGTGAGTTGAAAT	32,32,32,32,32	1	1	261408-261441	NZ_HF545616.1_1905205-1905172	NA:NA:NA:NA:NA	131,140,140,131,131	140	TypeI-U?,TypeI-C,TypeI-U	cas3,WYL,RT,csa3,DEDDh,cas5,cas8c,cas7,cas4,cas1,cas2,cas13d	NA,NA|158aa|down_0|NZ_HF545617.1_263017_263491_+,NA|51aa|down_1|NZ_HF545617.1_263665_263818_+	NA|135aa|up_9|NZ_HF545617.1_240848_241253_-	pfam12646, DUF3783, Domain of unknown function (DUF3783)	NA|781aa|up_8|NZ_HF545617.1_241410_243753_-	PRK10658, PRK10658, putative alpha-glucosidase; Provisional	NA|399aa|up_7|NZ_HF545617.1_244100_245297_-	pfam09587, PGA_cap, Bacterial capsule synthesis protein PGA_cap	cas3|735aa|up_6|NZ_HF545617.1_245611_247816_+	COG1203, COG1203, CRISPR-associated helicase Cas3 [Defense mechanisms]	cas5|240aa|up_5|NZ_HF545617.1_248122_248842_+	cd09752, Cas5_I-C, CRISPR/Cas system-associated RAMP superfamily protein Cas5	cas8c|617aa|up_4|NZ_HF545617.1_248835_250686_+	pfam09709, Cas_Csd1, CRISPR-associated protein (Cas_Csd1)	cas7|289aa|up_3|NZ_HF545617.1_250703_251570_+	pfam05107, Cas_Cas7, CRISPR-associated protein Cas7	cas4|227aa|up_2|NZ_HF545617.1_251569_252250_+	TIGR00372, conserved_hypothetical_protein, CRISPR-associated protein Cas4	cas1|341aa|up_1|NZ_HF545617.1_252242_253265_+	TIGR03640, cas1_DVULG, CRISPR-associated endonuclease Cas1, subtype I-C/DVULG	cas2|97aa|up_0|NZ_HF545617.1_253276_253567_+	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	NA|158aa|down_0|NZ_HF545617.1_263017_263491_+	NA	NA|51aa|down_1|NZ_HF545617.1_263665_263818_+	NA	NA|311aa|down_2|NZ_HF545617.1_264402_265335_-	pfam00665, rve, Integrase core domain	NA|418aa|down_3|NZ_HF545617.1_265549_266803_+	COG2265, TrmA, SAM-dependent methyltransferases related to tRNA (uracil-5-)-methyltransferase [Translation, ribosomal structure and biogenesis]	NA|313aa|down_4|NZ_HF545617.1_266771_267710_+	COG0714, COG0714, MoxR-like ATPases [General function prediction only]	NA|387aa|down_5|NZ_HF545617.1_267723_268884_+	pfam01882, DUF58, Protein of unknown function DUF58	NA|737aa|down_6|NZ_HF545617.1_268896_271107_+	pfam01841, Transglut_core, Transglutaminase-like superfamily	NA|253aa|down_7|NZ_HF545617.1_271251_272010_-	PRK00048, PRK00048, dihydrodipicolinate reductase; Provisional	NA|297aa|down_8|NZ_HF545617.1_272146_273037_-	PRK03170, PRK03170, dihydrodipicolinate synthase; Provisional	NA|361aa|down_9|NZ_HF545617.1_273073_274156_-	PRK08664, PRK08664, aspartate-semialdehyde dehydrogenase; Reviewed
GCF_000723465.1_Rb803	NZ_HF545617	Ruminococcus bicirculans strain 80/3 chromosome II	2	263506-263603	2	CRISPRCasFinder	no	cas3,cas5,cas8c,cas7,cas4,cas1,cas2	DEDDh,cas3,cas5,cas8c,cas7,cas4,cas1,cas2,cas13d,WYL	 Type I-U?,Type I-C,Type I-U	GTCACGCTCCACGTGAGCGTGTGAGTTGAAAT	32	0	0	NA	NA	NA	1	1	TypeI-U?,TypeI-C,TypeI-U	cas3,WYL,RT,csa3,DEDDh,cas5,cas8c,cas7,cas4,cas1,cas2,cas13d	NA|158aa|up_0|NZ_HF545617.1_263017_263491_+,NA|51aa|down_0|NZ_HF545617.1_263665_263818_+	NA|781aa|up_9|NZ_HF545617.1_241410_243753_-	PRK10658, PRK10658, putative alpha-glucosidase; Provisional	NA|399aa|up_8|NZ_HF545617.1_244100_245297_-	pfam09587, PGA_cap, Bacterial capsule synthesis protein PGA_cap	cas3|735aa|up_7|NZ_HF545617.1_245611_247816_+	COG1203, COG1203, CRISPR-associated helicase Cas3 [Defense mechanisms]	cas5|240aa|up_6|NZ_HF545617.1_248122_248842_+	cd09752, Cas5_I-C, CRISPR/Cas system-associated RAMP superfamily protein Cas5	cas8c|617aa|up_5|NZ_HF545617.1_248835_250686_+	pfam09709, Cas_Csd1, CRISPR-associated protein (Cas_Csd1)	cas7|289aa|up_4|NZ_HF545617.1_250703_251570_+	pfam05107, Cas_Cas7, CRISPR-associated protein Cas7	cas4|227aa|up_3|NZ_HF545617.1_251569_252250_+	TIGR00372, conserved_hypothetical_protein, CRISPR-associated protein Cas4	cas1|341aa|up_2|NZ_HF545617.1_252242_253265_+	TIGR03640, cas1_DVULG, CRISPR-associated endonuclease Cas1, subtype I-C/DVULG	cas2|97aa|up_1|NZ_HF545617.1_253276_253567_+	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	NA|158aa|up_0|NZ_HF545617.1_263017_263491_+	NA	NA|51aa|down_0|NZ_HF545617.1_263665_263818_+	NA	NA|311aa|down_1|NZ_HF545617.1_264402_265335_-	pfam00665, rve, Integrase core domain	NA|418aa|down_2|NZ_HF545617.1_265549_266803_+	COG2265, TrmA, SAM-dependent methyltransferases related to tRNA (uracil-5-)-methyltransferase [Translation, ribosomal structure and biogenesis]	NA|313aa|down_3|NZ_HF545617.1_266771_267710_+	COG0714, COG0714, MoxR-like ATPases [General function prediction only]	NA|387aa|down_4|NZ_HF545617.1_267723_268884_+	pfam01882, DUF58, Protein of unknown function DUF58	NA|737aa|down_5|NZ_HF545617.1_268896_271107_+	pfam01841, Transglut_core, Transglutaminase-like superfamily	NA|253aa|down_6|NZ_HF545617.1_271251_272010_-	PRK00048, PRK00048, dihydrodipicolinate reductase; Provisional	NA|297aa|down_7|NZ_HF545617.1_272146_273037_-	PRK03170, PRK03170, dihydrodipicolinate synthase; Provisional	NA|361aa|down_8|NZ_HF545617.1_273073_274156_-	PRK08664, PRK08664, aspartate-semialdehyde dehydrogenase; Reviewed	NA|345aa|down_9|NZ_HF545617.1_274752_275787_+	PRK00147, queA, S-adenosylmethionine:tRNA ribosyltransferase-isomerase; Provisional
GCF_000723465.1_Rb803	NZ_HF545617	Ruminococcus bicirculans strain 80/3 chromosome II	3	263849-263947	3	CRISPRCasFinder	no	cas3,cas5,cas8c,cas7,cas4,cas1,cas2	DEDDh,cas3,cas5,cas8c,cas7,cas4,cas1,cas2,cas13d,WYL	 Type I-U?,Type I-C,Type I-U	GTCACGCTCCACGTGAGCGTGTGAGTTGAAAT	32	0	0	NA	NA	NA	1	1	TypeI-U?,TypeI-C,TypeI-U	cas3,WYL,RT,csa3,DEDDh,cas5,cas8c,cas7,cas4,cas1,cas2,cas13d	NA|158aa|up_1|NZ_HF545617.1_263017_263491_+,NA|51aa|up_0|NZ_HF545617.1_263665_263818_+,NA	NA|399aa|up_9|NZ_HF545617.1_244100_245297_-	pfam09587, PGA_cap, Bacterial capsule synthesis protein PGA_cap	cas3|735aa|up_8|NZ_HF545617.1_245611_247816_+	COG1203, COG1203, CRISPR-associated helicase Cas3 [Defense mechanisms]	cas5|240aa|up_7|NZ_HF545617.1_248122_248842_+	cd09752, Cas5_I-C, CRISPR/Cas system-associated RAMP superfamily protein Cas5	cas8c|617aa|up_6|NZ_HF545617.1_248835_250686_+	pfam09709, Cas_Csd1, CRISPR-associated protein (Cas_Csd1)	cas7|289aa|up_5|NZ_HF545617.1_250703_251570_+	pfam05107, Cas_Cas7, CRISPR-associated protein Cas7	cas4|227aa|up_4|NZ_HF545617.1_251569_252250_+	TIGR00372, conserved_hypothetical_protein, CRISPR-associated protein Cas4	cas1|341aa|up_3|NZ_HF545617.1_252242_253265_+	TIGR03640, cas1_DVULG, CRISPR-associated endonuclease Cas1, subtype I-C/DVULG	cas2|97aa|up_2|NZ_HF545617.1_253276_253567_+	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	NA|158aa|up_1|NZ_HF545617.1_263017_263491_+	NA	NA|51aa|up_0|NZ_HF545617.1_263665_263818_+	NA	NA|311aa|down_0|NZ_HF545617.1_264402_265335_-	pfam00665, rve, Integrase core domain	NA|418aa|down_1|NZ_HF545617.1_265549_266803_+	COG2265, TrmA, SAM-dependent methyltransferases related to tRNA (uracil-5-)-methyltransferase [Translation, ribosomal structure and biogenesis]	NA|313aa|down_2|NZ_HF545617.1_266771_267710_+	COG0714, COG0714, MoxR-like ATPases [General function prediction only]	NA|387aa|down_3|NZ_HF545617.1_267723_268884_+	pfam01882, DUF58, Protein of unknown function DUF58	NA|737aa|down_4|NZ_HF545617.1_268896_271107_+	pfam01841, Transglut_core, Transglutaminase-like superfamily	NA|253aa|down_5|NZ_HF545617.1_271251_272010_-	PRK00048, PRK00048, dihydrodipicolinate reductase; Provisional	NA|297aa|down_6|NZ_HF545617.1_272146_273037_-	PRK03170, PRK03170, dihydrodipicolinate synthase; Provisional	NA|361aa|down_7|NZ_HF545617.1_273073_274156_-	PRK08664, PRK08664, aspartate-semialdehyde dehydrogenase; Reviewed	NA|345aa|down_8|NZ_HF545617.1_274752_275787_+	PRK00147, queA, S-adenosylmethionine:tRNA ribosyltransferase-isomerase; Provisional	NA|374aa|down_9|NZ_HF545617.1_275856_276978_+	COG3594, NolL, Fucose 4-O-acetylase and related acetyltransferases [Carbohydrate transport and metabolism]
GCF_000723465.1_Rb803	NZ_HF545617	Ruminococcus bicirculans strain 80/3 chromosome II	4	526942-527175	4,4,2	PILER-CR,CRISPRCasFinder,CRT	no	cas13d,WYL	DEDDh,cas3,cas5,cas8c,cas7,cas4,cas1,cas2,cas13d,WYL	Unclear	CTACTACACTGGTGCGAATTTGCACTAGTCTAAAAC,CTACTACACTGGTGCGAATTTGCACTAGTCTAAAAC,CTACTACACTGGTGCGAATTTGCACTAGTCTAAAA	36,36,35	0	0	NA	NA	NA:NA:NA	2,3,3	3	TypeVI-D	cas3,WYL,RT,csa3,DEDDh,cas5,cas8c,cas7,cas4,cas1,cas2,cas13d	NA|286aa|up_4|NZ_HF545617.1_520980_521838_+,NA|393aa|up_0|NZ_HF545617.1_525271_526450_-,cas13d|919aa|down_0|NZ_HF545617.1_527255_530012_+,NA|116aa|down_1|NZ_HF545617.1_530063_530411_-,NA|518aa|down_5|NZ_HF545617.1_537000_538554_+,NA|63aa|down_8|NZ_HF545617.1_540489_540678_-,NA|257aa|down_9|NZ_HF545617.1_541955_542726_+	NA|374aa|up_9|NZ_HF545617.1_515957_517079_-	COG3842, PotA, ABC-type spermidine/putrescine transport systems, ATPase components [Amino acid transport and metabolism]	NA|113aa|up_8|NZ_HF545617.1_517097_517436_-	COG0347, GlnK, Nitrogen regulatory protein PII [Amino acid transport and metabolism]	NA|118aa|up_7|NZ_HF545617.1_517732_518086_+	pfam07364, DUF1485, Metallopeptidase family M81	NA|384aa|up_6|NZ_HF545617.1_518075_519227_+	COG5476, COG5476, Uncharacterized conserved protein [Function unknown]	NA|341aa|up_5|NZ_HF545617.1_519536_520559_+	pfam00150, Cellulase, Cellulase (glycosyl hydrolase family 5)	NA|286aa|up_4|NZ_HF545617.1_520980_521838_+	NA	NA|477aa|up_3|NZ_HF545617.1_521849_523280_+	COG1653, UgpB, ABC-type sugar transport system, periplasmic component [Carbohydrate transport and metabolism]	NA|254aa|up_2|NZ_HF545617.1_523399_524161_+	TIGR04094, AraC_family_transcriptional_regulator, YSIRK-targeted surface antigen transcriptional regulator	NA|256aa|up_1|NZ_HF545617.1_524275_525043_+	cd02175, GH16_lichenase, lichenase, member of glycosyl hydrolase family 16	NA|393aa|up_0|NZ_HF545617.1_525271_526450_-	NA	cas13d|919aa|down_0|NZ_HF545617.1_527255_530012_+	NA	NA|116aa|down_1|NZ_HF545617.1_530063_530411_-	NA	NA|1316aa|down_2|NZ_HF545617.1_531290_535238_+	pfam06873, SerH, Cell surface immobilisation antigen SerH	NA|104aa|down_3|NZ_HF545617.1_535322_535634_-	cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins	WYL|325aa|down_4|NZ_HF545617.1_535969_536944_+	pfam13280, WYL, WYL domain	NA|518aa|down_5|NZ_HF545617.1_537000_538554_+	NA	NA|408aa|down_6|NZ_HF545617.1_538573_539797_+	PTZ00121, PTZ00121, MAEBL; Provisional	NA|48aa|down_7|NZ_HF545617.1_540323_540467_+	pfam01476, LysM, LysM domain	NA|63aa|down_8|NZ_HF545617.1_540489_540678_-	NA	NA|257aa|down_9|NZ_HF545617.1_541955_542726_+	NA
GCF_000723465.1_Rb803	NZ_HF545616	Ruminococcus bicirculans strain 80/3 chromosome I	1	102377-102518	1	CRISPRCasFinder	no		cas3,WYL,RT,csa3,DEDDh	Orphan	AAAAAATCCGCTGCCACGAATGATAGCAGATATGGGTGCAGGGGCACGC	49	0	0	NA	NA	NA	1	1	Orphan	cas3,WYL,RT,csa3,DEDDh,cas5,cas8c,cas7,cas4,cas1,cas2,cas13d	NA|75aa|up_8|NZ_HF545616.1_89038_89263_+,NA|233aa|up_7|NZ_HF545616.1_89511_90210_+,NA|588aa|up_3|NZ_HF545616.1_94713_96477_+,NA|403aa|up_1|NZ_HF545616.1_99633_100842_+,NA|281aa|down_6|NZ_HF545616.1_110986_111829_-,NA|168aa|down_8|NZ_HF545616.1_112718_113222_-	NA|437aa|up_9|NZ_HF545616.1_87422_88733_-	pfam13472, Lipase_GDSL_2, GDSL-like Lipase/Acylhydrolase family	NA|75aa|up_8|NZ_HF545616.1_89038_89263_+	NA	NA|233aa|up_7|NZ_HF545616.1_89511_90210_+	NA	NA|624aa|up_6|NZ_HF545616.1_90358_92230_+	cd06267, PBP1_LacI_sugar_binding-like, ligand binding domain of the LacI transcriptional regulator family belonging to the type 1 periplasmic-binding fold protein superfamily	NA|154aa|up_5|NZ_HF545616.1_92670_93132_+	pfam04138, GtrA, GtrA-like protein	NA|521aa|up_4|NZ_HF545616.1_93149_94712_+	PRK07208, PRK07208, hypothetical protein; Provisional	NA|588aa|up_3|NZ_HF545616.1_94713_96477_+	NA	NA|911aa|up_2|NZ_HF545616.1_96604_99337_-	pfam00150, Cellulase, Cellulase (glycosyl hydrolase family 5)	NA|403aa|up_1|NZ_HF545616.1_99633_100842_+	NA	NA|422aa|up_0|NZ_HF545616.1_101062_102328_+	COG0475, KefB, Kef-type K+ transport systems, membrane components [Inorganic ion transport and metabolism]	NA|442aa|down_0|NZ_HF545616.1_102524_103850_-	PRK05474, PRK05474, xylose isomerase; Provisional	NA|91aa|down_1|NZ_HF545616.1_104946_105219_+	PRK00364, groES, co-chaperonin GroES; Reviewed	NA|548aa|down_2|NZ_HF545616.1_105262_106906_+	PRK00013, groEL, chaperonin GroEL; Reviewed	NA|235aa|down_3|NZ_HF545616.1_107287_107992_+	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|474aa|down_4|NZ_HF545616.1_107984_109406_+	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|437aa|down_5|NZ_HF545616.1_109415_110726_+	TIGR02037, Probable_periplasmic_serine_protease_do/HhoA-like, periplasmic serine protease, Do/DeqQ family	NA|281aa|down_6|NZ_HF545616.1_110986_111829_-	NA	NA|149aa|down_7|NZ_HF545616.1_112118_112565_+	TIGR02838, stage_V_sporulation_protein_AC, stage V sporulation protein AC	NA|168aa|down_8|NZ_HF545616.1_112718_113222_-	NA	NA|451aa|down_9|NZ_HF545616.1_113401_114754_-	cd13138, MATE_yoeA_like, Subfamily of the multidrug and toxic compound extrusion (MATE)-like proteins similar to Bacillus subtilis yoeA
GCF_000723465.1_Rb803	NZ_HF545616	Ruminococcus bicirculans strain 80/3 chromosome I	2	1784406-1784515	2	CRISPRCasFinder	no	csa3	cas3,WYL,RT,csa3,DEDDh	Type I-A	TTTTAATAGGGGTTGCGAATCCGTCTAAAATCGCTTTAATAG	42	0	0	NA	NA	NA	1	1	Orphan	cas3,WYL,RT,csa3,DEDDh,cas5,cas8c,cas7,cas4,cas1,cas2,cas13d	NA,NA	NA|113aa|up_9|NZ_HF545616.1_1776891_1777230_-	pfam09858, DUF2085, Predicted membrane protein (DUF2085)	NA|77aa|up_8|NZ_HF545616.1_1777268_1777499_-	PRK05582, PRK05582, type I DNA topoisomerase	NA|479aa|up_7|NZ_HF545616.1_1777603_1779040_-	cd03302, Adenylsuccinate_lyase_2, Adenylsuccinate lyase (ASL)_subgroup 2	NA|466aa|up_6|NZ_HF545616.1_1779271_1780669_+	pfam04932, Wzy_C, O-Antigen ligase	NA|104aa|up_5|NZ_HF545616.1_1780821_1781133_+	pfam00829, Ribosomal_L21p, Ribosomal prokaryotic L21 protein	NA|107aa|up_4|NZ_HF545616.1_1781132_1781453_+	pfam04327, Peptidase_Prp, Cysteine protease Prp	NA|94aa|up_3|NZ_HF545616.1_1781488_1781770_+	PRK05435, rpmA, 50S ribosomal protein L27; Validated	NA|426aa|up_2|NZ_HF545616.1_1781867_1783145_+	PRK12297, obgE, GTPase CgtA; Reviewed	NA|130aa|up_1|NZ_HF545616.1_1783174_1783564_+	COG1939, COG1939, Ribonuclease III family protein [Replication, recombination, and    repair]	NA|248aa|up_0|NZ_HF545616.1_1783587_1784331_+	PRK00110, PRK00110, YebC/PmpR family DNA-binding transcriptional regulator	NA|176aa|down_0|NZ_HF545616.1_1784572_1785100_+	pfam13958, ToxN_toxin, Toxin ToxN, type III toxin-antitoxin system	NA|437aa|down_1|NZ_HF545616.1_1785353_1786664_+	pfam14903, WG_beta_rep, WG containing repeat	NA|479aa|down_2|NZ_HF545616.1_1786884_1788321_-	COG1316, LytR, Transcriptional regulator [Transcription]	NA|778aa|down_3|NZ_HF545616.1_1788520_1790854_-	cd07548, P-type_ATPase-Cd_Zn_Co_like, P-type heavy metal-transporting ATPase, similar to Bacillus subtilis CadA which appears to transport cadmium, zinc and cobalt but not copper out of the cell	csa3|122aa|down_4|NZ_HF545616.1_1790893_1791259_-	cd00090, HTH_ARSR, Arsenical Resistance Operon Repressor and similar prokaryotic, metal regulated homodimeric repressors	NA|131aa|down_5|NZ_HF545616.1_1791618_1792011_-	pfam18810, PBECR2, phage-Barnase-EndoU-ColicinE5/D-RelE like nuclease2	NA|224aa|down_6|NZ_HF545616.1_1792238_1792910_+	COG1272, COG1272, Predicted membrane protein, hemolysin III homolog [General function prediction only]	NA|493aa|down_7|NZ_HF545616.1_1793097_1794576_+	PRK14508, PRK14508, 4-alpha-glucanotransferase; Provisional	NA|794aa|down_8|NZ_HF545616.1_1795266_1797648_+	pfam00343, Phosphorylase, Carbohydrate phosphorylase	NA|185aa|down_9|NZ_HF545616.1_1797864_1798419_+	cd03392, PAP2_like_2, PAP2_like_2 proteins
