assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_001564455.1_ASM156445v1	NZ_CP011391	Faecalibaculum rodentium strain ALO17 chromosome, complete genome	1	137126-139197	1,1,1	PILER-CR,CRISPRCasFinder,CRT	no	RT,cas2,cas1,cas6,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,csx1	RT,cas2,cas1,cas6,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,csx1,DEDDh,csa3,cas3,WYL,c2c9_V-U4,PrimPol,cas6e,cas5,cas7,cse2gr11,cas8e	Type III-C,Type III-B,Type III-A,Type III-D	GGTAAATATTGTTCTTATTAAGGATTAACGC,GGTAAATATTGTTCTTATTAAGGATTAACGC,GGTAAATATTGTTCTTATTAAGGATTAACGC	31,31,31	0	0	NA	NA	NA:NA:NA	29,30,30	30	TypeIII-C,TypeIII-B,TypeIII-A,TypeIII-D	RT,cas2,cas1,cas6,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,csx1,DEDDh,csa3,cas3,WYL,c2c9_V-U4,PrimPol,cas6e,cas5,cas7,cse2gr11,cas8e	NA|219aa|up_9|NZ_CP011391.1_127105_127762_+,NA|143aa|up_6|NZ_CP011391.1_130115_130544_+,NA|239aa|down_3|NZ_CP011391.1_141233_141950_-	NA|219aa|up_9|NZ_CP011391.1_127105_127762_+	NA	RT|438aa|up_8|NZ_CP011391.1_127896_129210_-	TIGR04416, hypothetical_protein, group II intron reverse transcriptase/maturase	NA|68aa|up_7|NZ_CP011391.1_129915_130119_+	COG1476, COG1476, Predicted transcriptional regulators [Transcription]	NA|143aa|up_6|NZ_CP011391.1_130115_130544_+	NA	NA|461aa|up_5|NZ_CP011391.1_130678_132061_-	cd13143, MATE_MepA_like, Subfamily of the multidrug and toxic compound extrusion (MATE)-like proteins similar to Streptococcus aureus MepA	NA|131aa|up_4|NZ_CP011391.1_132342_132735_-	COG0295, Cdd, Cytidine deaminase [Nucleotide transport and metabolism]	NA|263aa|up_3|NZ_CP011391.1_132727_133516_-	cd17767, UP_EcUdp-like, uridine phosphorylases similar to Escherichia coli Udp and related phosphorylases	NA|235aa|up_2|NZ_CP011391.1_133638_134343_+	cd09006, PNP_EcPNPI-like, purine nucleoside phosphorylases similar to Escherichia coli PNP-I (DeoD) and Trichomonas vaginalis PNP	NA|399aa|up_1|NZ_CP011391.1_134339_135536_+	PRK05362, PRK05362, phosphopentomutase; Provisional	NA|365aa|up_0|NZ_CP011391.1_135838_136933_+	TIGR01995, beta-glucosides_PTS_EIIBCA, PTS system, beta-glucoside-specific IIABC component	cas2|87aa|down_0|NZ_CP011391.1_139290_139551_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|330aa|down_1|NZ_CP011391.1_139553_140543_-	TIGR03641, cas1_HMARI, CRISPR-associated endonuclease Cas1, subtype I-B/HMARI/TNEAP	cas6|239aa|down_2|NZ_CP011391.1_140539_141256_-	COG1583, COG1583, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	NA|239aa|down_3|NZ_CP011391.1_141233_141950_-	NA	csm5gr7|386aa|down_4|NZ_CP011391.1_141946_143104_-	COG1332, COG1332, CRISPR system related protein, RAMP superfamily [Defense mechanisms]	csm4gr5|321aa|down_5|NZ_CP011391.1_143103_144066_-	COG1567, COG1567, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	csm3gr7|211aa|down_6|NZ_CP011391.1_144062_144695_-	TIGR02582, CRISPR_type_III-associated_RAMP_protein_Csm3, CRISPR type III-A/MTUBE-associated RAMP protein Csm3	csm2gr11|169aa|down_7|NZ_CP011391.1_144694_145201_-	pfam03750, Csm2_III-A, Csm2 Type III-A	cas10|780aa|down_8|NZ_CP011391.1_145204_147544_-	cd09680, Cas10_III, CRISPR/Cas system-associated protein Cas10	csx1|413aa|down_9|NZ_CP011391.1_147556_148795_-	cd09732, Csx1_III-U, CRISPR/Cas system-associated protein Csx1
GCF_001564455.1_ASM156445v1	NZ_CP011391	Faecalibaculum rodentium strain ALO17 chromosome, complete genome	2	149152-149859	2,2	CRISPRCasFinder,CRT	no	cas2,cas1,cas6,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,csx1	RT,cas2,cas1,cas6,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,csx1,DEDDh,csa3,cas3,WYL,c2c9_V-U4,PrimPol,cas6e,cas5,cas7,cse2gr11,cas8e	Type III-C,Type III-B,Type III-A,Type III-D	GGTAAATATTGTTCTTATTAAGGATTAACGC,GGTAAATATTGTTCTTATTAAGGATTAACGC	31,31	0	0	NA	NA	NA:NA	10,10	10	TypeIII-C,TypeIII-B,TypeIII-A,TypeIII-D	RT,cas2,cas1,cas6,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,csx1,DEDDh,csa3,cas3,WYL,c2c9_V-U4,PrimPol,cas6e,cas5,cas7,cse2gr11,cas8e	NA|239aa|up_6|NZ_CP011391.1_141233_141950_-,NA|61aa|down_6|NZ_CP011391.1_156210_156393_-	cas2|87aa|up_9|NZ_CP011391.1_139290_139551_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|330aa|up_8|NZ_CP011391.1_139553_140543_-	TIGR03641, cas1_HMARI, CRISPR-associated endonuclease Cas1, subtype I-B/HMARI/TNEAP	cas6|239aa|up_7|NZ_CP011391.1_140539_141256_-	COG1583, COG1583, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	NA|239aa|up_6|NZ_CP011391.1_141233_141950_-	NA	csm5gr7|386aa|up_5|NZ_CP011391.1_141946_143104_-	COG1332, COG1332, CRISPR system related protein, RAMP superfamily [Defense mechanisms]	csm4gr5|321aa|up_4|NZ_CP011391.1_143103_144066_-	COG1567, COG1567, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	csm3gr7|211aa|up_3|NZ_CP011391.1_144062_144695_-	TIGR02582, CRISPR_type_III-associated_RAMP_protein_Csm3, CRISPR type III-A/MTUBE-associated RAMP protein Csm3	csm2gr11|169aa|up_2|NZ_CP011391.1_144694_145201_-	pfam03750, Csm2_III-A, Csm2 Type III-A	cas10|780aa|up_1|NZ_CP011391.1_145204_147544_-	cd09680, Cas10_III, CRISPR/Cas system-associated protein Cas10	csx1|413aa|up_0|NZ_CP011391.1_147556_148795_-	cd09732, Csx1_III-U, CRISPR/Cas system-associated protein Csx1	NA|201aa|down_0|NZ_CP011391.1_150306_150909_-	PRK08293, PRK08293, 3-hydroxyacyl-CoA dehydrogenase	NA|110aa|down_1|NZ_CP011391.1_151116_151446_-	cd02136, PnbA_NfnB-like, nitroreductase similar to Mycobacterium smegmatis NfnB	NA|140aa|down_2|NZ_CP011391.1_151611_152031_-	cd09834, CBS_pair_bac, Two tandem repeats of the cystathionine beta-synthase (CBS pair) domains present in bacteria	NA|100aa|down_3|NZ_CP011391.1_152297_152597_+	pfam01527, HTH_Tnp_1, Transposase	NA|296aa|down_4|NZ_CP011391.1_152593_153481_+	PHA02517, PHA02517, putative transposase OrfB; Reviewed	NA|588aa|down_5|NZ_CP011391.1_153896_155660_+	pfam13173, AAA_14, AAA domain	NA|61aa|down_6|NZ_CP011391.1_156210_156393_-	NA	NA|324aa|down_7|NZ_CP011391.1_157176_158148_+	PRK00856, pyrB, aspartate carbamoyltransferase catalytic subunit	NA|433aa|down_8|NZ_CP011391.1_158144_159443_+	PRK09357, pyrC, dihydroorotase; Validated	NA|361aa|down_9|NZ_CP011391.1_159443_160526_+	PRK12564, PRK12564, carbamoyl-phosphate synthase small subunit
GCF_001564455.1_ASM156445v1	NZ_CP011391	Faecalibaculum rodentium strain ALO17 chromosome, complete genome	3	1831513-1831606	3	CRISPRCasFinder	no	RT	RT,cas2,cas1,cas6,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,csx1,DEDDh,csa3,cas3,WYL,c2c9_V-U4,PrimPol,cas6e,cas5,cas7,cse2gr11,cas8e	Unclear	CAAACATTGGTGGTGACGCCCGTA	24	0	0	NA	NA	NA	1	1	Orphan	RT,cas2,cas1,cas6,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,csx1,DEDDh,csa3,cas3,WYL,c2c9_V-U4,PrimPol,cas6e,cas5,cas7,cse2gr11,cas8e	NA|231aa|up_2|NZ_CP011391.1_1827502_1828195_-,NA|373aa|down_0|NZ_CP011391.1_1831648_1832767_+,NA|91aa|down_4|NZ_CP011391.1_1837424_1837697_+	NA|320aa|up_9|NZ_CP011391.1_1820025_1820985_-	COG1481, COG1481, Uncharacterized protein conserved in bacteria [Function unknown]	NA|317aa|up_8|NZ_CP011391.1_1820981_1821932_-	cd07187, YvcK_like, family of mostly uncharacterized proteins similar to B	NA|288aa|up_7|NZ_CP011391.1_1821928_1822792_-	PRK05416, PRK05416, RNase adapter RapZ	NA|225aa|up_6|NZ_CP011391.1_1823054_1823729_-	COG0569, TrkA, K+ transport systems, NAD-binding component [Inorganic ion transport and metabolism]	NA|438aa|up_5|NZ_CP011391.1_1823771_1825085_-	TIGR00933, Trk_system_potassium_uptake_protein_trkH	NA|170aa|up_4|NZ_CP011391.1_1825189_1825699_+	pfam18143, HAD_SAK_2, HAD domain in Swiss Army Knife RNA repair proteins	NA|559aa|up_3|NZ_CP011391.1_1825735_1827412_+	pfam13200, DUF4015, Putative glycosyl hydrolase domain	NA|231aa|up_2|NZ_CP011391.1_1827502_1828195_-	NA	NA|334aa|up_1|NZ_CP011391.1_1828244_1829246_-	pfam12760, Zn_Tnp_IS1595, Transposase zinc-ribbon domain	RT|450aa|up_0|NZ_CP011391.1_1829361_1830711_-	TIGR04416, hypothetical_protein, group II intron reverse transcriptase/maturase	NA|373aa|down_0|NZ_CP011391.1_1831648_1832767_+	NA	NA|258aa|down_1|NZ_CP011391.1_1832929_1833703_-	COG1011, COG1011, Predicted hydrolase (HAD superfamily) [General function prediction only]	NA|526aa|down_2|NZ_CP011391.1_1833867_1835445_-	COG1283, NptA, Na+/phosphate symporter [Inorganic ion transport and metabolism]	NA|529aa|down_3|NZ_CP011391.1_1835724_1837311_-	cd16010, iPGM, 2 3 bisphosphoglycerate independent phosphoglycerate mutase iPGM	NA|91aa|down_4|NZ_CP011391.1_1837424_1837697_+	NA	NA|247aa|down_5|NZ_CP011391.1_1837957_1838698_-	COG1126, GlnQ, ABC-type polar amino acid transport system, ATPase component [Amino acid transport and metabolism]	NA|240aa|down_6|NZ_CP011391.1_1838707_1839427_-	COG0765, HisM, ABC-type amino acid transport system, permease component [Amino acid transport and metabolism]	NA|279aa|down_7|NZ_CP011391.1_1839430_1840267_-	cd13627, PBP2_AA_binding_like_2, Substrate-binding domain of putative amino acid-binding protein; the type 2 periplasmic-binding protein fold	NA|222aa|down_8|NZ_CP011391.1_1840592_1841258_-	cd05636, LbH_G1P_TT_C_like, Putative glucose-1-phosphate thymidylyltransferase, C-terminal Left-handed parallel beta-Helix (LbH) domain: Proteins in this family show simlarity to glucose-1-phosphate adenylyltransferases in that they contain N-terminal catalytic domains that resemble a dinucleotide-binding Rossmann fold and C-terminal LbH fold domains	NA|128aa|down_9|NZ_CP011391.1_1841257_1841641_-	pfam10825, DUF2752, Protein of unknown function (DUF2752)
GCF_001564455.1_ASM156445v1	NZ_CP011391	Faecalibaculum rodentium strain ALO17 chromosome, complete genome	4	2001214-2001374	4	CRISPRCasFinder	no	RT	RT,cas2,cas1,cas6,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,csx1,DEDDh,csa3,cas3,WYL,c2c9_V-U4,PrimPol,cas6e,cas5,cas7,cse2gr11,cas8e	Unclear	ACCCCAAAACGGCGGGCAGGATGCCAA	27	0	0	NA	NA	NA	2	2	Orphan	RT,cas2,cas1,cas6,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,csx1,DEDDh,csa3,cas3,WYL,c2c9_V-U4,PrimPol,cas6e,cas5,cas7,cse2gr11,cas8e	NA,NA	NA|207aa|up_9|NZ_CP011391.1_1984921_1985542_-	cd03357, LbH_MAT_GAT, Maltose O-acetyltransferase (MAT) and Galactoside O-acetyltransferase (GAT): MAT and GAT catalyze the CoA-dependent acetylation of the 6-hydroxyl group of their respective sugar substrates	NA|883aa|up_8|NZ_CP011391.1_1985777_1988426_-	cd18622, GH32_Inu-like, glycoside hydrolase family 32 protein such as Aspergillus ficuum endo-inulinase (Inu2)	NA|882aa|up_7|NZ_CP011391.1_1988909_1991555_-	cd18622, GH32_Inu-like, glycoside hydrolase family 32 protein such as Aspergillus ficuum endo-inulinase (Inu2)	NA|200aa|up_6|NZ_CP011391.1_1992077_1992677_-	cd03357, LbH_MAT_GAT, Maltose O-acetyltransferase (MAT) and Galactoside O-acetyltransferase (GAT): MAT and GAT catalyze the CoA-dependent acetylation of the 6-hydroxyl group of their respective sugar substrates	NA|347aa|up_5|NZ_CP011391.1_1992979_1994020_-	PRK09614, nrdF, ribonucleotide-diphosphate reductase subunit beta; Reviewed	NA|841aa|up_4|NZ_CP011391.1_1994057_1996580_-	PRK12364, PRK12364, ribonucleoside-diphosphate reductase subunit alpha	NA|431aa|up_3|NZ_CP011391.1_1997111_1998404_-	COG2865, COG2865, Predicted transcriptional regulator containing an HTH domain and an uncharacterized domain shared with the mammalian protein Schlafen [Transcription]	NA|251aa|up_2|NZ_CP011391.1_1998715_1999468_-	PRK01130, PRK01130, putative N-acetylmannosamine-6-phosphate 2-epimerase	NA|247aa|up_1|NZ_CP011391.1_1999464_2000205_-	COG1349, GlpR, Transcriptional regulators of sugar metabolism [Transcription / Carbohydrate transport and metabolism]	NA|304aa|up_0|NZ_CP011391.1_2000206_2001118_-	TIGR02494, PFLE_PFLC, glycyl-radical enzyme activating protein	NA|755aa|down_0|NZ_CP011391.1_2001412_2003677_-	pfam02901, PFL-like, Pyruvate formate lyase-like	NA|331aa|down_1|NZ_CP011391.1_2003868_2004861_-	COG0449, GlmS, Glucosamine 6-phosphate synthetase, contains amidotransferase and phosphosugar isomerase domains [Cell envelope biogenesis, outer membrane]	NA|277aa|down_2|NZ_CP011391.1_2005010_2005841_+	pfam08282, Hydrolase_3, haloacid dehalogenase-like hydrolase	NA|280aa|down_3|NZ_CP011391.1_2005842_2006682_+	cd07572, nit, Nit1, Nit 2, and related proteins, and the Nit1-like domain of NitFhit (class 10 nitrilases)	NA|325aa|down_4|NZ_CP011391.1_2006818_2007793_+	cd02252, nylC_like, nylC-like family; composed of proteins with similarity to Flavobacterium endo-type 6-aminohexanoate-oligomer hydrolase (EIII), the product of the nylon oligomer degradation gene, nylC	NA|134aa|down_5|NZ_CP011391.1_2007789_2008191_+	pfam07179, SseB, SseB protein N-terminal domain	NA|166aa|down_6|NZ_CP011391.1_2008456_2008954_-	PRK00854, rocD, ornithine--oxo-acid transaminase; Reviewed	NA|84aa|down_7|NZ_CP011391.1_2008999_2009251_+	PRK05326, PRK05326, potassium/proton antiporter	NA|477aa|down_8|NZ_CP011391.1_2009371_2010802_-	pfam03577, Peptidase_C69, Peptidase family C69	NA|481aa|down_9|NZ_CP011391.1_2011006_2012449_-	pfam03577, Peptidase_C69, Peptidase family C69
GCF_001564455.1_ASM156445v1	NZ_CP011391	Faecalibaculum rodentium strain ALO17 chromosome, complete genome	5	2168106-2170635	5,3,2	CRISPRCasFinder,CRT,PILER-CR	no	RT,DEDDh,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,cas3	RT,cas2,cas1,cas6,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,csx1,DEDDh,csa3,cas3,WYL,c2c9_V-U4,PrimPol,cas6e,cas5,cas7,cse2gr11,cas8e	Type I-E	GGATCACCCCCGCGGGTGCGGGGACATG,GGATCACCCCCGCGGGTGCGGGGACATG,GGATCACCCCCGCGGGTGCGGGGACATG	28,28,28	0	0	NA	NA	NA:NA:NA	41,41,29	41	TypeI-E	RT,cas2,cas1,cas6,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,csx1,DEDDh,csa3,cas3,WYL,c2c9_V-U4,PrimPol,cas6e,cas5,cas7,cse2gr11,cas8e	NA|201aa|up_2|NZ_CP011391.1_2164948_2165551_-,NA|224aa|up_1|NZ_CP011391.1_2165579_2166251_+,NA	NA|557aa|up_9|NZ_CP011391.1_2155613_2157284_-	TIGR00118, Probable_acetolactate_synthase_large_subunit, acetolactate synthase, large subunit, biosynthetic type	RT|450aa|up_8|NZ_CP011391.1_2157468_2158818_-	TIGR04416, hypothetical_protein, group II intron reverse transcriptase/maturase	NA|478aa|up_7|NZ_CP011391.1_2159432_2160866_-	COG0285, FolC, Folylpolyglutamate synthase [Coenzyme metabolism]	NA|528aa|up_6|NZ_CP011391.1_2160865_2162449_-	smart01073, CDC48_N, Cell division protein 48 (CDC48) N-terminal domain	NA|214aa|up_5|NZ_CP011391.1_2162441_2163083_-	pfam05013, FGase, N-formylglutamate amidohydrolase	NA|376aa|up_4|NZ_CP011391.1_2163063_2164191_-	COG0285, FolC, Folylpolyglutamate synthase [Coenzyme metabolism]	NA|224aa|up_3|NZ_CP011391.1_2164231_2164903_+	PRK05254, PRK05254, uracil-DNA glycosylase; Provisional	NA|201aa|up_2|NZ_CP011391.1_2164948_2165551_-	NA	NA|224aa|up_1|NZ_CP011391.1_2165579_2166251_+	NA	NA|474aa|up_0|NZ_CP011391.1_2166368_2167790_-	pfam01548, DEDD_Tnp_IS110, Transposase	DEDDh|301aa|down_0|NZ_CP011391.1_2170649_2171552_-	cd06127, DEDDh, DEDDh 3'-5' exonuclease domain family	cas1|323aa|down_1|NZ_CP011391.1_2171551_2172520_-	TIGR03638, cas1_ECOLI, CRISPR-associated endonuclease Cas1, subtype I-E/ECOLI	cas6e|239aa|down_2|NZ_CP011391.1_2172512_2173229_-	pfam08798, CRISPR_assoc, CRISPR associated protein	cas5|237aa|down_3|NZ_CP011391.1_2173134_2173845_-	cd09645, Cas5_I-E, CRISPR/Cas system-associated RAMP superfamily protein Cas5	cas7|358aa|down_4|NZ_CP011391.1_2173844_2174918_-	pfam09344, Cas_CT1975, CT1975-like protein	cse2gr11|201aa|down_5|NZ_CP011391.1_2174920_2175523_-	pfam09485, CRISPR_Cse2, CRISPR-associated protein Cse2 (CRISPR_cse2)	cas8e|550aa|down_6|NZ_CP011391.1_2175519_2177169_-	pfam09481, CRISPR_Cse1, CRISPR-associated protein Cse1 (CRISPR_cse1)	cas3|921aa|down_7|NZ_CP011391.1_2177460_2180223_-	PRK09694, PRK09694, CRISPR-associated helicase/endonuclease Cas3	NA|121aa|down_8|NZ_CP011391.1_2180767_2181130_-	cd17792, CtkA, serine/threonine-protein kinase CtkA and similar proteins	NA|588aa|down_9|NZ_CP011391.1_2181580_2183344_-	COG1132, MdlB, ABC-type multidrug transport system, ATPase and permease components [Defense mechanisms]
