assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_002586945.1_ASM258694v1	NZ_CP023819	Faecalibacterium prausnitzii strain Indica chromosome, complete genome	1	393720-393811	1	CRISPRCasFinder	no		DEDDh,PD-DExK,cas3,cas6e,cas5,cas7,cse2gr11,cas8e,WYL,csa3,RT,DinG	Orphan	CCGGCGAGAACATCCCGGTGGAC	23	0	0	NA	NA	NA	1	1	Orphan	DEDDh,PD-DExK,cas3,cas6e,cas5,cas7,cse2gr11,cas8e,WYL,csa3,RT,DinG	NA|221aa|up_7|NZ_CP023819.1_388239_388902_-,NA|94aa|down_4|NZ_CP023819.1_400258_400540_-	NA|278aa|up_9|NZ_CP023819.1_386672_387506_-	COG1187, RsuA, 16S rRNA uridine-516 pseudouridylate synthase and related pseudouridylate synthases [Translation, ribosomal structure and biogenesis]	NA|133aa|up_8|NZ_CP023819.1_387725_388124_-	TIGR02874, conserved_hypothetical_protein, sporulation protein YtfJ	NA|221aa|up_7|NZ_CP023819.1_388239_388902_-	NA	NA|177aa|up_6|NZ_CP023819.1_388907_389438_-	pfam04079, SMC_ScpB, Segregation and condensation complex subunit ScpB	NA|242aa|up_5|NZ_CP023819.1_389434_390160_-	COG1354, scpA, Rec8/ScpA/Scc1-like protein (kleisin family) [Replication,    recombination, and repair]	NA|225aa|up_4|NZ_CP023819.1_390160_390835_-	cd06158, S2P-M50_like_1, Uncharacterized homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms	NA|63aa|up_3|NZ_CP023819.1_391022_391211_+	PRK00359, rpmB, 50S ribosomal protein L28; Reviewed	NA|59aa|up_2|NZ_CP023819.1_391278_391455_+	pfam14056, DUF4250, Domain of unknown function (DUF4250)	NA|251aa|up_1|NZ_CP023819.1_391473_392226_+	COG4947, COG4947, Uncharacterized protein conserved in bacteria [Function unknown]	NA|144aa|up_0|NZ_CP023819.1_392452_392884_+	cd10156, FpFrmR-Cterm-like_DUF156, C-terminal domain of Faecalibacterium prausnitzii A2-165 FrmR , and related domains; this domain family was previously known as part of DUF156	NA|401aa|down_0|NZ_CP023819.1_395574_396777_-	cd03817, GT4_UGDG-like, UDP-Glc:1,2-diacylglycerol 3-a-glucosyltransferase and similar proteins	NA|476aa|down_1|NZ_CP023819.1_396878_398306_-	COG1982, LdcC, Arginine/lysine/ornithine decarboxylases [Amino acid transport and metabolism]	NA|488aa|down_2|NZ_CP023819.1_398306_399770_-	TIGR03974, six-Cys-in-45_modification_radical_SAM_protein, SCIFF radical SAM maturase	NA|49aa|down_3|NZ_CP023819.1_399826_399973_-	TIGR03973, conserved_hypothetical_protein, six-cysteine peptide SCIFF	NA|94aa|down_4|NZ_CP023819.1_400258_400540_-	NA	NA|785aa|down_5|NZ_CP023819.1_400951_403306_-	COG1472, BglX, Beta-glucosidase-related glycosidases [Carbohydrate transport and metabolism]	NA|591aa|down_6|NZ_CP023819.1_403578_405351_-	COG0737, UshA, 5'-nucleotidase/2',3'-cyclic phosphodiesterase and related esterases [Nucleotide transport and metabolism]	NA|591aa|down_7|NZ_CP023819.1_405678_407451_-	PRK13977, PRK13977, myosin-cross-reactive antigen; Provisional	NA|706aa|down_8|NZ_CP023819.1_407840_409958_-	TIGR02063, Ribonuclease_R, ribonuclease R	NA|83aa|down_9|NZ_CP023819.1_410073_410322_-	pfam03840, SecG, Preprotein translocase SecG subunit
GCF_002586945.1_ASM258694v1	NZ_CP023819	Faecalibacterium prausnitzii strain Indica chromosome, complete genome	2	1333854-1335407	2,1,1	CRISPRCasFinder,CRT,PILER-CR	no	DEDDh,cas6e,cas5,cas7,cse2gr11,cas8e,cas3	DEDDh,PD-DExK,cas3,cas6e,cas5,cas7,cse2gr11,cas8e,WYL,csa3,RT,DinG	Type I-E	GGATCACCCCCGCGTGGGCGGGGAAAAG,GGATCACCCCCGCGTNGGCGGGGAAAAG,GGATCACCCCCGCGTAGGCGGGGAAAAG	28,28,28	0	0	NA	NA	I-C,I-E,II-B:I-C,I-E,II-B:I-C,I-E,II-B	25,25,10	25	TypeI-E	DEDDh,PD-DExK,cas3,cas6e,cas5,cas7,cse2gr11,cas8e,WYL,csa3,RT,DinG	NA|83aa|up_4|NZ_CP023819.1_1328477_1328726_-,NA|75aa|down_6|NZ_CP023819.1_1343341_1343566_-,NA|173aa|down_7|NZ_CP023819.1_1343576_1344095_-,NA|319aa|down_9|NZ_CP023819.1_1346526_1347483_-	NA|172aa|up_9|NZ_CP023819.1_1323021_1323537_-	pfam09512, ThiW, Thiamine-precursor transporter protein (ThiW)	NA|430aa|up_8|NZ_CP023819.1_1324016_1325306_+	cd03408, SPFH_like_u1, Uncharacterized family; SPFH (stomatin, prohibitin, flotillin, and HflK/C) superfamily	NA|367aa|up_7|NZ_CP023819.1_1325320_1326421_+	pfam03966, Trm112p, Trm112p-like protein	NA|265aa|up_6|NZ_CP023819.1_1326433_1327228_+	pfam04536, TPM_phosphatase, TPM domain	NA|347aa|up_5|NZ_CP023819.1_1327323_1328364_+	COG2855, COG2855, Predicted membrane protein [Function unknown]	NA|83aa|up_4|NZ_CP023819.1_1328477_1328726_-	NA	NA|448aa|up_3|NZ_CP023819.1_1329068_1330412_-	cd13138, MATE_yoeA_like, Subfamily of the multidrug and toxic compound extrusion (MATE)-like proteins similar to Bacillus subtilis yoeA	NA|449aa|up_2|NZ_CP023819.1_1330600_1331947_-	pfam00375, SDF, Sodium:dicarboxylate symporter family	NA|176aa|up_1|NZ_CP023819.1_1332118_1332646_-	pfam00358, PTS_EIIA_1, phosphoenolpyruvate-dependent sugar phosphotransferase system, EIIA 1	NA|341aa|up_0|NZ_CP023819.1_1332760_1333783_-	COG2008, GLY1, Threonine aldolase [Amino acid transport and metabolism]	cas6e|214aa|down_0|NZ_CP023819.1_1335416_1336058_-	pfam08798, CRISPR_assoc, CRISPR associated protein	cas5|238aa|down_1|NZ_CP023819.1_1336064_1336778_-	TIGR01868, hypothetical_protein, CRISPR-associated protein Cas5/CasD, subtype I-E/ECOLI	cas7|358aa|down_2|NZ_CP023819.1_1336780_1337854_-	pfam09344, Cas_CT1975, CT1975-like protein	cse2gr11|208aa|down_3|NZ_CP023819.1_1337855_1338479_-	pfam09485, CRISPR_Cse2, CRISPR-associated protein Cse2 (CRISPR_cse2)	cas8e|542aa|down_4|NZ_CP023819.1_1338475_1340101_-	pfam09481, CRISPR_Cse1, CRISPR-associated protein Cse1 (CRISPR_cse1)	cas3|922aa|down_5|NZ_CP023819.1_1340087_1342853_-	PRK09694, PRK09694, CRISPR-associated helicase/endonuclease Cas3	NA|75aa|down_6|NZ_CP023819.1_1343341_1343566_-	NA	NA|173aa|down_7|NZ_CP023819.1_1343576_1344095_-	NA	NA|540aa|down_8|NZ_CP023819.1_1344601_1346221_-	COG4690, PepD, Dipeptidase [Amino acid transport and metabolism]	NA|319aa|down_9|NZ_CP023819.1_1346526_1347483_-	NA
GCF_002586945.1_ASM258694v1	NZ_CP023819	Faecalibacterium prausnitzii strain Indica chromosome, complete genome	3	1631554-1631667	3	CRISPRCasFinder	no		DEDDh,PD-DExK,cas3,cas6e,cas5,cas7,cse2gr11,cas8e,WYL,csa3,RT,DinG	Orphan	TGCGGCGCTTGCGAGGGCGCTTGCCCC	27	0	0	NA	NA	NA	1	1	Orphan	DEDDh,PD-DExK,cas3,cas6e,cas5,cas7,cse2gr11,cas8e,WYL,csa3,RT,DinG	NA|178aa|up_3|NZ_CP023819.1_1624075_1624609_-,NA	NA|432aa|up_9|NZ_CP023819.1_1615872_1617168_+	PRK01490, tig, trigger factor; Provisional	NA|260aa|up_8|NZ_CP023819.1_1617529_1618309_-	pfam01925, TauE, Sulfite exporter TauE/SafE	NA|145aa|up_7|NZ_CP023819.1_1618310_1618745_-	smart00347, HTH_MARR, helix_turn_helix multiple antibiotic resistance protein	NA|781aa|up_6|NZ_CP023819.1_1618934_1621277_-	COG0370, FeoB, Fe2+ transport system protein B [Inorganic ion transport and metabolism]	NA|342aa|up_5|NZ_CP023819.1_1621493_1622519_-	PLN02240, PLN02240, UDP-glucose 4-epimerase	NA|358aa|up_4|NZ_CP023819.1_1622778_1623852_-	cd09019, galactose_mutarotase_like, galactose mutarotase_like	NA|178aa|up_3|NZ_CP023819.1_1624075_1624609_-	NA	NA|275aa|up_2|NZ_CP023819.1_1624831_1625656_-	TIGR01784, Uncharacterized_protein_pSLT051, conserved hypothetical protein (putative transposase or invertase)	NA|1309aa|up_1|NZ_CP023819.1_1625777_1629704_-	COG0841, AcrB, Cation/multidrug efflux pump [Defense mechanisms]	NA|313aa|up_0|NZ_CP023819.1_1630291_1631230_+	COG1774, COG1774, Uncharacterized homolog of PSP1 [Function unknown]	NA|225aa|down_0|NZ_CP023819.1_1631824_1632499_+	COG4123, COG4123, Predicted O-methyltransferase [General function prediction only]	NA|280aa|down_1|NZ_CP023819.1_1632636_1633476_+	COG0313, COG0313, Predicted methyltransferases [General function prediction only]	NA|163aa|down_2|NZ_CP023819.1_1633692_1634181_+	COG0622, COG0622, Predicted phosphoesterase [General function prediction only]	NA|256aa|down_3|NZ_CP023819.1_1634177_1634945_+	COG0340, BirA, Biotin-(acetyl-CoA carboxylase) ligase [Coenzyme metabolism]	NA|179aa|down_4|NZ_CP023819.1_1635216_1635753_-	cd04645, LbH_gamma_CA_like, Gamma carbonic anhydrase-like: This family is composed of gamma carbonic anhydrase (CA), Ferripyochelin Binding Protein (FBP), E	NA|430aa|down_5|NZ_CP023819.1_1635858_1637148_+	COG1686, DacC, D-alanyl-D-alanine carboxypeptidase [Cell envelope biogenesis, outer membrane]	NA|322aa|down_6|NZ_CP023819.1_1637313_1638279_-	TIGR00678, DNA_polymerase_III_subunit_delta', DNA polymerase III, delta' subunit	NA|213aa|down_7|NZ_CP023819.1_1638275_1638914_-	pfam01694, Rhomboid, Rhomboid family	NA|524aa|down_8|NZ_CP023819.1_1639510_1641082_-	COG2239, MgtE, Mg/Co/Ni transporter MgtE (contains CBS domain) [Inorganic ion transport and metabolism]	NA|1182aa|down_9|NZ_CP023819.1_1642017_1645563_-	TIGR02176, pyruvate_flavodoxin/ferrodoxin_oxidoreductase, pyruvate:ferredoxin (flavodoxin) oxidoreductase, homodimeric
GCF_002586945.1_ASM258694v1	NZ_CP023819	Faecalibacterium prausnitzii strain Indica chromosome, complete genome	4	2820562-2820666	4	CRISPRCasFinder	no		DEDDh,PD-DExK,cas3,cas6e,cas5,cas7,cse2gr11,cas8e,WYL,csa3,RT,DinG	Orphan	CAAGTGCATCGGCTGCGGTGCCTG	24	0	0	NA	NA	NA	1	1	Orphan	DEDDh,PD-DExK,cas3,cas6e,cas5,cas7,cse2gr11,cas8e,WYL,csa3,RT,DinG	NA|246aa|up_9|NZ_CP023819.1_2808271_2809009_-,NA|81aa|up_6|NZ_CP023819.1_2812358_2812601_+,NA	NA|246aa|up_9|NZ_CP023819.1_2808271_2809009_-	NA	NA|585aa|up_8|NZ_CP023819.1_2809217_2810972_-	cd01454, vWA_norD_type, norD type: Denitrifying bacteria contain both membrane bound and periplasmic nitrate reductases	NA|307aa|up_7|NZ_CP023819.1_2810971_2811892_-	COG0714, COG0714, MoxR-like ATPases [General function prediction only]	NA|81aa|up_6|NZ_CP023819.1_2812358_2812601_+	NA	NA|185aa|up_5|NZ_CP023819.1_2812828_2813383_+	PRK13661, PRK13661, ECF-type riboflavin transporter substrate-binding protein	NA|574aa|up_4|NZ_CP023819.1_2813468_2815190_+	cd03225, ABC_cobalt_CbiO_domain1, First domain of the ATP-binding cassette component of cobalt transport system	NA|277aa|up_3|NZ_CP023819.1_2815179_2816010_+	COG0619, CbiQ, ABC-type cobalt transport system, permease component CbiQ and related transporters [Inorganic ion transport and metabolism]	NA|664aa|up_2|NZ_CP023819.1_2816113_2818105_-	cd08579, GDPD_memb_like, Glycerophosphodiester phosphodiesterase domain of uncharacterized bacterial glycerophosphodiester phosphodiesterases	NA|246aa|up_1|NZ_CP023819.1_2818247_2818985_+	pfam02634, FdhD-NarQ, FdhD/NarQ family	NA|304aa|up_0|NZ_CP023819.1_2819175_2820087_+	pfam04205, FMN_bind, FMN-binding domain	NA|719aa|down_0|NZ_CP023819.1_2820766_2822923_+	PRK09849, PRK09849, putative oxidoreductase; Provisional	NA|306aa|down_1|NZ_CP023819.1_2822949_2823867_+	PRK09947, PRK09947, YdhW family putative oxidoreductase system protein	NA|413aa|down_2|NZ_CP023819.1_2824295_2825534_+	cd00887, MoeA, MoeA family	NA|173aa|down_3|NZ_CP023819.1_2825523_2826042_+	cd03116, MobB, molybdopterin-guanine dinucleotide biosynthesis protein B	NA|340aa|down_4|NZ_CP023819.1_2826060_2827080_+	cd03522, MoeA_like, MoeA_like	NA|166aa|down_5|NZ_CP023819.1_2827151_2827649_+	PRK09364, moaC, cyclic pyranopterin monophosphate synthase MoaC	NA|327aa|down_6|NZ_CP023819.1_2827641_2828622_+	TIGR02666, Cyclic_pyranopterin_monophosphate_synthase, molybdenum cofactor biosynthesis protein A, bacterial	NA|311aa|down_7|NZ_CP023819.1_2828626_2829559_+	cd00886, MogA_MoaB, MogA_MoaB family	NA|296aa|down_8|NZ_CP023819.1_2829647_2830535_+	cd13537, PBP2_YvgL_like, Substrate binding domain of putative molybdate-binding protein YvgL and similar proteins;the type 2 periplasmic binding protein fold	NA|225aa|down_9|NZ_CP023819.1_2830555_2831230_+	COG4149, ModC, ABC-type molybdate transport system, permease component [Inorganic ion transport and metabolism]
