assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_001682215.1_ASM168221v1	NZ_CP012706	Bacteroides fragilis strain S14 chromosome, complete genome	1	234152-234297	1	PILER-CR	no		RT,DEDDh,cas6,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas1,cas2,PrimPol,WYL,cas3,cas9	Orphan	GAAATTCCCAATATATTGTGAATTTGA	27	0	0	NA	NA	NA	2	2	Orphan	RT,DEDDh,cas6,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas1,cas2,PrimPol,WYL,cas3,cas9	NA,NA|77aa|down_2|NZ_CP012706.1_236102_236333_-,NA|64aa|down_3|NZ_CP012706.1_236346_236538_+	NA|363aa|up_9|NZ_CP012706.1_225365_226454_+	pfam07610, DUF1573, Protein of unknown function (DUF1573)	NA|364aa|up_8|NZ_CP012706.1_226462_227554_+	PRK09435, PRK09435, methylmalonyl Co-A mutase-associated GTPase MeaB	NA|303aa|up_7|NZ_CP012706.1_227585_228494_-	COG0697, RhaT, Permeases of the drug/metabolite transporter (DMT) superfamily [Carbohydrate transport and metabolism / Amino acid transport and metabolism / General function prediction only]	NA|276aa|up_6|NZ_CP012706.1_228587_229415_+	pfam13304, AAA_21, AAA domain, putative AbiEii toxin, Type IV TA system	NA|339aa|up_5|NZ_CP012706.1_229436_230453_+	pfam14491, DUF4435, Protein of unknown function (DUF4435)	NA|416aa|up_4|NZ_CP012706.1_230424_231672_+	pfam00924, MS_channel, Mechanosensitive ion channel	NA|298aa|up_3|NZ_CP012706.1_231713_232607_+	COG2207, AraC, AraC-type DNA-binding domain-containing proteins [Transcription]	NA|280aa|up_2|NZ_CP012706.1_232609_233449_-	pfam07804, HipA_C, HipA-like C-terminal domain	NA|110aa|up_1|NZ_CP012706.1_233612_233942_-	TIGR03071, couple_hipA, HipA N-terminal domain	NA|71aa|up_0|NZ_CP012706.1_233938_234151_-	TIGR03070, couple_hipB, transcriptional regulator, y4mF family	NA|291aa|down_0|NZ_CP012706.1_234640_235513_-	pfam14297, DUF4373, Domain of unknown function (DUF4373)	NA|116aa|down_1|NZ_CP012706.1_235655_236003_-	pfam10902, WYL_2, WYL_2, Sm-like SH3 beta-barrel fold	NA|77aa|down_2|NZ_CP012706.1_236102_236333_-	NA	NA|64aa|down_3|NZ_CP012706.1_236346_236538_+	NA	NA|179aa|down_4|NZ_CP012706.1_237049_237586_+	cd09895, NGN_SP_UpxY, N-Utilization Substance G (NusG) N-terminal domain in the NusG Specialized Paralog (SP), UpxY	NA|163aa|down_5|NZ_CP012706.1_237605_238094_+	pfam06603, UpxZ, UpxZ family of transcription anti-terminator antagonists	NA|297aa|down_6|NZ_CP012706.1_238258_239149_+	TIGR01207, Glucose-1-phosphate_thymidylyltransferase_1, glucose-1-phosphate thymidylyltransferase, short form	NA|446aa|down_7|NZ_CP012706.1_239839_241177_+	PRK15057, PRK15057, UDP-glucose 6-dehydrogenase; Provisional	NA|353aa|down_8|NZ_CP012706.1_241181_242240_+	cd05253, UDP_GE_SDE_e, UDP glucuronic acid epimerase, extended (e) SDRs	NA|175aa|down_9|NZ_CP012706.1_242704_243229_+	COG1670, RimL, Acetyltransferases, including N-acetylases of ribosomal proteins [Translation, ribosomal structure and biogenesis]
GCF_001682215.1_ASM168221v1	NZ_CP012706	Bacteroides fragilis strain S14 chromosome, complete genome	2	1984987-1986628	2,1,1	PILER-CR,CRISPRCasFinder,CRT	no	cas6,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas1,cas2	RT,DEDDh,cas6,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas1,cas2,PrimPol,WYL,cas3,cas9	Type III-B,Type III-C,Type III-A,Type III-D	ATGTAGATGTATTCCAGTATAATAAGGATTAAGAC,ATGTAGATGTATTCCAGTATAATAAGGATTAAGAC,ATGTAGATGTATTCCAGTATAATAAGGATTAAGAC	35,35,35	0	0	NA	NA	NA:NA:NA	22,22,23	23	TypeIII-B,TypeIII-C,TypeIII-A,TypeIII-D	RT,DEDDh,cas6,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas1,cas2,PrimPol,WYL,cas3,cas9	NA|160aa|up_8|NZ_CP012706.1_1974894_1975374_+,cmr1gr7|470aa|up_7|NZ_CP012706.1_1975387_1976797_+,cmr5gr11|137aa|up_3|NZ_CP012706.1_1980607_1981018_+,NA|52aa|down_0|NZ_CP012706.1_1987312_1987468_+	NA|492aa|up_9|NZ_CP012706.1_1973422_1974898_+	cd12822, TmCorA-like, Thermotoga maritima CorA-like family	NA|160aa|up_8|NZ_CP012706.1_1974894_1975374_+	NA	cmr1gr7|470aa|up_7|NZ_CP012706.1_1975387_1976797_+	NA	cas10|601aa|up_6|NZ_CP012706.1_1976801_1978604_+	cd09679, Cas10_III, CRISPR/Cas system-associated protein Cas10	cmr3gr5|380aa|up_5|NZ_CP012706.1_1978596_1979736_+	TIGR01888, Hypothetical_protein_SSO1730, CRISPR type III-B/RAMP module-associated protein Cmr3	cmr4gr7|280aa|up_4|NZ_CP012706.1_1979755_1980595_+	TIGR02580, putative_CRISPR-associated_protein, CRISPR type III-B/RAMP module RAMP protein Cmr4	cmr5gr11|137aa|up_3|NZ_CP012706.1_1980607_1981018_+	NA	cmr6gr7|315aa|up_2|NZ_CP012706.1_1981020_1981965_+	TIGR01898, repair_system, CRISPR type III-B/RAMP module RAMP protein Cmr6	cas1|757aa|up_1|NZ_CP012706.1_1981984_1984255_+	TIGR04416, hypothetical_protein, group II intron reverse transcriptase/maturase	cas2|97aa|up_0|NZ_CP012706.1_1984248_1984539_+	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	NA|52aa|down_0|NZ_CP012706.1_1987312_1987468_+	NA	NA|395aa|down_1|NZ_CP012706.1_1987523_1988708_-	cd06454, KBL_like, KBL_like; this family belongs to the pyridoxal phosphate (PLP)-dependent aspartate aminotransferase superfamily (fold I)	NA|348aa|down_2|NZ_CP012706.1_1988942_1989986_+	COG1597, LCB5, Sphingosine kinase and enzymes related to eukaryotic diacylglycerol kinase [Lipid metabolism / General function prediction only]	NA|586aa|down_3|NZ_CP012706.1_1990067_1991825_+	PRK00476, aspS, aspartyl-tRNA synthetase; Validated	NA|128aa|down_4|NZ_CP012706.1_1991825_1992209_+	pfam04138, GtrA, GtrA-like protein	NA|295aa|down_5|NZ_CP012706.1_1992288_1993173_-	cd07573, CPA, N-carbamoylputrescine amidohydrolase (CPA) (class 11 nitrilases)	NA|372aa|down_6|NZ_CP012706.1_1993184_1994300_-	pfam04371, PAD_porph, Porphyromonas-type peptidyl-arginine deiminase	NA|176aa|down_7|NZ_CP012706.1_1994380_1994908_-	COG4739, COG4739, Uncharacterized protein containing a ferredoxin domain [Function unknown]	NA|207aa|down_8|NZ_CP012706.1_1995002_1995623_+	cd03257, ABC_NikE_OppD_transporters, ATP-binding cassette domain of nickel/oligopeptides specific transporters	NA|265aa|down_9|NZ_CP012706.1_1995626_1996421_+	pfam03649, UPF0014, Uncharacterized protein family (UPF0014)
GCF_001682215.1_ASM168221v1	NZ_CP012706	Bacteroides fragilis strain S14 chromosome, complete genome	3	2133309-2133851	2,2,3	CRISPRCasFinder,CRT,PILER-CR	no	cas2,cas6	RT,DEDDh,cas6,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas1,cas2,PrimPol,WYL,cas3,cas9	Unclear	ATTTCAATTCCATAAGGTACAATTAATAC,ATTTCAATTCCATAAGGTACAATTAATAC,ATTTCAATTCCATAAGGTACAATTAATAC	29,29,29	0	0	NA	NA	NA:NA:NA	8,8,7	8	Unclear	RT,DEDDh,cas6,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas1,cas2,PrimPol,WYL,cas3,cas9	NA|91aa|up_7|NZ_CP012706.1_2123171_2123444_-,NA|524aa|up_6|NZ_CP012706.1_2123650_2125222_-,NA|135aa|up_1|NZ_CP012706.1_2132266_2132671_-,NA|145aa|up_0|NZ_CP012706.1_2132693_2133128_-,NA	NA|351aa|up_9|NZ_CP012706.1_2121176_2122229_-	pfam12987, DUF3871, Domain of unknown function, B	NA|297aa|up_8|NZ_CP012706.1_2122191_2123082_-	pfam13479, AAA_24, AAA domain	NA|91aa|up_7|NZ_CP012706.1_2123171_2123444_-	NA	NA|524aa|up_6|NZ_CP012706.1_2123650_2125222_-	NA	NA|403aa|up_5|NZ_CP012706.1_2125383_2126592_-	cd01185, INTN1_C_like, Integrase IntN1 of Bacteroides mobilizable transposon NBU1 and similar proteins, C-terminal catalytic domain	NA|907aa|up_4|NZ_CP012706.1_2126770_2129491_-	PRK09279, PRK09279, pyruvate phosphate dikinase; Provisional	NA|473aa|up_3|NZ_CP012706.1_2129757_2131176_+	COG2265, TrmA, SAM-dependent methyltransferases related to tRNA (uracil-5-)-methyltransferase [Translation, ribosomal structure and biogenesis]	NA|308aa|up_2|NZ_CP012706.1_2131180_2132104_+	COG0564, RluA, Pseudouridylate synthases, 23S RNA-specific [Translation, ribosomal structure and biogenesis]	NA|135aa|up_1|NZ_CP012706.1_2132266_2132671_-	NA	NA|145aa|up_0|NZ_CP012706.1_2132693_2133128_-	NA	cas2|88aa|down_0|NZ_CP012706.1_2134063_2134327_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas6|224aa|down_1|NZ_CP012706.1_2134953_2135625_-	TIGR01877, CRISPR-associated_endoribonuclease_Cas6_1, CRISPR-associated endoribonuclease Cas6	NA|285aa|down_2|NZ_CP012706.1_2136445_2137300_+	cd01086, MetAP1, Methionine Aminopeptidase 1	NA|409aa|down_3|NZ_CP012706.1_2137300_2138527_+	COG1322, COG1322, Predicted nuclease of restriction endonuclease-like fold, RmuC family [General function prediction only]	NA|249aa|down_4|NZ_CP012706.1_2138554_2139301_+	pfam02596, DUF169, Uncharacterized ArCR, COG2043	NA|438aa|down_5|NZ_CP012706.1_2139500_2140814_-	pfam06965, Na_H_antiport_1, Na+/H+ antiporter 1	NA|393aa|down_6|NZ_CP012706.1_2140858_2142037_-	pfam00999, Na_H_Exchanger, Sodium/hydrogen exchanger family	NA|594aa|down_7|NZ_CP012706.1_2142182_2143964_-	PRK05433, PRK05433, GTP-binding protein LepA; Provisional	NA|67aa|down_8|NZ_CP012706.1_2144089_2144290_-	pfam10771, DUF2582, Winged helix-turn-helix domain (DUF2582)	NA|155aa|down_9|NZ_CP012706.1_2144436_2144901_-	pfam09719, C_GCAxxG_C_C, Putative redox-active protein (C_GCAxxG_C_C)
GCF_001682215.1_ASM168221v1	NZ_CP012706	Bacteroides fragilis strain S14 chromosome, complete genome	4	2381913-2382006	3	CRISPRCasFinder	no	PrimPol	RT,DEDDh,cas6,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas1,cas2,PrimPol,WYL,cas3,cas9	Unclear	CACAGATGGAGAATTGAATTTTCCCCGGAATC	32	0	0	NA	NA	NA	1	1	Orphan	RT,DEDDh,cas6,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas1,cas2,PrimPol,WYL,cas3,cas9	NA,NA|66aa|down_1|NZ_CP012706.1_2384575_2384773_-,NA|390aa|down_7|NZ_CP012706.1_2389805_2390975_+,NA|343aa|down_8|NZ_CP012706.1_2391082_2392111_+	NA|167aa|up_9|NZ_CP012706.1_2371941_2372442_-	PRK00522, tpx, thiol peroxidase	NA|194aa|up_8|NZ_CP012706.1_2372535_2373117_+	COG3247, HdeD, Uncharacterized conserved protein [Function unknown]	NA|198aa|up_7|NZ_CP012706.1_2373255_2373849_-	PRK13413, mpi, master DNA invertase Mpi family serine-type recombinase	NA|380aa|up_6|NZ_CP012706.1_2373864_2375004_+	cd01185, INTN1_C_like, Integrase IntN1 of Bacteroides mobilizable transposon NBU1 and similar proteins, C-terminal catalytic domain	NA|468aa|up_5|NZ_CP012706.1_2375429_2376833_+	TIGR03023, Sugar_transferase	NA|263aa|up_4|NZ_CP012706.1_2376989_2377778_+	COG1596, Wza, Periplasmic protein involved in polysaccharide export, contains    SLBB domain of b-grasp fold [Cell wall/membrane/envelope biogenesis]	NA|802aa|up_3|NZ_CP012706.1_2377791_2380197_+	TIGR01007, Tyrosine-protein_kinase_CpsD, capsular exopolysaccharide family	NA|158aa|up_2|NZ_CP012706.1_2380326_2380800_-	PHA00447, PHA00447, lysozyme	NA|151aa|up_1|NZ_CP012706.1_2380994_2381447_-	pfam18291, HU-HIG, HU domain fused to wHTH, Ig, or Glycine-rich motif	NA|83aa|up_0|NZ_CP012706.1_2381638_2381887_+	pfam14053, DUF4248, Domain of unknown function (DUF4248)	PrimPol|769aa|down_0|NZ_CP012706.1_2382203_2384510_-	pfam13148, DUF3987, Protein of unknown function (DUF3987)	NA|66aa|down_1|NZ_CP012706.1_2384575_2384773_-	NA	NA|173aa|down_2|NZ_CP012706.1_2385239_2385758_+	cd09895, NGN_SP_UpxY, N-Utilization Substance G (NusG) N-terminal domain in the NusG Specialized Paralog (SP), UpxY	NA|331aa|down_3|NZ_CP012706.1_2385813_2386806_+	COG3594, NolL, Fucose 4-O-acetylase and related acetyltransferases [Carbohydrate transport and metabolism]	NA|340aa|down_4|NZ_CP012706.1_2386810_2387830_+	pfam01757, Acyl_transf_3, Acyltransferase family	NA|310aa|down_5|NZ_CP012706.1_2387868_2388798_+	cd04194, GT8_A4GalT_like, A4GalT_like proteins catalyze the addition of galactose or glucose residues to the lipooligosaccharide (LOS) or lipopolysaccharide (LPS) of the bacterial cell surface	NA|311aa|down_6|NZ_CP012706.1_2388822_2389755_+	pfam04230, PS_pyruv_trans, Polysaccharide pyruvyl transferase	NA|390aa|down_7|NZ_CP012706.1_2389805_2390975_+	NA	NA|343aa|down_8|NZ_CP012706.1_2391082_2392111_+	NA	NA|511aa|down_9|NZ_CP012706.1_2392118_2393651_+	cd13128, MATE_Wzx_like, Wzx, a subfamily of the multidrug and toxic compound extrusion (MATE)-like proteins
GCF_001682215.1_ASM168221v1	NZ_CP012706	Bacteroides fragilis strain S14 chromosome, complete genome	5	3758696-3760579	4,4	CRISPRCasFinder,PILER-CR	no	cas2,cas1,cas9	RT,DEDDh,cas6,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas1,cas2,PrimPol,WYL,cas3,cas9	Type II-B, Type II-B,Type II-C, or Type II-C?,Type II-A	GTTGTGATTTGCTTTCAAATTAGTATCTTTGAACCATTGGAAACAGC,GTTGTGATTTGCTTTCAAATTAGTATCTTTGAACCATTGGAAACAGC	47,47	0	0	NA	NA	NA:NA	24,21	24	TypeII-B,TypeII-B,TypeII-C,orTypeII-C?,TypeII-A	RT,DEDDh,cas6,cmr1gr7,cas10,cmr3gr5,cmr4gr7,cmr5gr11,cmr6gr7,cas1,cas2,PrimPol,WYL,cas3,cas9	NA|272aa|up_9|NZ_CP012706.1_3743791_3744607_-,NA|148aa|up_1|NZ_CP012706.1_3751265_3751709_-,NA|76aa|up_0|NZ_CP012706.1_3757574_3757802_-,NA|71aa|down_4|NZ_CP012706.1_3767699_3767912_+,NA|87aa|down_5|NZ_CP012706.1_3767923_3768184_-,NA|98aa|down_9|NZ_CP012706.1_3772104_3772398_-	NA|272aa|up_9|NZ_CP012706.1_3743791_3744607_-	NA	NA|287aa|up_8|NZ_CP012706.1_3744603_3745464_-	cd03230, ABC_DR_subfamily_A, ATP-binding cassette domain of the drug resistance transporter and related proteins, subfamily A	NA|478aa|up_7|NZ_CP012706.1_3745769_3747203_-	TIGR02917, TPR_domain_protein, putative PEP-CTERM system TPR-repeat lipoprotein	NA|316aa|up_6|NZ_CP012706.1_3747222_3748170_-	pfam12849, PBP_like_2, PBP superfamily domain	NA|272aa|up_5|NZ_CP012706.1_3748172_3748988_-	pfam03544, TonB_C, Gram-negative bacterial TonB protein C-terminal	NA|217aa|up_4|NZ_CP012706.1_3749017_3749668_-	pfam02472, ExbD, Biopolymer transport protein ExbD/TolR	NA|202aa|up_3|NZ_CP012706.1_3749680_3750286_-	pfam02472, ExbD, Biopolymer transport protein ExbD/TolR	NA|271aa|up_2|NZ_CP012706.1_3750313_3751126_-	pfam01618, MotA_ExbB, MotA/TolQ/ExbB proton channel family	NA|148aa|up_1|NZ_CP012706.1_3751265_3751709_-	NA	NA|76aa|up_0|NZ_CP012706.1_3757574_3757802_-	NA	cas2|111aa|down_0|NZ_CP012706.1_3760687_3761020_-	COG3512, COG3512, CRISPR-associated protein, Cas2 homolog [Defense mechanisms]	cas1|311aa|down_1|NZ_CP012706.1_3761023_3761956_-	TIGR03639, cas1_NMENI, CRISPR-associated endonuclease Cas1, subtype II/NMENI	cas9|1437aa|down_2|NZ_CP012706.1_3762990_3767301_-	pfam18541, RuvC_III, RuvC endonuclease subdomain 3	NA|127aa|down_3|NZ_CP012706.1_3767378_3767759_-	pfam01610, DDE_Tnp_ISL3, Transposase	NA|71aa|down_4|NZ_CP012706.1_3767699_3767912_+	NA	NA|87aa|down_5|NZ_CP012706.1_3767923_3768184_-	NA	NA|496aa|down_6|NZ_CP012706.1_3768257_3769745_-	COG1696, DltB, Predicted membrane protein involved in D-alanine export [Cell envelope biogenesis, outer membrane]	NA|310aa|down_7|NZ_CP012706.1_3769731_3770661_-	cd01825, SGNH_hydrolase_peri1, SGNH_peri1; putative periplasmic member of the SGNH-family of hydrolases, a diverse family of lipases and esterases	NA|458aa|down_8|NZ_CP012706.1_3770614_3771988_-	cd01825, SGNH_hydrolase_peri1, SGNH_peri1; putative periplasmic member of the SGNH-family of hydrolases, a diverse family of lipases and esterases	NA|98aa|down_9|NZ_CP012706.1_3772104_3772398_-	NA
