assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_005706655.1_ASM570665v1	NZ_CP036555	Bacteroides fragilis strain CCUG4856T chromosome, complete genome	1	392239-394276	1,1	PILER-CR,CRISPRCasFinder	no	cas2,cas1,cas9	cas3,cas2,cas1,cas9,RT,DEDDh,cas6,PrimPol,WYL	 Type II-B,Type II-A,Type II-C, or Type II-C?,Type II-B	GTTGTGATTTGCTTTCAAATTAGTATCTTTGAACCATTGGAAACAGC,GTTGTGATTTGCTTTCAAATTAGTATCTTTGAACCATTGGAAACAGC	47,47	0	0	NA	NA	NA:NA	26,26	26	TypeII-B,TypeII-A,TypeII-C,orTypeII-C?,TypeII-B	cas3,cas2,cas1,cas9,RT,DEDDh,cas6,PrimPol,WYL	NA|272aa|up_9|NZ_CP036555.1_377344_378160_-,NA|148aa|up_1|NZ_CP036555.1_384818_385262_-,NA|76aa|up_0|NZ_CP036555.1_391117_391345_-,NA|71aa|down_4|NZ_CP036555.1_401396_401609_+,NA|87aa|down_5|NZ_CP036555.1_401620_401881_-,NA|98aa|down_9|NZ_CP036555.1_405801_406095_-	NA|272aa|up_9|NZ_CP036555.1_377344_378160_-	NA	NA|285aa|up_8|NZ_CP036555.1_378156_379011_-	cd03230, ABC_DR_subfamily_A, ATP-binding cassette domain of the drug resistance transporter and related proteins, subfamily A	NA|478aa|up_7|NZ_CP036555.1_379322_380756_-	TIGR02917, TPR_domain_protein, putative PEP-CTERM system TPR-repeat lipoprotein	NA|316aa|up_6|NZ_CP036555.1_380775_381723_-	pfam12849, PBP_like_2, PBP superfamily domain	NA|272aa|up_5|NZ_CP036555.1_381725_382541_-	pfam03544, TonB_C, Gram-negative bacterial TonB protein C-terminal	NA|217aa|up_4|NZ_CP036555.1_382570_383221_-	pfam02472, ExbD, Biopolymer transport protein ExbD/TolR	NA|202aa|up_3|NZ_CP036555.1_383233_383839_-	pfam02472, ExbD, Biopolymer transport protein ExbD/TolR	NA|271aa|up_2|NZ_CP036555.1_383866_384679_-	pfam01618, MotA_ExbB, MotA/TolQ/ExbB proton channel family	NA|148aa|up_1|NZ_CP036555.1_384818_385262_-	NA	NA|76aa|up_0|NZ_CP036555.1_391117_391345_-	NA	cas2|111aa|down_0|NZ_CP036555.1_394384_394717_-	COG3512, COG3512, CRISPR-associated protein, Cas2 homolog [Defense mechanisms]	cas1|311aa|down_1|NZ_CP036555.1_394720_395653_-	TIGR03639, cas1_NMENI, CRISPR-associated endonuclease Cas1, subtype II/NMENI	cas9|1437aa|down_2|NZ_CP036555.1_396687_400998_-	pfam18541, RuvC_III, RuvC endonuclease subdomain 3	NA|127aa|down_3|NZ_CP036555.1_401075_401456_-	pfam01610, DDE_Tnp_ISL3, Transposase	NA|71aa|down_4|NZ_CP036555.1_401396_401609_+	NA	NA|87aa|down_5|NZ_CP036555.1_401620_401881_-	NA	NA|496aa|down_6|NZ_CP036555.1_401954_403442_-	COG1696, DltB, Predicted membrane protein involved in D-alanine export [Cell envelope biogenesis, outer membrane]	NA|305aa|down_7|NZ_CP036555.1_403428_404343_-	cd01825, SGNH_hydrolase_peri1, SGNH_peri1; putative periplasmic member of the SGNH-family of hydrolases, a diverse family of lipases and esterases	NA|458aa|down_8|NZ_CP036555.1_404311_405685_-	cd01825, SGNH_hydrolase_peri1, SGNH_peri1; putative periplasmic member of the SGNH-family of hydrolases, a diverse family of lipases and esterases	NA|98aa|down_9|NZ_CP036555.1_405801_406095_-	NA
GCF_005706655.1_ASM570665v1	NZ_CP036555	Bacteroides fragilis strain CCUG4856T chromosome, complete genome	2	553311-553410	2	CRISPRCasFinder	no		cas3,cas2,cas1,cas9,RT,DEDDh,cas6,PrimPol,WYL	Orphan	GACGAAGGTAATGGTACATCAACAAGACTAC	31	0	0	NA	NA	NA	1	1	Orphan	cas3,cas2,cas1,cas9,RT,DEDDh,cas6,PrimPol,WYL	NA|58aa|up_3|NZ_CP036555.1_546861_547035_+,NA|291aa|up_0|NZ_CP036555.1_551053_551926_-,NA|327aa|down_1|NZ_CP036555.1_556749_557730_-,NA|338aa|down_2|NZ_CP036555.1_557762_558776_-,NA|403aa|down_3|NZ_CP036555.1_558792_560001_-,NA|171aa|down_9|NZ_CP036555.1_568541_569054_+	NA|192aa|up_9|NZ_CP036555.1_537906_538482_-	pfam14289, DUF4369, Domain of unknown function (DUF4369)	NA|325aa|up_8|NZ_CP036555.1_538572_539547_-	pfam16961, OmpA_like, Putative OmpA-OmpF-like porin family	NA|808aa|up_7|NZ_CP036555.1_539571_541995_-	COG1196, Smc, Chromosome segregation ATPases [Cell division and chromosome partitioning]	NA|311aa|up_6|NZ_CP036555.1_542440_543373_-	cd01185, INTN1_C_like, Integrase IntN1 of Bacteroides mobilizable transposon NBU1 and similar proteins, C-terminal catalytic domain	NA|713aa|up_5|NZ_CP036555.1_543548_545687_+	pfam11175, DUF2961, Protein of unknown function (DUF2961)	NA|224aa|up_4|NZ_CP036555.1_545905_546577_-	pfam12990, DUF3874, Domain of unknonw function from B	NA|58aa|up_3|NZ_CP036555.1_546861_547035_+	NA	NA|845aa|up_2|NZ_CP036555.1_547073_549608_-	cd01827, sialate_O-acetylesterase_like1, sialate O-acetylesterase_like family of the SGNH hydrolases, a diverse family of lipases and esterases	NA|388aa|up_1|NZ_CP036555.1_549865_551029_-	COG5279, CYK3, Uncharacterized protein involved in cytokinesis, contains TGc (transglutaminase/protease-like) domain [Cell division and chromosome partitioning]	NA|291aa|up_0|NZ_CP036555.1_551053_551926_-	NA	NA|872aa|down_0|NZ_CP036555.1_554097_556713_-	pfam00041, fn3, Fibronectin type III domain	NA|327aa|down_1|NZ_CP036555.1_556749_557730_-	NA	NA|338aa|down_2|NZ_CP036555.1_557762_558776_-	NA	NA|403aa|down_3|NZ_CP036555.1_558792_560001_-	NA	NA|186aa|down_4|NZ_CP036555.1_560677_561235_+	cd03424, ADPRase_NUDT5, ADP-ribose pyrophosphatase (ADPRase) catalyzes the hydrolysis of ADP-ribose and a variety of additional ADP-sugar conjugates to AMP and ribose-5-phosphate	NA|252aa|down_5|NZ_CP036555.1_562538_563294_+	COG3279, LytT, Response regulator of the LytR/AlgR family [Transcription / Signal transduction mechanisms]	NA|502aa|down_6|NZ_CP036555.1_563317_564823_+	cd16373, DMSOR_beta_like, uncharacterized subfamily of DMSO Reductase beta subunit family	NA|461aa|down_7|NZ_CP036555.1_564845_566228_+	COG1453, COG1453, Predicted oxidoreductases of the aldo/keto reductase family [General function prediction only]	NA|767aa|down_8|NZ_CP036555.1_566224_568525_+	pfam08973, TM1506, Domain of unknown function (DUF1893)	NA|171aa|down_9|NZ_CP036555.1_568541_569054_+	NA
GCF_005706655.1_ASM570665v1	NZ_CP036555	Bacteroides fragilis strain CCUG4856T chromosome, complete genome	3	1828231-1828376	2,3	PILER-CR,CRISPRCasFinder	no		cas3,cas2,cas1,cas9,RT,DEDDh,cas6,PrimPol,WYL	Orphan	GAAATTCCCAATATATTGTGAATTTGA,ATTCCCAATATATTGTGAATTTGA	27,24	0	0	NA	NA	NA:NA	2,2	2	Orphan	cas3,cas2,cas1,cas9,RT,DEDDh,cas6,PrimPol,WYL	NA,NA|77aa|down_2|NZ_CP036555.1_1830181_1830412_-	NA|363aa|up_9|NZ_CP036555.1_1819444_1820533_+	pfam07610, DUF1573, Protein of unknown function (DUF1573)	NA|364aa|up_8|NZ_CP036555.1_1820541_1821633_+	PRK09435, PRK09435, methylmalonyl Co-A mutase-associated GTPase MeaB	NA|303aa|up_7|NZ_CP036555.1_1821664_1822573_-	COG0697, RhaT, Permeases of the drug/metabolite transporter (DMT) superfamily [Carbohydrate transport and metabolism / Amino acid transport and metabolism / General function prediction only]	NA|276aa|up_6|NZ_CP036555.1_1822666_1823494_+	pfam13304, AAA_21, AAA domain, putative AbiEii toxin, Type IV TA system	NA|339aa|up_5|NZ_CP036555.1_1823515_1824532_+	pfam14491, DUF4435, Protein of unknown function (DUF4435)	NA|416aa|up_4|NZ_CP036555.1_1824503_1825751_+	pfam00924, MS_channel, Mechanosensitive ion channel	NA|298aa|up_3|NZ_CP036555.1_1825792_1826686_+	COG2207, AraC, AraC-type DNA-binding domain-containing proteins [Transcription]	NA|337aa|up_2|NZ_CP036555.1_1826688_1827699_-	pfam07804, HipA_C, HipA-like C-terminal domain	NA|110aa|up_1|NZ_CP036555.1_1827691_1828021_-	TIGR03071, couple_hipA, HipA N-terminal domain	NA|71aa|up_0|NZ_CP036555.1_1828017_1828230_-	TIGR03070, couple_hipB, transcriptional regulator, y4mF family	NA|291aa|down_0|NZ_CP036555.1_1828719_1829592_-	pfam14297, DUF4373, Domain of unknown function (DUF4373)	NA|116aa|down_1|NZ_CP036555.1_1829734_1830082_-	pfam10902, WYL_2, WYL_2, Sm-like SH3 beta-barrel fold	NA|77aa|down_2|NZ_CP036555.1_1830181_1830412_-	NA	NA|179aa|down_3|NZ_CP036555.1_1831129_1831666_+	cd09895, NGN_SP_UpxY, N-Utilization Substance G (NusG) N-terminal domain in the NusG Specialized Paralog (SP), UpxY	NA|163aa|down_4|NZ_CP036555.1_1831685_1832174_+	pfam06603, UpxZ, UpxZ family of transcription anti-terminator antagonists	NA|403aa|down_5|NZ_CP036555.1_1832196_1833405_+	cd05237, UDP_invert_4-6DH_SDR_e, UDP-Glcnac (UDP-linked N-acetylglucosamine) inverting 4,6-dehydratase, extended (e) SDRs	NA|378aa|down_6|NZ_CP036555.1_1833424_1834558_+	cd00616, AHBA_syn, 3-amino-5-hydroxybenzoic acid synthase family (AHBA_syn)	NA|203aa|down_7|NZ_CP036555.1_1834550_1835159_+	cd03360, LbH_AT_putative, Putative Acyltransferase (AT), Left-handed parallel beta-Helix (LbH) domain; This group is composed of mostly uncharacterized proteins containing an N-terminal helical subdomain followed by a LbH domain	NA|329aa|down_8|NZ_CP036555.1_1835288_1836275_+	cd06426, NTP_transferase_like_2, NTP_trnasferase_like_2 is a member of the nucleotidyl transferase family	NA|484aa|down_9|NZ_CP036555.1_1836342_1837794_+	cd13127, MATE_tuaB_like, Uncharacterized subfamily of the multidrug and toxic compound extrusion (MATE) proteins
GCF_005706655.1_ASM570665v1	NZ_CP036555	Bacteroides fragilis strain CCUG4856T chromosome, complete genome	4	3773677-3773842	4	CRISPRCasFinder	no		cas3,cas2,cas1,cas9,RT,DEDDh,cas6,PrimPol,WYL	Orphan	TTTTTCTTTGTCGGGGTAGCGGGATTCGAACCCACGACCCCCAGCTCCCAAAGC	54	0	0	NA	NA	NA	1	1	Orphan	cas3,cas2,cas1,cas9,RT,DEDDh,cas6,PrimPol,WYL	NA|147aa|up_8|NZ_CP036555.1_3759325_3759766_+,NA|225aa|up_7|NZ_CP036555.1_3759778_3760453_+,NA|701aa|up_6|NZ_CP036555.1_3764606_3766709_+,NA|398aa|up_5|NZ_CP036555.1_3766971_3768165_+,NA|145aa|up_2|NZ_CP036555.1_3769719_3770154_+,NA|113aa|up_1|NZ_CP036555.1_3770767_3771106_+,NA|73aa|down_0|NZ_CP036555.1_3774083_3774302_-,NA|51aa|down_1|NZ_CP036555.1_3774487_3774640_+,NA|58aa|down_3|NZ_CP036555.1_3775058_3775232_+	NA|132aa|up_9|NZ_CP036555.1_3758917_3759313_+	pfam08291, Peptidase_M15_3, Peptidase M15	NA|147aa|up_8|NZ_CP036555.1_3759325_3759766_+	NA	NA|225aa|up_7|NZ_CP036555.1_3759778_3760453_+	NA	NA|701aa|up_6|NZ_CP036555.1_3764606_3766709_+	NA	NA|398aa|up_5|NZ_CP036555.1_3766971_3768165_+	NA	NA|209aa|up_4|NZ_CP036555.1_3768262_3768889_+	cd14948, BACON, Bacteroidetes-Associated Carbohydrate-binding (putative) Often N-terminal (BACON) domain	NA|208aa|up_3|NZ_CP036555.1_3768941_3769565_-	cd14948, BACON, Bacteroidetes-Associated Carbohydrate-binding (putative) Often N-terminal (BACON) domain	NA|145aa|up_2|NZ_CP036555.1_3769719_3770154_+	NA	NA|113aa|up_1|NZ_CP036555.1_3770767_3771106_+	NA	NA|415aa|up_0|NZ_CP036555.1_3772278_3773523_-	cd01185, INTN1_C_like, Integrase IntN1 of Bacteroides mobilizable transposon NBU1 and similar proteins, C-terminal catalytic domain	NA|73aa|down_0|NZ_CP036555.1_3774083_3774302_-	NA	NA|51aa|down_1|NZ_CP036555.1_3774487_3774640_+	NA	NA|114aa|down_2|NZ_CP036555.1_3774648_3774990_+	pfam07784, DUF1622, Protein of unknown function (DUF1622)	NA|58aa|down_3|NZ_CP036555.1_3775058_3775232_+	NA	NA|426aa|down_4|NZ_CP036555.1_3775238_3776516_-	PRK11360, PRK11360, two-component system sensor histidine kinase AtoS	NA|455aa|down_5|NZ_CP036555.1_3776512_3777877_-	COG2204, AtoC, Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains [Signal transduction mechanisms]	NA|491aa|down_6|NZ_CP036555.1_3778239_3779712_+	COG1538, TolC, Outer membrane protein [Cell envelope biogenesis, outer membrane / Intracellular trafficking and secretion]	NA|417aa|down_7|NZ_CP036555.1_3779750_3781001_+	TIGR01730, COG0845:_Membrane-fusion_protein, RND family efflux transporter, MFP subunit	NA|808aa|down_8|NZ_CP036555.1_3781144_3783568_+	pfam12704, MacB_PCD, MacB-like periplasmic core domain	NA|782aa|down_9|NZ_CP036555.1_3783717_3786063_+	TIGR03434, ADOP, Acidobacterial duplicated orphan permease
GCF_005706655.1_ASM570665v1	NZ_CP036555	Bacteroides fragilis strain CCUG4856T chromosome, complete genome	5	3934213-3934755	5,1,3	CRISPRCasFinder,CRT,PILER-CR	no	cas2,cas6	cas3,cas2,cas1,cas9,RT,DEDDh,cas6,PrimPol,WYL	Unclear	ATTTCAATTCCATAAGGTACAATTAATAC,ATTTCAATTCCATAAGGTACAATTAATAC,ATTTCAATTCCATAAGGTACAATTAATAC	29,29,29	2	2	3934499-3934532|3934693-3934726	NZ_CP036555.1_2683610-2683643|NZ_CP036555.1_2683429-2683396	NA:NA:NA	8,8,7	8	Unclear	cas3,cas2,cas1,cas9,RT,DEDDh,cas6,PrimPol,WYL	NA|135aa|up_1|NZ_CP036555.1_3933170_3933575_-,NA|148aa|up_0|NZ_CP036555.1_3933597_3934041_-,NA	NA|375aa|up_9|NZ_CP036555.1_3923582_3924707_+	PRK09240, thiH, 2-iminoacetate synthase ThiH	NA|234aa|up_8|NZ_CP036555.1_3924729_3925431_+	cd00757, ThiF_MoeB_HesA_family, ThiF_MoeB_HesA	NA|203aa|up_7|NZ_CP036555.1_3925492_3926101_+	cd00564, TMP_TenI, Thiamine monophosphate synthase (TMP synthase)/TenI	NA|171aa|up_6|NZ_CP036555.1_3926178_3926691_-	pfam13505, OMP_b-brl, Outer membrane protein beta-barrel domain	NA|190aa|up_5|NZ_CP036555.1_3926892_3927462_-	pfam13568, OMP_b-brl_2, Outer membrane protein beta-barrel domain	NA|907aa|up_4|NZ_CP036555.1_3927674_3930395_-	PRK09279, PRK09279, pyruvate phosphate dikinase; Provisional	NA|473aa|up_3|NZ_CP036555.1_3930661_3932080_+	COG2265, TrmA, SAM-dependent methyltransferases related to tRNA (uracil-5-)-methyltransferase [Translation, ribosomal structure and biogenesis]	NA|306aa|up_2|NZ_CP036555.1_3932090_3933008_+	COG0564, RluA, Pseudouridylate synthases, 23S RNA-specific [Translation, ribosomal structure and biogenesis]	NA|135aa|up_1|NZ_CP036555.1_3933170_3933575_-	NA	NA|148aa|up_0|NZ_CP036555.1_3933597_3934041_-	NA	cas2|88aa|down_0|NZ_CP036555.1_3934967_3935231_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas6|224aa|down_1|NZ_CP036555.1_3935857_3936529_-	TIGR01877, CRISPR-associated_endoribonuclease_Cas6_1, CRISPR-associated endoribonuclease Cas6	NA|285aa|down_2|NZ_CP036555.1_3937350_3938205_+	cd01086, MetAP1, Methionine Aminopeptidase 1	NA|409aa|down_3|NZ_CP036555.1_3938205_3939432_+	COG1322, COG1322, Predicted nuclease of restriction endonuclease-like fold, RmuC family [General function prediction only]	NA|249aa|down_4|NZ_CP036555.1_3939459_3940206_+	pfam02596, DUF169, Uncharacterized ArCR, COG2043	NA|438aa|down_5|NZ_CP036555.1_3940405_3941719_-	pfam06965, Na_H_antiport_1, Na+/H+ antiporter 1	NA|393aa|down_6|NZ_CP036555.1_3941763_3942942_-	pfam00999, Na_H_Exchanger, Sodium/hydrogen exchanger family	NA|594aa|down_7|NZ_CP036555.1_3943087_3944869_-	PRK05433, PRK05433, GTP-binding protein LepA; Provisional	NA|67aa|down_8|NZ_CP036555.1_3944994_3945195_-	pfam10771, DUF2582, Winged helix-turn-helix domain (DUF2582)	NA|155aa|down_9|NZ_CP036555.1_3945341_3945806_-	pfam09719, C_GCAxxG_C_C, Putative redox-active protein (C_GCAxxG_C_C)
