assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_012851305.1_ASM1285130v1	NZ_CP051672	Parabacteroides distasonis strain CBBP chromosome, complete genome	1	950697-952058	1,1,1	PILER-CR,CRISPRCasFinder,CRT	no	cas2,cas1,cas4,cas7,cas8c,cas5,cas3	PrimPol,cas2,cas1,cas4,cas7,cas8c,cas5,cas3,RT,DEDDh,PD-DExK,csa3	Type I-U, Type I-U?,Type I-C	GTTTCAATCCACGCACCCACACGGGGTGCGAC,GTTTCAATCCACGCACCCACACGGGGTGCGAC,GTTTCAATCCACGCACCCACACGGGGTGCGAC	32,32,32	1	1	951990-952026	NZ_CP051672.1_738159-738195	I-C:I-C:I-C	20,20,20	20	TypeI-U,TypeI-U?,TypeI-C	PrimPol,cas2,cas1,cas4,cas7,cas8c,cas5,cas3,RT,DEDDh,PD-DExK,csa3	NA,NA	NA|270aa|up_9|NZ_CP051672.1_939729_940539_+	pfam06439, DUF1080, Domain of Unknown Function (DUF1080)	NA|434aa|up_8|NZ_CP051672.1_940645_941947_+	cd05913, PaaK, Phenylacetate-CoA ligase (also known as PaaK)	NA|142aa|up_7|NZ_CP051672.1_941967_942393_+	COG4747, COG4747, ACT domain-containing protein [General function prediction only]	NA|269aa|up_6|NZ_CP051672.1_942459_943266_+	TIGR03302, OM_YfiO, outer membrane assembly lipoprotein YfiO	NA|111aa|up_5|NZ_CP051672.1_943290_943623_+	pfam01192, RNA_pol_Rpb6, RNA polymerase Rpb6	NA|154aa|up_4|NZ_CP051672.1_943699_944161_+	pfam14126, DUF4293, Domain of unknown function (DUF4293)	NA|512aa|up_3|NZ_CP051672.1_944342_945878_+	cd16144, ARS_like, uncharacterized arylsulfatase subfamily	NA|331aa|up_2|NZ_CP051672.1_945897_946890_+	cd19151, AKR_AKR14A2, Salmonella enterica aldo-keto reductase (AKR) and similar protein	NA|960aa|up_1|NZ_CP051672.1_946963_949843_+	pfam12705, PDDEXK_1, PD-(D/E)XK nuclease superfamily	NA|159aa|up_0|NZ_CP051672.1_949994_950471_-	COG1956, COG1956, GAF domain-containing protein [Signal transduction mechanisms]	cas2|97aa|down_0|NZ_CP051672.1_952227_952518_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|341aa|down_1|NZ_CP051672.1_952527_953550_-	TIGR03640, cas1_DVULG, CRISPR-associated endonuclease Cas1, subtype I-C/DVULG	cas4|222aa|down_2|NZ_CP051672.1_953546_954212_-	pfam01930, Cas_Cas4, Domain of unknown function DUF83	cas7|293aa|down_3|NZ_CP051672.1_954223_955102_-	pfam05107, Cas_Cas7, CRISPR-associated protein Cas7	cas8c|570aa|down_4|NZ_CP051672.1_955128_956838_-	pfam09709, Cas_Csd1, CRISPR-associated protein (Cas_Csd1)	cas5|227aa|down_5|NZ_CP051672.1_956834_957515_-	cd09651, Cas5_I-C, CRISPR/Cas system-associated RAMP superfamily protein Cas5	cas3|742aa|down_6|NZ_CP051672.1_957528_959754_-	COG1203, COG1203, CRISPR-associated helicase Cas3 [Defense mechanisms]	NA|450aa|down_7|NZ_CP051672.1_959824_961174_+	cd13143, MATE_MepA_like, Subfamily of the multidrug and toxic compound extrusion (MATE)-like proteins similar to Streptococcus aureus MepA	NA|343aa|down_8|NZ_CP051672.1_961163_962192_+	pfam04371, PAD_porph, Porphyromonas-type peptidyl-arginine deiminase	NA|292aa|down_9|NZ_CP051672.1_962343_963219_+	cd07573, CPA, N-carbamoylputrescine amidohydrolase (CPA) (class 11 nitrilases)
GCF_012851305.1_ASM1285130v1	NZ_CP051672	Parabacteroides distasonis strain CBBP chromosome, complete genome	2	4206472-4206580	2	CRISPRCasFinder	no		PrimPol,cas2,cas1,cas4,cas7,cas8c,cas5,cas3,RT,DEDDh,PD-DExK,csa3	Orphan	ATAGCTATCAAGGGCAACTCGCCAAATGGCTGAAAAA	37	1	1	4206509-4206543	NZ_CP051672.1_627250-627284	NA	1	1	Orphan	PrimPol,cas2,cas1,cas4,cas7,cas8c,cas5,cas3,RT,DEDDh,PD-DExK,csa3	NA|95aa|up_3|NZ_CP051672.1_4203953_4204238_+,NA|250aa|up_0|NZ_CP051672.1_4205503_4206253_+,NA	NA|172aa|up_9|NZ_CP051672.1_4199716_4200232_-	pfam00583, Acetyltransf_1, Acetyltransferase (GNAT) family	NA|336aa|up_8|NZ_CP051672.1_4200254_4201262_-	cd12810, Esterase_713_like-3, Uncharacterized enzymes similar to novel bacterial esterase that cleaves esters on halogenated cyclic compounds	NA|397aa|up_7|NZ_CP051672.1_4201294_4202485_-	cd19078, AKR_AKR13C1_2, AKR13C family of aldo-keto reductase (AKR)	NA|90aa|up_6|NZ_CP051672.1_4202732_4203002_+	pfam12674, Zn_ribbon_2, Putative zinc ribbon domain	NA|97aa|up_5|NZ_CP051672.1_4203008_4203299_+	COG2350, COG2350, Uncharacterized protein conserved in bacteria [Function unknown]	NA|177aa|up_4|NZ_CP051672.1_4203295_4203826_+	pfam13302, Acetyltransf_3, Acetyltransferase (GNAT) domain	NA|95aa|up_3|NZ_CP051672.1_4203953_4204238_+	NA	NA|192aa|up_2|NZ_CP051672.1_4204262_4204838_+	sd00010, SLR, Sel1-like repeat	NA|206aa|up_1|NZ_CP051672.1_4204873_4205491_+	sd00010, SLR, Sel1-like repeat	NA|250aa|up_0|NZ_CP051672.1_4205503_4206253_+	NA	NA|167aa|down_0|NZ_CP051672.1_4207415_4207916_-	cd04335, PrdX_deacylase, This CD includes bacterial (Agrobacterium tumefaciens and Caulobacter crescentus ProX, and Clostridium sticklandii PrdX) and eukaryotic (Plasmodium falciparum N-terminal ProRS editing domain) sequences	NA|206aa|down_1|NZ_CP051672.1_4207922_4208540_-	cd02603, HAD_sEH-N_like, N-terminal lipase phosphatase domain of human soluble epoxide hydrolase, Escherichia coli YihX/HAD4 alpha-D-glucose 1-phosphate phosphatase, and related domains, may be inactive	NA|188aa|down_2|NZ_CP051672.1_4208655_4209219_-	COG0664, Crp, cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases [Signal transduction mechanisms]	NA|300aa|down_3|NZ_CP051672.1_4210666_4211566_+	pfam12833, HTH_18, Helix-turn-helix domain	NA|779aa|down_4|NZ_CP051672.1_4211668_4214005_-	pfam13715, CarbopepD_reg_2, CarboxypepD_reg-like domain	NA|247aa|down_5|NZ_CP051672.1_4214127_4214868_-	smart00421, HTH_LUXR, helix_turn_helix, Lux Regulon	NA|200aa|down_6|NZ_CP051672.1_4214902_4215502_-	pfam09347, DUF1989, Domain of unknown function (DUF1989)	NA|781aa|down_7|NZ_CP051672.1_4215562_4217905_-	pfam14905, OMP_b-brl_3, Outer membrane protein beta-barrel family	NA|405aa|down_8|NZ_CP051672.1_4217888_4219103_-	cd17320, MFS_MdfA_MDR_like, Multidrug transporter MdfA and similar multidrug resistance (MDR) transporters of the Major Facilitator Superfamily	NA|183aa|down_9|NZ_CP051672.1_4219315_4219864_+	pfam13568, OMP_b-brl_2, Outer membrane protein beta-barrel domain
GCF_012851305.1_ASM1285130v1	NZ_CP051672	Parabacteroides distasonis strain CBBP chromosome, complete genome	3	5191137-5191223	3	CRISPRCasFinder	no		PrimPol,cas2,cas1,cas4,cas7,cas8c,cas5,cas3,RT,DEDDh,PD-DExK,csa3	Orphan	GAGACGGGGCACGCCCCGTCTCA	23	0	0	NA	NA	NA	1	1	Orphan	PrimPol,cas2,cas1,cas4,cas7,cas8c,cas5,cas3,RT,DEDDh,PD-DExK,csa3	NA|110aa|up_5|NZ_CP051672.1_5181381_5181711_+,NA|885aa|up_4|NZ_CP051672.1_5181977_5184632_+,NA|275aa|up_3|NZ_CP051672.1_5184739_5185564_-,NA	NA|148aa|up_9|NZ_CP051672.1_5177432_5177876_+	PRK05234, mgsA, methylglyoxal synthase; Validated	NA|354aa|up_8|NZ_CP051672.1_5177891_5178953_+	PRK00772, PRK00772, 3-isopropylmalate dehydrogenase; Provisional	NA|274aa|up_7|NZ_CP051672.1_5179063_5179885_-	sd00006, TPR, Tetratricopeptide repeat	NA|282aa|up_6|NZ_CP051672.1_5180043_5180889_+	PRK10334, PRK10334, small-conductance mechanosensitive channel MscS	NA|110aa|up_5|NZ_CP051672.1_5181381_5181711_+	NA	NA|885aa|up_4|NZ_CP051672.1_5181977_5184632_+	NA	NA|275aa|up_3|NZ_CP051672.1_5184739_5185564_-	NA	NA|477aa|up_2|NZ_CP051672.1_5185652_5187083_+	TIGR02692, putative_tRNA_nucleotidyltransferase, tRNA adenylyltransferase	NA|738aa|up_1|NZ_CP051672.1_5187213_5189427_-	COG0317, SpoT, Guanosine polyphosphate pyrophosphohydrolases/synthetases [Signal transduction mechanisms / Transcription]	NA|507aa|up_0|NZ_CP051672.1_5189602_5191123_+	pfam14134, DUF4301, Domain of unknown function (DUF4301)	NA|438aa|down_0|NZ_CP051672.1_5191235_5192549_-	COG0673, MviM, Predicted dehydrogenases and related proteins [General function prediction only]	NA|365aa|down_1|NZ_CP051672.1_5192813_5193908_+	pfam13201, PCMD, Putative carbohydrate metabolism domain	NA|261aa|down_2|NZ_CP051672.1_5193921_5194704_+	pfam13568, OMP_b-brl_2, Outer membrane protein beta-barrel domain	NA|484aa|down_3|NZ_CP051672.1_5194854_5196306_+	COG0673, MviM, Predicted dehydrogenases and related proteins [General function prediction only]	NA|335aa|down_4|NZ_CP051672.1_5196329_5197334_+	COG1082, IolE, Sugar phosphate isomerases/epimerases [Carbohydrate transport and metabolism]	NA|1080aa|down_5|NZ_CP051672.1_5197465_5200705_+	pfam05359, DUF748, Domain of Unknown Function (DUF748)	NA|87aa|down_6|NZ_CP051672.1_5200708_5200969_+	pfam14542, Acetyltransf_CG, GCN5-related N-acetyl-transferase	NA|217aa|down_7|NZ_CP051672.1_5200965_5201616_-	COG1357, COG1357, Pentapeptide repeats containing protein [Function unknown]	NA|286aa|down_8|NZ_CP051672.1_5201763_5202621_+	pfam06445, GyrI-like, GyrI-like small molecule binding domain	NA|290aa|down_9|NZ_CP051672.1_5202737_5203607_-	pfam16242, Pyrid_ox_like, Pyridoxamine 5'-phosphate oxidase like
