assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_007012325.1_ASM701232v1	NZ_CP041395	Bacteroides ovatus strain 3725 D1 iv chromosome, complete genome	1	3888980-3889080	1	CRISPRCasFinder	no	PD-DExK	PrimPol,RT,DEDDh,WYL,PD-DExK,cas9,cas1,cas2,cas3	Unclear	ATTTCATTTGCCGGCAATTTGCCGA	25	0	0	NA	NA	NA	1	1	Orphan	PrimPol,RT,DEDDh,WYL,PD-DExK,cas9,cas1,cas2,cas3	NA|100aa|up_5|NZ_CP041395.1_3883944_3884244_+,NA|183aa|down_0|NZ_CP041395.1_3889134_3889683_-,NA|132aa|down_2|NZ_CP041395.1_3890916_3891312_-,NA|159aa|down_3|NZ_CP041395.1_3891684_3892161_-,NA|116aa|down_4|NZ_CP041395.1_3892212_3892560_-,NA|415aa|down_7|NZ_CP041395.1_3894581_3895826_-,NA|419aa|down_9|NZ_CP041395.1_3897871_3899128_+	NA|76aa|up_9|NZ_CP041395.1_3877680_3877908_+	cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|1077aa|up_8|NZ_CP041395.1_3877904_3881135_+	COG0610, COG0610, Type I site-specific restriction-modification system, R (restriction) subunit and related helicases [Defense mechanisms]	NA|554aa|up_7|NZ_CP041395.1_3881147_3882809_+	COG0286, HsdM, Type I restriction-modification system methyltransferase subunit [Defense mechanisms]	PD-DExK|378aa|up_6|NZ_CP041395.1_3882801_3883935_+	pfam06250, DUF1016, Protein of unknown function (DUF1016)	NA|100aa|up_5|NZ_CP041395.1_3883944_3884244_+	NA	NA|201aa|up_4|NZ_CP041395.1_3884240_3884843_+	cd17278, RMtype1_S_LdeBORF1052P-TRD2-CR2, Type I restriction-modification system specificity (S) subunit Target Recognition Domain-ConseRved domain (TRD-CR), similar to Lactobacillus delbrueckii subsp	NA|439aa|up_3|NZ_CP041395.1_3884766_3886083_-	cd17517, RMtype1_S_EcoKI_StySPI-TRD2-CR2_like, Type I restriction-modification system specificity (S) subunit Target Recognition Domain-ConseRved domain (TRD-CR),similar to Escherichia coli str	NA|331aa|up_2|NZ_CP041395.1_3886135_3887128_-	PRK00236, xerC, site-specific tyrosine recombinase XerC; Reviewed	NA|273aa|up_1|NZ_CP041395.1_3887385_3888204_-	pfam11185, DUF2971, Protein of unknown function (DUF2971)	NA|233aa|up_0|NZ_CP041395.1_3888206_3888905_-	cd17254, RMtype1_S_FclI-TRD1-CR1_like, Type I restriction-modification system specificity (S) subunit Target Recognition Domain-ConseRved domain (TRD-CR), similar to S	NA|183aa|down_0|NZ_CP041395.1_3889134_3889683_-	NA	NA|398aa|down_1|NZ_CP041395.1_3889707_3890901_-	pfam03432, Relaxase, Relaxase/Mobilisation nuclease domain	NA|132aa|down_2|NZ_CP041395.1_3890916_3891312_-	NA	NA|159aa|down_3|NZ_CP041395.1_3891684_3892161_-	NA	NA|116aa|down_4|NZ_CP041395.1_3892212_3892560_-	NA	NA|483aa|down_5|NZ_CP041395.1_3892694_3894143_-	pfam05272, VirE, Virulence-associated protein E	NA|106aa|down_6|NZ_CP041395.1_3894084_3894402_-	pfam12964, DUF3853, Protein of unknown function (DUF3853)	NA|415aa|down_7|NZ_CP041395.1_3894581_3895826_-	NA	NA|498aa|down_8|NZ_CP041395.1_3895838_3897332_-	cd01185, INTN1_C_like, Integrase IntN1 of Bacteroides mobilizable transposon NBU1 and similar proteins, C-terminal catalytic domain	NA|419aa|down_9|NZ_CP041395.1_3897871_3899128_+	NA
GCF_007012325.1_ASM701232v1	NZ_CP041395	Bacteroides ovatus strain 3725 D1 iv chromosome, complete genome	2	4444209-4446782	1,2,2	PILER-CR,CRISPRCasFinder,PILER-CR	no	cas9,cas1,cas2,WYL	PrimPol,RT,DEDDh,WYL,PD-DExK,cas9,cas1,cas2,cas3	Type II-C,Type II-A,Type II-B	GTATTGTTCCCAATGGTTCAAAGATACTAATTTGAAAGCAAATCACAAC,ATTGTTCCCAATGGTTCAAAGATACTAATTTGAAAGCAAATCACAAC,ATTGTTCCCAATGGTTCAAAGATACTAATTTGAAAGCAAATCACAAC	49,47,47	0	0	NA	NA	NA:NA:NA	20,33,20	33	TypeII-C,TypeII-A,TypeII-B	PrimPol,RT,DEDDh,WYL,PD-DExK,cas9,cas1,cas2,cas3	NA,NA|131aa|down_1|NZ_CP041395.1_4447766_4448159_+,NA|167aa|down_3|NZ_CP041395.1_4450397_4450898_-,NA|339aa|down_5|NZ_CP041395.1_4452146_4453163_+,NA|290aa|down_9|NZ_CP041395.1_4456030_4456900_+	NA|336aa|up_9|NZ_CP041395.1_4430387_4431395_+	COG0451, WcaG, Nucleoside-diphosphate-sugar epimerases [Cell envelope biogenesis, outer membrane / Carbohydrate transport and metabolism]	NA|322aa|up_8|NZ_CP041395.1_4431385_4432351_+	pfam14378, PAP2_3, PAP2 superfamily	NA|332aa|up_7|NZ_CP041395.1_4432441_4433437_-	cd02801, DUS_like_FMN, Dihydrouridine synthase-like (DUS-like) FMN-binding domain	NA|430aa|up_6|NZ_CP041395.1_4433561_4434851_+	COG0826, COG0826, Collagenase and related proteases [Posttranslational modification, protein turnover, chaperones]	NA|135aa|up_5|NZ_CP041395.1_4434893_4435298_+	cd00586, 4HBT, 4-hydroxybenzoyl-CoA thioesterase (4HBT)	NA|360aa|up_4|NZ_CP041395.1_4435339_4436419_+	pfam02481, DNA_processg_A, DNA recombination-mediator protein A	cas9|1511aa|up_3|NZ_CP041395.1_4437302_4441835_+	pfam18541, RuvC_III, RuvC endonuclease subdomain 3	NA|338aa|up_2|NZ_CP041395.1_4441831_4442845_+	pfam13310, Virulence_RhuM, Virulence protein RhuM family	cas1|311aa|up_1|NZ_CP041395.1_4442837_4443770_+	TIGR03639, cas1_NMENI, CRISPR-associated endonuclease Cas1, subtype II/NMENI	cas2|111aa|up_0|NZ_CP041395.1_4443769_4444102_+	COG3512, COG3512, CRISPR-associated protein, Cas2 homolog [Defense mechanisms]	NA|74aa|down_0|NZ_CP041395.1_4447072_4447294_+	pfam14053, DUF4248, Domain of unknown function (DUF4248)	NA|131aa|down_1|NZ_CP041395.1_4447766_4448159_+	NA	NA|592aa|down_2|NZ_CP041395.1_4448388_4450164_+	COG5016, COG5016, Pyruvate/oxaloacetate carboxyltransferase [Energy production and conversion]	NA|167aa|down_3|NZ_CP041395.1_4450397_4450898_-	NA	WYL|300aa|down_4|NZ_CP041395.1_4451187_4452087_+	pfam13280, WYL, WYL domain	NA|339aa|down_5|NZ_CP041395.1_4452146_4453163_+	NA	NA|421aa|down_6|NZ_CP041395.1_4453159_4454422_+	cd07732, metallo-hydrolase-like_MBL-fold, uncharacterized subgroup of the MBL-fold_metallo-hydrolase superfamily; MBL-fold metallo hydrolase domain	NA|190aa|down_7|NZ_CP041395.1_4454418_4454988_+	cd07379, MPP_239FB, Homo sapiens 239FB and related proteins, metallophosphatase domain	NA|132aa|down_8|NZ_CP041395.1_4455241_4455637_-	cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|290aa|down_9|NZ_CP041395.1_4456030_4456900_+	NA
GCF_007012325.1_ASM701232v1	NZ_CP041395	Bacteroides ovatus strain 3725 D1 iv chromosome, complete genome	3	5026210-5026300	3	CRISPRCasFinder	no		PrimPol,RT,DEDDh,WYL,PD-DExK,cas9,cas1,cas2,cas3	Orphan	AAAGGGTTTACACTTGGGTTTACA	24	0	0	NA	NA	NA	1	1	Orphan	PrimPol,RT,DEDDh,WYL,PD-DExK,cas9,cas1,cas2,cas3	NA|300aa|up_1|NZ_CP041395.1_5021475_5022375_+,NA|71aa|up_0|NZ_CP041395.1_5025876_5026089_+,NA|71aa|down_1|NZ_CP041395.1_5027071_5027284_+,NA|65aa|down_2|NZ_CP041395.1_5027283_5027478_+,NA|67aa|down_3|NZ_CP041395.1_5027596_5027797_+,NA|74aa|down_4|NZ_CP041395.1_5027786_5028008_+,NA|133aa|down_7|NZ_CP041395.1_5030975_5031374_+,NA|79aa|down_9|NZ_CP041395.1_5032050_5032287_+	NA|353aa|up_9|NZ_CP041395.1_5010306_5011365_+	PRK00578, prfB, peptide chain release factor 2; Validated	NA|351aa|up_8|NZ_CP041395.1_5011819_5012872_+	pfam07396, Porin_O_P, Phosphate-selective porin O and P	NA|156aa|up_7|NZ_CP041395.1_5012990_5013458_+	cd07891, CYTH-like_CthTTM-like_1, CYTH-like Clostridium thermocellum TTM-like subgroup 1	NA|431aa|up_6|NZ_CP041395.1_5013516_5014809_+	pfam13194, DUF4010, Domain of unknown function (DUF4010)	NA|421aa|up_5|NZ_CP041395.1_5014955_5016218_+	COG1373, COG1373, Predicted ATPase (AAA+ superfamily) [General function prediction only]	NA|387aa|up_4|NZ_CP041395.1_5016561_5017722_-	pfam01223, Endonuclease_NS, DNA/RNA non-specific endonuclease	NA|342aa|up_3|NZ_CP041395.1_5017718_5018744_-	cd10283, MnuA_DNase1-like, Mycoplasma pulmonis MnuA nuclease-like	NA|843aa|up_2|NZ_CP041395.1_5018944_5021473_+	cd01347, ligand_gated_channel, TonB dependent/Ligand-Gated channels are created by a monomeric 22 strand (22,24) anti-parallel beta-barrel	NA|300aa|up_1|NZ_CP041395.1_5021475_5022375_+	NA	NA|71aa|up_0|NZ_CP041395.1_5025876_5026089_+	NA	NA|141aa|down_0|NZ_CP041395.1_5026491_5026914_-	pfam07022, Phage_CI_repr, Bacteriophage CI repressor helix-turn-helix domain	NA|71aa|down_1|NZ_CP041395.1_5027071_5027284_+	NA	NA|65aa|down_2|NZ_CP041395.1_5027283_5027478_+	NA	NA|67aa|down_3|NZ_CP041395.1_5027596_5027797_+	NA	NA|74aa|down_4|NZ_CP041395.1_5027786_5028008_+	NA	NA|684aa|down_5|NZ_CP041395.1_5028022_5030074_+	pfam00665, rve, Integrase core domain	NA|290aa|down_6|NZ_CP041395.1_5030109_5030979_+	pfam13401, AAA_22, AAA domain	NA|133aa|down_7|NZ_CP041395.1_5030975_5031374_+	NA	NA|226aa|down_8|NZ_CP041395.1_5031376_5032054_+	COG0467, RAD55, RecA-superfamily ATPases implicated in signal transduction [Signal transduction mechanisms]	NA|79aa|down_9|NZ_CP041395.1_5032050_5032287_+	NA
GCF_007012325.1_ASM701232v1	NZ_CP041396	Bacteroides ovatus strain 3725 D1 iv plasmid unnamed1, complete sequence	1	33213-33465	1	CRISPRCasFinder	no			Orphan	TAGCTGTGTATAAGTATAGGGAA	23	0	0	NA	NA	NA	4	4	Orphan	PrimPol,RT,DEDDh,WYL,PD-DExK,cas9,cas1,cas2,cas3	NA|103aa|up_8|NZ_CP041396.1_24574_24883_+,NA|67aa|up_6|NZ_CP041396.1_27498_27699_+,NA|136aa|up_2|NZ_CP041396.1_31173_31581_-,NA|71aa|up_1|NZ_CP041396.1_31743_31956_+,NA|381aa|up_0|NZ_CP041396.1_31962_33105_+,NA|70aa|down_0|NZ_CP041396.1_34180_34390_-,NA|141aa|down_1|NZ_CP041396.1_34870_35293_-,NA|308aa|down_5|NZ_CP041396.1_38393_39317_+,NA|166aa|down_7|NZ_CP041396.1_41014_41512_-,NA|76aa|down_8|NZ_CP041396.1_41610_41838_-	NA|269aa|up_9|NZ_CP041396.1_23763_24570_-	pfam00656, Peptidase_C14, Caspase domain	NA|103aa|up_8|NZ_CP041396.1_24574_24883_+	NA	NA|548aa|up_7|NZ_CP041396.1_25015_26659_+	COG2865, COG2865, Predicted transcriptional regulator containing an HTH domain and an uncharacterized domain shared with the mammalian protein Schlafen [Transcription]	NA|67aa|up_6|NZ_CP041396.1_27498_27699_+	NA	NA|193aa|up_5|NZ_CP041396.1_27762_28341_+	pfam10543, ORF6N, ORF6N domain	NA|613aa|up_4|NZ_CP041396.1_28413_30252_+	pfam13149, Mfa_like_1, Fimbrillin-like	NA|150aa|up_3|NZ_CP041396.1_30325_30775_+	pfam14466, PLCC, PLAT/LH2 and C2-like Ca2+-binding lipoprotein	NA|136aa|up_2|NZ_CP041396.1_31173_31581_-	NA	NA|71aa|up_1|NZ_CP041396.1_31743_31956_+	NA	NA|381aa|up_0|NZ_CP041396.1_31962_33105_+	NA	NA|70aa|down_0|NZ_CP041396.1_34180_34390_-	NA	NA|141aa|down_1|NZ_CP041396.1_34870_35293_-	NA	NA|233aa|down_2|NZ_CP041396.1_35297_35996_-	cd02042, ParAB_family, partition proteins ParAB family	NA|103aa|down_3|NZ_CP041396.1_36535_36844_+	PRK13877, PRK13877, conjugal transfer transcriptional regulator TraJ	NA|514aa|down_4|NZ_CP041396.1_36840_38382_+	pfam03432, Relaxase, Relaxase/Mobilisation nuclease domain	NA|308aa|down_5|NZ_CP041396.1_38393_39317_+	NA	NA|430aa|down_6|NZ_CP041396.1_39445_40735_-	pfam13155, Toprim_2, Toprim-like	NA|166aa|down_7|NZ_CP041396.1_41014_41512_-	NA	NA|76aa|down_8|NZ_CP041396.1_41610_41838_-	NA	NA|197aa|down_9|NZ_CP041396.1_42018_42609_-	cd03768, SR_ResInv, Serine Recombinase (SR) family, Resolvase and Invertase subfamily, catalytic domain; members contain a C-terminal DNA binding domain
