assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_001688725.2_ASM168872v2	NZ_CP015401	Bacteroides caecimuris strain I48 chromosome, complete genome	1	2094376-2094528	1	CRISPRCasFinder	no		cas3,PD-DExK,RT,PrimPol,WYL,DEDDh	Orphan	TTTGTACACCCTCAGGGACTCGAACCCTGGACCCATTGATTAAGAGTCA	49	0	0	NA	NA	NA	1	1	Orphan	cas3,PD-DExK,RT,PrimPol,WYL,DEDDh	NA|397aa|up_9|NZ_CP015401.2_2082827_2084018_-,NA|293aa|up_6|NZ_CP015401.2_2085417_2086296_-,NA|83aa|down_2|NZ_CP015401.2_2097077_2097326_-,NA|151aa|down_5|NZ_CP015401.2_2098676_2099129_+,NA|129aa|down_8|NZ_CP015401.2_2102249_2102636_-	NA|397aa|up_9|NZ_CP015401.2_2082827_2084018_-	NA	NA|112aa|up_8|NZ_CP015401.2_2084032_2084368_-	pfam09851, SHOCT, Short C-terminal domain	NA|228aa|up_7|NZ_CP015401.2_2084400_2085084_-	PRK09598, PRK09598, phosphoethanolamine--lipid A transferase EptA	NA|293aa|up_6|NZ_CP015401.2_2085417_2086296_-	NA	NA|162aa|up_5|NZ_CP015401.2_2087661_2088147_-	pfam18291, HU-HIG, HU domain fused to wHTH, Ig, or Glycine-rich motif	NA|480aa|up_4|NZ_CP015401.2_2088612_2090052_-	PRK03643, PRK03643, tagaturonate reductase	NA|356aa|up_3|NZ_CP015401.2_2090078_2091146_-	cd06307, PBP1_sugar_binding, periplasmic sugar-binding domain of uncharacterized transport systems	NA|469aa|up_2|NZ_CP015401.2_2091344_2092751_+	PRK02925, PRK02925, glucuronate isomerase; Reviewed	NA|312aa|up_1|NZ_CP015401.2_2092897_2093833_-	pfam01610, DDE_Tnp_ISL3, Transposase	NA|124aa|up_0|NZ_CP015401.2_2093873_2094245_-	pfam14690, zf-ISL3, zinc-finger of transposase IS204/IS1001/IS1096/IS1165	NA|268aa|down_0|NZ_CP015401.2_2094645_2095449_-	cd07733, YycJ-like_MBL-fold, uncharacterized subgroup which includes Bacillus subtilis YycJ and related proteins; MBL-fold metallo hydrolase domain	NA|463aa|down_1|NZ_CP015401.2_2095531_2096920_-	cd17346, MFS_DtpA_like, Dipeptide and tripeptide permease A (DtpA)-like subfamily of the Major Facilitator Superfamily of transporters	NA|83aa|down_2|NZ_CP015401.2_2097077_2097326_-	NA	NA|152aa|down_3|NZ_CP015401.2_2097487_2097943_+	pfam07681, DoxX, DoxX	NA|184aa|down_4|NZ_CP015401.2_2098030_2098582_+	pfam11810, DUF3332, Domain of unknown function (DUF3332)	NA|151aa|down_5|NZ_CP015401.2_2098676_2099129_+	NA	NA|770aa|down_6|NZ_CP015401.2_2099082_2101392_-	COG4775, COG4775, Outer membrane protein/protective antigen OMA87 [Cell envelope biogenesis, outer membrane]	NA|255aa|down_7|NZ_CP015401.2_2101534_2102299_+	cd18109, SpoU-like_RNA-MTase, SAM-dependent RNA methylase related to SpoU-TrmH	NA|129aa|down_8|NZ_CP015401.2_2102249_2102636_-	NA	NA|359aa|down_9|NZ_CP015401.2_2102641_2103718_-	pfam14129, DUF4296, Domain of unknown function (DUF4296)
GCF_001688725.2_ASM168872v2	NZ_CP015401	Bacteroides caecimuris strain I48 chromosome, complete genome	2	2348934-2349071	1	CRT	no	PrimPol	cas3,PD-DExK,RT,PrimPol,WYL,DEDDh	Unclear	TCTTTTATACCTTCTTTT	18	3	11	2348952-2348969|2348952-2348969|2348952-2348969|2348952-2348969|2348988-2349017|2348988-2349017|2348988-2349017|2349036-2349053|2349036-2349053|2349036-2349053|2349036-2349053	NZ_CP015401.2_2346620-2346637|NZ_CP015401.2_2350190-2350207|NZ_CP015401.2_2351368-2351385|NZ_CP015401.2_2351356-2351373|NZ_CP015401.2_2346560-2346589|NZ_CP015401.2_2350166-2350195|NZ_CP015401.2_2351392-2351421|NZ_CP015401.2_2346644-2346661|NZ_CP015401.2_2347798-2347815|NZ_CP015401.2_2350214-2350231|NZ_CP015401.2_2351416-2351433	NA	3	3	Orphan	cas3,PD-DExK,RT,PrimPol,WYL,DEDDh	NA|47aa|up_9|NZ_CP015401.2_2339600_2339741_-,NA|36aa|up_4|NZ_CP015401.2_2343729_2343837_+,NA	NA|47aa|up_9|NZ_CP015401.2_2339600_2339741_-	NA	PrimPol|208aa|up_8|NZ_CP015401.2_2339905_2340529_+	pfam08800, VirE_N, VirE N-terminal domain	NA|621aa|up_7|NZ_CP015401.2_2340549_2342412_+	pfam13148, DUF3987, Protein of unknown function (DUF3987)	NA|73aa|up_6|NZ_CP015401.2_2342539_2342758_-	pfam14053, DUF4248, Domain of unknown function (DUF4248)	NA|163aa|up_5|NZ_CP015401.2_2342979_2343468_+	pfam18291, HU-HIG, HU domain fused to wHTH, Ig, or Glycine-rich motif	NA|36aa|up_4|NZ_CP015401.2_2343729_2343837_+	NA	NA|138aa|up_3|NZ_CP015401.2_2343841_2344255_+	PHA00447, PHA00447, lysozyme	NA|564aa|up_2|NZ_CP015401.2_2344523_2346215_+	COG3666, COG3666, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|316aa|up_1|NZ_CP015401.2_2346462_2347410_-	pfam12784, PDDEXK_2, PD-(D/E)XK nuclease family transposase	NA|296aa|up_0|NZ_CP015401.2_2347676_2348564_-	pfam12784, PDDEXK_2, PD-(D/E)XK nuclease family transposase	NA|304aa|down_0|NZ_CP015401.2_2350068_2350980_-	pfam12784, PDDEXK_2, PD-(D/E)XK nuclease family transposase	NA|312aa|down_1|NZ_CP015401.2_2351246_2352182_-	pfam12784, PDDEXK_2, PD-(D/E)XK nuclease family transposase	NA|363aa|down_2|NZ_CP015401.2_2352556_2353645_-	cd06853, GT_WecA_like, This subfamily contains Escherichia coli WecA, Bacillus subtilis TagO and related proteins	NA|356aa|down_3|NZ_CP015401.2_2354146_2355214_-	cd04955, GT4-like, glycosyltransferase family 4 proteins	NA|360aa|down_4|NZ_CP015401.2_2355233_2356313_-	cd03801, GT4_PimA-like, phosphatidyl-myo-inositol mannosyltransferase	NA|361aa|down_5|NZ_CP015401.2_2356327_2357410_-	COG1132, MdlB, ABC-type multidrug transport system, ATPase and permease components [Defense mechanisms]	NA|378aa|down_6|NZ_CP015401.2_2357376_2358510_-	cd03820, GT4_AmsD-like, amylovoran biosynthesis glycosyltransferase AmsD and similar proteins	NA|261aa|down_7|NZ_CP015401.2_2358521_2359304_-	pfam04991, LicD, LicD family	NA|365aa|down_8|NZ_CP015401.2_2359305_2360400_-	smart00854, PGA_cap, Bacterial capsule synthesis protein PGA_cap	NA|489aa|down_9|NZ_CP015401.2_2360408_2361875_-	cd13127, MATE_tuaB_like, Uncharacterized subfamily of the multidrug and toxic compound extrusion (MATE) proteins
