assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_002355195.1_ASM235519v1	NZ_AP014597	Prevotella intermedia strain OMA14 chromosome I	1	504041-504148	1	CRISPRCasFinder	no		RT,PD-DExK,cas3,WYL,cas2,cas1,DEDDh	Orphan	CAAGACATAACGGATGTTAAATGCCG	26	1	1	504067-504122	NZ_AP014597.1_504026-504081	NA	1	1	Orphan	RT,PD-DExK,cas3,WYL,cas2,cas1,DEDDh,cas13b,Csx28	NA|52aa|up_9|NZ_AP014597.1_488362_488518_+,NA|88aa|up_0|NZ_AP014597.1_503674_503938_+,NA|272aa|down_0|NZ_AP014597.1_504272_505088_+,NA|1218aa|down_1|NZ_AP014597.1_505118_508772_+,NA|1237aa|down_2|NZ_AP014597.1_509227_512938_+,NA|70aa|down_3|NZ_AP014597.1_513076_513286_+,NA|140aa|down_6|NZ_AP014597.1_517121_517541_-	NA|52aa|up_9|NZ_AP014597.1_488362_488518_+	NA	NA|160aa|up_8|NZ_AP014597.1_488601_489081_-	pfam08989, DUF1896, Domain of unknown function (DUF1896)	NA|428aa|up_7|NZ_AP014597.1_489134_490418_-	cd01185, INTN1_C_like, Integrase IntN1 of Bacteroides mobilizable transposon NBU1 and similar proteins, C-terminal catalytic domain	NA|418aa|up_6|NZ_AP014597.1_490430_491684_-	cd01185, INTN1_C_like, Integrase IntN1 of Bacteroides mobilizable transposon NBU1 and similar proteins, C-terminal catalytic domain	NA|1500aa|up_5|NZ_AP014597.1_491941_496441_-	COG4646, COG4646, DNA methylase [Transcription / DNA replication, recombination, and repair]	NA|695aa|up_4|NZ_AP014597.1_496534_498619_-	PRK07726, PRK07726, DNA topoisomerase 3	NA|468aa|up_3|NZ_AP014597.1_498640_500044_-	pfam13351, DUF4099, Protein of unknown function (DUF4099)	NA|343aa|up_2|NZ_AP014597.1_500229_501258_-	smart00421, HTH_LUXR, helix_turn_helix, Lux Regulon	NA|580aa|up_1|NZ_AP014597.1_501807_503547_+	pfam12833, HTH_18, Helix-turn-helix domain	NA|88aa|up_0|NZ_AP014597.1_503674_503938_+	NA	NA|272aa|down_0|NZ_AP014597.1_504272_505088_+	NA	NA|1218aa|down_1|NZ_AP014597.1_505118_508772_+	NA	NA|1237aa|down_2|NZ_AP014597.1_509227_512938_+	NA	NA|70aa|down_3|NZ_AP014597.1_513076_513286_+	NA	NA|668aa|down_4|NZ_AP014597.1_513746_515750_-	pfam14293, YWFCY, YWFCY protein	NA|427aa|down_5|NZ_AP014597.1_515844_517125_-	pfam03432, Relaxase, Relaxase/Mobilisation nuclease domain	NA|140aa|down_6|NZ_AP014597.1_517121_517541_-	NA	NA|268aa|down_7|NZ_AP014597.1_518124_518928_+	pfam01656, CbiA, CobQ/CobB/MinD/ParA nucleotide binding domain	NA|158aa|down_8|NZ_AP014597.1_518914_519388_+	pfam11888, DUF3408, Protein of unknown function (DUF3408)	NA|159aa|down_9|NZ_AP014597.1_519394_519871_+	pfam11888, DUF3408, Protein of unknown function (DUF3408)
GCF_002355195.1_ASM235519v1	NZ_AP014597	Prevotella intermedia strain OMA14 chromosome I	2	1860929-1863584	1,2	PILER-CR,CRISPRCasFinder	no	cas2,cas1	RT,PD-DExK,cas3,WYL,cas2,cas1,DEDDh	Unclear	GTTGTGATTAGCTTTAAAATTAGTATCTTTGCATTGGCAAATACAAC,GTTGTGATTAGCTTTAAAATTAGTATCTTTGCATTGGCAAATACAAC	47,47	1	2	1863508-1863537|1863508-1863537	NZ_AP014597.1_1109456-1109485|NZ_AP014597.1_1879098-1879069	NA:NA	34,34	34	Unclear	RT,PD-DExK,cas3,WYL,cas2,cas1,DEDDh,cas13b,Csx28	NA|479aa|up_6|NZ_AP014597.1_1851575_1853012_-,NA|450aa|up_5|NZ_AP014597.1_1853008_1854358_-,NA|80aa|down_2|NZ_AP014597.1_1866852_1867092_+,NA|124aa|down_3|NZ_AP014597.1_1868129_1868501_+,NA|674aa|down_4|NZ_AP014597.1_1869113_1871135_+,NA|83aa|down_6|NZ_AP014597.1_1872147_1872396_+,NA|208aa|down_8|NZ_AP014597.1_1873032_1873656_+,NA|57aa|down_9|NZ_AP014597.1_1873669_1873840_+	NA|194aa|up_9|NZ_AP014597.1_1848447_1849029_+	PRK00071, nadD, nicotinate-nucleotide adenylyltransferase	NA|480aa|up_8|NZ_AP014597.1_1849095_1850535_+	cd13127, MATE_tuaB_like, Uncharacterized subfamily of the multidrug and toxic compound extrusion (MATE) proteins	NA|320aa|up_7|NZ_AP014597.1_1850602_1851562_-	cd04196, GT_2_like_d, Subfamily of Glycosyltransferase Family GT2 of unknown function	NA|479aa|up_6|NZ_AP014597.1_1851575_1853012_-	NA	NA|450aa|up_5|NZ_AP014597.1_1853008_1854358_-	NA	NA|321aa|up_4|NZ_AP014597.1_1854443_1855406_-	pfam12710, HAD, haloacid dehalogenase-like hydrolase	NA|293aa|up_3|NZ_AP014597.1_1855407_1856286_-	cd13963, PT_UbiA_2, UbiA family of prenyltransferases (PTases), Unknown subgroup	NA|341aa|up_2|NZ_AP014597.1_1856290_1857313_-	COG1835, COG1835, Predicted acyltransferases [Lipid metabolism]	NA|368aa|up_1|NZ_AP014597.1_1857314_1858418_-	pfam01757, Acyl_transf_3, Acyltransferase family	NA|616aa|up_0|NZ_AP014597.1_1858480_1860328_-	cd16017, LptA, Lipooligosaccharide Phosphoethanolamine Transferase A (LptA) or Lipid A Phosphoethanolamine Transferase	cas2|99aa|down_0|NZ_AP014597.1_1863704_1864001_-	COG3512, COG3512, CRISPR-associated protein, Cas2 homolog [Defense mechanisms]	cas1|311aa|down_1|NZ_AP014597.1_1864052_1864985_-	TIGR03639, cas1_NMENI, CRISPR-associated endonuclease Cas1, subtype II/NMENI	NA|80aa|down_2|NZ_AP014597.1_1866852_1867092_+	NA	NA|124aa|down_3|NZ_AP014597.1_1868129_1868501_+	NA	NA|674aa|down_4|NZ_AP014597.1_1869113_1871135_+	NA	NA|292aa|down_5|NZ_AP014597.1_1871266_1872142_+	pfam13401, AAA_22, AAA domain	NA|83aa|down_6|NZ_AP014597.1_1872147_1872396_+	NA	NA|212aa|down_7|NZ_AP014597.1_1872395_1873031_+	COG1066, Sms, Predicted ATP-dependent serine protease [Posttranslational modification, protein turnover, chaperones]	NA|208aa|down_8|NZ_AP014597.1_1873032_1873656_+	NA	NA|57aa|down_9|NZ_AP014597.1_1873669_1873840_+	NA
GCF_002355195.1_ASM235519v1	NZ_AP014598	Prevotella intermedia strain OMA14 chromosome II	1	151557-152054	1,1	PILER-CR,CRT	no	cas13b,Csx28	cas13b,Csx28,DEDDh,PD-DExK	Type VI-B2,Type VI-B1	GTTGCATCTGCCTGCTGTTTGCAAGGTAAAAACAA,GTTGCATCTGCCTGCTGTTTGCAAGGTAAAAACAAC	35,36	0	0	NA	NA	VI-B2:VI-B2	6,7	7	TypeVI-B2,TypeVI-B1	RT,PD-DExK,cas3,WYL,cas2,cas1,DEDDh,cas13b,Csx28	NA|411aa|up_6|NZ_AP014598.1_136408_137641_+,NA|320aa|up_4|NZ_AP014598.1_138567_139527_+,NA|139aa|up_3|NZ_AP014598.1_145756_146173_-,NA|298aa|up_2|NZ_AP014598.1_146349_147243_+,Csx28|182aa|up_0|NZ_AP014598.1_150941_151487_+,NA|678aa|down_6|NZ_AP014598.1_165224_167258_-,NA|170aa|down_7|NZ_AP014598.1_168088_168598_-	NA|155aa|up_9|NZ_AP014598.1_131439_131904_+	cd03450, NodN, NodN (nodulation factor N) contains a single hot dog fold similar to those of the peroxisomal Hydratase-Dehydrogenase-Epimerase (HDE) protein, and the fatty acid synthase beta subunit	NA|264aa|up_8|NZ_AP014598.1_132109_132901_-	TIGR02493, PFLA, pyruvate formate-lyase 1-activating enzyme	NA|749aa|up_7|NZ_AP014598.1_133136_135383_-	cd01678, PFL1, Pyruvate formate lyase 1	NA|411aa|up_6|NZ_AP014598.1_136408_137641_+	NA	NA|303aa|up_5|NZ_AP014598.1_137646_138555_+	pfam13175, AAA_15, AAA ATPase domain	NA|320aa|up_4|NZ_AP014598.1_138567_139527_+	NA	NA|139aa|up_3|NZ_AP014598.1_145756_146173_-	NA	NA|298aa|up_2|NZ_AP014598.1_146349_147243_+	NA	cas13b|1134aa|up_1|NZ_AP014598.1_147520_150922_+	cd20477, Cas13b_Pb-like, Class 2 type VI-B CRISPR-associated RNA-guided ribonuclease Cas13b from Prevotella buccae and similar Cas13b proteins	Csx28|182aa|up_0|NZ_AP014598.1_150941_151487_+	NA	NA|1007aa|down_0|NZ_AP014598.1_152870_155891_-	TIGR04183, hypothetical_protein, Por secretion system C-terminal sorting domain	NA|267aa|down_1|NZ_AP014598.1_156622_157423_+	pfam04383, KilA-N, KilA-N domain	NA|506aa|down_2|NZ_AP014598.1_157583_159101_-	pfam14134, DUF4301, Domain of unknown function (DUF4301)	NA|733aa|down_3|NZ_AP014598.1_159253_161452_+	COG0317, SpoT, Guanosine polyphosphate pyrophosphohydrolases/synthetases [Signal transduction mechanisms / Transcription]	NA|206aa|down_4|NZ_AP014598.1_161574_162192_-	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|763aa|down_5|NZ_AP014598.1_162328_164617_+	PRK07232, PRK07232, bifunctional malic enzyme oxidoreductase/phosphotransacetylase; Reviewed	NA|678aa|down_6|NZ_AP014598.1_165224_167258_-	NA	NA|170aa|down_7|NZ_AP014598.1_168088_168598_-	NA	NA|652aa|down_8|NZ_AP014598.1_168691_170647_-	PRK02628, nadE, NAD synthetase; Reviewed	NA|397aa|down_9|NZ_AP014598.1_170662_171853_-	pfam09640, DUF2027, Domain of unknown function (DUF2027)
GCF_002355195.1_ASM235519v1	NZ_AP014598	Prevotella intermedia strain OMA14 chromosome II	2	218294-218421	1	CRISPRCasFinder	no		cas13b,Csx28,DEDDh,PD-DExK	Orphan	TAACTACCATGTAACTGGCATGTAACTGA	29	0	0	NA	NA	NA	1	1	Orphan	RT,PD-DExK,cas3,WYL,cas2,cas1,DEDDh,cas13b,Csx28	NA|445aa|up_9|NZ_AP014598.1_202291_203626_-,NA|216aa|up_2|NZ_AP014598.1_212515_213163_+,NA|408aa|down_6|NZ_AP014598.1_223628_224852_-,NA|156aa|down_9|NZ_AP014598.1_226242_226710_-	NA|445aa|up_9|NZ_AP014598.1_202291_203626_-	NA	NA|331aa|up_8|NZ_AP014598.1_204801_205794_-	pfam00656, Peptidase_C14, Caspase domain	NA|132aa|up_7|NZ_AP014598.1_205804_206200_-	pfam08937, DUF1863, MTH538 TIR-like domain (DUF1863)	NA|444aa|up_6|NZ_AP014598.1_206363_207695_-	pfam13304, AAA_21, AAA domain, putative AbiEii toxin, Type IV TA system	NA|211aa|up_5|NZ_AP014598.1_208530_209163_+	pfam00239, Resolvase, Resolvase, N terminal domain	NA|91aa|up_4|NZ_AP014598.1_209331_209604_+	smart00530, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|954aa|up_3|NZ_AP014598.1_209642_212504_+	COG0610, COG0610, Type I site-specific restriction-modification system, R (restriction) subunit and related helicases [Defense mechanisms]	NA|216aa|up_2|NZ_AP014598.1_212515_213163_+	NA	NA|519aa|up_1|NZ_AP014598.1_213167_214724_+	TIGR00497, hsdM, type I restriction system adenine methylase (hsdM)	NA|702aa|up_0|NZ_AP014598.1_214742_216848_+	pfam03235, DUF262, Protein of unknown function DUF262	NA|405aa|down_0|NZ_AP014598.1_218455_219670_+	cd17275, RMtype1_S_MjaORF132P-TRD1-CR1_like, Type I restriction-modification system specificity (S) subunit TRD-CR, similar to MjaXIP/S	NA|200aa|down_1|NZ_AP014598.1_219662_220262_-	cd17255, RMtype1_S_Fco49512ORF2615P-TRD2-CR2_like, Type I restriction-modification system specificity (S) subunit Target Recognition Domain-ConseRved domain (TRD-CR), similar to Flavobacterium columnare S subunit (S	NA|171aa|down_2|NZ_AP014598.1_220260_220773_+	cd17266, RMtype1_S_Sau1132ORF3780P-TRD2-CR2_like, Type I restriction-modification system specificity (S) subunit Target Recognition Domain-ConseRved domain (TRD-CR), similar to Staphylococcus aureus subsp	NA|388aa|down_3|NZ_AP014598.1_220765_221929_-	cd17517, RMtype1_S_EcoKI_StySPI-TRD2-CR2_like, Type I restriction-modification system specificity (S) subunit Target Recognition Domain-ConseRved domain (TRD-CR),similar to Escherichia coli str	NA|309aa|down_4|NZ_AP014598.1_222047_222974_+	cd01189, INT_ICEBs1_C_like, C-terminal catalytic domain of integrases from bacterial phages and conjugate transposons	NA|175aa|down_5|NZ_AP014598.1_223044_223569_-	TIGR00996, Mtu_fam_mce, virulence factor Mce family protein	NA|408aa|down_6|NZ_AP014598.1_223628_224852_-	NA	NA|124aa|down_7|NZ_AP014598.1_224866_225238_-	pfam05534, HicB, HicB family	NA|144aa|down_8|NZ_AP014598.1_225614_226046_-	PRK06835, PRK06835, DNA replication protein DnaC; Validated	NA|156aa|down_9|NZ_AP014598.1_226242_226710_-	NA
GCF_002355195.1_ASM235519v1	NZ_AP014598	Prevotella intermedia strain OMA14 chromosome II	3	653617-653706	2	CRISPRCasFinder	no		cas13b,Csx28,DEDDh,PD-DExK	Orphan	TGTTTTGTTTTAGGAATGTGATGA	24	0	0	NA	NA	NA	1	1	Orphan	RT,PD-DExK,cas3,WYL,cas2,cas1,DEDDh,cas13b,Csx28	NA|189aa|up_8|NZ_AP014598.1_643535_644102_-,NA|109aa|up_5|NZ_AP014598.1_646947_647274_-,NA|281aa|up_2|NZ_AP014598.1_649651_650494_-,NA|196aa|down_0|NZ_AP014598.1_653741_654329_-	NA|247aa|up_9|NZ_AP014598.1_642607_643348_-	cd00081, Hint, Hedgehog/Intein domain, found in Hedgehog proteins as well as proteins which contain inteins and undergo protein splicing (e	NA|189aa|up_8|NZ_AP014598.1_643535_644102_-	NA	NA|129aa|up_7|NZ_AP014598.1_644113_644500_-	pfam14040, DNase_NucA_NucB, Deoxyribonuclease NucA/NucB	NA|178aa|up_6|NZ_AP014598.1_646249_646783_-	pfam14410, GH-E, HNH/ENDO VII superfamily nuclease with conserved GHE residues	NA|109aa|up_5|NZ_AP014598.1_646947_647274_-	NA	NA|230aa|up_4|NZ_AP014598.1_647270_647960_-	pfam13646, HEAT_2, HEAT repeats	NA|563aa|up_3|NZ_AP014598.1_647973_649662_-	cd00081, Hint, Hedgehog/Intein domain, found in Hedgehog proteins as well as proteins which contain inteins and undergo protein splicing (e	NA|281aa|up_2|NZ_AP014598.1_649651_650494_-	NA	NA|619aa|up_1|NZ_AP014598.1_650511_652368_-	COG3501, VgrG, Uncharacterized protein conserved in bacteria [Function unknown]	NA|133aa|up_0|NZ_AP014598.1_652587_652986_-	pfam17642, TssD, Hemolysin coregulated protein Hcp (TssD)	NA|196aa|down_0|NZ_AP014598.1_653741_654329_-	NA	NA|480aa|down_1|NZ_AP014598.1_654511_655951_-	PRK05777, PRK05777, NADH-quinone oxidoreductase subunit NuoN	NA|504aa|down_2|NZ_AP014598.1_655966_657478_-	PRK05846, PRK05846, NADH:ubiquinone oxidoreductase subunit M; Reviewed	NA|675aa|down_3|NZ_AP014598.1_657482_659507_-	PRK06590, PRK06590, NADH:ubiquinone oxidoreductase subunit L; Reviewed	NA|103aa|down_4|NZ_AP014598.1_659516_659825_-	PRK05715, PRK05715, NADH-quinone oxidoreductase subunit NuoK	NA|180aa|down_5|NZ_AP014598.1_659821_660361_-	pfam00499, Oxidored_q3, NADH-ubiquinone/plastoquinone oxidoreductase chain 6	NA|178aa|down_6|NZ_AP014598.1_660377_660911_-	PRK05888, PRK05888, NADH-quinone oxidoreductase subunit NuoI	NA|365aa|down_7|NZ_AP014598.1_660951_662046_-	pfam00146, NADHdh, NADH dehydrogenase	NA|525aa|down_8|NZ_AP014598.1_662092_663667_-	COG0649, NuoD, NADH:ubiquinone oxidoreductase 49 kD subunit 7 [Energy production and conversion]	NA|291aa|down_9|NZ_AP014598.1_663685_664558_-	PRK14816, PRK14816, NADH-quinone oxidoreductase subunit B
