assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_003019785.1_ASM301978v1	NZ_CP028109	Fusobacterium nucleatum subsp. nucleatum ATCC 23726 chromosome, complete genome	1	3856-4167	1	CRT	no		cas3,WYL,csa3,DEDDh,cas6,cas8b2,cas7,cas5,cas4,cas1,cas2,DinG,PD-DExK	Orphan	ANAGTTCAGCTTTTGGAN	18	0	0	NA	NA	NA	7	7	Orphan	cas3,WYL,csa3,DEDDh,cas6,cas8b2,cas7,cas5,cas4,cas1,cas2,DinG,PD-DExK	NA,NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|288aa|up_2|NZ_CP028109.1_0_864_-	COG1792, MreC, Cell shape-determining protein [Cell envelope biogenesis, outer membrane]	NA|392aa|up_1|NZ_CP028109.1_1027_2203_-	cd17325, MFS_MdtG_SLC18_like, bacterial MdtG-like and eukaryotic solute carrier 18 (SLC18) family of the Major Facilitator Superfamily of transporters	NA|300aa|up_0|NZ_CP028109.1_2626_3526_+	COG0697, RhaT, Permeases of the drug/metabolite transporter (DMT) superfamily [Carbohydrate transport and metabolism / Amino acid transport and metabolism / General function prediction only]	NA|244aa|down_0|NZ_CP028109.1_5245_5977_-	COG1124, DppF, ABC-type dipeptide/oligopeptide/nickel transport system, ATPase component [Amino acid transport and metabolism / Inorganic ion transport and metabolism]	NA|261aa|down_1|NZ_CP028109.1_5977_6760_-	COG0444, DppD, ABC-type dipeptide/oligopeptide/nickel transport system, ATPase component [Amino acid transport and metabolism / Inorganic ion transport and metabolism]	NA|268aa|down_2|NZ_CP028109.1_6756_7560_-	COG1173, DppC, ABC-type dipeptide/oligopeptide/nickel transport systems, permease components [Amino acid transport and metabolism / Inorganic ion transport and metabolism]	NA|310aa|down_3|NZ_CP028109.1_7552_8482_-	COG0601, DppB, ABC-type dipeptide/oligopeptide/nickel transport systems, permease components [Amino acid transport and metabolism / Inorganic ion transport and metabolism]	NA|535aa|down_4|NZ_CP028109.1_8496_10101_-	cd08489, PBP2_NikA, The substrate-binding component of an ABC-type nickel import system contains the type 2 periplasmic binding fold	NA|154aa|down_5|NZ_CP028109.1_10507_10969_+	PRK00061, ribH, 6,7-dimethyl-8-ribityllumazine synthase; Provisional	NA|370aa|down_6|NZ_CP028109.1_10970_12080_+	TIGR00326, eubact_ribD, riboflavin biosynthesis protein RibD	NA|219aa|down_7|NZ_CP028109.1_12271_12928_+	PRK09289, PRK09289, riboflavin synthase	NA|400aa|down_8|NZ_CP028109.1_12937_14137_+	PRK09311, PRK09311, bifunctional 3,4-dihydroxy-2-butanone-4-phosphate synthase/GTP cyclohydrolase II	NA|547aa|down_9|NZ_CP028109.1_15580_17221_+	COG2849, COG2849, Uncharacterized protein conserved in bacteria [Function unknown]
GCF_003019785.1_ASM301978v1	NZ_CP028109	Fusobacterium nucleatum subsp. nucleatum ATCC 23726 chromosome, complete genome	2	174776-174875	1	CRISPRCasFinder	no	cas3	cas3,WYL,csa3,DEDDh,cas6,cas8b2,cas7,cas5,cas4,cas1,cas2,DinG,PD-DExK	Unclear	GATACAGACTAATTATAGAACCTGTTACTG	30	1	1	174806-174845	NZ_CP028109.1_1057219-1057258	NA	1	1	Unclear	cas3,WYL,csa3,DEDDh,cas6,cas8b2,cas7,cas5,cas4,cas1,cas2,DinG,PD-DExK	NA,NA|118aa|down_1|NZ_CP028109.1_175328_175682_+	NA|545aa|up_9|NZ_CP028109.1_161266_162901_-	pfam09820, AAA-ATPase_like, Predicted AAA-ATPase	NA|510aa|up_8|NZ_CP028109.1_163042_164572_+	COG2461, COG2461, Uncharacterized conserved protein [Function unknown]	NA|268aa|up_7|NZ_CP028109.1_164826_165630_-	cd07331, M48C_Oma1_like, Peptidase M48C, integral membrane endopeptidase	NA|73aa|up_6|NZ_CP028109.1_165807_166026_-	PRK00391, rpsR, 30S ribosomal protein S18; Reviewed	NA|95aa|up_5|NZ_CP028109.1_166070_166355_-	cd00473, bS6, Bacterial ribosomal protein S6	NA|568aa|up_4|NZ_CP028109.1_166453_168157_-	PRK09194, PRK09194, prolyl-tRNA synthetase; Provisional	cas3|690aa|up_3|NZ_CP028109.1_168226_170296_-	COG1200, RecG, RecG-like helicase [DNA replication, recombination, and repair / Transcription]	NA|250aa|up_2|NZ_CP028109.1_170347_171097_-	PRK00110, PRK00110, YebC/PmpR family DNA-binding transcriptional regulator	NA|485aa|up_1|NZ_CP028109.1_171276_172731_-	COG2865, COG2865, Predicted transcriptional regulator containing an HTH domain and an uncharacterized domain shared with the mammalian protein Schlafen [Transcription]	NA|469aa|up_0|NZ_CP028109.1_172748_174155_-	pfam12728, HTH_17, Helix-turn-helix domain	NA|124aa|down_0|NZ_CP028109.1_174947_175319_+	COG3093, VapI, Plasmid maintenance system antidote protein [General function prediction only]	NA|118aa|down_1|NZ_CP028109.1_175328_175682_+	NA	NA|401aa|down_2|NZ_CP028109.1_175727_176930_-	COG1088, RfbB, dTDP-D-glucose 4,6-dehydratase [Cell envelope biogenesis, outer membrane]	NA|466aa|down_3|NZ_CP028109.1_177032_178430_-	cd14440, AlgX_N_like_3, Uncharacterized proteins similar to putative alginate O-acetyltransferase	NA|460aa|down_4|NZ_CP028109.1_178448_179828_-	COG1696, DltB, Predicted membrane protein involved in D-alanine export [Cell envelope biogenesis, outer membrane]	NA|366aa|down_5|NZ_CP028109.1_179976_181074_-	pfam01757, Acyl_transf_3, Acyltransferase family	NA|507aa|down_6|NZ_CP028109.1_181054_182575_-	cd13123, MATE_MurJ_like, MurJ/MviN, a subfamily of the multidrug and toxic compound extrusion (MATE)-like proteins	NA|375aa|down_7|NZ_CP028109.1_182588_183713_-	TIGR03573, WbuX, N-acetyl sugar amidotransferase	NA|401aa|down_8|NZ_CP028109.1_183709_184912_-	TIGR03588, PseC, UDP-4-amino-4,6-dideoxy-N-acetyl-beta-L-altrosamine transaminase	NA|509aa|down_9|NZ_CP028109.1_184904_186431_-	TIGR03586, PseI, pseudaminic acid synthase
GCF_003019785.1_ASM301978v1	NZ_CP028109	Fusobacterium nucleatum subsp. nucleatum ATCC 23726 chromosome, complete genome	3	901342-901436	2	CRISPRCasFinder	no		cas3,WYL,csa3,DEDDh,cas6,cas8b2,cas7,cas5,cas4,cas1,cas2,DinG,PD-DExK	Orphan	TATGATGAAAGATGATAAAATGATGA	26	0	0	NA	NA	NA	1	1	Orphan	cas3,WYL,csa3,DEDDh,cas6,cas8b2,cas7,cas5,cas4,cas1,cas2,DinG,PD-DExK	NA|159aa|up_6|NZ_CP028109.1_894074_894551_-,NA|197aa|up_2|NZ_CP028109.1_897880_898471_-,NA	NA|100aa|up_9|NZ_CP028109.1_891235_891535_+	COG0851, MinE, Septum formation topological specificity factor [Cell division and chromosome partitioning]	NA|327aa|up_8|NZ_CP028109.1_891618_892599_-	PHA03095, PHA03095, ankyrin-like protein; Provisional	NA|485aa|up_7|NZ_CP028109.1_892615_894070_-	sd00045, ANK, ankyrin repeats	NA|159aa|up_6|NZ_CP028109.1_894074_894551_-	NA	NA|115aa|up_5|NZ_CP028109.1_894682_895027_-	COG3862, COG3862, Uncharacterized protein with conserved CXXC pairs [Function unknown]	NA|422aa|up_4|NZ_CP028109.1_895026_896292_-	pfam07992, Pyr_redox_2, Pyridine nucleotide-disulphide oxidoreductase	NA|477aa|up_3|NZ_CP028109.1_896302_897733_-	COG0579, COG0579, Predicted dehydrogenase [General function prediction only]	NA|197aa|up_2|NZ_CP028109.1_897880_898471_-	NA	NA|439aa|up_1|NZ_CP028109.1_898491_899808_-	COG3593, COG3593, Predicted ATP-dependent endonuclease of the OLD family [DNA replication, recombination, and repair]	NA|218aa|up_0|NZ_CP028109.1_900128_900782_+	COG0785, CcdA, Cytochrome c biogenesis protein [Posttranslational modification, protein turnover, chaperones]	NA|309aa|down_0|NZ_CP028109.1_901521_902448_+	PRK14018, PRK14018, bifunctional peptide-methionine (S)-S-oxide reductase MsrA/peptide-methionine (R)-S-oxide reductase MsrB	NA|262aa|down_1|NZ_CP028109.1_902478_903264_+	COG4753, COG4753, Response regulator containing CheY-like receiver domain and AraC-type DNA-binding domain [Signal transduction mechanisms]	NA|553aa|down_2|NZ_CP028109.1_903250_904909_+	COG2972, COG2972, Predicted signal transduction protein with a C-terminal ATPase domain [Signal transduction mechanisms]	NA|478aa|down_3|NZ_CP028109.1_905027_906461_+	COG2865, COG2865, Predicted transcriptional regulator containing an HTH domain and an uncharacterized domain shared with the mammalian protein Schlafen [Transcription]	NA|516aa|down_4|NZ_CP028109.1_906617_908165_+	cd08520, PBP2_NikA_DppA_OppA_like_21, The substrate-binding component of an uncharacterized ABC-type nickel/dipeptide/oligopeptide-like import system contains the type 2 periplasmic binding fold	NA|319aa|down_5|NZ_CP028109.1_908161_909118_+	COG0601, DppB, ABC-type dipeptide/oligopeptide/nickel transport systems, permease components [Amino acid transport and metabolism / Inorganic ion transport and metabolism]	NA|275aa|down_6|NZ_CP028109.1_909114_909939_+	COG1173, DppC, ABC-type dipeptide/oligopeptide/nickel transport systems, permease components [Amino acid transport and metabolism / Inorganic ion transport and metabolism]	NA|556aa|down_7|NZ_CP028109.1_909935_911603_+	COG1123, COG1123, ATPase components of various ABC-type transport systems, contain duplicated ATPase [General function prediction only]	NA|117aa|down_8|NZ_CP028109.1_911592_911943_+	COG1695, COG1695, Predicted transcriptional regulators [Transcription]	NA|267aa|down_9|NZ_CP028109.1_911939_912740_+	pfam13649, Methyltransf_25, Methyltransferase domain
GCF_003019785.1_ASM301978v1	NZ_CP028109	Fusobacterium nucleatum subsp. nucleatum ATCC 23726 chromosome, complete genome	4	1292060-1293631	1,3,2	PILER-CR,CRISPRCasFinder,CRT	no	cas6,cas8b2,cas7,cas5,cas3,cas4,cas1,cas2	cas3,WYL,csa3,DEDDh,cas6,cas8b2,cas7,cas5,cas4,cas1,cas2,DinG,PD-DExK	Unclear	ATTTATGTATTTCTATATTAGAATTTAAAT,ATTTATGTATTTCTATATTAGAATTTAAAT,ATTTATGTATTTCTATATTAGAATTTAAAT	30,30,30	0	0	NA	NA	NA:NA:NA	23,23,23	23	Unclear	cas3,WYL,csa3,DEDDh,cas6,cas8b2,cas7,cas5,cas4,cas1,cas2,DinG,PD-DExK	NA,NA|69aa|down_0|NZ_CP028109.1_1294162_1294369_-	NA|393aa|up_9|NZ_CP028109.1_1279766_1280945_-	cd05672, M20_ACY1L2-like, M20 Peptidase aminoacylase 1-like protein 2-like, amidohydrolase subfamily	NA|253aa|up_8|NZ_CP028109.1_1281330_1282089_+	cd01411, SIR2H, SIR2H: Uncharacterized prokaryotic Sir2 homologs from several gram positive bacterial species and Fusobacteria; and are members of the SIR2 family of proteins, silent information regulator 2 (Sir2) enzymes which catalyze NAD+-dependent protein/histone deacetylation	cas6|251aa|up_7|NZ_CP028109.1_1283187_1283940_+	TIGR01877, CRISPR-associated_endoribonuclease_Cas6_1, CRISPR-associated endoribonuclease Cas6	cas8b2|519aa|up_6|NZ_CP028109.1_1283929_1285486_+	pfam09657, Cas_Csx8, CRISPR-associated protein Csx8 (Cas_Csx8)	cas7|301aa|up_5|NZ_CP028109.1_1285498_1286401_+	TIGR01875, CRISPR-associated_protein_Cas7/Cst2/DevR, CRISPR-associated autoregulator DevR family	cas5|367aa|up_4|NZ_CP028109.1_1286413_1287514_+	TIGR02593, CRISPR-associated_protein_Cas5, CRISPR-associated protein Cas5, N-terminal domain	cas3|813aa|up_3|NZ_CP028109.1_1287602_1290041_+	COG1203, COG1203, CRISPR-associated helicase Cas3 [Defense mechanisms]	cas4|165aa|up_2|NZ_CP028109.1_1290083_1290578_+	pfam01930, Cas_Cas4, Domain of unknown function DUF83	cas1|331aa|up_1|NZ_CP028109.1_1290589_1291582_+	TIGR03641, cas1_HMARI, CRISPR-associated endonuclease Cas1, subtype I-B/HMARI/TNEAP	cas2|107aa|up_0|NZ_CP028109.1_1291544_1291865_+	COG1343, COG1343, CRISPR-associated protein Cas2 [Defense mechanisms]	NA|69aa|down_0|NZ_CP028109.1_1294162_1294369_-	NA	NA|335aa|down_1|NZ_CP028109.1_1295235_1296240_+	PRK09653, eutD, phosphotransacetylase	NA|399aa|down_2|NZ_CP028109.1_1296290_1297487_+	PRK00180, PRK00180, acetate kinase A/propionate kinase 2; Reviewed	NA|1189aa|down_3|NZ_CP028109.1_1297578_1301145_+	TIGR02176, pyruvate_flavodoxin/ferrodoxin_oxidoreductase, pyruvate:ferredoxin (flavodoxin) oxidoreductase, homodimeric	NA|319aa|down_4|NZ_CP028109.1_1301315_1302272_+	TIGR01771, L-lactate_dehydrogenase, L-lactate dehydrogenase	NA|340aa|down_5|NZ_CP028109.1_1303550_1304570_-	PRK09478, mglC, galactose/methyl galactoside ABC transporter permease MglC	NA|501aa|down_6|NZ_CP028109.1_1304592_1306095_-	PRK10982, PRK10982, galactose/methyl galaxtoside transporter ATP-binding protein; Provisional	NA|342aa|down_7|NZ_CP028109.1_1306184_1307210_-	PRK15395, PRK15395, galactose/glucose ABC transporter substrate-binding protein MglB	NA|316aa|down_8|NZ_CP028109.1_1307406_1308354_-	COG1940, NagC, Transcriptional regulator/sugar kinase [Transcription / Carbohydrate transport and metabolism]	NA|308aa|down_9|NZ_CP028109.1_1308377_1309301_-	COG0492, TrxB, Thioredoxin reductase [Posttranslational modification, protein turnover, chaperones]
GCF_003019785.1_ASM301978v1	NZ_CP028109	Fusobacterium nucleatum subsp. nucleatum ATCC 23726 chromosome, complete genome	5	1772664-1772911	4,2	CRISPRCasFinder,PILER-CR	no	DinG	cas3,WYL,csa3,DEDDh,cas6,cas8b2,cas7,cas5,cas4,cas1,cas2,DinG,PD-DExK	Type IV-A	TCCATCTAGTTCACCATTTTTATAATT,TCCATCTAGTTCACCATTTTTATAATTTTCTT	27,32	0	0	NA	NA	NA:NA	3,2	3	Orphan	cas3,WYL,csa3,DEDDh,cas6,cas8b2,cas7,cas5,cas4,cas1,cas2,DinG,PD-DExK	NA,NA|178aa|down_6|NZ_CP028109.1_1780840_1781374_-	NA|399aa|up_9|NZ_CP028109.1_1759770_1760967_-	COG1168, MalY, Bifunctional PLP-dependent enzyme with beta-cystathionase and maltose regulon repressor activities [Amino acid transport and metabolism]	NA|496aa|up_8|NZ_CP028109.1_1761010_1762498_-	pfam01235, Na_Ala_symp, Sodium:alanine symporter family	NA|183aa|up_7|NZ_CP028109.1_1762665_1763214_+	cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|163aa|up_6|NZ_CP028109.1_1763422_1763911_-	pfam02130, UPF0054, Uncharacterized protein family UPF0054	NA|691aa|up_5|NZ_CP028109.1_1763927_1766000_-	COG1480, COG1480, Predicted membrane-associated HD superfamily hydrolase [General function prediction only]	DinG|821aa|up_4|NZ_CP028109.1_1766017_1768480_-	COG1199, DinG, Rad3-related DNA helicases [Transcription / DNA replication, recombination, and repair]	NA|157aa|up_3|NZ_CP028109.1_1768501_1768972_-	COG4807, COG4807, Uncharacterized protein conserved in bacteria [Function unknown]	NA|322aa|up_2|NZ_CP028109.1_1769187_1770153_+	COG3643, COG3643, Glutamate formiminotransferase [Amino acid transport and metabolism]	NA|414aa|up_1|NZ_CP028109.1_1770229_1771471_+	PRK09356, PRK09356, imidazolonepropionase; Validated	NA|213aa|up_0|NZ_CP028109.1_1771488_1772127_+	COG3404, COG3404, Methenyl tetrahydrofolate cyclohydrolase [Amino acid transport and metabolism]	NA|110aa|down_0|NZ_CP028109.1_1773399_1773729_+	pfam08921, DUF1904, Domain of unknown function (DUF1904)	NA|252aa|down_1|NZ_CP028109.1_1773730_1774486_+	pfam08241, Methyltransf_11, Methyltransferase domain	NA|603aa|down_2|NZ_CP028109.1_1774574_1776383_-	COG5295, Hia, Autotransporter adhesin [Intracellular trafficking and secretion / Extracellular structures]	NA|569aa|down_3|NZ_CP028109.1_1776578_1778285_-	TIGR03904, putative_radical_SAM_protein_YgiQ, uncharacterized radical SAM protein YgiQ	NA|413aa|down_4|NZ_CP028109.1_1778291_1779530_-	PRK05469, PRK05469, tripeptide aminopeptidase PepT	NA|397aa|down_5|NZ_CP028109.1_1779633_1780824_-	pfam05636, HIGH_NTase1, HIGH Nucleotidyl Transferase	NA|178aa|down_6|NZ_CP028109.1_1780840_1781374_-	NA	NA|205aa|down_7|NZ_CP028109.1_1781402_1782017_-	COG0588, GpmA, Phosphoglycerate mutase 1 [Carbohydrate transport and metabolism]	NA|229aa|down_8|NZ_CP028109.1_1782053_1782740_-	PRK14115, gpmA, 2,3-diphosphoglycerate-dependent phosphoglycerate mutase	NA|240aa|down_9|NZ_CP028109.1_1782870_1783590_+	pfam02661, Fic, Fic/DOC family
