assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_002211605.1_ASM221160v1	NZ_CP022122	Fusobacterium nucleatum subsp. nucleatum strain ChDC F317 chromosome, complete genome	1	359420-359741	1	CRISPRCasFinder	no		cas3,WYL,PD-DExK,DEDDh,cas6,cas8b2,cas7,cas5,cas4,cas1,cas2,DinG,csa3	Orphan	TTATTATCAAAATGGAAAATTAAAAGTAGA	30	0	0	NA	NA	NA	4	4	Orphan	cas3,WYL,PD-DExK,DEDDh,cas6,cas8b2,cas7,cas5,cas4,cas1,cas2,DinG,csa3	NA|91aa|up_9|NZ_CP022122.1_346872_347145_+,NA|161aa|down_1|NZ_CP022122.1_360500_360983_+,NA|162aa|down_2|NZ_CP022122.1_361151_361637_+,NA|172aa|down_3|NZ_CP022122.1_361690_362206_+	NA|91aa|up_9|NZ_CP022122.1_346872_347145_+	NA	NA|636aa|up_8|NZ_CP022122.1_347157_349065_+	PRK05644, gyrB, DNA gyrase subunit B; Validated	NA|812aa|up_7|NZ_CP022122.1_349210_351646_+	PRK05560, PRK05560, DNA gyrase subunit A; Validated	NA|154aa|up_6|NZ_CP022122.1_351658_352120_+	COG0622, COG0622, Predicted phosphoesterase [General function prediction only]	NA|339aa|up_5|NZ_CP022122.1_352146_353163_+	PRK00488, pheS, phenylalanyl-tRNA synthetase subunit alpha; Validated	NA|799aa|up_4|NZ_CP022122.1_353176_355573_+	PRK00629, pheT, phenylalanyl-tRNA synthetase subunit beta; Reviewed	NA|244aa|up_3|NZ_CP022122.1_355603_356335_+	COG2849, COG2849, Uncharacterized protein conserved in bacteria [Function unknown]	NA|190aa|up_2|NZ_CP022122.1_356440_357010_+	COG2849, COG2849, Uncharacterized protein conserved in bacteria [Function unknown]	NA|339aa|up_1|NZ_CP022122.1_357318_358335_+	COG2849, COG2849, Uncharacterized protein conserved in bacteria [Function unknown]	NA|246aa|up_0|NZ_CP022122.1_358346_359084_+	COG2849, COG2849, Uncharacterized protein conserved in bacteria [Function unknown]	NA|169aa|down_0|NZ_CP022122.1_359866_360373_+	COG2849, COG2849, Uncharacterized protein conserved in bacteria [Function unknown]	NA|161aa|down_1|NZ_CP022122.1_360500_360983_+	NA	NA|162aa|down_2|NZ_CP022122.1_361151_361637_+	NA	NA|172aa|down_3|NZ_CP022122.1_361690_362206_+	NA	NA|220aa|down_4|NZ_CP022122.1_362281_362941_+	COG2849, COG2849, Uncharacterized protein conserved in bacteria [Function unknown]	NA|169aa|down_5|NZ_CP022122.1_362960_363467_+	COG2849, COG2849, Uncharacterized protein conserved in bacteria [Function unknown]	NA|330aa|down_6|NZ_CP022122.1_363487_364477_-	COG1087, GalE, UDP-glucose 4-epimerase [Cell envelope biogenesis, outer membrane]	NA|510aa|down_7|NZ_CP022122.1_364476_366006_-	COG4468, GalT, Galactose-1-phosphate uridyltransferase [Carbohydrate transport and metabolism]	NA|390aa|down_8|NZ_CP022122.1_366005_367175_-	PRK05322, PRK05322, galactokinase; Provisional	NA|506aa|down_9|NZ_CP022122.1_367310_368828_-	COG1288, COG1288, Predicted membrane protein [Function unknown]
GCF_002211605.1_ASM221160v1	NZ_CP022122	Fusobacterium nucleatum subsp. nucleatum strain ChDC F317 chromosome, complete genome	2	360022-360123	2	CRISPRCasFinder	no		cas3,WYL,PD-DExK,DEDDh,cas6,cas8b2,cas7,cas5,cas4,cas1,cas2,DinG,csa3	Orphan	TTATTATCAAAATGGAAAATTAAAAGTAGA	30	0	0	NA	NA	NA	1	1	Orphan	cas3,WYL,PD-DExK,DEDDh,cas6,cas8b2,cas7,cas5,cas4,cas1,cas2,DinG,csa3	NA,NA|161aa|down_0|NZ_CP022122.1_360500_360983_+,NA|162aa|down_1|NZ_CP022122.1_361151_361637_+,NA|172aa|down_2|NZ_CP022122.1_361690_362206_+	NA|636aa|up_9|NZ_CP022122.1_347157_349065_+	PRK05644, gyrB, DNA gyrase subunit B; Validated	NA|812aa|up_8|NZ_CP022122.1_349210_351646_+	PRK05560, PRK05560, DNA gyrase subunit A; Validated	NA|154aa|up_7|NZ_CP022122.1_351658_352120_+	COG0622, COG0622, Predicted phosphoesterase [General function prediction only]	NA|339aa|up_6|NZ_CP022122.1_352146_353163_+	PRK00488, pheS, phenylalanyl-tRNA synthetase subunit alpha; Validated	NA|799aa|up_5|NZ_CP022122.1_353176_355573_+	PRK00629, pheT, phenylalanyl-tRNA synthetase subunit beta; Reviewed	NA|244aa|up_4|NZ_CP022122.1_355603_356335_+	COG2849, COG2849, Uncharacterized protein conserved in bacteria [Function unknown]	NA|190aa|up_3|NZ_CP022122.1_356440_357010_+	COG2849, COG2849, Uncharacterized protein conserved in bacteria [Function unknown]	NA|339aa|up_2|NZ_CP022122.1_357318_358335_+	COG2849, COG2849, Uncharacterized protein conserved in bacteria [Function unknown]	NA|246aa|up_1|NZ_CP022122.1_358346_359084_+	COG2849, COG2849, Uncharacterized protein conserved in bacteria [Function unknown]	NA|245aa|up_0|NZ_CP022122.1_359111_359846_+	COG2849, COG2849, Uncharacterized protein conserved in bacteria [Function unknown]	NA|161aa|down_0|NZ_CP022122.1_360500_360983_+	NA	NA|162aa|down_1|NZ_CP022122.1_361151_361637_+	NA	NA|172aa|down_2|NZ_CP022122.1_361690_362206_+	NA	NA|220aa|down_3|NZ_CP022122.1_362281_362941_+	COG2849, COG2849, Uncharacterized protein conserved in bacteria [Function unknown]	NA|169aa|down_4|NZ_CP022122.1_362960_363467_+	COG2849, COG2849, Uncharacterized protein conserved in bacteria [Function unknown]	NA|330aa|down_5|NZ_CP022122.1_363487_364477_-	COG1087, GalE, UDP-glucose 4-epimerase [Cell envelope biogenesis, outer membrane]	NA|510aa|down_6|NZ_CP022122.1_364476_366006_-	COG4468, GalT, Galactose-1-phosphate uridyltransferase [Carbohydrate transport and metabolism]	NA|390aa|down_7|NZ_CP022122.1_366005_367175_-	PRK05322, PRK05322, galactokinase; Provisional	NA|506aa|down_8|NZ_CP022122.1_367310_368828_-	COG1288, COG1288, Predicted membrane protein [Function unknown]	NA|495aa|down_9|NZ_CP022122.1_369139_370624_-	COG3333, COG3333, Uncharacterized protein conserved in bacteria [Function unknown]
GCF_002211605.1_ASM221160v1	NZ_CP022122	Fusobacterium nucleatum subsp. nucleatum strain ChDC F317 chromosome, complete genome	3	893827-894854	3,1,1	CRISPRCasFinder,CRT,PILER-CR	no	cas6,cas8b2,cas7,cas5,cas3,cas4,cas1,cas2	cas3,WYL,PD-DExK,DEDDh,cas6,cas8b2,cas7,cas5,cas4,cas1,cas2,DinG,csa3	Unclear	ATTTATGTATTTCTATATTAGAATTTAAAT,ATTTATGTATTTCTATATTAGAATTTAAAT,ATTTATGTATTTCTATATTAGAATTTAAAT	30,30,30	0	0	NA	NA	NA:NA:NA	15,15,11	15	Unclear	cas3,WYL,PD-DExK,DEDDh,cas6,cas8b2,cas7,cas5,cas4,cas1,cas2,DinG,csa3	NA,NA|69aa|down_0|NZ_CP022122.1_895385_895592_-	NA|253aa|up_9|NZ_CP022122.1_882826_883585_+	cd01411, SIR2H, SIR2H: Uncharacterized prokaryotic Sir2 homologs from several gram positive bacterial species and Fusobacteria; and are members of the SIR2 family of proteins, silent information regulator 2 (Sir2) enzymes which catalyze NAD+-dependent protein/histone deacetylation	NA|300aa|up_8|NZ_CP022122.1_884038_884938_+	COG4823, AbiF, Abortive infection bacteriophage resistance protein [Defense mechanisms]	cas6|251aa|up_7|NZ_CP022122.1_884966_885719_+	TIGR01877, CRISPR-associated_endoribonuclease_Cas6_1, CRISPR-associated endoribonuclease Cas6	cas8b2|515aa|up_6|NZ_CP022122.1_885708_887253_+	pfam09657, Cas_Csx8, CRISPR-associated protein Csx8 (Cas_Csx8)	cas7|301aa|up_5|NZ_CP022122.1_887265_888168_+	TIGR01875, CRISPR-associated_protein_Cas7/Cst2/DevR, CRISPR-associated autoregulator DevR family	cas5|367aa|up_4|NZ_CP022122.1_888180_889281_+	TIGR02593, CRISPR-associated_protein_Cas5, CRISPR-associated protein Cas5, N-terminal domain	cas3|813aa|up_3|NZ_CP022122.1_889369_891808_+	COG1203, COG1203, CRISPR-associated helicase Cas3 [Defense mechanisms]	cas4|165aa|up_2|NZ_CP022122.1_891850_892345_+	pfam01930, Cas_Cas4, Domain of unknown function DUF83	cas1|331aa|up_1|NZ_CP022122.1_892356_893349_+	TIGR03641, cas1_HMARI, CRISPR-associated endonuclease Cas1, subtype I-B/HMARI/TNEAP	cas2|107aa|up_0|NZ_CP022122.1_893311_893632_+	COG1343, COG1343, CRISPR-associated protein Cas2 [Defense mechanisms]	NA|69aa|down_0|NZ_CP022122.1_895385_895592_-	NA	NA|335aa|down_1|NZ_CP022122.1_896458_897463_+	PRK09653, eutD, phosphotransacetylase	NA|399aa|down_2|NZ_CP022122.1_897513_898710_+	PRK00180, PRK00180, acetate kinase A/propionate kinase 2; Reviewed	NA|1189aa|down_3|NZ_CP022122.1_898801_902368_+	TIGR02176, pyruvate_flavodoxin/ferrodoxin_oxidoreductase, pyruvate:ferredoxin (flavodoxin) oxidoreductase, homodimeric	NA|319aa|down_4|NZ_CP022122.1_902538_903495_+	TIGR01771, L-lactate_dehydrogenase, L-lactate dehydrogenase	NA|303aa|down_5|NZ_CP022122.1_903515_904424_+	cd06173, MFS_MefA_like, Macrolide efflux protein A and similar proteins of the Major Facilitator Superfamily of transporters	NA|340aa|down_6|NZ_CP022122.1_904467_905487_-	PRK09478, mglC, galactose/methyl galactoside ABC transporter permease MglC	NA|501aa|down_7|NZ_CP022122.1_905509_907012_-	PRK10982, PRK10982, galactose/methyl galaxtoside transporter ATP-binding protein; Provisional	NA|342aa|down_8|NZ_CP022122.1_907101_908127_-	PRK15395, PRK15395, galactose/glucose ABC transporter substrate-binding protein MglB	NA|316aa|down_9|NZ_CP022122.1_908323_909271_-	COG1940, NagC, Transcriptional regulator/sugar kinase [Transcription / Carbohydrate transport and metabolism]
GCF_002211605.1_ASM221160v1	NZ_CP022122	Fusobacterium nucleatum subsp. nucleatum strain ChDC F317 chromosome, complete genome	4	1350077-1350324	4,2	CRISPRCasFinder,PILER-CR	no	DinG	cas3,WYL,PD-DExK,DEDDh,cas6,cas8b2,cas7,cas5,cas4,cas1,cas2,DinG,csa3	Type IV-A	TCCATCTAGTTCACCATTTTTATAATT,TCCATCTAGTTCACCATTTTTATAATTTTCTT	27,32	0	0	NA	NA	NA:NA	3,2	3	Orphan	cas3,WYL,PD-DExK,DEDDh,cas6,cas8b2,cas7,cas5,cas4,cas1,cas2,DinG,csa3	NA,NA|178aa|down_6|NZ_CP022122.1_1358254_1358788_-	NA|415aa|up_9|NZ_CP022122.1_1335847_1337092_+	pfam13645, YkuD_2, L,D-transpeptidase catalytic domain	NA|450aa|up_8|NZ_CP022122.1_1337203_1338553_+	pfam07613, DUF1576, Protein of unknown function (DUF1576)	NA|178aa|up_7|NZ_CP022122.1_1338549_1339083_+	cd03424, ADPRase_NUDT5, ADP-ribose pyrophosphatase (ADPRase) catalyzes the hydrolysis of ADP-ribose and a variety of additional ADP-sugar conjugates to AMP and ribose-5-phosphate	NA|163aa|up_6|NZ_CP022122.1_1340836_1341325_-	pfam02130, UPF0054, Uncharacterized protein family UPF0054	NA|691aa|up_5|NZ_CP022122.1_1341341_1343414_-	COG1480, COG1480, Predicted membrane-associated HD superfamily hydrolase [General function prediction only]	DinG|821aa|up_4|NZ_CP022122.1_1343431_1345894_-	COG1199, DinG, Rad3-related DNA helicases [Transcription / DNA replication, recombination, and repair]	NA|157aa|up_3|NZ_CP022122.1_1345914_1346385_-	COG4807, COG4807, Uncharacterized protein conserved in bacteria [Function unknown]	NA|322aa|up_2|NZ_CP022122.1_1346600_1347566_+	COG3643, COG3643, Glutamate formiminotransferase [Amino acid transport and metabolism]	NA|414aa|up_1|NZ_CP022122.1_1347642_1348884_+	PRK09356, PRK09356, imidazolonepropionase; Validated	NA|213aa|up_0|NZ_CP022122.1_1348901_1349540_+	COG3404, COG3404, Methenyl tetrahydrofolate cyclohydrolase [Amino acid transport and metabolism]	NA|110aa|down_0|NZ_CP022122.1_1350812_1351142_+	pfam08921, DUF1904, Domain of unknown function (DUF1904)	NA|252aa|down_1|NZ_CP022122.1_1351143_1351899_+	pfam08241, Methyltransf_11, Methyltransferase domain	NA|603aa|down_2|NZ_CP022122.1_1351987_1353796_-	COG5295, Hia, Autotransporter adhesin [Intracellular trafficking and secretion / Extracellular structures]	NA|569aa|down_3|NZ_CP022122.1_1353991_1355698_-	TIGR03904, putative_radical_SAM_protein_YgiQ, uncharacterized radical SAM protein YgiQ	NA|413aa|down_4|NZ_CP022122.1_1355704_1356943_-	PRK05469, PRK05469, tripeptide aminopeptidase PepT	NA|397aa|down_5|NZ_CP022122.1_1357047_1358238_-	pfam05636, HIGH_NTase1, HIGH Nucleotidyl Transferase	NA|178aa|down_6|NZ_CP022122.1_1358254_1358788_-	NA	NA|206aa|down_7|NZ_CP022122.1_1358841_1359459_-	COG0588, GpmA, Phosphoglycerate mutase 1 [Carbohydrate transport and metabolism]	NA|229aa|down_8|NZ_CP022122.1_1359470_1360157_-	PRK14115, gpmA, 2,3-diphosphoglycerate-dependent phosphoglycerate mutase	NA|240aa|down_9|NZ_CP022122.1_1360287_1361007_+	pfam02661, Fic, Fic/DOC family
GCF_002211605.1_ASM221160v1	NZ_CP022122	Fusobacterium nucleatum subsp. nucleatum strain ChDC F317 chromosome, complete genome	5	2138597-2138788	3	PILER-CR	no		cas3,WYL,PD-DExK,DEDDh,cas6,cas8b2,cas7,cas5,cas4,cas1,cas2,DinG,csa3	Orphan	AAGACTGGCGCTCTACCAACTGAGCTA	27	0	0	NA	NA	NA	2	2	Orphan	cas3,WYL,PD-DExK,DEDDh,cas6,cas8b2,cas7,cas5,cas4,cas1,cas2,DinG,csa3	NA|125aa|up_1|NZ_CP022122.1_2137497_2137872_-,NA|83aa|up_0|NZ_CP022122.1_2137919_2138168_-,NA|71aa|down_2|NZ_CP022122.1_2142511_2142724_-	NA|859aa|up_9|NZ_CP022122.1_2118678_2121255_+	TIGR03346, chaperone_ClpB, ATP-dependent chaperone ClpB	NA|230aa|up_8|NZ_CP022122.1_2121331_2122021_-	COG2964, COG2964, Uncharacterized protein conserved in bacteria [Function unknown]	NA|546aa|up_7|NZ_CP022122.1_2122232_2123870_+	COG3033, TnaA, Tryptophanase [Amino acid transport and metabolism]	NA|445aa|up_6|NZ_CP022122.1_2123993_2125328_+	COG0733, COG0733, Na+-dependent transporters of the SNF family [General function prediction only]	NA|463aa|up_5|NZ_CP022122.1_2130868_2132257_-	cd01087, Prolidase, Prolidase	NA|1023aa|up_4|NZ_CP022122.1_2132397_2135466_-	COG4625, COG4625, Uncharacterized protein with a C-terminal OMP (outer membrane protein) domain [Function unknown]	NA|176aa|up_3|NZ_CP022122.1_2135725_2136253_-	cd02908, Macro_OAADPr_deacetylase, macrodomain, O-acetyl-ADP-ribose (OAADPr) family	NA|72aa|up_2|NZ_CP022122.1_2137117_2137333_-	COG0584, UgpQ, Glycerophosphoryl diester phosphodiesterase [Energy production and conversion]	NA|125aa|up_1|NZ_CP022122.1_2137497_2137872_-	NA	NA|83aa|up_0|NZ_CP022122.1_2137919_2138168_-	NA	NA|803aa|down_0|NZ_CP022122.1_2138986_2141395_-	TIGR02917, TPR_domain_protein, putative PEP-CTERM system TPR-repeat lipoprotein	NA|346aa|down_1|NZ_CP022122.1_2141406_2142444_-	sd00006, TPR, Tetratricopeptide repeat	NA|71aa|down_2|NZ_CP022122.1_2142511_2142724_-	NA	NA|317aa|down_3|NZ_CP022122.1_2142799_2143750_-	pfam01261, AP_endonuc_2, Xylose isomerase-like TIM barrel	NA|258aa|down_4|NZ_CP022122.1_2143762_2144536_-	COG1120, FepC, ABC-type cobalamin/Fe3+-siderophores transport systems, ATPase components [Inorganic ion transport and metabolism / Coenzyme metabolism]	NA|342aa|down_5|NZ_CP022122.1_2144532_2145558_-	pfam01032, FecCD, FecCD transport family	NA|290aa|down_6|NZ_CP022122.1_2145560_2146430_-	COG0614, FepB, ABC-type Fe3+-hydroxamate transport system, periplasmic component [Inorganic ion transport and metabolism]	NA|658aa|down_7|NZ_CP022122.1_2146479_2148453_-	cd01347, ligand_gated_channel, TonB dependent/Ligand-Gated channels are created by a monomeric 22 strand (22,24) anti-parallel beta-barrel	NA|162aa|down_8|NZ_CP022122.1_2148752_2149238_-	cd04684, Nudix_Hydrolase_25, Contains a crystal structure of the Nudix hydrolase from Enterococcus faecalis, which has an unknown function	NA|126aa|down_9|NZ_CP022122.1_2149322_2149700_-	TIGR00004, RutC_family_protein, reactive intermediate/imine deaminase
