assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_902381645.1_UHGG_MGYG-HGUT-01347	NZ_LR698955	Fusobacterium nucleatum isolate MGYG-HGUT-01347 chromosome 1	1	944985-946274	1,1,1	PILER-CR,CRISPRCasFinder,CRT	no	cas9,cas1,cas2,csn2,DinG	RT,csa3,cas4,cas9,cas1,cas2,csn2,DinG,WYL,DEDDh,cas3	Type II-A,Type II-B,Type II-C	GTTTGAGAGTAATGTTATTTTAAATAGATTCAAAAC,GTTTGAGAGTAATGTTATTTTAAATAGATTCAAAAC,GTTTGAGAGTAATGTTATTTTAAATAGATTCAAAAC	36,36,36	0	0	NA	NA	NA:NA:NA	18,19,19	19	TypeII-A,TypeII-B,TypeII-C	RT,csa3,cas4,cas9,cas1,cas2,csn2,DinG,WYL,DEDDh,cas3	NA|105aa|up_9|NZ_LR698955.1_930893_931208_+,NA|240aa|up_6|NZ_LR698955.1_932866_933586_-,NA|609aa|up_4|NZ_LR698955.1_936795_938622_+,NA|130aa|down_0|NZ_LR698955.1_946324_946714_-,NA|178aa|down_6|NZ_LR698955.1_950598_951132_+	NA|105aa|up_9|NZ_LR698955.1_930893_931208_+	NA	NA|351aa|up_8|NZ_LR698955.1_931207_932260_+	COG4394, EarP, Elongation-Factor P (EF-P) rhamnosyltransferase EarP [Translation, ribosomal structure and biogenesis]	NA|188aa|up_7|NZ_LR698955.1_932259_932823_+	PRK00529, PRK00529, elongation factor P; Validated	NA|240aa|up_6|NZ_LR698955.1_932866_933586_-	NA	NA|1019aa|up_5|NZ_LR698955.1_933735_936792_+	pfam13676, TIR_2, TIR domain	NA|609aa|up_4|NZ_LR698955.1_936795_938622_+	NA	cas9|1368aa|up_3|NZ_LR698955.1_938928_943032_+	pfam16592, Cas9_REC, REC lobe of CRISPR-associated endonuclease Cas9	cas1|293aa|up_2|NZ_LR698955.1_943056_943935_+	TIGR03639, cas1_NMENI, CRISPR-associated endonuclease Cas1, subtype II/NMENI	cas2|107aa|up_1|NZ_LR698955.1_943924_944245_+	COG3512, COG3512, CRISPR-associated protein, Cas2 homolog [Defense mechanisms]	csn2|221aa|up_0|NZ_LR698955.1_944241_944904_+	cd09644, Csn2, CRISPR/Cas system-associated protein Csn2	NA|130aa|down_0|NZ_LR698955.1_946324_946714_-	NA	NA|168aa|down_1|NZ_LR698955.1_946894_947398_-	TIGR01752, Flavodoxin_1	NA|235aa|down_2|NZ_LR698955.1_947411_948116_-	COG1179, COG1179, Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 1 [Coenzyme metabolism]	NA|252aa|down_3|NZ_LR698955.1_948276_949032_-	pfam14025, DUF4241, Protein of unknown function (DUF4241)	NA|240aa|down_4|NZ_LR698955.1_949049_949769_-	COG3177, COG3177, Fic family protein [Function unknown]	NA|229aa|down_5|NZ_LR698955.1_949896_950583_+	PRK14115, gpmA, 2,3-diphosphoglycerate-dependent phosphoglycerate mutase	NA|178aa|down_6|NZ_LR698955.1_950598_951132_+	NA	NA|397aa|down_7|NZ_LR698955.1_951149_952340_+	pfam05636, HIGH_NTase1, HIGH Nucleotidyl Transferase	NA|413aa|down_8|NZ_LR698955.1_952444_953683_+	PRK05469, PRK05469, tripeptide aminopeptidase PepT	NA|567aa|down_9|NZ_LR698955.1_953689_955390_+	TIGR03904, putative_radical_SAM_protein_YgiQ, uncharacterized radical SAM protein YgiQ
GCF_902381645.1_UHGG_MGYG-HGUT-01347	NZ_LR698955	Fusobacterium nucleatum isolate MGYG-HGUT-01347 chromosome 1	2	1488023-1488102	2	CRISPRCasFinder	no		RT,csa3,cas4,cas9,cas1,cas2,csn2,DinG,WYL,DEDDh,cas3	Orphan	TTATCTTCTCCCATTGATAATTA	23	0	0	NA	NA	NA	1	1	Orphan	RT,csa3,cas4,cas9,cas1,cas2,csn2,DinG,WYL,DEDDh,cas3	NA|155aa|up_1|NZ_LR698955.1_1486728_1487193_+,NA	NA|430aa|up_9|NZ_LR698955.1_1477750_1479040_-	COG1593, DctQ, TRAP-type C4-dicarboxylate transport system, large permease component [Carbohydrate transport and metabolism]	NA|157aa|up_8|NZ_LR698955.1_1479060_1479531_-	COG3090, DctM, TRAP-type C4-dicarboxylate transport system, small permease component [Carbohydrate transport and metabolism]	NA|348aa|up_7|NZ_LR698955.1_1479630_1480674_-	cd13669, PBP2_TRAP_TM0322_like, Periplasmic component of TRAP-type C4-dicarboxylate transport system TM0322 from Thermotoga maritima and similar proteins; the type 2 periplasmic binding protein fold	NA|198aa|up_6|NZ_LR698955.1_1480965_1481559_+	cd10033, UDG_like, uncharacterized family of the uracil-DNA glycosylase superfamily	NA|435aa|up_5|NZ_LR698955.1_1481695_1483000_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|225aa|up_4|NZ_LR698955.1_1483002_1483677_-	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|520aa|up_3|NZ_LR698955.1_1483924_1485484_+	COG1807, ArnT, 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family [Cell envelope biogenesis, outer membrane]	NA|274aa|up_2|NZ_LR698955.1_1485743_1486565_-	COG4822, CbiK, Cobalamin biosynthesis protein CbiK, Co2+ chelatase [Coenzyme metabolism]	NA|155aa|up_1|NZ_LR698955.1_1486728_1487193_+	NA	NA|213aa|up_0|NZ_LR698955.1_1487361_1488000_+	COG2885, OmpA, Outer membrane protein and related peptidoglycan-associated (lipo)proteins [Cell envelope biogenesis, outer membrane]	NA|295aa|down_0|NZ_LR698955.1_1488175_1489060_-	COG1210, GalU, UDP-glucose pyrophosphorylase [Cell envelope biogenesis, outer membrane]	NA|207aa|down_1|NZ_LR698955.1_1489072_1489693_-	sd00006, TPR, Tetratricopeptide repeat	NA|637aa|down_2|NZ_LR698955.1_1489721_1491632_-	PRK12267, PRK12267, methionyl-tRNA synthetase; Reviewed	NA|208aa|down_3|NZ_LR698955.1_1491854_1492478_-	COG2121, COG2121, Uncharacterized protein conserved in bacteria [Function unknown]	NA|117aa|down_4|NZ_LR698955.1_1492568_1492919_-	PRK00153, PRK00153, YbaB/EbfC family nucleoid-associated protein	NA|579aa|down_5|NZ_LR698955.1_1492987_1494724_-	TIGR00705, Protease_4, signal peptide peptidase SppA, 67K type	NA|212aa|down_6|NZ_LR698955.1_1494940_1495576_+	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|424aa|down_7|NZ_LR698955.1_1495597_1496869_+	COG1538, TolC, Outer membrane protein [Cell envelope biogenesis, outer membrane / Intracellular trafficking and secretion]	NA|370aa|down_8|NZ_LR698955.1_1496886_1497996_+	TIGR01730, COG0845:_Membrane-fusion_protein, RND family efflux transporter, MFP subunit	NA|1023aa|down_9|NZ_LR698955.1_1497995_1501064_+	COG0841, AcrB, Cation/multidrug efflux pump [Defense mechanisms]
GCF_902381645.1_UHGG_MGYG-HGUT-01347	NZ_LR698955	Fusobacterium nucleatum isolate MGYG-HGUT-01347 chromosome 1	3	1816326-1816406	3	CRISPRCasFinder	no		RT,csa3,cas4,cas9,cas1,cas2,csn2,DinG,WYL,DEDDh,cas3	Orphan	AGAAGATATGTTCATATAGCAGTGAACA	28	0	0	NA	NA	NA	1	1	Orphan	RT,csa3,cas4,cas9,cas1,cas2,csn2,DinG,WYL,DEDDh,cas3	NA,NA|128aa|down_4|NZ_LR698955.1_1819496_1819880_-,NA|66aa|down_5|NZ_LR698955.1_1819894_1820092_-,NA|241aa|down_7|NZ_LR698955.1_1820774_1821497_-	NA|325aa|up_9|NZ_LR698955.1_1800668_1801643_-	cd06581, TM_PBP1_LivM_like, Transmembrane subunit (TM) of Escherichia coli LivM and related proteins	NA|296aa|up_8|NZ_LR698955.1_1801643_1802531_-	COG0559, LivH, Branched-chain amino acid ABC-type transport system, permease components [Amino acid transport and metabolism]	NA|385aa|up_7|NZ_LR698955.1_1802544_1803699_-	cd06347, PBP1_ABC_LivK_ligand_binding-like, type 1 periplasmic ligand-binding domain of uncharacterized ABC (Atpase Binding Cassette)-type active transport systems predicted to be involved in uptake of amino acids, peptides, or inorganic ions	NA|261aa|up_6|NZ_LR698955.1_1803908_1804691_-	cd05346, SDR_c5, classical (c) SDR, subgroup 5	NA|813aa|up_5|NZ_LR698955.1_1804710_1807149_-	sd00006, TPR, Tetratricopeptide repeat	NA|580aa|up_4|NZ_LR698955.1_1807513_1809253_+	COG1132, MdlB, ABC-type multidrug transport system, ATPase and permease components [Defense mechanisms]	NA|600aa|up_3|NZ_LR698955.1_1809264_1811064_+	COG1132, MdlB, ABC-type multidrug transport system, ATPase and permease components [Defense mechanisms]	NA|86aa|up_2|NZ_LR698955.1_1811131_1811389_-	COG0227, RpmB, Ribosomal protein L28 [Translation, ribosomal structure and biogenesis]	NA|153aa|up_1|NZ_LR698955.1_1813260_1813719_-	COG1943, COG1943, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|624aa|up_0|NZ_LR698955.1_1814408_1816280_+	PRK09765, PRK09765, PTS system 2-O-a-mannosyl-D-glycerate specific transporter subunit IIABC; Provisional	NA|145aa|down_0|NZ_LR698955.1_1816492_1816927_+	cd17493, toxin_TenpN, type III toxin-antitoxin system toxin TenpN and similar proteins	NA|513aa|down_1|NZ_LR698955.1_1816990_1818529_+	PRK00074, guaA, GMP synthase; Reviewed	NA|129aa|down_2|NZ_LR698955.1_1818662_1819049_-	pfam05105, Phage_holin_4_1, Bacteriophage holin family	NA|150aa|down_3|NZ_LR698955.1_1819060_1819510_-	pfam07087, DUF1353, Protein of unknown function (DUF1353)	NA|128aa|down_4|NZ_LR698955.1_1819496_1819880_-	NA	NA|66aa|down_5|NZ_LR698955.1_1819894_1820092_-	NA	NA|170aa|down_6|NZ_LR698955.1_1820151_1820661_-	cd14845, L-Ala-D-Glu_peptidase_like, L-Ala-D-Glu peptidase, also known as L-alanyl-D-glutamate endopeptidase	NA|241aa|down_7|NZ_LR698955.1_1820774_1821497_-	NA	NA|217aa|down_8|NZ_LR698955.1_1822257_1822908_-	pfam10076, DUF2313, Uncharacterized protein conserved in bacteria (DUF2313)	NA|355aa|down_9|NZ_LR698955.1_1822900_1823965_-	pfam04865, Baseplate_J, Baseplate J-like protein
