assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_001547855.1_ASM154785v1	NZ_AP013045	Tannerella forsythia KS16	1	1773501-1773611	1	CRISPRCasFinder	no		csm3gr7,csx10gr5,csx19,PD-DExK,WYL,cas3,cas6,cas5,cas8b4,cas7b,cas4,cas10,csm2gr11,csm4gr5,csm5gr7,csx1,csx20,cas1,cas2,DEDDh,RT,DinG,cas9	Orphan	GTCAAACCCCGCGAAGGAGAAACGATAGAAAACGT	35	0	0	NA	NA	NA	1	1	Orphan	csm3gr7,csx10gr5,csx19,PD-DExK,WYL,cas3,cas6,cas5,cas8b4,cas7b,cas4,cas10,csm2gr11,csm4gr5,csm5gr7,csx1,csx20,cas1,cas2,DEDDh,RT,DinG,cas9	NA|334aa|up_6|NZ_AP013045.1_1765920_1766922_+,NA|67aa|up_2|NZ_AP013045.1_1769381_1769582_-,NA|288aa|up_0|NZ_AP013045.1_1772301_1773165_+,NA	NA|218aa|up_9|NZ_AP013045.1_1758702_1759356_+	COG0177, Nth, Predicted EndoIII-related endonuclease [DNA replication, recombination, and repair]	NA|943aa|up_8|NZ_AP013045.1_1759465_1762294_+	COG0612, PqqL, Predicted Zn-dependent peptidases [General function prediction only]	NA|332aa|up_7|NZ_AP013045.1_1764501_1765497_+	pfam03385, STELLO, STELLO glycosyltransferases	NA|334aa|up_6|NZ_AP013045.1_1765920_1766922_+	NA	NA|403aa|up_5|NZ_AP013045.1_1767093_1768302_-	COG1940, NagC, Transcriptional regulator/sugar kinase [Transcription / Carbohydrate transport and metabolism]	NA|118aa|up_4|NZ_AP013045.1_1768543_1768897_-	smart00347, HTH_MARR, helix_turn_helix multiple antibiotic resistance protein	NA|153aa|up_3|NZ_AP013045.1_1768914_1769373_-	TIGR01068, Thioredoxin-like_protein_slr0233, thioredoxin	NA|67aa|up_2|NZ_AP013045.1_1769381_1769582_-	NA	NA|815aa|up_1|NZ_AP013045.1_1769597_1772042_-	TIGR03385, Coenzyme_A_disulfide_reductase, CoA-disulfide reductase	NA|288aa|up_0|NZ_AP013045.1_1772301_1773165_+	NA	NA|294aa|down_0|NZ_AP013045.1_1773702_1774584_+	cd07498, Peptidases_S8_15, Peptidase S8 family domain, uncharacterized subfamily 15	NA|269aa|down_1|NZ_AP013045.1_1774528_1775335_-	cd07498, Peptidases_S8_15, Peptidase S8 family domain, uncharacterized subfamily 15	NA|179aa|down_2|NZ_AP013045.1_1776050_1776587_+	PRK13949, PRK13949, shikimate kinase; Provisional	NA|620aa|down_3|NZ_AP013045.1_1776646_1778506_+	TIGR03710, OAFO_sf, 2-oxoacid:acceptor oxidoreductase, alpha subunit	NA|339aa|down_4|NZ_AP013045.1_1778507_1779524_+	PRK11867, PRK11867, 2-oxoglutarate ferredoxin oxidoreductase subunit beta; Reviewed	NA|293aa|down_5|NZ_AP013045.1_1779836_1780715_+	COG2177, FtsX, Cell division protein [Cell division and chromosome partitioning]	NA|85aa|down_6|NZ_AP013045.1_1780739_1780994_+	pfam11297, DUF3098, Protein of unknown function (DUF3098)	NA|271aa|down_7|NZ_AP013045.1_1780990_1781803_+	pfam02673, BacA, Bacitracin resistance protein BacA	NA|240aa|down_8|NZ_AP013045.1_1781814_1782534_+	cd02573, PseudoU_synth_EcTruB, Pseudouridine synthase, Escherichia coli TruB like	NA|350aa|down_9|NZ_AP013045.1_1782548_1783598_+	PRK00147, queA, S-adenosylmethionine:tRNA ribosyltransferase-isomerase; Provisional
GCF_001547855.1_ASM154785v1	NZ_AP013045	Tannerella forsythia KS16	2	2414489-2416791	1,2,1	PILER-CR,CRISPRCasFinder,CRT	no	cas6,cas5,cas8b4,cas7b,cas4,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,csx1,csx20,cas1,cas2	csm3gr7,csx10gr5,csx19,PD-DExK,WYL,cas3,cas6,cas5,cas8b4,cas7b,cas4,cas10,csm2gr11,csm4gr5,csm5gr7,csx1,csx20,cas1,cas2,DEDDh,RT,DinG,cas9	Type III-B,Type III-A,Type III-D,Type III-C	CTTTTAATCGGACTATCATAGAATTGAAAC,CTTTTAATCGGACTATCATAGAATTGAAAC,CTTTTAATCGGACTATCATAGAATTGAAAC	30,30,30	1	1	2416324-2416358	NZ_AP013045.1_2265998-2266032	I-A,II-B,III-A:I-A,II-B,III-A:I-A,II-B,III-A	34,34,34	34	TypeIII-B,TypeIII-A,TypeIII-D,TypeIII-C	csm3gr7,csx10gr5,csx19,PD-DExK,WYL,cas3,cas6,cas5,cas8b4,cas7b,cas4,cas10,csm2gr11,csm4gr5,csm5gr7,csx1,csx20,cas1,cas2,DEDDh,RT,DinG,cas9	csx20|122aa|up_2|NZ_AP013045.1_2412606_2412972_+,NA|61aa|down_1|NZ_AP013045.1_2418439_2418622_+	cas4|177aa|up_9|NZ_AP013045.1_2406058_2406589_+	pfam01930, Cas_Cas4, Domain of unknown function DUF83	cas10|481aa|up_8|NZ_AP013045.1_2406635_2408078_+	cd09680, Cas10_III, CRISPR/Cas system-associated protein Cas10	csm2gr11|157aa|up_7|NZ_AP013045.1_2408083_2408554_+	pfam03750, Csm2_III-A, Csm2 Type III-A	csm3gr7|213aa|up_6|NZ_AP013045.1_2408563_2409202_+	cd09684, Csm3_III-A, CRISPR/Cas system-associated RAMP superfamily protein Csm3	csm4gr5|332aa|up_5|NZ_AP013045.1_2409201_2410197_+	COG1567, COG1567, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	csm5gr7|383aa|up_4|NZ_AP013045.1_2410209_2411358_+	TIGR01899, cas_TM1807_csm5, CRISPR type III-A/MTUBE-associated RAMP protein Csm5	csx1|415aa|up_3|NZ_AP013045.1_2411359_2412604_+	pfam09455, Cas_DxTHG, CRISPR-associated (Cas) DxTHG family	csx20|122aa|up_2|NZ_AP013045.1_2412606_2412972_+	NA	cas1|339aa|up_1|NZ_AP013045.1_2412996_2414013_+	TIGR03641, cas1_HMARI, CRISPR-associated endonuclease Cas1, subtype I-B/HMARI/TNEAP	cas2|88aa|up_0|NZ_AP013045.1_2414012_2414276_+	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	NA|69aa|down_0|NZ_AP013045.1_2417056_2417263_-	COG2826, Tra8, Transposase and inactivated derivatives, IS30 family [DNA replication, recombination, and repair]	NA|61aa|down_1|NZ_AP013045.1_2418439_2418622_+	NA	NA|175aa|down_2|NZ_AP013045.1_2419534_2420059_+	cd01055, Nonheme_Ferritin, nonheme-containing ferritins	NA|734aa|down_3|NZ_AP013045.1_2420323_2422525_-	pfam03030, H_PPase, Inorganic H+ pyrophosphatase	NA|379aa|down_4|NZ_AP013045.1_2423185_2424322_-	cd06829, PLPDE_III_CANSDC, Type III Pyridoxal 5-phosphate (PLP)-Dependent Enzyme Carboxynorspermidine Decarboxylase	NA|765aa|down_5|NZ_AP013045.1_2424326_2426621_-	TIGR01073, ATP-dependent_DNA_helicase_PcrA, ATP-dependent DNA helicase PcrA	NA|166aa|down_6|NZ_AP013045.1_2426875_2427373_+	cd13831, HU, histone-like DNA-binding protein HU	NA|119aa|down_7|NZ_AP013045.1_2427547_2427904_-	pfam02152, FolB, Dihydroneopterin aldolase	NA|140aa|down_8|NZ_AP013045.1_2427992_2428412_+	pfam04519, Bactofilin, Polymer-forming cytoskeletal	NA|898aa|down_9|NZ_AP013045.1_2428404_2431098_-	PLN02950, PLN02950, 4-alpha-glucanotransferase
GCF_001547855.1_ASM154785v1	NZ_AP013045	Tannerella forsythia KS16	3	2818573-2818659	3	CRISPRCasFinder	no		csm3gr7,csx10gr5,csx19,PD-DExK,WYL,cas3,cas6,cas5,cas8b4,cas7b,cas4,cas10,csm2gr11,csm4gr5,csm5gr7,csx1,csx20,cas1,cas2,DEDDh,RT,DinG,cas9	Orphan	CGGCATAACCATTAACCATTGACAATTAAC	30	0	0	NA	NA	NA	1	1	Orphan	csm3gr7,csx10gr5,csx19,PD-DExK,WYL,cas3,cas6,cas5,cas8b4,cas7b,cas4,cas10,csm2gr11,csm4gr5,csm5gr7,csx1,csx20,cas1,cas2,DEDDh,RT,DinG,cas9	NA|76aa|up_8|NZ_AP013045.1_2806376_2806604_-,NA|91aa|down_0|NZ_AP013045.1_2818914_2819187_-,NA|98aa|down_1|NZ_AP013045.1_2819419_2819713_-,NA|149aa|down_6|NZ_AP013045.1_2826848_2827295_-	NA|264aa|up_9|NZ_AP013045.1_2805573_2806365_-	cd08023, GH16_laminarinase_like, Laminarinase, member of the glycosyl hydrolase family 16	NA|76aa|up_8|NZ_AP013045.1_2806376_2806604_-	NA	NA|527aa|up_7|NZ_AP013045.1_2806617_2808198_-	cd16031, G6S_like, unchracterized sulfatase homologous to glucosamine (N-acetyl)-6-sulfatase(G6S, GNS)	NA|192aa|up_6|NZ_AP013045.1_2808467_2809043_+	pfam01923, Cob_adeno_trans, Cobalamin adenosyltransferase	NA|473aa|up_5|NZ_AP013045.1_2809077_2810496_+	cd11646, Precorrin_3B_C17_MT, Precorrin-3B C(17)-methyltransferase (also named CobJ or CbiH)	NA|394aa|up_4|NZ_AP013045.1_2810510_2811692_+	cd11644, Precorrin-6Y-MT, Precorrin-6Y methyltransferase (also named CbiE)	NA|607aa|up_3|NZ_AP013045.1_2811688_2813509_+	cd11641, Precorrin-4_C11-MT, Precorrin-4 C11-methyltransferase (CbiF/CobM)	NA|603aa|up_2|NZ_AP013045.1_2813505_2815314_+	pfam01888, CbiD, CbiD	NA|634aa|up_1|NZ_AP013045.1_2815471_2817373_+	TIGR01134, purF, amidophosphoribosyltransferase	NA|177aa|up_0|NZ_AP013045.1_2817413_2817944_+	pfam13508, Acetyltransf_7, Acetyltransferase (GNAT) domain	NA|91aa|down_0|NZ_AP013045.1_2818914_2819187_-	NA	NA|98aa|down_1|NZ_AP013045.1_2819419_2819713_-	NA	NA|1056aa|down_2|NZ_AP013045.1_2819716_2822884_-	COG0841, AcrB, Cation/multidrug efflux pump [Defense mechanisms]	NA|339aa|down_3|NZ_AP013045.1_2822950_2823967_-	TIGR01730, COG0845:_Membrane-fusion_protein, RND family efflux transporter, MFP subunit	NA|494aa|down_4|NZ_AP013045.1_2824007_2825489_-	COG1538, TolC, Outer membrane protein [Cell envelope biogenesis, outer membrane / Intracellular trafficking and secretion]	NA|340aa|down_5|NZ_AP013045.1_2825635_2826655_+	pfam12833, HTH_18, Helix-turn-helix domain	NA|149aa|down_6|NZ_AP013045.1_2826848_2827295_-	NA	NA|520aa|down_7|NZ_AP013045.1_2827351_2828911_-	cd17346, MFS_DtpA_like, Dipeptide and tripeptide permease A (DtpA)-like subfamily of the Major Facilitator Superfamily of transporters	NA|357aa|down_8|NZ_AP013045.1_2828965_2830036_-	pfam16115, DUF4831, Domain of unknown function (DUF4831)	NA|106aa|down_9|NZ_AP013045.1_2830138_2830456_+	pfam11950, DUF3467, Protein of unknown function (DUF3467)
GCF_001547855.1_ASM154785v1	NZ_AP013045	Tannerella forsythia KS16	4	3115419-3117003	2,4	PILER-CR,CRISPRCasFinder	no	cas2,cas1,cas9	csm3gr7,csx10gr5,csx19,PD-DExK,WYL,cas3,cas6,cas5,cas8b4,cas7b,cas4,cas10,csm2gr11,csm4gr5,csm5gr7,csx1,csx20,cas1,cas2,DEDDh,RT,DinG,cas9	Type II-C,Type II-B, or Type II-C?, Type II-B,Type II-A	GTTGTGATTTGCTTGAAAACTACTACCTTTGTAGTATCAACAACAGC,GTTGTGATTTGCTTGAAAACTACTACCTTTGTAGTATCAACAACAGC	47,47	0	0	NA	NA	NA:NA	20,20	20	TypeII-C,TypeII-B,orTypeII-C?,TypeII-B,TypeII-A	csm3gr7,csx10gr5,csx19,PD-DExK,WYL,cas3,cas6,cas5,cas8b4,cas7b,cas4,cas10,csm2gr11,csm4gr5,csm5gr7,csx1,csx20,cas1,cas2,DEDDh,RT,DinG,cas9	NA,NA|109aa|down_3|NZ_AP013045.1_3124575_3124902_-	NA|417aa|up_9|NZ_AP013045.1_3101884_3103135_-	COG4591, LolE, ABC-type transport system, involved in lipoprotein release, permease component [Cell envelope biogenesis, outer membrane]	NA|402aa|up_8|NZ_AP013045.1_3103191_3104397_-	PRK07568, PRK07568, pyridoxal phosphate-dependent aminotransferase	NA|453aa|up_7|NZ_AP013045.1_3104714_3106073_+	TIGR01350, Dihydrolipoyl_dehydrogenase, dihydrolipoamide dehydrogenase	NA|438aa|up_6|NZ_AP013045.1_3106075_3107389_+	cd00609, AAT_like, Aspartate aminotransferase family	NA|198aa|up_5|NZ_AP013045.1_3107504_3108098_+	pfam02622, DUF179, Uncharacterized ACR, COG1678	NA|427aa|up_4|NZ_AP013045.1_3108378_3109659_-	COG2873, MET17, O-acetylhomoserine sulfhydrylase [Amino acid transport and metabolism]	NA|428aa|up_3|NZ_AP013045.1_3109717_3111001_-	COG1668, NatB, ABC-type Na+ efflux pump, permease component [Energy production and conversion / Inorganic ion transport and metabolism]	NA|308aa|up_2|NZ_AP013045.1_3110997_3111921_-	cd03269, ABC_putative_ATPase, ATP-binding cassette domain of an uncharacterized transporter	NA|699aa|up_1|NZ_AP013045.1_3111926_3114023_-	TIGR01391, DNA_primase, DNA primase, catalytic core	NA|376aa|up_0|NZ_AP013045.1_3114211_3115340_-	pfam13358, DDE_3, DDE superfamily endonuclease	cas2|114aa|down_0|NZ_AP013045.1_3117577_3117919_-	COG3512, COG3512, CRISPR-associated protein, Cas2 homolog [Defense mechanisms]	cas1|311aa|down_1|NZ_AP013045.1_3117925_3118858_-	TIGR03639, cas1_NMENI, CRISPR-associated endonuclease Cas1, subtype II/NMENI	cas9|1484aa|down_2|NZ_AP013045.1_3118871_3123323_-	pfam18541, RuvC_III, RuvC endonuclease subdomain 3	NA|109aa|down_3|NZ_AP013045.1_3124575_3124902_-	NA	NA|101aa|down_4|NZ_AP013045.1_3124901_3125204_-	pfam04977, DivIC, Septum formation initiator	NA|823aa|down_5|NZ_AP013045.1_3127145_3129614_-	COG0466, Lon, ATP-dependent Lon protease, bacterial type [Posttranslational modification, protein turnover, chaperones]	NA|686aa|down_6|NZ_AP013045.1_3129816_3131874_-	COG1770, PtrB, Protease II [Amino acid transport and metabolism]	NA|282aa|down_7|NZ_AP013045.1_3132030_3132876_-	pfam14257, DUF4349, Domain of unknown function (DUF4349)	NA|697aa|down_8|NZ_AP013045.1_3132917_3135008_-	COG1506, DAP2, Dipeptidyl aminopeptidases/acylaminoacyl-peptidases [Amino acid transport and metabolism]	NA|232aa|down_9|NZ_AP013045.1_3135198_3135894_+	PRK00648, PRK00648, Maf-like protein; Reviewed
GCF_001547855.1_ASM154785v1	NZ_AP013045	Tannerella forsythia KS16	5	3286507-3286602	5	CRISPRCasFinder	no		csm3gr7,csx10gr5,csx19,PD-DExK,WYL,cas3,cas6,cas5,cas8b4,cas7b,cas4,cas10,csm2gr11,csm4gr5,csm5gr7,csx1,csx20,cas1,cas2,DEDDh,RT,DinG,cas9	Orphan	GTTGTGCCTAAACCACTTACCTCTCACAGG	30	0	0	NA	NA	NA	1	1	Orphan	csm3gr7,csx10gr5,csx19,PD-DExK,WYL,cas3,cas6,cas5,cas8b4,cas7b,cas4,cas10,csm2gr11,csm4gr5,csm5gr7,csx1,csx20,cas1,cas2,DEDDh,RT,DinG,cas9	NA|121aa|up_9|NZ_AP013045.1_3277764_3278127_+,NA|50aa|up_0|NZ_AP013045.1_3286079_3286229_-,NA|1070aa|down_5|NZ_AP013045.1_3291432_3294642_+,NA|78aa|down_8|NZ_AP013045.1_3298022_3298256_+	NA|121aa|up_9|NZ_AP013045.1_3277764_3278127_+	NA	NA|135aa|up_8|NZ_AP013045.1_3278188_3278593_-	pfam12680, SnoaL_2, SnoaL-like domain	NA|105aa|up_7|NZ_AP013045.1_3280332_3280647_+	COG1733, COG1733, Predicted transcriptional regulators [Transcription]	NA|283aa|up_6|NZ_AP013045.1_3280700_3281549_+	PRK11525, dinD, DNA-damage-inducible protein D; Provisional	NA|198aa|up_5|NZ_AP013045.1_3281568_3282162_+	cd03140, GATase1_PfpI_3, Type 1 glutamine amidotransferase (GATase1)-like domain found in a subgroup of proteins similar to PfpI from Pyrococcus furiosus	NA|177aa|up_4|NZ_AP013045.1_3282613_3283144_+	pfam13787, HXXEE, Protein of unknown function with HXXEE motif	NA|283aa|up_3|NZ_AP013045.1_3283177_3284026_+	pfam12833, HTH_18, Helix-turn-helix domain	NA|451aa|up_2|NZ_AP013045.1_3284122_3285475_+	cd13143, MATE_MepA_like, Subfamily of the multidrug and toxic compound extrusion (MATE)-like proteins similar to Streptococcus aureus MepA	NA|107aa|up_1|NZ_AP013045.1_3285772_3286093_+	pfam09357, RteC, RteC protein	NA|50aa|up_0|NZ_AP013045.1_3286079_3286229_-	NA	NA|142aa|down_0|NZ_AP013045.1_3286967_3287393_+	pfam11888, DUF3408, Protein of unknown function (DUF3408)	NA|417aa|down_1|NZ_AP013045.1_3287990_3289241_+	COG1373, COG1373, Predicted ATPase (AAA+ superfamily) [General function prediction only]	NA|208aa|down_2|NZ_AP013045.1_3289256_3289880_+	pfam13182, DUF4007, Protein of unknown function (DUF4007)	NA|414aa|down_3|NZ_AP013045.1_3289866_3291108_-	pfam01610, DDE_Tnp_ISL3, Transposase	NA|83aa|down_4|NZ_AP013045.1_3291187_3291436_+	pfam13182, DUF4007, Protein of unknown function (DUF4007)	NA|1070aa|down_5|NZ_AP013045.1_3291432_3294642_+	NA	NA|369aa|down_6|NZ_AP013045.1_3294638_3295745_+	cd01713, PAPS_reductase, This domain is found in phosphoadenosine phosphosulphate (PAPS) reductase enzymes or PAPS sulphotransferase	NA|757aa|down_7|NZ_AP013045.1_3295745_3298016_+	TIGR04095, type_III_restriction_protein_res_subunit, DNA phosphorothioation system restriction enzyme	NA|78aa|down_8|NZ_AP013045.1_3298022_3298256_+	NA	NA|718aa|down_9|NZ_AP013045.1_3298252_3300406_+	pfam13476, AAA_23, AAA domain
