assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000164695.2_ASM16469v2	NC_014643	Rothia dentocariosa ATCC 17931, complete sequence	1	916336-916463	1	CRISPRCasFinder	no		DEDDh,csa3,cas3,DinG,WYL,csb3,csb2gr5,csb1gr7	Orphan	TCTTCAAGCTGTTTCTTACCGTC	23	0	0	NA	NA	NA	2	2	Orphan	DEDDh,csa3,cas3,DinG,WYL,csb3,csb2gr5,csb1gr7	NA,NA|288aa|down_3|NC_014643.1_920220_921084_-,NA|490aa|down_4|NC_014643.1_921434_922904_-,NA|464aa|down_5|NC_014643.1_922915_924307_-	NA|485aa|up_9|NC_014643.1_900221_901676_-	PRK08244, PRK08244, monooxygenase	NA|238aa|up_8|NC_014643.1_902011_902725_+	COG3208, GrsT, Predicted thioesterase involved in non-ribosomal peptide biosynthesis [Secondary metabolites biosynthesis, transport, and catabolism]	NA|208aa|up_7|NC_014643.1_902745_903369_-	COG2091, Sfp, Phosphopantetheinyl transferase [Coenzyme metabolism]	NA|977aa|up_6|NC_014643.1_903519_906450_-	COG3321, COG3321, Polyketide synthase modules and related proteins [Secondary metabolites biosynthesis, transport, and catabolism]	NA|88aa|up_5|NC_014643.1_906442_906706_-	smart00823, PKS_PP, Phosphopantetheine attachment site	NA|571aa|up_4|NC_014643.1_906715_908428_-	cd00567, ACAD, Acyl-CoA dehydrogenase	NA|572aa|up_3|NC_014643.1_908439_910155_-	COG1960, CaiA, Acyl-CoA dehydrogenases [Lipid metabolism]	NA|575aa|up_2|NC_014643.1_910151_911876_-	cd05931, FAAL, Fatty acyl-AMP ligase (FAAL)	NA|151aa|up_1|NC_014643.1_913013_913466_-	pfam02537, CRCB, CrcB-like protein, Camphor Resistance (CrcB)	NA|176aa|up_0|NC_014643.1_913450_913978_-	COG0239, CrcB, Integral membrane protein possibly involved in chromosome condensation [Cell division and chromosome partitioning]	NA|234aa|down_0|NC_014643.1_916812_917514_-	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|522aa|down_1|NC_014643.1_917642_919208_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|248aa|down_2|NC_014643.1_919293_920037_-	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|288aa|down_3|NC_014643.1_920220_921084_-	NA	NA|490aa|down_4|NC_014643.1_921434_922904_-	NA	NA|464aa|down_5|NC_014643.1_922915_924307_-	NA	NA|258aa|down_6|NC_014643.1_924559_925333_-	pfam05110, AF-4, AF-4 proto-oncoprotein	NA|285aa|down_7|NC_014643.1_925584_926439_+	PRK09772, PRK09772, transcriptional antiterminator BglG; Provisional	NA|330aa|down_8|NC_014643.1_926879_927869_+	PRK15088, PRK15088, PTS system mannose-specific transporter subunits IIAB; Provisional	NA|291aa|down_9|NC_014643.1_927865_928738_+	PRK15065, PRK15065, mannose/fructose/sorbose family PTS transporter subunit IIC
GCF_000164695.2_ASM16469v2	NC_014643	Rothia dentocariosa ATCC 17931, complete sequence	2	1308815-1308924	2	CRISPRCasFinder	no		DEDDh,csa3,cas3,DinG,WYL,csb3,csb2gr5,csb1gr7	Orphan	GGATCGGCAGGCTTCTCCGGTGCTGGTGC	29	0	0	NA	NA	NA	1	1	Orphan	DEDDh,csa3,cas3,DinG,WYL,csb3,csb2gr5,csb1gr7	NA|369aa|up_7|NC_014643.1_1294661_1295768_-,NA|83aa|up_6|NC_014643.1_1296019_1296268_+,NA|318aa|down_5|NC_014643.1_1316788_1317742_-	NA|573aa|up_9|NC_014643.1_1290764_1292483_+	cd08501, PBP2_Lpqw, The substrate-binding domain of mycobacterial lipoprotein Lpqw contains type 2 periplasmic binding fold	NA|573aa|up_8|NC_014643.1_1292853_1294572_+	cd08501, PBP2_Lpqw, The substrate-binding domain of mycobacterial lipoprotein Lpqw contains type 2 periplasmic binding fold	NA|369aa|up_7|NC_014643.1_1294661_1295768_-	NA	NA|83aa|up_6|NC_014643.1_1296019_1296268_+	NA	NA|569aa|up_5|NC_014643.1_1296584_1298291_+	cd08501, PBP2_Lpqw, The substrate-binding domain of mycobacterial lipoprotein Lpqw contains type 2 periplasmic binding fold	NA|692aa|up_4|NC_014643.1_1298399_1300475_-	TIGR01995, beta-glucosides_PTS_EIIBCA, PTS system, beta-glucoside-specific IIABC component	NA|846aa|up_3|NC_014643.1_1300940_1303478_-	cd02609, P-type_ATPase, uncharacterized subfamily of P-type ATPase transporter, similar to uncharacterized Streptococcus pneumoniae exported protein 7, Exp7	NA|254aa|up_2|NC_014643.1_1303748_1304510_-	PRK13548, hmuV, hemin importer ATP-binding subunit; Provisional	NA|362aa|up_1|NC_014643.1_1304509_1305595_-	pfam01032, FecCD, FecCD transport family	NA|337aa|up_0|NC_014643.1_1305607_1306618_-	COG4558, ChuT, ABC-type hemin transport system, periplasmic component [Inorganic ion transport and metabolism]	NA|236aa|down_0|NC_014643.1_1309808_1310516_+	cd19165, HemeO, heme oxygenase in eukaryotes and some bacteria	NA|451aa|down_1|NC_014643.1_1310804_1312157_-	PRK13375, pimE, mannosyltransferase; Provisional	NA|773aa|down_2|NC_014643.1_1312358_1314677_-	PRK01213, PRK01213, phosphoribosylformylglycinamidine synthase subunit PurL	NA|271aa|down_3|NC_014643.1_1314706_1315519_-	PRK03619, PRK03619, phosphoribosylformylglycinamidine synthase subunit PurQ	NA|84aa|down_4|NC_014643.1_1315523_1315775_-	pfam02700, PurS, Phosphoribosylformylglycinamidine (FGAM) synthase	NA|318aa|down_5|NC_014643.1_1316788_1317742_-	NA	NA|432aa|down_6|NC_014643.1_1317751_1319047_-	PRK06635, PRK06635, aspartate kinase; Reviewed	NA|380aa|down_7|NC_014643.1_1319238_1320378_-	PRK00076, recR, recombination protein RecR; Reviewed	NA|958aa|down_8|NC_014643.1_1320420_1323294_-	PRK07764, PRK07764, DNA polymerase III subunits gamma and tau; Validated	NA|103aa|down_9|NC_014643.1_1324325_1324634_+	COG5450, COG5450, Transcription regulator of the Arc/MetJ class [Transcription]
GCF_000164695.2_ASM16469v2	NC_014643	Rothia dentocariosa ATCC 17931, complete sequence	3	1585906-1586182	3	CRISPRCasFinder	no		DEDDh,csa3,cas3,DinG,WYL,csb3,csb2gr5,csb1gr7	Orphan	CCTATGATGCGTGTGGGCGTGTGGT	25	0	0	NA	NA	NA	4	4	Orphan	DEDDh,csa3,cas3,DinG,WYL,csb3,csb2gr5,csb1gr7	NA,NA|222aa|down_0|NC_014643.1_1590037_1590703_+,NA|223aa|down_1|NC_014643.1_1590746_1591415_+,NA|221aa|down_3|NC_014643.1_1592758_1593421_+,NA|192aa|down_5|NC_014643.1_1594301_1594877_+,NA|203aa|down_6|NC_014643.1_1595718_1596327_+,NA|225aa|down_8|NC_014643.1_1596772_1597447_+,NA|233aa|down_9|NC_014643.1_1597552_1598251_+	NA|503aa|up_9|NC_014643.1_1569509_1571018_-	cd05387, BY-kinase, bacterial tyrosine-kinase	NA|287aa|up_8|NC_014643.1_1571086_1571947_-	TIGR01207, Glucose-1-phosphate_thymidylyltransferase_1, glucose-1-phosphate thymidylyltransferase, short form	NA|333aa|up_7|NC_014643.1_1572020_1573019_+	COG1088, RfbB, dTDP-D-glucose 4,6-dehydratase [Cell envelope biogenesis, outer membrane]	NA|479aa|up_6|NC_014643.1_1573019_1574456_+	pfam04321, RmlD_sub_bind, RmlD substrate binding domain	NA|719aa|up_5|NC_014643.1_1574445_1576602_+	pfam10131, PTPS_related, 6-pyruvoyl-tetrahydropterin synthase related domain; membrane protein	NA|492aa|up_4|NC_014643.1_1577020_1578496_-	cd07117, ALDH_StaphAldA1, Uncharacterized Staphylococcus aureus AldA1 (SACOL0154) aldehyde dehydrogenase-like	NA|95aa|up_3|NC_014643.1_1578983_1579268_+	PRK05618, PRK05618, 50S ribosomal protein L25/general stress protein Ctc; Reviewed	NA|195aa|up_2|NC_014643.1_1579555_1580140_+	PRK05426, PRK05426, peptidyl-tRNA hydrolase; Provisional	NA|433aa|up_1|NC_014643.1_1580440_1581739_+	TIGR01979, Probable_cysteine_desulfurase, cysteine desulfurases, SufSfamily	NA|156aa|up_0|NC_014643.1_1581776_1582244_+	TIGR01994, Iron-sulfur_cluster_assembly_scaffold_protein_IscU, SUF system FeS assembly protein, NifU family	NA|222aa|down_0|NC_014643.1_1590037_1590703_+	NA	NA|223aa|down_1|NC_014643.1_1590746_1591415_+	NA	NA|57aa|down_2|NC_014643.1_1592581_1592752_+	cd00933, barnase, Barnase, a member of the family of homologous microbial ribonucleases, catalyses the cleavage of single-stranded RNA via a two-step mechanism thought to be similar to that of pancreatic ribonuclease	NA|221aa|down_3|NC_014643.1_1592758_1593421_+	NA	NA|61aa|down_4|NC_014643.1_1594116_1594299_+	pfam00545, Ribonuclease, ribonuclease	NA|192aa|down_5|NC_014643.1_1594301_1594877_+	NA	NA|203aa|down_6|NC_014643.1_1595718_1596327_+	NA	NA|72aa|down_7|NC_014643.1_1596538_1596754_+	pfam00545, Ribonuclease, ribonuclease	NA|225aa|down_8|NC_014643.1_1596772_1597447_+	NA	NA|233aa|down_9|NC_014643.1_1597552_1598251_+	NA
GCF_000164695.2_ASM16469v2	NC_014643	Rothia dentocariosa ATCC 17931, complete sequence	4	1649328-1649424	4	CRISPRCasFinder	no	csa3	DEDDh,csa3,cas3,DinG,WYL,csb3,csb2gr5,csb1gr7	Type I-A	GTTCAGGAAGCAGCACCTGCTGCTC	25	0	0	NA	NA	NA	1	1	Orphan	DEDDh,csa3,cas3,DinG,WYL,csb3,csb2gr5,csb1gr7	NA,NA|143aa|down_7|NC_014643.1_1659974_1660403_+	NA|810aa|up_9|NC_014643.1_1635514_1637944_-	pfam04471, Mrr_cat, Restriction endonuclease	NA|286aa|up_8|NC_014643.1_1638107_1638965_-	pfam04471, Mrr_cat, Restriction endonuclease	NA|128aa|up_7|NC_014643.1_1639316_1639700_-	pfam13822, ACC_epsilon, Acyl-CoA carboxylase epsilon subunit	NA|533aa|up_6|NC_014643.1_1639795_1641394_-	COG4799, COG4799, Acetyl-CoA carboxylase, carboxyltransferase component (subunits alpha and beta) [Lipid metabolism]	NA|95aa|up_5|NC_014643.1_1641656_1641941_-	PRK12863, PRK12863, YciI-like protein; Reviewed	NA|329aa|up_4|NC_014643.1_1642099_1643086_+	COG0340, BirA, Biotin-(acetyl-CoA carboxylase) ligase [Coenzyme metabolism]	NA|164aa|up_3|NC_014643.1_1643088_1643580_+	pfam03703, bPH_2, Bacterial PH domain	NA|390aa|up_2|NC_014643.1_1643745_1644915_+	cd07302, CHD, cyclase homology domain	NA|307aa|up_1|NC_014643.1_1644899_1645820_+	COG0631, PTC1, Serine/threonine protein phosphatase [Signal transduction mechanisms]	NA|741aa|up_0|NC_014643.1_1645877_1648100_+	PRK07956, ligA, NAD-dependent DNA ligase LigA; Validated	NA|99aa|down_0|NC_014643.1_1649862_1650159_+	PRK00034, gatC, Asp-tRNA(Asn)/Glu-tRNA(Gln) amidotransferase subunit GatC	NA|532aa|down_1|NC_014643.1_1650168_1651764_+	PRK00012, gatA, Asp-tRNA(Asn)/Glu-tRNA(Gln) amidotransferase subunit GatA	NA|502aa|down_2|NC_014643.1_1651763_1653269_+	PRK05477, gatB, Asp-tRNA(Asn)/Glu-tRNA(Gln) amidotransferase subunit GatB	NA|558aa|down_3|NC_014643.1_1653470_1655144_+	COG1132, MdlB, ABC-type multidrug transport system, ATPase and permease components [Defense mechanisms]	NA|734aa|down_4|NC_014643.1_1655370_1657572_-	COG1770, PtrB, Protease II [Amino acid transport and metabolism]	NA|299aa|down_5|NC_014643.1_1657851_1658748_+	pfam07751, Abi_2, Abi-like protein	NA|264aa|down_6|NC_014643.1_1659054_1659846_+	cd05233, SDR_c, classical (c) SDRs	NA|143aa|down_7|NC_014643.1_1659974_1660403_+	NA	NA|157aa|down_8|NC_014643.1_1660538_1661009_-	cd08893, SRPBCC_CalC_Aha1-like_GntR-HTH, Putative hydrophobic ligand-binding SRPBCC domain of an uncharacterized subgroup of CalC- and Aha1-like proteins; some contain an N-terminal GntR family winged HTH DNA-binding domain	csa3|107aa|down_9|NC_014643.1_1661005_1661326_-	pfam12840, HTH_20, Helix-turn-helix domain
GCF_000164695.2_ASM16469v2	NC_014643	Rothia dentocariosa ATCC 17931, complete sequence	5	2360567-2360891	1,5,1	CRT,CRISPRCasFinder,PILER-CR	no	csb3,cas3,csb2gr5,csb1gr7	DEDDh,csa3,cas3,DinG,WYL,csb3,csb2gr5,csb1gr7	Unclear	CCCTCAATGAAAGTCACCTGTTCTCACAGGTGAGAC,GAAAGTCACCTGTTCTCACAGGTGAGAC,CCCTCAATGAAAGTCACCTGTTCTCACAGGTGAGA	36,28,35	0	0	NA	NA	NA:NA:NA	4,4,3	4	Unclear	DEDDh,csa3,cas3,DinG,WYL,csb3,csb2gr5,csb1gr7	NA|292aa|up_2|NC_014643.1_2358214_2359090_-,NA|213aa|down_6|NC_014643.1_2372185_2372824_+	NA|196aa|up_9|NC_014643.1_2349429_2350017_-	PRK00277, clpP, ATP-dependent Clp protease proteolytic subunit; Reviewed	NA|434aa|up_8|NC_014643.1_2350344_2351646_-	PRK01490, tig, trigger factor; Provisional	NA|295aa|up_7|NC_014643.1_2352459_2353344_-	COG0266, Nei, Formamidopyrimidine-DNA glycosylase [DNA replication, recombination, and repair]	NA|163aa|up_6|NC_014643.1_2353571_2354060_-	PRK05571, PRK05571, ribose-5-phosphate isomerase B; Provisional	NA|850aa|up_5|NC_014643.1_2354506_2357056_+	TIGR02412, Aminopeptidase_N, aminopeptidase N, Streptomyces lividans type	NA|162aa|up_4|NC_014643.1_2357217_2357703_-	pfam00583, Acetyltransf_1, Acetyltransferase (GNAT) family	NA|156aa|up_3|NC_014643.1_2357750_2358218_-	cd14771, TrHb2_Mt-trHbO-like_O, Truncated hemoglobins, group 2 (O); Mycobacterium tuberculosis hemoglobin O like	NA|292aa|up_2|NC_014643.1_2358214_2359090_-	NA	NA|299aa|up_1|NC_014643.1_2359124_2360021_+	COG1946, TesB, Acyl-CoA thioesterase [Lipid metabolism]	NA|98aa|up_0|NC_014643.1_2360213_2360507_+	COG2329, COG2329, Uncharacterized enzyme involved in biosynthesis of extracellular polysaccharides [General function prediction only]	csb3|350aa|down_0|NC_014643.1_2361681_2362731_-	cd09764, Csb3_I-U, CRISPR/Cas system-associated RAMP superfamily protein Csb3	cas3|923aa|down_1|NC_014643.1_2362730_2365499_-	TIGR02621, CRISPR-associated_helicase_Cas3, CRISPR-associated helicase Cas3, subtype Dpsyc	csb2gr5|503aa|down_2|NC_014643.1_2365488_2366997_-	TIGR02165, CRISPR-associated_protein_GSU0054_family, CRISPR-associated protein GSU0054/csb2, Dpsyc system	csb1gr7|380aa|down_3|NC_014643.1_2366998_2368138_-	pfam09617, Cas_GSU0053, CRISPR-associated protein GSU0053 (Cas_GSU0053)	NA|560aa|down_4|NC_014643.1_2369896_2371576_-	PRK11819, PRK11819, putative ABC transporter ATP-binding protein; Reviewed	NA|138aa|down_5|NC_014643.1_2371756_2372170_+	PRK10250, PRK10250, MmcQ/YjbR family DNA-binding protein	NA|213aa|down_6|NC_014643.1_2372185_2372824_+	NA	NA|294aa|down_7|NC_014643.1_2372877_2373759_-	PRK00026, trmD, tRNA (guanine-N(1)-)-methyltransferase; Reviewed	NA|213aa|down_8|NC_014643.1_2373762_2374401_-	PRK00122, rimM, 16S rRNA-processing protein RimM; Provisional	NA|81aa|down_9|NC_014643.1_2374591_2374834_-	PRK02821, PRK02821, RNA-binding protein
GCF_000164695.2_ASM16469v2	NC_014643	Rothia dentocariosa ATCC 17931, complete sequence	6	2368430-2369637	6,2,2	CRISPRCasFinder,CRT,PILER-CR	no	csb3,cas3,csb2gr5,csb1gr7	DEDDh,csa3,cas3,DinG,WYL,csb3,csb2gr5,csb1gr7	Unclear	CCCTCAATGAAAGTCACCCATTCTCATGGGTGAGAC,CCCTCAATGAAAGTCACCCATTCTCATGGGTGAGAC,CCCTCAATGAAAGTCACCCATTCTCATGGGTGAGAC	36,36,36	0	0	NA	NA	NA:NA:NA	16,16,14	16	Unclear	DEDDh,csa3,cas3,DinG,WYL,csb3,csb2gr5,csb1gr7	NA|292aa|up_6|NC_014643.1_2358214_2359090_-,NA|213aa|down_2|NC_014643.1_2372185_2372824_+,NA|850aa|down_8|NC_014643.1_2376009_2378559_-	NA|850aa|up_9|NC_014643.1_2354506_2357056_+	TIGR02412, Aminopeptidase_N, aminopeptidase N, Streptomyces lividans type	NA|162aa|up_8|NC_014643.1_2357217_2357703_-	pfam00583, Acetyltransf_1, Acetyltransferase (GNAT) family	NA|156aa|up_7|NC_014643.1_2357750_2358218_-	cd14771, TrHb2_Mt-trHbO-like_O, Truncated hemoglobins, group 2 (O); Mycobacterium tuberculosis hemoglobin O like	NA|292aa|up_6|NC_014643.1_2358214_2359090_-	NA	NA|299aa|up_5|NC_014643.1_2359124_2360021_+	COG1946, TesB, Acyl-CoA thioesterase [Lipid metabolism]	NA|98aa|up_4|NC_014643.1_2360213_2360507_+	COG2329, COG2329, Uncharacterized enzyme involved in biosynthesis of extracellular polysaccharides [General function prediction only]	csb3|350aa|up_3|NC_014643.1_2361681_2362731_-	cd09764, Csb3_I-U, CRISPR/Cas system-associated RAMP superfamily protein Csb3	cas3|923aa|up_2|NC_014643.1_2362730_2365499_-	TIGR02621, CRISPR-associated_helicase_Cas3, CRISPR-associated helicase Cas3, subtype Dpsyc	csb2gr5|503aa|up_1|NC_014643.1_2365488_2366997_-	TIGR02165, CRISPR-associated_protein_GSU0054_family, CRISPR-associated protein GSU0054/csb2, Dpsyc system	csb1gr7|380aa|up_0|NC_014643.1_2366998_2368138_-	pfam09617, Cas_GSU0053, CRISPR-associated protein GSU0053 (Cas_GSU0053)	NA|560aa|down_0|NC_014643.1_2369896_2371576_-	PRK11819, PRK11819, putative ABC transporter ATP-binding protein; Reviewed	NA|138aa|down_1|NC_014643.1_2371756_2372170_+	PRK10250, PRK10250, MmcQ/YjbR family DNA-binding protein	NA|213aa|down_2|NC_014643.1_2372185_2372824_+	NA	NA|294aa|down_3|NC_014643.1_2372877_2373759_-	PRK00026, trmD, tRNA (guanine-N(1)-)-methyltransferase; Reviewed	NA|213aa|down_4|NC_014643.1_2373762_2374401_-	PRK00122, rimM, 16S rRNA-processing protein RimM; Provisional	NA|81aa|down_5|NC_014643.1_2374591_2374834_-	PRK02821, PRK02821, RNA-binding protein	NA|144aa|down_6|NC_014643.1_2374836_2375268_-	PRK14520, rpsP, 30S ribosomal protein S16; Provisional	NA|162aa|down_7|NC_014643.1_2375471_2375957_-	cd02883, Nudix_Hydrolase, Nudix hydrolase is a superfamily of enzymes found in all three kingdoms of life, and it catalyzes the hydrolysis of NUcleoside DIphosphates linked to other moieties, X	NA|850aa|down_8|NC_014643.1_2376009_2378559_-	NA	NA|529aa|down_9|NC_014643.1_2378827_2380414_-	PRK10867, PRK10867, signal recognition particle protein; Provisional
