assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_001025155.1_ASM102515v1	NZ_AP012322	Bifidobacterium angulatum DSM 20098 = JCM 7096	1	1483173-1494078	1,1,1,2,3	PILER-CR,CRISPRCasFinder,CRT,PILER-CR,PILER-CR	no	cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,cas3	DEDDh,WYL,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,cas3	Type I-E	GGGATCATCCCCGCGTGTGCGGGGAACAC,GGGATCATCCCCGCGTGTGCGGGGAACAC,GGGATCATCCCCGCGTGTGCGGGGAACAC,GGGATCATCCCCGCGTGTGCGGGGAACAC,GGGATCATCCCCGCGTGTGCGGGGAACACA	29,29,29,29,30	2	2	1487234-1487267|1487228-1487261	NZ_AP012322.1_1805791-1805758|NZ_AP012322.1_1805791-1805758	I-E:I-E:I-E:I-E:I-E	173,178,178,173,173	178	TypeI-E	DEDDh,WYL,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,cas3	NA,NA	NA|193aa|up_9|NZ_AP012322.1_1464603_1465182_-	pfam05656, DUF805, Protein of unknown function (DUF805)	NA|393aa|up_8|NZ_AP012322.1_1465604_1466783_-	cd01117, YbiR_permease, Putative anion permease YbiR	NA|688aa|up_7|NZ_AP012322.1_1467278_1469342_-	PRK05667, dnaG, DNA primase; Validated	NA|453aa|up_6|NZ_AP012322.1_1469450_1470809_-	cd00430, PLPDE_III_AR, Type III Pyridoxal 5-phosphate (PLP)-Dependent Enzyme Alanine Racemase	NA|476aa|up_5|NZ_AP012322.1_1471109_1472537_+	TIGR00909, putative_amino_acid_transporter, amino acid transporter	NA|162aa|up_4|NZ_AP012322.1_1472744_1473230_-	pfam13302, Acetyltransf_3, Acetyltransferase (GNAT) domain	NA|162aa|up_3|NZ_AP012322.1_1473441_1473927_-	PRK02260, PRK02260, S-ribosylhomocysteine lyase	NA|648aa|up_2|NZ_AP012322.1_1474062_1476006_-	TIGR01389, recQ, ATP-dependent DNA helicase RecQ	NA|315aa|up_1|NZ_AP012322.1_1476083_1477028_-	pfam04657, DMT_YdcZ, Putative inner membrane exporter, YdcZ	NA|911aa|up_0|NZ_AP012322.1_1477527_1480260_-	cd07197, nitrilase, Nitrilase superfamily, including nitrile- or amide-hydrolyzing enzymes and amide-condensing enzymes	cas2|120aa|down_0|NZ_AP012322.1_1494138_1494498_-	pfam09707, Cas_Cas2CT1978, CRISPR-associated protein (Cas_Cas2CT1978)	cas1|347aa|down_1|NZ_AP012322.1_1494491_1495532_-	cd09719, Cas1_I-E, CRISPR/Cas system-associated protein Cas1	cas6e|234aa|down_2|NZ_AP012322.1_1495531_1496233_-	pfam08798, CRISPR_assoc, CRISPR associated protein	cas5|249aa|down_3|NZ_AP012322.1_1496243_1496990_-	cd09645, Cas5_I-E, CRISPR/Cas system-associated RAMP superfamily protein Cas5	cas7|388aa|down_4|NZ_AP012322.1_1497008_1498172_-	pfam09344, Cas_CT1975, CT1975-like protein	cse2gr11|214aa|down_5|NZ_AP012322.1_1498229_1498871_-	pfam09485, CRISPR_Cse2, CRISPR-associated protein Cse2 (CRISPR_cse2)	cas8e|574aa|down_6|NZ_AP012322.1_1498845_1500567_-	pfam09481, CRISPR_Cse1, CRISPR-associated protein Cse1 (CRISPR_cse1)	cas3|1035aa|down_7|NZ_AP012322.1_1500631_1503736_-	PRK09694, PRK09694, CRISPR-associated helicase/endonuclease Cas3	NA|245aa|down_8|NZ_AP012322.1_1504019_1504754_+	COG0400, COG0400, Predicted esterase [General function prediction only]	NA|1105aa|down_9|NZ_AP012322.1_1505169_1508484_-	PRK06039, ileS, isoleucyl-tRNA synthetase; Reviewed
GCF_001025155.1_ASM102515v1	NZ_AP012322	Bifidobacterium angulatum DSM 20098 = JCM 7096	2	1715355-1715434	2	CRISPRCasFinder	no		DEDDh,WYL,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,cas3	Orphan	CCACCGACAACAGCGGCGGCGCC	23	0	0	NA	NA	NA	1	1	Orphan	DEDDh,WYL,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,cas3	NA,NA|207aa|down_2|NZ_AP012322.1_1718606_1719227_-	NA|335aa|up_9|NZ_AP012322.1_1698490_1699495_-	cd08234, threonine_DH_like, L-threonine dehydrogenase	NA|865aa|up_8|NZ_AP012322.1_1699718_1702313_+	TIGR02100, Glycogen_operon_protein_GlgX_homolog, glycogen debranching enzyme GlgX	NA|164aa|up_7|NZ_AP012322.1_1702537_1703029_-	PRK00132, rpsI, 30S ribosomal protein S9; Reviewed	NA|150aa|up_6|NZ_AP012322.1_1703049_1703499_-	PRK09216, rplM, 50S ribosomal protein L13; Reviewed	NA|724aa|up_5|NZ_AP012322.1_1703789_1705961_-	PRK14507, PRK14507, malto-oligosyltrehalose synthase	NA|214aa|up_4|NZ_AP012322.1_1706732_1707374_-	COG1739, COG1739, Uncharacterized conserved protein [Function unknown]	NA|346aa|up_3|NZ_AP012322.1_1707425_1708463_-	cd12169, PGDH_like_1, Putative D-3-Phosphoglycerate Dehydrogenases	NA|439aa|up_2|NZ_AP012322.1_1708489_1709806_-	pfam09587, PGA_cap, Bacterial capsule synthesis protein PGA_cap	NA|424aa|up_1|NZ_AP012322.1_1710411_1711683_+	cd03586, PolY_Pol_IV_kappa, DNA Polymerase IV/Kappa	NA|392aa|up_0|NZ_AP012322.1_1711801_1712977_+	COG0562, Glf, UDP-galactopyranose mutase [Cell envelope biogenesis, outer membrane]	NA|636aa|down_0|NZ_AP012322.1_1715725_1717633_-	PRK03739, PRK03739, 2-isopropylmalate synthase; Validated	NA|195aa|down_1|NZ_AP012322.1_1717713_1718298_-	sd00037, PASTA, PASTA domain	NA|207aa|down_2|NZ_AP012322.1_1718606_1719227_-	NA	NA|365aa|down_3|NZ_AP012322.1_1719525_1720620_+	PRK08664, PRK08664, aspartate-semialdehyde dehydrogenase; Reviewed	NA|188aa|down_4|NZ_AP012322.1_1720917_1721481_-	PRK06635, PRK06635, aspartate kinase; Reviewed	NA|255aa|down_5|NZ_AP012322.1_1721523_1722288_-	PRK06635, PRK06635, aspartate kinase; Reviewed	NA|201aa|down_6|NZ_AP012322.1_1722431_1723034_-	PRK00076, recR, recombination protein RecR; Reviewed	NA|907aa|down_7|NZ_AP012322.1_1723037_1725758_-	PRK07764, PRK07764, DNA polymerase III subunits gamma and tau; Validated	NA|202aa|down_8|NZ_AP012322.1_1725949_1726555_+	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|360aa|down_9|NZ_AP012322.1_1726612_1727692_-	COG1597, LCB5, Sphingosine kinase and enzymes related to eukaryotic diacylglycerol kinase [Lipid metabolism / General function prediction only]
GCF_001025155.1_ASM102515v1	NZ_AP012322	Bifidobacterium angulatum DSM 20098 = JCM 7096	3	1799722-1799797	3	CRISPRCasFinder	no		DEDDh,WYL,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,cas3	Orphan	ATGACCGATTCGGCAGAATCGGACA	25	0	0	NA	NA	NA	1	1	Orphan	DEDDh,WYL,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,cas3	NA,NA	NA|494aa|up_9|NZ_AP012322.1_1786683_1788165_+	COG1113, AnsP, Gamma-aminobutyrate permease and related permeases [Amino acid transport and metabolism]	NA|338aa|up_8|NZ_AP012322.1_1788476_1789490_+	PRK03092, PRK03092, ribose-phosphate diphosphokinase	NA|614aa|up_7|NZ_AP012322.1_1789955_1791797_-	COG1132, MdlB, ABC-type multidrug transport system, ATPase and permease components [Defense mechanisms]	NA|596aa|up_6|NZ_AP012322.1_1791810_1793598_-	COG1132, MdlB, ABC-type multidrug transport system, ATPase and permease components [Defense mechanisms]	NA|169aa|up_5|NZ_AP012322.1_1793938_1794445_-	smart00347, HTH_MARR, helix_turn_helix multiple antibiotic resistance protein	NA|353aa|up_4|NZ_AP012322.1_1794530_1795589_-	cd19088, AKR_AKR13B1, AKR13B family of aldo-keto reductase (AKR)	NA|65aa|up_3|NZ_AP012322.1_1795916_1796111_+	cd00565, Ubl_ThiS, ubiquitin-like (Ubl) domain found in sulfur carrier protein ThiS	NA|238aa|up_2|NZ_AP012322.1_1796202_1796916_+	PRK08644, PRK08644, sulfur carrier protein ThiS adenylyltransferase ThiF	NA|305aa|up_1|NZ_AP012322.1_1796947_1797862_+	PRK00208, thiG, thiazole synthase; Reviewed	NA|505aa|up_0|NZ_AP012322.1_1798021_1799536_-	COG1113, AnsP, Gamma-aminobutyrate permease and related permeases [Amino acid transport and metabolism]	NA|214aa|down_0|NZ_AP012322.1_1800809_1801451_+	PRK00129, upp, uracil phosphoribosyltransferase; Reviewed	NA|160aa|down_1|NZ_AP012322.1_1801729_1802209_+	pfam02590, SPOUT_MTase, Predicted SPOUT methyltransferase	NA|358aa|down_2|NZ_AP012322.1_1802407_1803481_-	cd08585, GDPD_like_3, Glycerophosphodiester phosphodiesterase domain of uncharacterized bacterial glycerophosphodiester phosphodiesterases	NA|382aa|down_3|NZ_AP012322.1_1803534_1804680_-	PRK05710, PRK05710, tRNA glutamyl-Q(34) synthetase GluQRS	NA|911aa|down_4|NZ_AP012322.1_1805182_1807915_-	TIGR03346, chaperone_ClpB, ATP-dependent chaperone ClpB	NA|275aa|down_5|NZ_AP012322.1_1808207_1809032_-	COG0179, MhpD, 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway) [Secondary metabolites biosynthesis, transport, and catabolism]	NA|101aa|down_6|NZ_AP012322.1_1809155_1809458_-	PRK14428, PRK14428, acylphosphatase; Provisional	NA|285aa|down_7|NZ_AP012322.1_1809551_1810406_-	cd00635, PLPDE_III_YBL036c_like, Type III Pyridoxal 5-phosphate (PLP)-Dependent Enzymes, YBL036c-like proteins	NA|882aa|down_8|NZ_AP012322.1_1811137_1813783_-	COG0542, clpA, ATP-binding subunits of Clp protease and DnaK/DnaJ chaperones [Posttranslational modification, protein turnover, chaperones]	NA|332aa|down_9|NZ_AP012322.1_1814101_1815097_+	pfam00582, Usp, Universal stress protein family
GCF_001025155.1_ASM102515v1	NZ_AP012322	Bifidobacterium angulatum DSM 20098 = JCM 7096	4	1905769-1905844	4	CRISPRCasFinder	no		DEDDh,WYL,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,cas3	Orphan	GTGTCCGATTCGGCAGAATCGGACA	25	0	0	NA	NA	NA	1	1	Orphan	DEDDh,WYL,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,cas3	NA,NA|336aa|down_7|NZ_AP012322.1_1919444_1920452_-	NA|330aa|up_9|NZ_AP012322.1_1889842_1890832_-	cd04187, DPM1_like_bac, Bacterial DPM1_like enzymes are related to eukaryotic DPM1	NA|480aa|up_8|NZ_AP012322.1_1891024_1892464_+	COG5263, COG5263, FOG: Glucan-binding domain (YG repeat) [General function prediction only]	NA|603aa|up_7|NZ_AP012322.1_1892526_1894335_-	cd06414, GH25_LytC-like, The LytC lysozyme of Streptococcus pneumoniae is a bacterial cell wall hydrolase that cleaves the beta1-4-glycosydic bond located between the N-acetylmuramoyl-N-glucosaminyl residues of the cell wall polysaccharide chains	NA|465aa|up_6|NZ_AP012322.1_1894721_1896116_+	pfam03629, SASA, Carbohydrate esterase, sialic acid-specific acetylesterase	NA|693aa|up_5|NZ_AP012322.1_1896264_1898343_-	cd00637, 7tm_classA_rhodopsin-like, rhodopsin receptor-like class A family of the seven-transmembrane G protein-coupled receptor superfamily	NA|361aa|up_4|NZ_AP012322.1_1898469_1899552_-	cd04185, GT_2_like_b, Subfamily of Glycosyltransferase Family GT2 of unknown function	NA|558aa|up_3|NZ_AP012322.1_1900284_1901958_-	COG2865, COG2865, Predicted transcriptional regulator containing an HTH domain and an uncharacterized domain shared with the mammalian protein Schlafen [Transcription]	NA|370aa|up_2|NZ_AP012322.1_1902255_1903365_+	pfam13240, zinc_ribbon_2, zinc-ribbon domain	NA|217aa|up_1|NZ_AP012322.1_1903554_1904205_+	pfam06941, NT5C, 5' nucleotidase, deoxy (Pyrimidine), cytosolic type C protein (NT5C)	NA|126aa|up_0|NZ_AP012322.1_1904283_1904661_-	pfam02537, CRCB, CrcB-like protein, Camphor Resistance (CrcB)	NA|530aa|down_0|NZ_AP012322.1_1906357_1907947_-	cd01031, EriC, ClC chloride channel EriC	NA|429aa|down_1|NZ_AP012322.1_1908412_1909699_-	PRK01117, PRK01117, adenylosuccinate synthetase; Provisional	NA|356aa|down_2|NZ_AP012322.1_1909910_1910978_-	PRK09197, PRK09197, fructose-bisphosphate aldolase; Provisional	NA|606aa|down_3|NZ_AP012322.1_1911105_1912923_-	pfam13196, DUF4012, Protein of unknown function (DUF4012)	NA|873aa|down_4|NZ_AP012322.1_1913170_1915789_+	COG1511, COG1511, Predicted membrane protein [Function unknown]	NA|759aa|down_5|NZ_AP012322.1_1915785_1918062_+	COG1511, COG1511, Predicted membrane protein [Function unknown]	NA|319aa|down_6|NZ_AP012322.1_1918422_1919379_-	PRK11689, PRK11689, aromatic amino acid efflux DMT transporter YddG	NA|336aa|down_7|NZ_AP012322.1_1919444_1920452_-	NA	NA|314aa|down_8|NZ_AP012322.1_1921449_1922391_-	PRK03072, PRK03072, heat shock protein HtpX; Provisional	NA|482aa|down_9|NZ_AP012322.1_1922790_1924236_-	PLN02852, PLN02852, ferredoxin-NADP+ reductase
