assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000829295.1_ASM82929v1	NZ_AP014658	Bifidobacterium longum strain 105-A	1	161848-162216	1	CRISPRCasFinder	no		csa3,cas3,WYL,DEDDh,cas2,cas1,cas9,casR	Orphan	CGATGACCCGTGGAATCAACCGCA	24	0	0	NA	NA	NA	6	6	Orphan	csa3,cas3,WYL,DEDDh,cas2,cas1,cas9,casR	NA|201aa|up_5|NZ_AP014658.1_155396_155999_+,NA|207aa|down_6|NZ_AP014658.1_169487_170108_+	NA|242aa|up_9|NZ_AP014658.1_151771_152497_+	PRK10847, PRK10847, DedA family protein	NA|317aa|up_8|NZ_AP014658.1_152719_153670_+	TIGR03815, CpaE_hom_Actino, helicase/secretion neighborhood CpaE-like protein	NA|356aa|up_7|NZ_AP014658.1_153676_154744_+	TIGR03819, heli_sec_ATPase, helicase/secretion neighborhood ATPase	NA|219aa|up_6|NZ_AP014658.1_154743_155400_+	COG4965, TadB, Flp pilus assembly protein TadB [Intracellular trafficking and secretion]	NA|201aa|up_5|NZ_AP014658.1_155396_155999_+	NA	NA|96aa|up_4|NZ_AP014658.1_156283_156571_+	pfam14029, DUF4244, Protein of unknown function (DUF4244)	NA|126aa|up_3|NZ_AP014658.1_156576_156954_+	pfam07811, TadE, TadE-like protein	NA|126aa|up_2|NZ_AP014658.1_157004_157382_+	TIGR03816, tadE_like_DECH, helicase/secretion neighborhood TadE-like protein	NA|377aa|up_1|NZ_AP014658.1_157430_158561_+	PRK11914, PRK11914, diacylglycerol kinase; Reviewed	NA|200aa|up_0|NZ_AP014658.1_158758_159358_-	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|201aa|down_0|NZ_AP014658.1_162491_163094_+	PRK00076, recR, recombination protein RecR; Reviewed	NA|378aa|down_1|NZ_AP014658.1_163090_164224_-	cd05827, Sortase_C, Sortase domain found in class C sortases	NA|428aa|down_2|NZ_AP014658.1_165113_166397_+	pfam13635, DUF4143, Domain of unknown function (DUF4143)	NA|255aa|down_3|NZ_AP014658.1_166757_167522_+	PRK06635, PRK06635, aspartate kinase; Reviewed	NA|189aa|down_4|NZ_AP014658.1_167577_168144_+	PRK06635, PRK06635, aspartate kinase; Reviewed	NA|365aa|down_5|NZ_AP014658.1_168225_169320_-	PRK08664, PRK08664, aspartate-semialdehyde dehydrogenase; Reviewed	NA|207aa|down_6|NZ_AP014658.1_169487_170108_+	NA	NA|527aa|down_7|NZ_AP014658.1_170110_171691_-	cd07383, MPP_Dcr2, Saccharomyces cerevisiae DCR2 phosphatase and related proteins, metallophosphatase domain	NA|639aa|down_8|NZ_AP014658.1_172379_174296_+	PRK03739, PRK03739, 2-isopropylmalate synthase; Validated	NA|728aa|down_9|NZ_AP014658.1_174367_176551_-	COG0744, MrcB, Membrane carboxypeptidase (penicillin-binding protein) [Cell envelope biogenesis, outer membrane]
GCF_000829295.1_ASM82929v1	NZ_AP014658	Bifidobacterium longum strain 105-A	2	279969-280054	2	CRISPRCasFinder	no		csa3,cas3,WYL,DEDDh,cas2,cas1,cas9,casR	Orphan	GACAGCTCCCGCCAGCGGGAGCACTT	26	0	0	NA	NA	NA	1	1	Orphan	csa3,cas3,WYL,DEDDh,cas2,cas1,cas9,casR	NA,NA|695aa|down_6|NZ_AP014658.1_288510_290595_+	NA|560aa|up_9|NZ_AP014658.1_267469_269149_+	COG1080, PtsA, Phosphoenolpyruvate-protein kinase (PTS system EI component in bacteria) [Carbohydrate transport and metabolism]	NA|245aa|up_8|NZ_AP014658.1_269382_270117_+	COG0580, GlpF, Glycerol uptake facilitator and related permeases (Major Intrinsic Protein Family) [Carbohydrate transport and metabolism]	NA|903aa|up_7|NZ_AP014658.1_270200_272909_-	cd02079, P-type_ATPase_HM, P-type heavy metal-transporting ATPase	NA|94aa|up_6|NZ_AP014658.1_273020_273302_+	COG1937, COG1937, Uncharacterized protein conserved in bacteria [Function unknown]	NA|475aa|up_5|NZ_AP014658.1_273344_274769_+	pfam02646, RmuC, RmuC family	NA|210aa|up_4|NZ_AP014658.1_274765_275395_+	PRK00455, pyrE, orotate phosphoribosyltransferase; Validated	NA|283aa|up_3|NZ_AP014658.1_275488_276337_+	cd18096, SpoU-like, SAM-dependent rRNA or tRNA methylase related to SpoU	NA|100aa|up_2|NZ_AP014658.1_276503_276803_+	PRK00034, gatC, Asp-tRNA(Asn)/Glu-tRNA(Gln) amidotransferase subunit GatC	NA|514aa|up_1|NZ_AP014658.1_276806_278348_+	PRK00012, gatA, Asp-tRNA(Asn)/Glu-tRNA(Gln) amidotransferase subunit GatA	NA|500aa|up_0|NZ_AP014658.1_278373_279873_+	PRK05477, gatB, Asp-tRNA(Asn)/Glu-tRNA(Gln) amidotransferase subunit GatB	NA|355aa|down_0|NZ_AP014658.1_280168_281233_+	pfam13302, Acetyltransf_3, Acetyltransferase (GNAT) domain	NA|105aa|down_1|NZ_AP014658.1_281243_281558_+	pfam10611, DUF2469, Protein of unknown function (DUF2469)	NA|618aa|down_2|NZ_AP014658.1_281731_283585_+	cd07061, HP_HAP_like, Histidine phosphatase domain found in histidine acid phosphatases and phytases; contains a His residue which is phosphorylated during the reaction	NA|538aa|down_3|NZ_AP014658.1_283803_285417_+	PRK07208, PRK07208, hypothetical protein; Provisional	NA|690aa|down_4|NZ_AP014658.1_285622_287692_+	PRK12678, PRK12678, transcription termination factor Rho; Provisional	NA|130aa|down_5|NZ_AP014658.1_287999_288389_+	PRK09239, PRK09239, chorismate mutase; Provisional	NA|695aa|down_6|NZ_AP014658.1_288510_290595_+	NA	NA|912aa|down_7|NZ_AP014658.1_290820_293556_-	NF000540, alt_ValS, valine--tRNA ligase	NA|509aa|down_8|NZ_AP014658.1_293699_295226_-	cd08494, PBP2_NikA_DppA_OppA_like_6, The substrate-binding component of an uncharacterized ABC-type nickel/dipeptide/oligopeptide-like import system contains the type 2 periplasmic binding fold	NA|229aa|down_9|NZ_AP014658.1_295287_295974_-	COG0177, Nth, Predicted EndoIII-related endonuclease [DNA replication, recombination, and repair]
GCF_000829295.1_ASM82929v1	NZ_AP014658	Bifidobacterium longum strain 105-A	3	1363977-1364057	3	CRISPRCasFinder	no		csa3,cas3,WYL,DEDDh,cas2,cas1,cas9,casR	Orphan	CGCACAGTGAAACCGTCTCATAT	23	0	0	NA	NA	NA	1	1	Orphan	csa3,cas3,WYL,DEDDh,cas2,cas1,cas9,casR	NA|107aa|up_5|NZ_AP014658.1_1354345_1354666_-,NA|289aa|down_0|NZ_AP014658.1_1364957_1365824_+	NA|501aa|up_9|NZ_AP014658.1_1347684_1349187_-	COG4584, COG4584, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|355aa|up_8|NZ_AP014658.1_1350765_1351830_-	PRK01045, ispH, 4-hydroxy-3-methylbut-2-enyl diphosphate reductase; Reviewed	NA|319aa|up_7|NZ_AP014658.1_1352054_1353011_+	cd09022, Aldose_epim_Ec_YihR, Aldose 1-epimerase, similar to Escherichia coli YihR	NA|315aa|up_6|NZ_AP014658.1_1353135_1354080_+	cd09022, Aldose_epim_Ec_YihR, Aldose 1-epimerase, similar to Escherichia coli YihR	NA|107aa|up_5|NZ_AP014658.1_1354345_1354666_-	NA	NA|263aa|up_4|NZ_AP014658.1_1354665_1355454_-	PRK09652, PRK09652, RNA polymerase sigma factor RpoE; Provisional	NA|518aa|up_3|NZ_AP014658.1_1355592_1357146_+	PRK00139, murE, UDP-N-acetylmuramoylalanyl-D-glutamate--2,6-diaminopimelate ligase; Provisional	NA|332aa|up_2|NZ_AP014658.1_1357267_1358263_+	pfam13480, Acetyltransf_6, Acetyltransferase (GNAT) domain	NA|1226aa|up_1|NZ_AP014658.1_1358381_1362059_-	TIGR02168, Chromosome_partition_protein_Smc, chromosome segregation protein SMC, common bacterial type	NA|528aa|up_0|NZ_AP014658.1_1362119_1363703_-	COG0285, FolC, Folylpolyglutamate synthase [Coenzyme metabolism]	NA|289aa|down_0|NZ_AP014658.1_1364957_1365824_+	NA	NA|534aa|down_1|NZ_AP014658.1_1366104_1367706_+	cd01087, Prolidase, Prolidase	NA|174aa|down_2|NZ_AP014658.1_1367848_1368370_+	cd04676, Nudix_Hydrolase_17, Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X	NA|570aa|down_3|NZ_AP014658.1_1368419_1370129_-	COG1123, COG1123, ATPase components of various ABC-type transport systems, contain duplicated ATPase [General function prediction only]	NA|390aa|down_4|NZ_AP014658.1_1370132_1371302_-	COG1173, DppC, ABC-type dipeptide/oligopeptide/nickel transport systems, permease components [Amino acid transport and metabolism / Inorganic ion transport and metabolism]	NA|364aa|down_5|NZ_AP014658.1_1371303_1372395_-	COG0601, DppB, ABC-type dipeptide/oligopeptide/nickel transport systems, permease components [Amino acid transport and metabolism / Inorganic ion transport and metabolism]	NA|545aa|down_6|NZ_AP014658.1_1372560_1374195_-	cd08519, PBP2_NikA_DppA_OppA_like_20, The substrate-binding component of an uncharacterized ABC-type nickel/dipeptide/oligopeptide-like import system contains the type 2 periplasmic binding fold	NA|428aa|down_7|NZ_AP014658.1_1374474_1375758_-	cd00854, NagA, N-acetylglucosamine-6-phosphate deacetylase, NagA, catalyzes the hydrolysis of the N-acetyl group of N-acetyl-glucosamine-6-phosphate (GlcNAc-6-P) to glucosamine 6-phosphate and acetate	NA|271aa|down_8|NZ_AP014658.1_1375813_1376626_-	PRK00443, nagB, glucosamine-6-phosphate deaminase; Provisional	NA|375aa|down_9|NZ_AP014658.1_1376963_1378088_+	COG1940, NagC, Transcriptional regulator/sugar kinase [Transcription / Carbohydrate transport and metabolism]
GCF_000829295.1_ASM82929v1	NZ_AP014658	Bifidobacterium longum strain 105-A	4	1736132-1738286	4,1,1	CRISPRCasFinder,CRT,PILER-CR	no	WYL,cas2,cas1,cas9	csa3,cas3,WYL,DEDDh,cas2,cas1,cas9,casR	Type II-B,Type II-C,,Type II-A	GCTGGGAATTAGCATTCACCCTTCTTGATAAGCTTG,GCTGGGAATTAGCATTCACCCTTCTTGATAAGCTTG,GCTGGGAATTAGCATTCACCCTTCTTGATAAGCTTG	36,36,36	0	0	NA	NA	NA:NA:NA	33,33,12	33	TypeII-B,,TypeII-C,TypeII-A	csa3,cas3,WYL,DEDDh,cas2,cas1,cas9,casR	NA|97aa|up_0|NZ_AP014658.1_1735820_1736111_-,NA	NA|1395aa|up_9|NZ_AP014658.1_1720147_1724332_-	COG0553, HepA, Superfamily II DNA/RNA helicases, SNF2 family [Transcription / DNA replication, recombination, and repair]	WYL|362aa|up_8|NZ_AP014658.1_1724409_1725495_+	pfam13280, WYL, WYL domain	NA|340aa|up_7|NZ_AP014658.1_1725728_1726748_+	TIGR03535, DapD_actino, 2,3,4,5-tetrahydropyridine-2,6-dicarboxylate N-succinyltransferase	NA|431aa|up_6|NZ_AP014658.1_1726961_1728254_-	cd06114, EcCS_like, Escherichia coli (Ec) citrate synthase (CS) GltA_like	NA|261aa|up_5|NZ_AP014658.1_1728526_1729309_-	cd01086, MetAP1, Methionine Aminopeptidase 1	NA|327aa|up_4|NZ_AP014658.1_1729498_1730479_-	COG3247, HdeD, Uncharacterized conserved protein [Function unknown]	NA|725aa|up_3|NZ_AP014658.1_1730655_1732830_+	COG3590, PepO, Predicted metalloendopeptidase [Posttranslational modification, protein turnover, chaperones]	NA|246aa|up_2|NZ_AP014658.1_1732912_1733650_-	cd04496, SSB_OBF, SSB_OBF: A subfamily of OB folds similar to the OB fold of ssDNA-binding protein (SSB)	NA|605aa|up_1|NZ_AP014658.1_1733925_1735740_-	PRK09194, PRK09194, prolyl-tRNA synthetase; Provisional	NA|97aa|up_0|NZ_AP014658.1_1735820_1736111_-	NA	cas2|111aa|down_0|NZ_AP014658.1_1738336_1738669_-	COG3512, COG3512, CRISPR-associated protein, Cas2 homolog [Defense mechanisms]	cas1|302aa|down_1|NZ_AP014658.1_1738665_1739571_-	TIGR03639, cas1_NMENI, CRISPR-associated endonuclease Cas1, subtype II/NMENI	cas9|1139aa|down_2|NZ_AP014658.1_1739574_1742991_-	pfam18470, Cas9_a, Cas9 alpha-helical lobe domain	NA|473aa|down_3|NZ_AP014658.1_1743224_1744643_-	cd18037, DEXSc_Pif1_like, DEAD-box helicase domain of Pif1	NA|217aa|down_4|NZ_AP014658.1_1744691_1745342_-	PRK05359, PRK05359, oligoribonuclease; Provisional	NA|518aa|down_5|NZ_AP014658.1_1745509_1747063_-	pfam00478, IMPDH, IMP dehydrogenase / GMP reductase domain	NA|428aa|down_6|NZ_AP014658.1_1747115_1748399_-	cd06853, GT_WecA_like, This subfamily contains Escherichia coli WecA, Bacillus subtilis TagO and related proteins	NA|225aa|down_7|NZ_AP014658.1_1748395_1749070_-	COG0009, SUA5, Putative translation factor (SUA5) [Translation, ribosomal structure and biogenesis]	NA|226aa|down_8|NZ_AP014658.1_1749244_1749922_+	cd03357, LbH_MAT_GAT, Maltose O-acetyltransferase (MAT) and Galactoside O-acetyltransferase (GAT): MAT and GAT catalyze the CoA-dependent acetylation of the 6-hydroxyl group of their respective sugar substrates	NA|235aa|down_9|NZ_AP014658.1_1750053_1750758_-	COG0410, LivF, ABC-type branched-chain amino acid transport systems, ATPase component [Amino acid transport and metabolism]
GCF_000829295.1_ASM82929v1	NZ_AP014658	Bifidobacterium longum strain 105-A	5	1869888-1869973	5	CRISPRCasFinder	no		csa3,cas3,WYL,DEDDh,cas2,cas1,cas9,casR	Orphan	GGCCCTGAGCGTGCGGGCGCGGA	23	0	0	NA	NA	NA	1	1	Orphan	csa3,cas3,WYL,DEDDh,cas2,cas1,cas9,casR	NA|51aa|up_8|NZ_AP014658.1_1859630_1859783_-,NA|30aa|up_5|NZ_AP014658.1_1861976_1862066_-,NA|211aa|up_4|NZ_AP014658.1_1862257_1862890_-,NA|243aa|down_1|NZ_AP014658.1_1872053_1872782_+,NA|98aa|down_6|NZ_AP014658.1_1879931_1880225_+	NA|539aa|up_9|NZ_AP014658.1_1857855_1859472_+	PRK06416, PRK06416, dihydrolipoamide dehydrogenase; Reviewed	NA|51aa|up_8|NZ_AP014658.1_1859630_1859783_-	NA	NA|297aa|up_7|NZ_AP014658.1_1860104_1860995_+	cd09278, RNase_HI_prokaryote_like, RNase HI family found mainly in prokaryotes	NA|233aa|up_6|NZ_AP014658.1_1861125_1861824_+	PRK00702, PRK00702, ribose-5-phosphate isomerase RpiA	NA|30aa|up_5|NZ_AP014658.1_1861976_1862066_-	NA	NA|211aa|up_4|NZ_AP014658.1_1862257_1862890_-	NA	NA|513aa|up_3|NZ_AP014658.1_1863017_1864556_+	PRK11823, PRK11823, DNA repair protein RadA; Provisional	NA|375aa|up_2|NZ_AP014658.1_1864570_1865695_-	PRK05627, PRK05627, bifunctional riboflavin kinase/FAD synthetase	NA|388aa|up_1|NZ_AP014658.1_1865792_1866956_-	PRK03287, truB, tRNA pseudouridine synthase B; Provisional	NA|158aa|up_0|NZ_AP014658.1_1866957_1867431_-	PRK00521, rbfA, 30S ribosome-binding factor RbfA	NA|356aa|down_0|NZ_AP014658.1_1870778_1871846_-	PRK12327, nusA, transcription elongation factor NusA; Provisional	NA|243aa|down_1|NZ_AP014658.1_1872053_1872782_+	NA	NA|348aa|down_2|NZ_AP014658.1_1872822_1873866_-	COG1609, PurR, Transcriptional regulators [Transcription]	NA|523aa|down_3|NZ_AP014658.1_1874398_1875967_-	COG3534, AbfA, Alpha-L-arabinofuranosidase [Carbohydrate transport and metabolism]	NA|267aa|down_4|NZ_AP014658.1_1876776_1877577_-	COG1305, COG1305, Transglutaminase-like enzymes, putative cysteine proteases [Amino acid transport and metabolism]	NA|727aa|down_5|NZ_AP014658.1_1877783_1879964_-	pfam14403, CP_ATPgrasp_2, Circularly permuted ATP-grasp type 2	NA|98aa|down_6|NZ_AP014658.1_1879931_1880225_+	NA	NA|304aa|down_7|NZ_AP014658.1_1880221_1881133_+	cd02570, PseudoU_synth_EcTruA, Eukaryotic and bacterial pseudouridine synthases similar to E	NA|181aa|down_8|NZ_AP014658.1_1881214_1881757_-	PRK05591, rplQ, 50S ribosomal protein L17; Validated	NA|332aa|down_9|NZ_AP014658.1_1881856_1882852_-	PRK05182, PRK05182, DNA-directed RNA polymerase subunit alpha; Provisional
GCF_000829295.1_ASM82929v1	NZ_AP014658	Bifidobacterium longum strain 105-A	6	2005747-2005820	6	CRISPRCasFinder	no		csa3,cas3,WYL,DEDDh,cas2,cas1,cas9,casR	Orphan	AAATGACGAACCGGGACAGCGAA	23	0	0	NA	NA	NA	1	1	Orphan	csa3,cas3,WYL,DEDDh,cas2,cas1,cas9,casR	NA|59aa|up_0|NZ_AP014658.1_2004727_2004904_+,NA	NA|378aa|up_9|NZ_AP014658.1_1992300_1993434_-	PRK05429, PRK05429, gamma-glutamyl kinase; Provisional	NA|564aa|up_8|NZ_AP014658.1_1993434_1995126_-	PRK12296, obgE, GTPase CgtA; Reviewed	NA|83aa|up_7|NZ_AP014658.1_1995194_1995443_-	PRK05435, rpmA, 50S ribosomal protein L27; Validated	NA|103aa|up_6|NZ_AP014658.1_1995465_1995774_-	PRK05573, rplU, 50S ribosomal protein L21; Validated	NA|1023aa|up_5|NZ_AP014658.1_1995925_1998994_-	TIGR00757, Ribonuclease_E/G-like_protein, ribonuclease, Rne/Rng family	NA|402aa|up_4|NZ_AP014658.1_1999306_2000512_-	cd05647, M20_DapE_actinobac, M20 Peptidase actinobacterial DapE encoded N-succinyl-L,L-diaminopimelic acid desuccinylase	NA|315aa|up_3|NZ_AP014658.1_2000618_2001563_+	COG0679, COG0679, Predicted permeases [General function prediction only]	NA|483aa|up_2|NZ_AP014658.1_2001738_2003187_-	COG3127, COG3127, Predicted ABC-type transport system involved in lysophospholipase L1 biosynthesis, permease component [Secondary metabolites biosynthesis, transport, and catabolism]	NA|365aa|up_1|NZ_AP014658.1_2003183_2004278_-	cd03255, ABC_MJ0796_LolCDE_FtsE, ATP-binding cassette domain of the transporters involved in export of lipoprotein and macrolide, and cell division protein	NA|59aa|up_0|NZ_AP014658.1_2004727_2004904_+	NA	NA|485aa|down_0|NZ_AP014658.1_2005899_2007354_-	PRK00148, PRK00148, Maf-like protein; Reviewed	NA|371aa|down_1|NZ_AP014658.1_2007490_2008603_-	PRK01212, PRK01212, homoserine kinase; Provisional	NA|439aa|down_2|NZ_AP014658.1_2008697_2010014_-	PRK06349, PRK06349, homoserine dehydrogenase; Provisional	NA|531aa|down_3|NZ_AP014658.1_2010174_2011767_-	cd06828, PLPDE_III_DapDC, Type III Pyridoxal 5-phosphate (PLP)-Dependent Enzyme Diaminopimelate Decarboxylase	NA|621aa|down_4|NZ_AP014658.1_2011769_2013632_-	COG0018, ArgS, Arginyl-tRNA synthetase [Translation, ribosomal structure and biogenesis]	NA|222aa|down_5|NZ_AP014658.1_2013845_2014511_+	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|426aa|down_6|NZ_AP014658.1_2014507_2015785_+	cd06173, MFS_MefA_like, Macrolide efflux protein A and similar proteins of the Major Facilitator Superfamily of transporters	NA|311aa|down_7|NZ_AP014658.1_2015871_2016804_+	cd08423, PBP2_LTTR_like_6, The C-terminal substrate binding domain of an uncharacterized LysR-type transcriptional regulator, contains the type 2 periplasmic binding fold	NA|418aa|down_8|NZ_AP014658.1_2016897_2018151_+	COG1168, MalY, Bifunctional PLP-dependent enzyme with beta-cystathionase and maltose regulon repressor activities [Amino acid transport and metabolism]	NA|442aa|down_9|NZ_AP014658.1_2018196_2019522_-	PRK09369, PRK09369, UDP-N-acetylglucosamine 1-carboxyvinyltransferase; Validated
