assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_001051015.2_ASM105101v2	NZ_LN824140	Bifidobacterium longum subsp. infantis strain CECT 7210 chromosome I	1	562136-562219	1	CRISPRCasFinder	no		cas3,WYL,DEDDh,casR	Orphan	ATTTCGCGGAAACCGTACGATGA	23	0	0	NA	NA	NA	1	1	Orphan	cas3,WYL,DEDDh,casR	NA,NA	NA|113aa|up_9|NZ_LN824140.1_550786_551125_+	cd01276, PKCI_related, Protein Kinase C Interacting protein related (PKCI): PKCI and related proteins belong to the ubiquitous HIT family of hydrolases that act on alpha-phosphates of ribonucleotides	NA|392aa|up_8|NZ_LN824140.1_551143_552319_+	pfam02562, PhoH, PhoH-like protein	NA|183aa|up_7|NZ_LN824140.1_552308_552857_+	PRK00016, PRK00016, metal-binding heat shock protein; Provisional	NA|478aa|up_6|NZ_LN824140.1_552932_554366_+	COG1253, TlyC, Hemolysins and related proteins containing CBS domains [General function prediction only]	NA|355aa|up_5|NZ_LN824140.1_554367_555432_+	PRK00089, era, GTPase Era; Reviewed	NA|678aa|up_4|NZ_LN824140.1_555484_557518_-	cd05907, VL_LC_FACS_like, Long-chain fatty acid CoA synthetases and Bubblegum-like very long-chain fatty acid CoA synthetases	NA|388aa|up_3|NZ_LN824140.1_557936_559100_+	cd05304, Rubrum_tdh, Rubrum transdehydrogenase NAD-binding and catalytic domains	NA|102aa|up_2|NZ_LN824140.1_559115_559421_+	pfam12769, PNTB_4TM, 4TM region of pyridine nucleotide transhydrogenase, mitoch	NA|475aa|up_1|NZ_LN824140.1_559420_560845_+	pfam02233, PNTB, NAD(P) transhydrogenase beta subunit	NA|346aa|up_0|NZ_LN824140.1_560980_562018_-	COG1073, COG1073, Hydrolases of the alpha/beta superfamily [General function prediction only]	NA|207aa|down_0|NZ_LN824140.1_562446_563067_+	PRK05618, PRK05618, 50S ribosomal protein L25/general stress protein Ctc; Reviewed	NA|376aa|down_1|NZ_LN824140.1_563292_564420_+	PRK13357, PRK13357, branched-chain amino acid aminotransferase; Provisional	NA|300aa|down_2|NZ_LN824140.1_564617_565517_+	PRK12334, PRK12334, nucleoside triphosphate pyrophosphohydrolase; Reviewed	NA|264aa|down_3|NZ_LN824140.1_565761_566553_+	COG2860, COG2860, Predicted membrane protein [Function unknown]	NA|87aa|down_4|NZ_LN824140.1_566804_567065_-	PRK00239, rpsT, 30S ribosomal protein S20; Reviewed	NA|627aa|down_5|NZ_LN824140.1_567211_569092_+	PRK05433, PRK05433, GTP-binding protein LepA; Provisional	NA|470aa|down_6|NZ_LN824140.1_569091_570501_+	PRK05628, PRK05628, coproporphyrinogen III oxidase; Validated	NA|367aa|down_7|NZ_LN824140.1_570580_571681_-	COG1609, PurR, Transcriptional regulators [Transcription]	NA|377aa|down_8|NZ_LN824140.1_571863_572994_+	pfam02595, Gly_kinase, Glycerate kinase family	NA|546aa|down_9|NZ_LN824140.1_573095_574733_-	COG2826, Tra8, Transposase and inactivated derivatives, IS30 family [DNA replication, recombination, and repair]
GCF_001051015.2_ASM105101v2	NZ_LN824140	Bifidobacterium longum subsp. infantis strain CECT 7210 chromosome I	2	1021873-1021953	2	CRISPRCasFinder	no		cas3,WYL,DEDDh,casR	Orphan	CGCACAGTGAAACCGTCTCATAT	23	0	0	NA	NA	NA	1	1	Orphan	cas3,WYL,DEDDh,casR	NA|107aa|up_5|NZ_LN824140.1_1012220_1012541_-,NA|284aa|down_0|NZ_LN824140.1_1022853_1023705_+	NA|170aa|up_9|NZ_LN824140.1_1008096_1008606_+	cd00002, YbaK_deacylase, This CD includes cysteinyl-tRNA(Pro) deacylases from Haemophilus influenzae and Escherichia coli and other related bacterial proteins	NA|353aa|up_8|NZ_LN824140.1_1008645_1009704_-	PRK01045, ispH, 4-hydroxy-3-methylbut-2-enyl diphosphate reductase; Reviewed	NA|319aa|up_7|NZ_LN824140.1_1009928_1010885_+	cd09022, Aldose_epim_Ec_YihR, Aldose 1-epimerase, similar to Escherichia coli YihR	NA|315aa|up_6|NZ_LN824140.1_1011010_1011955_+	cd09022, Aldose_epim_Ec_YihR, Aldose 1-epimerase, similar to Escherichia coli YihR	NA|107aa|up_5|NZ_LN824140.1_1012220_1012541_-	NA	NA|263aa|up_4|NZ_LN824140.1_1012540_1013329_-	PRK09652, PRK09652, RNA polymerase sigma factor RpoE; Provisional	NA|518aa|up_3|NZ_LN824140.1_1013467_1015021_+	PRK00139, murE, UDP-N-acetylmuramoylalanyl-D-glutamate--2,6-diaminopimelate ligase; Provisional	NA|332aa|up_2|NZ_LN824140.1_1015142_1016138_+	pfam13480, Acetyltransf_6, Acetyltransferase (GNAT) domain	NA|1226aa|up_1|NZ_LN824140.1_1016256_1019934_-	TIGR02168, Chromosome_partition_protein_Smc, chromosome segregation protein SMC, common bacterial type	NA|535aa|up_0|NZ_LN824140.1_1019994_1021599_-	COG0285, FolC, Folylpolyglutamate synthase [Coenzyme metabolism]	NA|284aa|down_0|NZ_LN824140.1_1022853_1023705_+	NA	NA|532aa|down_1|NZ_LN824140.1_1024008_1025604_+	cd01087, Prolidase, Prolidase	NA|174aa|down_2|NZ_LN824140.1_1025652_1026174_+	cd04676, Nudix_Hydrolase_17, Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X	NA|570aa|down_3|NZ_LN824140.1_1026223_1027933_-	COG1123, COG1123, ATPase components of various ABC-type transport systems, contain duplicated ATPase [General function prediction only]	NA|390aa|down_4|NZ_LN824140.1_1027936_1029106_-	COG1173, DppC, ABC-type dipeptide/oligopeptide/nickel transport systems, permease components [Amino acid transport and metabolism / Inorganic ion transport and metabolism]	NA|364aa|down_5|NZ_LN824140.1_1029107_1030199_-	COG0601, DppB, ABC-type dipeptide/oligopeptide/nickel transport systems, permease components [Amino acid transport and metabolism / Inorganic ion transport and metabolism]	NA|545aa|down_6|NZ_LN824140.1_1030363_1031998_-	cd08519, PBP2_NikA_DppA_OppA_like_20, The substrate-binding component of an uncharacterized ABC-type nickel/dipeptide/oligopeptide-like import system contains the type 2 periplasmic binding fold	NA|428aa|down_7|NZ_LN824140.1_1032277_1033561_-	cd00854, NagA, N-acetylglucosamine-6-phosphate deacetylase, NagA, catalyzes the hydrolysis of the N-acetyl group of N-acetyl-glucosamine-6-phosphate (GlcNAc-6-P) to glucosamine 6-phosphate and acetate	NA|271aa|down_8|NZ_LN824140.1_1033616_1034429_-	PRK00443, nagB, glucosamine-6-phosphate deaminase; Provisional	NA|375aa|down_9|NZ_LN824140.1_1034766_1035891_+	COG1940, NagC, Transcriptional regulator/sugar kinase [Transcription / Carbohydrate transport and metabolism]
GCF_001051015.2_ASM105101v2	NZ_LN824140	Bifidobacterium longum subsp. infantis strain CECT 7210 chromosome I	3	1542284-1542369	3	CRISPRCasFinder	no		cas3,WYL,DEDDh,casR	Orphan	GGCCCTGAGCGTGCGGGCGCGGA	23	0	0	NA	NA	NA	1	1	Orphan	cas3,WYL,DEDDh,casR	NA|30aa|up_5|NZ_LN824140.1_1534372_1534462_-,NA|211aa|up_4|NZ_LN824140.1_1534653_1535286_-,NA|243aa|down_1|NZ_LN824140.1_1544449_1545178_+	NA|141aa|up_9|NZ_LN824140.1_1529885_1530308_+	pfam02082, Rrf2, Transcriptional regulator	NA|539aa|up_8|NZ_LN824140.1_1530441_1532058_+	PRK06416, PRK06416, dihydrolipoamide dehydrogenase; Reviewed	NA|303aa|up_7|NZ_LN824140.1_1532482_1533391_+	cd09278, RNase_HI_prokaryote_like, RNase HI family found mainly in prokaryotes	NA|233aa|up_6|NZ_LN824140.1_1533521_1534220_+	PRK00702, PRK00702, ribose-5-phosphate isomerase RpiA	NA|30aa|up_5|NZ_LN824140.1_1534372_1534462_-	NA	NA|211aa|up_4|NZ_LN824140.1_1534653_1535286_-	NA	NA|513aa|up_3|NZ_LN824140.1_1535413_1536952_+	PRK11823, PRK11823, DNA repair protein RadA; Provisional	NA|375aa|up_2|NZ_LN824140.1_1536966_1538091_-	PRK05627, PRK05627, bifunctional riboflavin kinase/FAD synthetase	NA|388aa|up_1|NZ_LN824140.1_1538188_1539352_-	PRK03287, truB, tRNA pseudouridine synthase B; Provisional	NA|158aa|up_0|NZ_LN824140.1_1539353_1539827_-	PRK00521, rbfA, 30S ribosome-binding factor RbfA	NA|356aa|down_0|NZ_LN824140.1_1543174_1544242_-	PRK12327, nusA, transcription elongation factor NusA; Provisional	NA|243aa|down_1|NZ_LN824140.1_1544449_1545178_+	NA	NA|348aa|down_2|NZ_LN824140.1_1545218_1546262_-	COG1609, PurR, Transcriptional regulators [Transcription]	NA|761aa|down_3|NZ_LN824140.1_1546427_1548710_-	PRK10658, PRK10658, putative alpha-glucosidase; Provisional	NA|305aa|down_4|NZ_LN824140.1_1548822_1549737_-	COG0395, UgpE, ABC-type sugar transport system, permease component [Carbohydrate transport and metabolism]	NA|329aa|down_5|NZ_LN824140.1_1549733_1550720_-	COG1175, UgpA, ABC-type sugar transport systems, permease components [Carbohydrate transport and metabolism]	NA|481aa|down_6|NZ_LN824140.1_1550901_1552344_-	COG1653, UgpB, ABC-type sugar transport system, periplasmic component [Carbohydrate transport and metabolism]	NA|740aa|down_7|NZ_LN824140.1_1552312_1554532_-	cd14791, GH36, glycosyl hydrolase family 36 (GH36)	NA|520aa|down_8|NZ_LN824140.1_1554585_1556145_-	COG3534, AbfA, Alpha-L-arabinofuranosidase [Carbohydrate transport and metabolism]	NA|267aa|down_9|NZ_LN824140.1_1558128_1558929_-	COG1305, COG1305, Transglutaminase-like enzymes, putative cysteine proteases [Amino acid transport and metabolism]
GCF_001051015.2_ASM105101v2	NZ_LN824140	Bifidobacterium longum subsp. infantis strain CECT 7210 chromosome I	4	2121792-2122160	4	CRISPRCasFinder	no		cas3,WYL,DEDDh,casR	Orphan	CGATGACCCGTGGAATCAACCGCA	24	0	0	NA	NA	NA	6	6	Orphan	cas3,WYL,DEDDh,casR	NA|201aa|up_5|NZ_LN824140.1_2115342_2115945_+,NA|99aa|down_2|NZ_LN824140.1_2124530_2124827_+,NA|207aa|down_6|NZ_LN824140.1_2129429_2130050_+,NA|57aa|down_7|NZ_LN824140.1_2130123_2130294_+	NA|242aa|up_9|NZ_LN824140.1_2111779_2112505_+	PRK10847, PRK10847, DedA family protein	NA|317aa|up_8|NZ_LN824140.1_2112665_2113616_+	TIGR03815, CpaE_hom_Actino, helicase/secretion neighborhood CpaE-like protein	NA|356aa|up_7|NZ_LN824140.1_2113622_2114690_+	TIGR03819, heli_sec_ATPase, helicase/secretion neighborhood ATPase	NA|219aa|up_6|NZ_LN824140.1_2114689_2115346_+	COG4965, TadB, Flp pilus assembly protein TadB [Intracellular trafficking and secretion]	NA|201aa|up_5|NZ_LN824140.1_2115342_2115945_+	NA	NA|96aa|up_4|NZ_LN824140.1_2116229_2116517_+	pfam14029, DUF4244, Protein of unknown function (DUF4244)	NA|126aa|up_3|NZ_LN824140.1_2116522_2116900_+	pfam07811, TadE, TadE-like protein	NA|126aa|up_2|NZ_LN824140.1_2116949_2117327_+	TIGR03816, tadE_like_DECH, helicase/secretion neighborhood TadE-like protein	NA|377aa|up_1|NZ_LN824140.1_2117375_2118506_+	PRK11914, PRK11914, diacylglycerol kinase; Reviewed	NA|200aa|up_0|NZ_LN824140.1_2118703_2119303_-	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|201aa|down_0|NZ_LN824140.1_2122435_2123038_+	PRK00076, recR, recombination protein RecR; Reviewed	NA|378aa|down_1|NZ_LN824140.1_2123034_2124168_-	cd05827, Sortase_C, Sortase domain found in class C sortases	NA|99aa|down_2|NZ_LN824140.1_2124530_2124827_+	NA	NA|255aa|down_3|NZ_LN824140.1_2126699_2127464_+	PRK06635, PRK06635, aspartate kinase; Reviewed	NA|189aa|down_4|NZ_LN824140.1_2127519_2128086_+	PRK06635, PRK06635, aspartate kinase; Reviewed	NA|365aa|down_5|NZ_LN824140.1_2128167_2129262_-	PRK08664, PRK08664, aspartate-semialdehyde dehydrogenase; Reviewed	NA|207aa|down_6|NZ_LN824140.1_2129429_2130050_+	NA	NA|57aa|down_7|NZ_LN824140.1_2130123_2130294_+	NA	NA|489aa|down_8|NZ_LN824140.1_2130384_2131851_+	cd07383, MPP_Dcr2, Saccharomyces cerevisiae DCR2 phosphatase and related proteins, metallophosphatase domain	NA|639aa|down_9|NZ_LN824140.1_2132342_2134259_+	PRK03739, PRK03739, 2-isopropylmalate synthase; Validated
GCF_001051015.2_ASM105101v2	NZ_LN824140	Bifidobacterium longum subsp. infantis strain CECT 7210 chromosome I	5	2240651-2240736	5	CRISPRCasFinder	no		cas3,WYL,DEDDh,casR	Orphan	GACAGCTCCCGCCAGCGGGAGCACTT	26	0	0	NA	NA	NA	1	1	Orphan	cas3,WYL,DEDDh,casR	NA,NA|695aa|down_6|NZ_LN824140.1_2249192_2251277_+	NA|560aa|up_9|NZ_LN824140.1_2228151_2229831_+	COG1080, PtsA, Phosphoenolpyruvate-protein kinase (PTS system EI component in bacteria) [Carbohydrate transport and metabolism]	NA|245aa|up_8|NZ_LN824140.1_2230064_2230799_+	COG0580, GlpF, Glycerol uptake facilitator and related permeases (Major Intrinsic Protein Family) [Carbohydrate transport and metabolism]	NA|903aa|up_7|NZ_LN824140.1_2230882_2233591_-	cd02094, P-type_ATPase_Cu-like, P-type heavy metal-transporting ATPase, similar to human copper-transporting ATPases, ATP7A and ATP7B	NA|94aa|up_6|NZ_LN824140.1_2233702_2233984_+	COG1937, COG1937, Uncharacterized protein conserved in bacteria [Function unknown]	NA|475aa|up_5|NZ_LN824140.1_2234026_2235451_+	pfam02646, RmuC, RmuC family	NA|210aa|up_4|NZ_LN824140.1_2235447_2236077_+	PRK00455, pyrE, orotate phosphoribosyltransferase; Validated	NA|283aa|up_3|NZ_LN824140.1_2236170_2237019_+	cd18096, SpoU-like, SAM-dependent rRNA or tRNA methylase related to SpoU	NA|100aa|up_2|NZ_LN824140.1_2237185_2237485_+	PRK00034, gatC, Asp-tRNA(Asn)/Glu-tRNA(Gln) amidotransferase subunit GatC	NA|514aa|up_1|NZ_LN824140.1_2237488_2239030_+	PRK00012, gatA, Asp-tRNA(Asn)/Glu-tRNA(Gln) amidotransferase subunit GatA	NA|500aa|up_0|NZ_LN824140.1_2239055_2240555_+	PRK05477, gatB, Asp-tRNA(Asn)/Glu-tRNA(Gln) amidotransferase subunit GatB	NA|355aa|down_0|NZ_LN824140.1_2240850_2241915_+	pfam13302, Acetyltransf_3, Acetyltransferase (GNAT) domain	NA|105aa|down_1|NZ_LN824140.1_2241925_2242240_+	pfam10611, DUF2469, Protein of unknown function (DUF2469)	NA|618aa|down_2|NZ_LN824140.1_2242413_2244267_+	cd07061, HP_HAP_like, Histidine phosphatase domain found in histidine acid phosphatases and phytases; contains a His residue which is phosphorylated during the reaction	NA|538aa|down_3|NZ_LN824140.1_2244485_2246099_+	PRK07208, PRK07208, hypothetical protein; Provisional	NA|690aa|down_4|NZ_LN824140.1_2246304_2248374_+	PRK12678, PRK12678, transcription termination factor Rho; Provisional	NA|130aa|down_5|NZ_LN824140.1_2248681_2249071_+	PRK09239, PRK09239, chorismate mutase; Provisional	NA|695aa|down_6|NZ_LN824140.1_2249192_2251277_+	NA	NA|912aa|down_7|NZ_LN824140.1_2251502_2254238_-	NF000540, alt_ValS, valine--tRNA ligase	NA|509aa|down_8|NZ_LN824140.1_2254381_2255908_-	cd08494, PBP2_NikA_DppA_OppA_like_6, The substrate-binding component of an uncharacterized ABC-type nickel/dipeptide/oligopeptide-like import system contains the type 2 periplasmic binding fold	NA|229aa|down_9|NZ_LN824140.1_2255969_2256656_-	COG0177, Nth, Predicted EndoIII-related endonuclease [DNA replication, recombination, and repair]
