assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_001719085.1_ASM171908v1	NZ_CP013673	Bifidobacterium longum strain 35624, complete genome	1	171510-171878	1	CRISPRCasFinder	no		cas3,WYL,DEDDh,cas2,cas1,cas4,cas7,cas8c,cas5,casR	Orphan	CGATGACCCGTGGAATCAACCGCA	24	0	0	NA	NA	NA	6	6	Orphan	cas3,WYL,DEDDh,cas2,cas1,cas4,cas7,cas8c,cas5,casR	NA|201aa|up_5|NZ_CP013673.1_165059_165662_+,NA|91aa|down_2|NZ_CP013673.1_174120_174393_-,NA|238aa|down_7|NZ_CP013673.1_179054_179768_+,NA|170aa|down_9|NZ_CP013673.1_181473_181983_-	NA|242aa|up_9|NZ_CP013673.1_161434_162160_+	PRK10847, PRK10847, DedA family protein	NA|317aa|up_8|NZ_CP013673.1_162382_163333_+	TIGR03815, CpaE_hom_Actino, helicase/secretion neighborhood CpaE-like protein	NA|356aa|up_7|NZ_CP013673.1_163339_164407_+	TIGR03819, heli_sec_ATPase, helicase/secretion neighborhood ATPase	NA|231aa|up_6|NZ_CP013673.1_164370_165063_+	COG4965, TadB, Flp pilus assembly protein TadB [Intracellular trafficking and secretion]	NA|201aa|up_5|NZ_CP013673.1_165059_165662_+	NA	NA|132aa|up_4|NZ_CP013673.1_165838_166234_+	pfam14029, DUF4244, Protein of unknown function (DUF4244)	NA|129aa|up_3|NZ_CP013673.1_166230_166617_+	pfam07811, TadE, TadE-like protein	NA|129aa|up_2|NZ_CP013673.1_166657_167044_+	TIGR03816, tadE_like_DECH, helicase/secretion neighborhood TadE-like protein	NA|377aa|up_1|NZ_CP013673.1_167092_168223_+	PRK11914, PRK11914, diacylglycerol kinase; Reviewed	NA|200aa|up_0|NZ_CP013673.1_168420_169020_-	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|201aa|down_0|NZ_CP013673.1_172153_172756_+	PRK00076, recR, recombination protein RecR; Reviewed	NA|378aa|down_1|NZ_CP013673.1_172752_173886_-	cd05827, Sortase_C, Sortase domain found in class C sortases	NA|91aa|down_2|NZ_CP013673.1_174120_174393_-	NA	NA|428aa|down_3|NZ_CP013673.1_174773_176057_+	pfam13635, DUF4143, Domain of unknown function (DUF4143)	NA|255aa|down_4|NZ_CP013673.1_176417_177182_+	PRK06635, PRK06635, aspartate kinase; Reviewed	NA|189aa|down_5|NZ_CP013673.1_177237_177804_+	PRK06635, PRK06635, aspartate kinase; Reviewed	NA|365aa|down_6|NZ_CP013673.1_177885_178980_-	PRK08664, PRK08664, aspartate-semialdehyde dehydrogenase; Reviewed	NA|238aa|down_7|NZ_CP013673.1_179054_179768_+	NA	NA|524aa|down_8|NZ_CP013673.1_179770_181342_-	cd07383, MPP_Dcr2, Saccharomyces cerevisiae DCR2 phosphatase and related proteins, metallophosphatase domain	NA|170aa|down_9|NZ_CP013673.1_181473_181983_-	NA
GCF_001719085.1_ASM171908v1	NZ_CP013673	Bifidobacterium longum strain 35624, complete genome	2	289528-289613	2	CRISPRCasFinder	no		cas3,WYL,DEDDh,cas2,cas1,cas4,cas7,cas8c,cas5,casR	Orphan	GACAGCTCCCGCCAGCGGGAGCACTT	26	0	0	NA	NA	NA	1	1	Orphan	cas3,WYL,DEDDh,cas2,cas1,cas4,cas7,cas8c,cas5,casR	NA|186aa|up_4|NZ_CP013673.1_284396_284954_+,NA|695aa|down_6|NZ_CP013673.1_298069_300154_+	NA|560aa|up_9|NZ_CP013673.1_277028_278708_+	COG1080, PtsA, Phosphoenolpyruvate-protein kinase (PTS system EI component in bacteria) [Carbohydrate transport and metabolism]	NA|245aa|up_8|NZ_CP013673.1_278941_279676_+	COG0580, GlpF, Glycerol uptake facilitator and related permeases (Major Intrinsic Protein Family) [Carbohydrate transport and metabolism]	NA|903aa|up_7|NZ_CP013673.1_279759_282468_-	cd02079, P-type_ATPase_HM, P-type heavy metal-transporting ATPase	NA|94aa|up_6|NZ_CP013673.1_282579_282861_+	COG1937, COG1937, Uncharacterized protein conserved in bacteria [Function unknown]	NA|468aa|up_5|NZ_CP013673.1_282924_284328_+	pfam02646, RmuC, RmuC family	NA|186aa|up_4|NZ_CP013673.1_284396_284954_+	NA	NA|283aa|up_3|NZ_CP013673.1_285047_285896_+	cd18096, SpoU-like, SAM-dependent rRNA or tRNA methylase related to SpoU	NA|100aa|up_2|NZ_CP013673.1_286062_286362_+	PRK00034, gatC, Asp-tRNA(Asn)/Glu-tRNA(Gln) amidotransferase subunit GatC	NA|514aa|up_1|NZ_CP013673.1_286365_287907_+	PRK00012, gatA, Asp-tRNA(Asn)/Glu-tRNA(Gln) amidotransferase subunit GatA	NA|500aa|up_0|NZ_CP013673.1_287932_289432_+	PRK05477, gatB, Asp-tRNA(Asn)/Glu-tRNA(Gln) amidotransferase subunit GatB	NA|346aa|down_0|NZ_CP013673.1_289754_290792_+	pfam13302, Acetyltransf_3, Acetyltransferase (GNAT) domain	NA|105aa|down_1|NZ_CP013673.1_290802_291117_+	pfam10611, DUF2469, Protein of unknown function (DUF2469)	NA|607aa|down_2|NZ_CP013673.1_291323_293144_+	cd07061, HP_HAP_like, Histidine phosphatase domain found in histidine acid phosphatases and phytases; contains a His residue which is phosphorylated during the reaction	NA|538aa|down_3|NZ_CP013673.1_293362_294976_+	PRK07208, PRK07208, hypothetical protein; Provisional	NA|690aa|down_4|NZ_CP013673.1_295181_297251_+	PRK12678, PRK12678, transcription termination factor Rho; Provisional	NA|130aa|down_5|NZ_CP013673.1_297558_297948_+	PRK09239, PRK09239, chorismate mutase; Provisional	NA|695aa|down_6|NZ_CP013673.1_298069_300154_+	NA	NA|947aa|down_7|NZ_CP013673.1_300379_303220_-	NF000540, alt_ValS, valine--tRNA ligase	NA|489aa|down_8|NZ_CP013673.1_303258_304725_-	cd08494, PBP2_NikA_DppA_OppA_like_6, The substrate-binding component of an uncharacterized ABC-type nickel/dipeptide/oligopeptide-like import system contains the type 2 periplasmic binding fold	NA|229aa|down_9|NZ_CP013673.1_304846_305533_-	COG0177, Nth, Predicted EndoIII-related endonuclease [DNA replication, recombination, and repair]
GCF_001719085.1_ASM171908v1	NZ_CP013673	Bifidobacterium longum strain 35624, complete genome	3	881726-881806	3	CRISPRCasFinder	no		cas3,WYL,DEDDh,cas2,cas1,cas4,cas7,cas8c,cas5,casR	Orphan	ATATGAGACGGCTTCACTGTGCG	23	0	0	NA	NA	NA	1	1	Orphan	cas3,WYL,DEDDh,cas2,cas1,cas4,cas7,cas8c,cas5,casR	NA|327aa|up_0|NZ_CP013673.1_879973_880954_-,NA|107aa|down_5|NZ_CP013673.1_891128_891449_+	NA|375aa|up_9|NZ_CP013673.1_867788_868913_-	COG1940, NagC, Transcriptional regulator/sugar kinase [Transcription / Carbohydrate transport and metabolism]	NA|271aa|up_8|NZ_CP013673.1_869250_870063_+	PRK00443, nagB, glucosamine-6-phosphate deaminase; Provisional	NA|426aa|up_7|NZ_CP013673.1_870118_871396_+	cd00854, NagA, N-acetylglucosamine-6-phosphate deacetylase, NagA, catalyzes the hydrolysis of the N-acetyl group of N-acetyl-glucosamine-6-phosphate (GlcNAc-6-P) to glucosamine 6-phosphate and acetate	NA|545aa|up_6|NZ_CP013673.1_871675_873310_+	cd08519, PBP2_NikA_DppA_OppA_like_20, The substrate-binding component of an uncharacterized ABC-type nickel/dipeptide/oligopeptide-like import system contains the type 2 periplasmic binding fold	NA|364aa|up_5|NZ_CP013673.1_873479_874571_+	COG0601, DppB, ABC-type dipeptide/oligopeptide/nickel transport systems, permease components [Amino acid transport and metabolism / Inorganic ion transport and metabolism]	NA|390aa|up_4|NZ_CP013673.1_874572_875742_+	COG1173, DppC, ABC-type dipeptide/oligopeptide/nickel transport systems, permease components [Amino acid transport and metabolism / Inorganic ion transport and metabolism]	NA|570aa|up_3|NZ_CP013673.1_875745_877455_+	COG1123, COG1123, ATPase components of various ABC-type transport systems, contain duplicated ATPase [General function prediction only]	NA|174aa|up_2|NZ_CP013673.1_877504_878026_-	cd04676, Nudix_Hydrolase_17, Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X	NA|532aa|up_1|NZ_CP013673.1_878074_879670_-	cd01087, Prolidase, Prolidase	NA|327aa|up_0|NZ_CP013673.1_879973_880954_-	NA	NA|532aa|down_0|NZ_CP013673.1_882079_883675_+	COG0285, FolC, Folylpolyglutamate synthase [Coenzyme metabolism]	NA|1226aa|down_1|NZ_CP013673.1_883735_887413_+	TIGR02168, Chromosome_partition_protein_Smc, chromosome segregation protein SMC, common bacterial type	NA|332aa|down_2|NZ_CP013673.1_887531_888527_-	pfam13480, Acetyltransf_6, Acetyltransferase (GNAT) domain	NA|518aa|down_3|NZ_CP013673.1_888648_890202_-	PRK00139, murE, UDP-N-acetylmuramoylalanyl-D-glutamate--2,6-diaminopimelate ligase; Provisional	NA|270aa|down_4|NZ_CP013673.1_890319_891129_+	PRK09652, PRK09652, RNA polymerase sigma factor RpoE; Provisional	NA|107aa|down_5|NZ_CP013673.1_891128_891449_+	NA	NA|315aa|down_6|NZ_CP013673.1_891715_892660_-	cd09022, Aldose_epim_Ec_YihR, Aldose 1-epimerase, similar to Escherichia coli YihR	NA|319aa|down_7|NZ_CP013673.1_892784_893741_-	cd09022, Aldose_epim_Ec_YihR, Aldose 1-epimerase, similar to Escherichia coli YihR	NA|353aa|down_8|NZ_CP013673.1_893965_895024_+	PRK01045, ispH, 4-hydroxy-3-methylbut-2-enyl diphosphate reductase; Reviewed	NA|170aa|down_9|NZ_CP013673.1_895063_895573_-	cd00002, YbaK_deacylase, This CD includes cysteinyl-tRNA(Pro) deacylases from Haemophilus influenzae and Escherichia coli and other related bacterial proteins
GCF_001719085.1_ASM171908v1	NZ_CP013673	Bifidobacterium longum strain 35624, complete genome	4	1566345-1566476	4	CRISPRCasFinder	no		cas3,WYL,DEDDh,cas2,cas1,cas4,cas7,cas8c,cas5,casR	Orphan	GCGCAACATTCCCGGGAACGCGTCGC	26	0	0	NA	NA	NA	2	2	Orphan	cas3,WYL,DEDDh,cas2,cas1,cas4,cas7,cas8c,cas5,casR	NA|307aa|up_7|NZ_CP013673.1_1550944_1551865_+,NA|766aa|up_3|NZ_CP013673.1_1559942_1562240_+,NA|172aa|down_8|NZ_CP013673.1_1574809_1575325_-	NA|272aa|up_9|NZ_CP013673.1_1549601_1550417_-	COG0395, UgpE, ABC-type sugar transport system, permease component [Carbohydrate transport and metabolism]	NA|71aa|up_8|NZ_CP013673.1_1550554_1550767_+	cd06290, PBP1_LacI-like, ligand-binding domain of uncharacterized DNA-binding regulatory proteins that are members of the LacI-GalR family of bacterial transcription repressors	NA|307aa|up_7|NZ_CP013673.1_1550944_1551865_+	NA	NA|372aa|up_6|NZ_CP013673.1_1552138_1553254_-	pfam10609, ParA, NUBPL iron-transfer P-loop NTPase	NA|921aa|up_5|NZ_CP013673.1_1553430_1556193_-	PRK07956, ligA, NAD-dependent DNA ligase LigA; Validated	NA|1185aa|up_4|NZ_CP013673.1_1556257_1559812_-	sd00006, TPR, Tetratricopeptide repeat	NA|766aa|up_3|NZ_CP013673.1_1559942_1562240_+	NA	NA|590aa|up_2|NZ_CP013673.1_1562374_1564144_+	pfam05935, Arylsulfotrans, Arylsulfotransferase (ASST)	NA|275aa|up_1|NZ_CP013673.1_1564200_1565025_-	cd03225, ABC_cobalt_CbiO_domain1, First domain of the ATP-binding cassette component of cobalt transport system	NA|420aa|up_0|NZ_CP013673.1_1565033_1566293_-	cd00609, AAT_like, Aspartate aminotransferase family	NA|256aa|down_0|NZ_CP013673.1_1566569_1567337_-	COG1940, NagC, Transcriptional regulator/sugar kinase [Transcription / Carbohydrate transport and metabolism]	NA|262aa|down_1|NZ_CP013673.1_1567582_1568368_+	pfam14512, TM1586_NiRdase, Putative TM nitroreductase	NA|341aa|down_2|NZ_CP013673.1_1568409_1569432_-	cd04185, GT_2_like_b, Subfamily of Glycosyltransferase Family GT2 of unknown function	NA|218aa|down_3|NZ_CP013673.1_1569674_1570328_+	COG3601, COG3601, Predicted membrane protein [Function unknown]	NA|811aa|down_4|NZ_CP013673.1_1570458_1572891_+	cd03225, ABC_cobalt_CbiO_domain1, First domain of the ATP-binding cassette component of cobalt transport system	NA|102aa|down_5|NZ_CP013673.1_1572962_1573268_+	pfam01985, CRS1_YhbY, CRS1 / YhbY (CRM) domain	NA|122aa|down_6|NZ_CP013673.1_1573366_1573732_-	pfam04020, Phage_holin_4_2, Mycobacterial 4 TMS phage holin, superfamily IV	NA|217aa|down_7|NZ_CP013673.1_1573808_1574459_+	pfam05154, TM2, TM2 domain	NA|172aa|down_8|NZ_CP013673.1_1574809_1575325_-	NA	NA|299aa|down_9|NZ_CP013673.1_1575428_1576325_-	cd05300, 2-Hacid_dh_1, Putative D-isomer specific 2-hydroxyacid dehydrogenase
GCF_001719085.1_ASM171908v1	NZ_CP013673	Bifidobacterium longum strain 35624, complete genome	5	1656102-1666931	1,5,1	PILER-CR,CRISPRCasFinder,CRT	no	cas2,cas1,cas4,cas7,cas8c,cas5,cas3	cas3,WYL,DEDDh,cas2,cas1,cas4,cas7,cas8c,cas5,casR	 Type I-U?,Type I-C,Type I-U	ATTTCAATCCACGCACCCCAGTGGGGTGCGAC,ATTTCAATCCACGCACCCCAGTGGGGTGCGAC,ATTTCAATCCACGCACCCCAGTGGGGTGCGAC	32,32,32	1	1	1666865-1666899	NZ_CP013673.1_1858392-1858358	I-C:I-C:I-C	163,163,163	163	TypeI-U?,TypeI-C,TypeI-U	cas3,WYL,DEDDh,cas2,cas1,cas4,cas7,cas8c,cas5,casR	NA,NA	NA|556aa|up_9|NZ_CP013673.1_1643135_1644803_-	TIGR03688, pupylate_PafA2, proteasome accessory factor PafA2	NA|522aa|up_8|NZ_CP013673.1_1644824_1646390_-	TIGR03689, pup_AAA, proteasome ATPase	NA|233aa|up_7|NZ_CP013673.1_1646450_1647149_-	COG0586, DedA, Uncharacterized membrane-associated protein [Function unknown]	NA|251aa|up_6|NZ_CP013673.1_1647169_1647922_+	cd07500, HAD_PSP, phosphoserine phosphatase (PSP), similar to Methanococcus Jannaschii PSP and Saccharomyces cerevisiae SER2p	NA|771aa|up_5|NZ_CP013673.1_1647961_1650274_+	PRK14873, PRK14873, primosomal protein N'	NA|235aa|up_4|NZ_CP013673.1_1650332_1651037_+	cd02603, HAD_sEH-N_like, N-terminal lipase phosphatase domain of human soluble epoxide hydrolase, Escherichia coli YihX/HAD4 alpha-D-glucose 1-phosphate phosphatase, and related domains, may be inactive	NA|329aa|up_3|NZ_CP013673.1_1651060_1652047_+	PRK00005, fmt, methionyl-tRNA formyltransferase; Reviewed	NA|621aa|up_2|NZ_CP013673.1_1652138_1654001_-	PRK12448, PRK12448, dihydroxy-acid dehydratase; Provisional	NA|95aa|up_1|NZ_CP013673.1_1654202_1654487_+	PRK00392, rpoZ, DNA-directed RNA polymerase subunit omega; Reviewed	NA|407aa|up_0|NZ_CP013673.1_1654764_1655985_+	PRK05250, PRK05250, S-adenosylmethionine synthetase; Validated	cas2|97aa|down_0|NZ_CP013673.1_1667317_1667608_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|344aa|down_1|NZ_CP013673.1_1667693_1668725_-	TIGR03640, cas1_DVULG, CRISPR-associated endonuclease Cas1, subtype I-C/DVULG	cas4|234aa|down_2|NZ_CP013673.1_1668721_1669423_-	TIGR00372, conserved_hypothetical_protein, CRISPR-associated protein Cas4	cas7|284aa|down_3|NZ_CP013673.1_1669461_1670313_-	cd09689, Cas7_I-C, CRISPR/Cas system-associated RAMP superfamily protein Cas7	cas8c|653aa|down_4|NZ_CP013673.1_1670316_1672275_-	pfam09709, Cas_Csd1, CRISPR-associated protein (Cas_Csd1)	cas5|235aa|down_5|NZ_CP013673.1_1672277_1672982_-	cd09651, Cas5_I-C, CRISPR/Cas system-associated RAMP superfamily protein Cas5	cas3|813aa|down_6|NZ_CP013673.1_1672988_1675427_-	COG1203, COG1203, CRISPR-associated helicase Cas3 [Defense mechanisms]	NA|74aa|down_7|NZ_CP013673.1_1676105_1676327_-	pfam01610, DDE_Tnp_ISL3, Transposase	NA|1104aa|down_8|NZ_CP013673.1_1677049_1680361_-	PRK06039, ileS, isoleucyl-tRNA synthetase; Reviewed	NA|404aa|down_9|NZ_CP013673.1_1680954_1682166_+	COG1168, MalY, Bifunctional PLP-dependent enzyme with beta-cystathionase and maltose regulon repressor activities [Amino acid transport and metabolism]
GCF_001719085.1_ASM171908v1	NZ_CP013673	Bifidobacterium longum strain 35624, complete genome	6	1892256-1892341	6	CRISPRCasFinder	no		cas3,WYL,DEDDh,cas2,cas1,cas4,cas7,cas8c,cas5,casR	Orphan	GGCCCTGAGCGTGCGGGCGCGGA	23	0	0	NA	NA	NA	1	1	Orphan	cas3,WYL,DEDDh,cas2,cas1,cas4,cas7,cas8c,cas5,casR	NA|116aa|up_8|NZ_CP013673.1_1882026_1882374_+,NA|30aa|up_5|NZ_CP013673.1_1884344_1884434_-,NA|193aa|up_4|NZ_CP013673.1_1884625_1885204_-,NA|274aa|down_1|NZ_CP013673.1_1894328_1895150_+,NA|104aa|down_7|NZ_CP013673.1_1902298_1902610_+	NA|539aa|up_9|NZ_CP013673.1_1880223_1881840_+	PRK06416, PRK06416, dihydrolipoamide dehydrogenase; Reviewed	NA|116aa|up_8|NZ_CP013673.1_1882026_1882374_+	NA	NA|329aa|up_7|NZ_CP013673.1_1882376_1883363_+	cd09278, RNase_HI_prokaryote_like, RNase HI family found mainly in prokaryotes	NA|233aa|up_6|NZ_CP013673.1_1883493_1884192_+	PRK00702, PRK00702, ribose-5-phosphate isomerase RpiA	NA|30aa|up_5|NZ_CP013673.1_1884344_1884434_-	NA	NA|193aa|up_4|NZ_CP013673.1_1884625_1885204_-	NA	NA|513aa|up_3|NZ_CP013673.1_1885385_1886924_+	PRK11823, PRK11823, DNA repair protein RadA; Provisional	NA|375aa|up_2|NZ_CP013673.1_1886938_1888063_-	PRK05627, PRK05627, bifunctional riboflavin kinase/FAD synthetase	NA|388aa|up_1|NZ_CP013673.1_1888160_1889324_-	PRK03287, truB, tRNA pseudouridine synthase B; Provisional	NA|158aa|up_0|NZ_CP013673.1_1889325_1889799_-	PRK00521, rbfA, 30S ribosome-binding factor RbfA	NA|356aa|down_0|NZ_CP013673.1_1893146_1894214_-	PRK12327, nusA, transcription elongation factor NusA; Provisional	NA|274aa|down_1|NZ_CP013673.1_1894328_1895150_+	NA	NA|348aa|down_2|NZ_CP013673.1_1895190_1896234_-	COG1609, PurR, Transcriptional regulators [Transcription]	NA|389aa|down_3|NZ_CP013673.1_1896766_1897933_-	COG3534, AbfA, Alpha-L-arabinofuranosidase [Carbohydrate transport and metabolism]	NA|134aa|down_4|NZ_CP013673.1_1897932_1898334_-	COG3534, AbfA, Alpha-L-arabinofuranosidase [Carbohydrate transport and metabolism]	NA|267aa|down_5|NZ_CP013673.1_1899143_1899944_-	COG1305, COG1305, Transglutaminase-like enzymes, putative cysteine proteases [Amino acid transport and metabolism]	NA|727aa|down_6|NZ_CP013673.1_1900150_1902331_-	pfam14403, CP_ATPgrasp_2, Circularly permuted ATP-grasp type 2	NA|104aa|down_7|NZ_CP013673.1_1902298_1902610_+	NA	NA|304aa|down_8|NZ_CP013673.1_1902606_1903518_+	cd02570, PseudoU_synth_EcTruA, Eukaryotic and bacterial pseudouridine synthases similar to E	NA|181aa|down_9|NZ_CP013673.1_1903599_1904142_-	PRK05591, rplQ, 50S ribosomal protein L17; Validated
