assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000219455.1_ASM21945v1	NC_017221	Bifidobacterium longum subsp. longum KACC 91563, complete sequence	1	279776-279861	1	CRISPRCasFinder	no		casR,cas9,cas1,cas2,WYL,DEDDh,cas3	Orphan	CTCCGCGCCCGCACGCTCAGGGC	23	0	0	NA	NA	NA	1	1	Orphan	casR,cas9,cas1,cas2,WYL,DEDDh,cas3	NA|243aa|up_1|NC_017221.1_276967_277696_-,NA|211aa|down_4|NC_017221.1_286859_287492_+,NA|30aa|down_5|NC_017221.1_287683_287773_+,NA|99aa|down_8|NC_017221.1_289822_290119_+	NA|133aa|up_9|NC_017221.1_266454_266853_+	PRK05309, PRK05309, 30S ribosomal protein S11; Validated	NA|332aa|up_8|NC_017221.1_266933_267929_+	PRK05182, PRK05182, DNA-directed RNA polymerase subunit alpha; Provisional	NA|181aa|up_7|NC_017221.1_268028_268571_+	PRK05591, rplQ, 50S ribosomal protein L17; Validated	NA|304aa|up_6|NC_017221.1_268652_269564_-	cd02570, PseudoU_synth_EcTruA, Eukaryotic and bacterial pseudouridine synthases similar to E	NA|727aa|up_5|NC_017221.1_269785_271966_+	pfam14403, CP_ATPgrasp_2, Circularly permuted ATP-grasp type 2	NA|267aa|up_4|NC_017221.1_272172_272973_+	COG1305, COG1305, Transglutaminase-like enzymes, putative cysteine proteases [Amino acid transport and metabolism]	NA|523aa|up_3|NC_017221.1_273782_275351_+	COG3534, AbfA, Alpha-L-arabinofuranosidase [Carbohydrate transport and metabolism]	NA|348aa|up_2|NC_017221.1_275883_276927_+	COG1609, PurR, Transcriptional regulators [Transcription]	NA|243aa|up_1|NC_017221.1_276967_277696_-	NA	NA|356aa|up_0|NC_017221.1_277903_278971_+	PRK12327, nusA, transcription elongation factor NusA; Provisional	NA|158aa|down_0|NC_017221.1_282318_282792_+	PRK00521, rbfA, 30S ribosome-binding factor RbfA	NA|388aa|down_1|NC_017221.1_282793_283957_+	PRK03287, truB, tRNA pseudouridine synthase B; Provisional	NA|375aa|down_2|NC_017221.1_284054_285179_+	PRK05627, PRK05627, bifunctional riboflavin kinase/FAD synthetase	NA|513aa|down_3|NC_017221.1_285193_286732_-	PRK11823, PRK11823, DNA repair protein RadA; Provisional	NA|211aa|down_4|NC_017221.1_286859_287492_+	NA	NA|30aa|down_5|NC_017221.1_287683_287773_+	NA	NA|233aa|down_6|NC_017221.1_287925_288624_-	PRK00702, PRK00702, ribose-5-phosphate isomerase RpiA	NA|297aa|down_7|NC_017221.1_288754_289645_-	cd09278, RNase_HI_prokaryote_like, RNase HI family found mainly in prokaryotes	NA|99aa|down_8|NC_017221.1_289822_290119_+	NA	NA|539aa|down_9|NC_017221.1_290277_291894_-	PRK08010, PRK08010, pyridine nucleotide-disulfide oxidoreductase; Provisional
GCF_000219455.1_ASM21945v1	NC_017221	Bifidobacterium longum subsp. longum KACC 91563, complete sequence	2	454101-456189	2,1,1,2,3	CRISPRCasFinder,CRT,PILER-CR,PILER-CR,PILER-CR	no	cas9,cas1,cas2,WYL	casR,cas9,cas1,cas2,WYL,DEDDh,cas3	Type II-B,Type II-A,Type II-C	CAAGCTTATCAAGAAGGGTGAATGCTAATTCCCAGC,CAAGCTTATCAAGAAGGGTGAATGCTAATTCCCAGC,CAAGCTTATCAAGAAGGGTGAATGCTAATTCCCAGC,CAAGCTTATCAAGAAGGGTGAATGCTAATTCCCAGC,CAAGCTTATCAAGAAGGGTGAATGCTAATTCCCAGCT	36,36,36,36,37	0	0	NA	NA	NA:NA:NA:NA:NA	32,32,26,26,26	32	TypeII-C,TypeII-A,TypeII-B	casR,cas9,cas1,cas2,WYL,DEDDh,cas3	NA,NA|97aa|down_0|NC_017221.1_456209_456500_+	NA|226aa|up_9|NC_017221.1_441095_441773_-	cd03357, LbH_MAT_GAT, Maltose O-acetyltransferase (MAT) and Galactoside O-acetyltransferase (GAT): MAT and GAT catalyze the CoA-dependent acetylation of the 6-hydroxyl group of their respective sugar substrates	NA|225aa|up_8|NC_017221.1_441947_442622_+	COG0009, SUA5, Putative translation factor (SUA5) [Translation, ribosomal structure and biogenesis]	NA|428aa|up_7|NC_017221.1_442618_443902_+	cd06853, GT_WecA_like, This subfamily contains Escherichia coli WecA, Bacillus subtilis TagO and related proteins	NA|518aa|up_6|NC_017221.1_443954_445508_+	pfam00478, IMPDH, IMP dehydrogenase / GMP reductase domain	NA|217aa|up_5|NC_017221.1_445675_446326_+	PRK05359, PRK05359, oligoribonuclease; Provisional	NA|473aa|up_4|NC_017221.1_446374_447793_+	cd18037, DEXSc_Pif1_like, DEAD-box helicase domain of Pif1	cas9|1139aa|up_3|NC_017221.1_448026_451443_+	pfam18470, Cas9_a, Cas9 alpha-helical lobe domain	cas1|302aa|up_2|NC_017221.1_451446_452352_+	TIGR03639, cas1_NMENI, CRISPR-associated endonuclease Cas1, subtype II/NMENI	cas2|108aa|up_1|NC_017221.1_452348_452672_+	COG3512, COG3512, CRISPR-associated protein, Cas2 homolog [Defense mechanisms]	NA|395aa|up_0|NC_017221.1_452791_453976_+	pfam00872, Transposase_mut, Transposase, Mutator family	NA|97aa|down_0|NC_017221.1_456209_456500_+	NA	NA|605aa|down_1|NC_017221.1_456581_458396_+	PRK09194, PRK09194, prolyl-tRNA synthetase; Provisional	NA|246aa|down_2|NC_017221.1_458671_459409_+	cd04496, SSB_OBF, SSB_OBF: A subfamily of OB folds similar to the OB fold of ssDNA-binding protein (SSB)	NA|725aa|down_3|NC_017221.1_459491_461666_-	COG3590, PepO, Predicted metalloendopeptidase [Posttranslational modification, protein turnover, chaperones]	NA|327aa|down_4|NC_017221.1_461842_462823_+	COG3247, HdeD, Uncharacterized conserved protein [Function unknown]	NA|261aa|down_5|NC_017221.1_463012_463795_+	cd01086, MetAP1, Methionine Aminopeptidase 1	NA|431aa|down_6|NC_017221.1_464068_465361_+	cd06114, EcCS_like, Escherichia coli (Ec) citrate synthase (CS) GltA_like	NA|340aa|down_7|NC_017221.1_465574_466594_-	TIGR03535, DapD_actino, 2,3,4,5-tetrahydropyridine-2,6-dicarboxylate N-succinyltransferase	WYL|362aa|down_8|NC_017221.1_466827_467913_-	pfam13280, WYL, WYL domain	NA|1395aa|down_9|NC_017221.1_467990_472175_+	COG0553, HepA, Superfamily II DNA/RNA helicases, SNF2 family [Transcription / DNA replication, recombination, and repair]
GCF_000219455.1_ASM21945v1	NC_017221	Bifidobacterium longum subsp. longum KACC 91563, complete sequence	3	847799-847879	3	CRISPRCasFinder	no		casR,cas9,cas1,cas2,WYL,DEDDh,cas3	Orphan	ATATGAGACGGCTTCACTGTGCG	23	0	0	NA	NA	NA	1	1	Orphan	casR,cas9,cas1,cas2,WYL,DEDDh,cas3	NA|284aa|up_0|NC_017221.1_846046_846898_-,NA|107aa|down_5|NC_017221.1_857195_857516_+	NA|375aa|up_9|NC_017221.1_833856_834981_-	COG1940, NagC, Transcriptional regulator/sugar kinase [Transcription / Carbohydrate transport and metabolism]	NA|271aa|up_8|NC_017221.1_835318_836131_+	PRK00443, nagB, glucosamine-6-phosphate deaminase; Provisional	NA|428aa|up_7|NC_017221.1_836186_837470_+	cd00854, NagA, N-acetylglucosamine-6-phosphate deacetylase, NagA, catalyzes the hydrolysis of the N-acetyl group of N-acetyl-glucosamine-6-phosphate (GlcNAc-6-P) to glucosamine 6-phosphate and acetate	NA|545aa|up_6|NC_017221.1_837749_839384_+	cd08519, PBP2_NikA_DppA_OppA_like_20, The substrate-binding component of an uncharacterized ABC-type nickel/dipeptide/oligopeptide-like import system contains the type 2 periplasmic binding fold	NA|364aa|up_5|NC_017221.1_839552_840644_+	COG0601, DppB, ABC-type dipeptide/oligopeptide/nickel transport systems, permease components [Amino acid transport and metabolism / Inorganic ion transport and metabolism]	NA|390aa|up_4|NC_017221.1_840645_841815_+	COG1173, DppC, ABC-type dipeptide/oligopeptide/nickel transport systems, permease components [Amino acid transport and metabolism / Inorganic ion transport and metabolism]	NA|570aa|up_3|NC_017221.1_841818_843528_+	COG1123, COG1123, ATPase components of various ABC-type transport systems, contain duplicated ATPase [General function prediction only]	NA|174aa|up_2|NC_017221.1_843577_844099_-	cd04676, Nudix_Hydrolase_17, Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X	NA|532aa|up_1|NC_017221.1_844147_845743_-	cd01087, Prolidase, Prolidase	NA|284aa|up_0|NC_017221.1_846046_846898_-	NA	NA|530aa|down_0|NC_017221.1_848152_849742_+	COG0285, FolC, Folylpolyglutamate synthase [Coenzyme metabolism]	NA|1226aa|down_1|NC_017221.1_849802_853480_+	TIGR02168, Chromosome_partition_protein_Smc, chromosome segregation protein SMC, common bacterial type	NA|332aa|down_2|NC_017221.1_853598_854594_-	pfam13480, Acetyltransf_6, Acetyltransferase (GNAT) domain	NA|518aa|down_3|NC_017221.1_854715_856269_-	PRK00139, murE, UDP-N-acetylmuramoylalanyl-D-glutamate--2,6-diaminopimelate ligase; Provisional	NA|263aa|down_4|NC_017221.1_856407_857196_+	PRK09652, PRK09652, RNA polymerase sigma factor RpoE; Provisional	NA|107aa|down_5|NC_017221.1_857195_857516_+	NA	NA|315aa|down_6|NC_017221.1_857781_858726_-	cd09022, Aldose_epim_Ec_YihR, Aldose 1-epimerase, similar to Escherichia coli YihR	NA|319aa|down_7|NC_017221.1_858851_859808_-	cd09022, Aldose_epim_Ec_YihR, Aldose 1-epimerase, similar to Escherichia coli YihR	NA|353aa|down_8|NC_017221.1_860032_861091_+	PRK01045, ispH, 4-hydroxy-3-methylbut-2-enyl diphosphate reductase; Reviewed	NA|170aa|down_9|NC_017221.1_861130_861640_-	cd00002, YbaK_deacylase, This CD includes cysteinyl-tRNA(Pro) deacylases from Haemophilus influenzae and Escherichia coli and other related bacterial proteins
GCF_000219455.1_ASM21945v1	NC_017221	Bifidobacterium longum subsp. longum KACC 91563, complete sequence	4	1246426-1246515	4	CRISPRCasFinder	no		casR,cas9,cas1,cas2,WYL,DEDDh,cas3	Orphan	CTCCCCTCAAAGAGGGGAGCCAAGA	25	0	0	NA	NA	NA	1	1	Orphan	casR,cas9,cas1,cas2,WYL,DEDDh,cas3	NA|331aa|up_9|NC_017221.1_1235164_1236157_-,NA|313aa|up_8|NC_017221.1_1236181_1237120_-,NA|537aa|up_7|NC_017221.1_1237134_1238745_-,NA|170aa|up_4|NC_017221.1_1241247_1241757_-,NA|275aa|up_3|NC_017221.1_1242064_1242889_+,NA|155aa|down_1|NC_017221.1_1247971_1248436_-,NA|187aa|down_4|NC_017221.1_1252874_1253435_+,NA|109aa|down_8|NC_017221.1_1262274_1262601_+,NA|63aa|down_9|NC_017221.1_1262657_1262846_-	NA|331aa|up_9|NC_017221.1_1235164_1236157_-	NA	NA|313aa|up_8|NC_017221.1_1236181_1237120_-	NA	NA|537aa|up_7|NC_017221.1_1237134_1238745_-	NA	NA|274aa|up_6|NC_017221.1_1239256_1240078_-	pfam03747, ADP_ribosyl_GH, ADP-ribosylglycohydrolase	NA|75aa|up_5|NC_017221.1_1240167_1240392_+	cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|170aa|up_4|NC_017221.1_1241247_1241757_-	NA	NA|275aa|up_3|NC_017221.1_1242064_1242889_+	NA	NA|184aa|up_2|NC_017221.1_1242991_1243543_-	COG1225, Bcp, Peroxiredoxin [Posttranslational modification, protein turnover, chaperones]	NA|286aa|up_1|NC_017221.1_1244503_1245361_-	pfam08282, Hydrolase_3, haloacid dehalogenase-like hydrolase	NA|283aa|up_0|NC_017221.1_1245515_1246364_-	COG1119, ModF, ABC-type molybdenum transport system, ATPase component/photorepair protein PhrA [Inorganic ion transport and metabolism]	NA|417aa|down_0|NC_017221.1_1246592_1247843_-	TIGR02149, glgA_Coryne, glycogen synthase, Corynebacterium family	NA|155aa|down_1|NC_017221.1_1247971_1248436_-	NA	NA|628aa|down_2|NC_017221.1_1248539_1250423_-	cd14244, GH_101_like, Endo-a-N-acetylgalactosaminidase and related glcyosyl hydrolases	NA|548aa|down_3|NC_017221.1_1250639_1252283_-	cd00995, PBP2_NikA_DppA_OppA_like, The substrate-binding domain of an ABC-type nickel/oligopeptide-like import system contains the type 2 periplasmic binding fold	NA|187aa|down_4|NC_017221.1_1252874_1253435_+	NA	NA|513aa|down_5|NC_017221.1_1253997_1255536_-	PRK12810, gltD, glutamate synthase subunit beta; Reviewed	NA|1524aa|down_6|NC_017221.1_1255537_1260109_-	PRK11750, gltB, glutamate synthase subunit alpha; Provisional	NA|342aa|down_7|NC_017221.1_1260790_1261816_-	COG3804, COG3804, Uncharacterized conserved protein related to dihydrodipicolinate reductase [Function unknown]	NA|109aa|down_8|NC_017221.1_1262274_1262601_+	NA	NA|63aa|down_9|NC_017221.1_1262657_1262846_-	NA
GCF_000219455.1_ASM21945v1	NC_017221	Bifidobacterium longum subsp. longum KACC 91563, complete sequence	5	1970796-1970880	5	CRISPRCasFinder	no		casR,cas9,cas1,cas2,WYL,DEDDh,cas3	Orphan	AAGGCTCCCGCTGGCGGGAGCTGTC	25	0	0	NA	NA	NA	1	1	Orphan	casR,cas9,cas1,cas2,WYL,DEDDh,cas3	NA|684aa|up_6|NC_017221.1_1960286_1962338_-,NA|323aa|down_3|NC_017221.1_1974927_1975896_+,NA|210aa|down_5|NC_017221.1_1977272_1977902_-	NA|229aa|up_9|NC_017221.1_1954907_1955594_+	COG0177, Nth, Predicted EndoIII-related endonuclease [DNA replication, recombination, and repair]	NA|509aa|up_8|NC_017221.1_1955655_1957182_+	cd08494, PBP2_NikA_DppA_OppA_like_6, The substrate-binding component of an uncharacterized ABC-type nickel/dipeptide/oligopeptide-like import system contains the type 2 periplasmic binding fold	NA|912aa|up_7|NC_017221.1_1957325_1960061_+	NF000540, alt_ValS, valine--tRNA ligase	NA|684aa|up_6|NC_017221.1_1960286_1962338_-	NA	NA|130aa|up_5|NC_017221.1_1962459_1962849_-	PRK09239, PRK09239, chorismate mutase; Provisional	NA|690aa|up_4|NC_017221.1_1963156_1965226_-	PRK12678, PRK12678, transcription termination factor Rho; Provisional	NA|538aa|up_3|NC_017221.1_1965431_1967045_-	PRK07208, PRK07208, hypothetical protein; Provisional	NA|618aa|up_2|NC_017221.1_1967263_1969117_-	cd07061, HP_HAP_like, Histidine phosphatase domain found in histidine acid phosphatases and phytases; contains a His residue which is phosphorylated during the reaction	NA|105aa|up_1|NC_017221.1_1969290_1969605_-	pfam10611, DUF2469, Protein of unknown function (DUF2469)	NA|355aa|up_0|NC_017221.1_1969615_1970680_-	pfam13302, Acetyltransf_3, Acetyltransferase (GNAT) domain	NA|500aa|down_0|NC_017221.1_1970975_1972475_-	PRK05477, gatB, Asp-tRNA(Asn)/Glu-tRNA(Gln) amidotransferase subunit GatB	NA|514aa|down_1|NC_017221.1_1972500_1974042_-	PRK00012, gatA, Asp-tRNA(Asn)/Glu-tRNA(Gln) amidotransferase subunit GatA	NA|100aa|down_2|NC_017221.1_1974045_1974345_-	PRK00034, gatC, Asp-tRNA(Asn)/Glu-tRNA(Gln) amidotransferase subunit GatC	NA|323aa|down_3|NC_017221.1_1974927_1975896_+	NA	NA|283aa|down_4|NC_017221.1_1976330_1977179_-	cd18096, SpoU-like, SAM-dependent rRNA or tRNA methylase related to SpoU	NA|210aa|down_5|NC_017221.1_1977272_1977902_-	NA	NA|475aa|down_6|NC_017221.1_1977898_1979323_-	pfam02646, RmuC, RmuC family	NA|94aa|down_7|NC_017221.1_1979365_1979647_-	COG1937, COG1937, Uncharacterized protein conserved in bacteria [Function unknown]	NA|889aa|down_8|NC_017221.1_1979758_1982425_+	cd02094, P-type_ATPase_Cu-like, P-type heavy metal-transporting ATPase, similar to human copper-transporting ATPases, ATP7A and ATP7B	NA|245aa|down_9|NC_017221.1_1982508_1983243_-	COG0580, GlpF, Glycerol uptake facilitator and related permeases (Major Intrinsic Protein Family) [Carbohydrate transport and metabolism]
GCF_000219455.1_ASM21945v1	NC_017220	Bifidobacterium longum subsp. longum KACC 91563 plasmid BLNIAS_P1, complete sequence	1	3671-3773	1	CRISPRCasFinder	no			Orphan	AAGGGAGCGAACCGGGGACAAAAAGGGAGCGAAC	34	0	0	NA	NA	NA	1	1	Orphan	casR,cas9,cas1,cas2,WYL,DEDDh,cas3	NA|48aa|up_1|NC_017220.1_901_1045_+,NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|212aa|up_2|NC_017220.1_171_807_+	pfam04796, RepA_C, Plasmid encoded RepA protein	NA|48aa|up_1|NC_017220.1_901_1045_+	NA	NA|567aa|up_0|NC_017220.1_1041_2742_-	pfam03389, MobA_MobL, MobA/MobL family	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA
