assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000269965.1_ASM26996v1	NC_017219	Bifidobacterium longum subsp. infantis ATCC 15697 = JCM 1222 = DSM 20088, complete genome	1	932216-932372	1	CRISPRCasFinder	no	WYL	WYL,c2c9_V-U4,DEDDh,cas14j,cas3	Unclear	GAGCGTACCCGCTCCGAGGCCGACG	25	0	0	NA	NA	NA	2	2	Orphan	WYL,c2c9_V-U4,DEDDh,cas14j,cas3	NA,NA|433aa|down_8|NC_017219.1_947527_948826_+	NA|126aa|up_9|NC_017219.1_919264_919642_-	pfam01910, Thiamine_BP, Thiamine-binding protein	NA|270aa|up_8|NC_017219.1_919705_920515_-	pfam08543, Phos_pyr_kin, Phosphomethylpyrimidine kinase	NA|403aa|up_7|NC_017219.1_920641_921850_+	COG1222, RPT1, ATP-dependent 26S proteasome regulatory subunit [Posttranslational modification, protein turnover, chaperones]	NA|918aa|up_6|NC_017219.1_921886_924640_-	PRK09284, PRK09284, thiamine biosynthesis protein ThiC; Provisional	NA|315aa|up_5|NC_017219.1_924722_925667_-	cd01170, THZ_kinase, 4-methyl-5-beta-hydroxyethylthiazole (Thz) kinase catalyzes the phosphorylation of the hydroxylgroup of Thz	NA|491aa|up_4|NC_017219.1_926102_927575_+	PRK04173, PRK04173, glycyl-tRNA synthetase; Provisional	NA|415aa|up_3|NC_017219.1_927719_928964_+	TIGR00737, Probable_tRNA-dihydrouridine_synthase, putative TIM-barrel protein, nifR3 family	NA|404aa|up_2|NC_017219.1_929075_930287_+	PRK09330, PRK09330, cell division protein FtsZ; Validated	NA|160aa|up_1|NC_017219.1_930299_930779_+	pfam04472, SepF, Cell division protein SepF	NA|101aa|up_0|NC_017219.1_930900_931203_+	pfam02325, YGGT, YGGT family	NA|183aa|down_0|NC_017219.1_932743_933292_+	PRK14771, PRK14771, lipoprotein signal peptidase; Provisional	NA|321aa|down_1|NC_017219.1_933291_934254_+	COG0564, RluA, Pseudouridylate synthases, 23S RNA-specific [Translation, ribosomal structure and biogenesis]	NA|267aa|down_2|NC_017219.1_934775_935576_+	pfam03631, Virul_fac_BrkB, Virulence factor BrkB	WYL|349aa|down_3|NC_017219.1_935603_936650_-	pfam13280, WYL, WYL domain	NA|879aa|down_4|NC_017219.1_936733_939370_+	PRK05673, dnaE, DNA polymerase III subunit alpha; Validated	NA|1099aa|down_5|NC_017219.1_939507_942804_+	TIGR02773, ATP-dependent_helicase/deoxyribonuclease_subunit_B, helicase-exonuclease AddAB, AddB subunit	NA|1372aa|down_6|NC_017219.1_942797_946913_+	TIGR02785, ATP-dependent_helicase/nuclease_subunit_A, helicase-exonuclease AddAB, AddA subunit, Firmicutes type	NA|143aa|down_7|NC_017219.1_947057_947486_+	cd02201, FtsZ_type1, Filamenting temperature sensitive mutant Z, type 1	NA|433aa|down_8|NC_017219.1_947527_948826_+	NA	NA|646aa|down_9|NC_017219.1_949622_951560_-	pfam03235, DUF262, Protein of unknown function DUF262
GCF_000269965.1_ASM26996v1	NC_017219	Bifidobacterium longum subsp. infantis ATCC 15697 = JCM 1222 = DSM 20088, complete genome	2	1257332-1257419	2	CRISPRCasFinder	no		WYL,c2c9_V-U4,DEDDh,cas14j,cas3	Orphan	CCCGCTGGCGGGGGCTCCCGCGCAGC	26	0	0	NA	NA	NA	1	1	Orphan	WYL,c2c9_V-U4,DEDDh,cas14j,cas3	NA|284aa|up_2|NC_017219.1_1254042_1254894_+,NA|132aa|up_1|NC_017219.1_1256672_1257068_+,NA	NA|137aa|up_9|NC_017219.1_1246763_1247174_+	PRK00051, hisI, phosphoribosyl-AMP cyclohydrolase; Reviewed	NA|519aa|up_8|NC_017219.1_1247240_1248797_+	PRK13571, PRK13571, anthranilate synthase component I; Provisional	NA|216aa|up_7|NC_017219.1_1248861_1249509_-	PRK13197, PRK13197, pyrrolidone-carboxylate peptidase; Provisional	NA|318aa|up_6|NC_017219.1_1249556_1250510_-	pfam06166, DUF979, Protein of unknown function (DUF979)	NA|243aa|up_5|NC_017219.1_1250511_1251240_-	pfam06149, DUF969, Protein of unknown function (DUF969)	NA|534aa|up_4|NC_017219.1_1251473_1253075_+	COG0488, Uup, ATPase components of ABC transporters with duplicated ATPase domains [General function prediction only]	NA|236aa|up_3|NC_017219.1_1253210_1253918_+	pfam00106, adh_short, short chain dehydrogenase	NA|284aa|up_2|NC_017219.1_1254042_1254894_+	NA	NA|132aa|up_1|NC_017219.1_1256672_1257068_+	NA	NA|82aa|up_0|NC_017219.1_1257068_1257314_+	PRK11578, PRK11578, macrolide transporter subunit MacA; Provisional	NA|218aa|down_0|NC_017219.1_1257531_1258185_+	cd03255, ABC_MJ0796_LolCDE_FtsE, ATP-binding cassette domain of the transporters involved in export of lipoprotein and macrolide, and cell division protein	NA|542aa|down_1|NC_017219.1_1258340_1259966_+	pfam11855, DUF3375, Protein of unknown function (DUF3375)	NA|221aa|down_2|NC_017219.1_1259962_1260625_+	pfam13835, DUF4194, Domain of unknown function (DUF4194)	NA|1185aa|down_3|NC_017219.1_1260621_1264176_+	COG4913, COG4913, Uncharacterized protein conserved in bacteria [Function unknown]	NA|1012aa|down_4|NC_017219.1_1264371_1267407_+	PRK00349, uvrA, excinuclease ABC subunit UvrA	NA|789aa|down_5|NC_017219.1_1267557_1269924_+	PRK00558, uvrC, excinuclease ABC subunit UvrC	NA|324aa|down_6|NC_017219.1_1270032_1271004_+	PRK00258, aroE, shikimate 5-dehydrogenase; Reviewed	NA|329aa|down_7|NC_017219.1_1271003_1271990_+	PRK05416, PRK05416, RNase adapter RapZ	NA|317aa|down_8|NC_017219.1_1272188_1273139_+	TIGR00647, DNA_bind_WhiA, DNA-binding protein WhiA	NA|402aa|down_9|NC_017219.1_1273307_1274513_+	PRK00073, pgk, phosphoglycerate kinase; Provisional
GCF_000269965.1_ASM26996v1	NC_017219	Bifidobacterium longum subsp. infantis ATCC 15697 = JCM 1222 = DSM 20088, complete genome	3	1989775-1989857	3	CRISPRCasFinder	no	cas3	WYL,c2c9_V-U4,DEDDh,cas14j,cas3	Unclear	GCGATCACCACGGACAAGCTGGC	23	0	0	NA	NA	NA	1	1	Unclear	WYL,c2c9_V-U4,DEDDh,cas14j,cas3	NA|115aa|up_7|NC_017219.1_1981069_1981414_+,NA|92aa|up_6|NC_017219.1_1981417_1981693_+,NA|126aa|up_5|NC_017219.1_1981692_1982070_+,NA|200aa|up_4|NC_017219.1_1982085_1982685_+,NA|122aa|up_3|NC_017219.1_1982844_1983210_+,NA|150aa|up_2|NC_017219.1_1983206_1983656_+,NA|326aa|up_0|NC_017219.1_1985905_1986883_+,NA|191aa|down_0|NC_017219.1_1991043_1991616_+,NA|57aa|down_1|NC_017219.1_1991619_1991790_+,NA|176aa|down_2|NC_017219.1_1991864_1992392_+,NA|170aa|down_3|NC_017219.1_1992388_1992898_+,NA|152aa|down_4|NC_017219.1_1992923_1993379_+,NA|132aa|down_5|NC_017219.1_1993405_1993801_+	NA|188aa|up_9|NC_017219.1_1980088_1980652_+	pfam11133, Phage_head_fibr, Head fiber protein	NA|136aa|up_8|NC_017219.1_1980665_1981073_+	pfam09355, Phage_Gp19, Phage protein Gp19/Gp15/Gp42	NA|115aa|up_7|NC_017219.1_1981069_1981414_+	NA	NA|92aa|up_6|NC_017219.1_1981417_1981693_+	NA	NA|126aa|up_5|NC_017219.1_1981692_1982070_+	NA	NA|200aa|up_4|NC_017219.1_1982085_1982685_+	NA	NA|122aa|up_3|NC_017219.1_1982844_1983210_+	NA	NA|150aa|up_2|NC_017219.1_1983206_1983656_+	NA	NA|737aa|up_1|NC_017219.1_1983680_1985891_+	TIGR02675, Mu-like_prophage_FluMu_protein_gp42, tape measure domain	NA|326aa|up_0|NC_017219.1_1985905_1986883_+	NA	NA|191aa|down_0|NC_017219.1_1991043_1991616_+	NA	NA|57aa|down_1|NC_017219.1_1991619_1991790_+	NA	NA|176aa|down_2|NC_017219.1_1991864_1992392_+	NA	NA|170aa|down_3|NC_017219.1_1992388_1992898_+	NA	NA|152aa|down_4|NC_017219.1_1992923_1993379_+	NA	NA|132aa|down_5|NC_017219.1_1993405_1993801_+	NA	NA|411aa|down_6|NC_017219.1_1993861_1995094_+	cd06417, GH25_LysA-like, LysA is a cell wall endolysin produced by Lactobacillus fermentum, which degrades bacterial cell walls by catalyzing the hydrolysis of 1,4-beta-linkages between N-acetylmuramic acid and N-acetyl-D-glucosamine residues	NA|73aa|down_7|NC_017219.1_1995219_1995438_+	pfam16938, Phage_holin_Dp1, Putative phage holin Dp-1	NA|334aa|down_8|NC_017219.1_1995955_1996957_-	COG0248, GppA, Exopolyphosphatase [Nucleotide transport and metabolism / Inorganic ion transport and metabolism]	NA|189aa|down_9|NC_017219.1_1997019_1997586_-	pfam04417, DUF501, Protein of unknown function (DUF501)
GCF_000269965.1_ASM26996v1	NC_017219	Bifidobacterium longum subsp. infantis ATCC 15697 = JCM 1222 = DSM 20088, complete genome	4	2438382-2438467	4	CRISPRCasFinder	no		WYL,c2c9_V-U4,DEDDh,cas14j,cas3	Orphan	GGCCCTGAGCGTGCGGGCGCGGA	23	0	0	NA	NA	NA	1	1	Orphan	WYL,c2c9_V-U4,DEDDh,cas14j,cas3	NA|58aa|up_8|NC_017219.1_2428122_2428296_-,NA|30aa|up_5|NC_017219.1_2430467_2430557_-,NA|211aa|up_4|NC_017219.1_2430748_2431381_-,NA|243aa|down_1|NC_017219.1_2440580_2441309_+,NA|40aa|down_5|NC_017219.1_2446029_2446149_-	NA|548aa|up_9|NC_017219.1_2426306_2427950_+	PRK08010, PRK08010, pyridine nucleotide-disulfide oxidoreductase; Provisional	NA|58aa|up_8|NC_017219.1_2428122_2428296_-	NA	NA|297aa|up_7|NC_017219.1_2428595_2429486_+	cd09278, RNase_HI_prokaryote_like, RNase HI family found mainly in prokaryotes	NA|233aa|up_6|NC_017219.1_2429616_2430315_+	PRK00702, PRK00702, ribose-5-phosphate isomerase RpiA	NA|30aa|up_5|NC_017219.1_2430467_2430557_-	NA	NA|211aa|up_4|NC_017219.1_2430748_2431381_-	NA	NA|513aa|up_3|NC_017219.1_2431508_2433047_+	PRK11823, PRK11823, DNA repair protein RadA; Provisional	NA|375aa|up_2|NC_017219.1_2433061_2434186_-	PRK05627, PRK05627, bifunctional riboflavin kinase/FAD synthetase	NA|388aa|up_1|NC_017219.1_2434283_2435447_-	PRK03287, truB, tRNA pseudouridine synthase B; Provisional	NA|159aa|up_0|NC_017219.1_2435448_2435925_-	PRK00521, rbfA, 30S ribosome-binding factor RbfA	NA|356aa|down_0|NC_017219.1_2439305_2440373_-	PRK12327, nusA, transcription elongation factor NusA; Provisional	NA|243aa|down_1|NC_017219.1_2440580_2441309_+	NA	NA|461aa|down_2|NC_017219.1_2442228_2443611_-	cd13585, PBP2_TMBP_like, The periplasmic-binding component of ABC transport systems specific for trehalose/maltose and similar oligosaccharides; possess type 2 periplasmic binding fold	NA|326aa|down_3|NC_017219.1_2443724_2444702_-	COG0395, UgpE, ABC-type sugar transport system, permease component [Carbohydrate transport and metabolism]	NA|304aa|down_4|NC_017219.1_2444701_2445613_-	COG1175, UgpA, ABC-type sugar transport systems, permease components [Carbohydrate transport and metabolism]	NA|40aa|down_5|NC_017219.1_2446029_2446149_-	NA	NA|437aa|down_6|NC_017219.1_2446488_2447799_-	pfam13635, DUF4143, Domain of unknown function (DUF4143)	NA|267aa|down_7|NC_017219.1_2448167_2448968_-	COG1305, COG1305, Transglutaminase-like enzymes, putative cysteine proteases [Amino acid transport and metabolism]	NA|727aa|down_8|NC_017219.1_2449172_2451353_-	pfam14403, CP_ATPgrasp_2, Circularly permuted ATP-grasp type 2	NA|304aa|down_9|NC_017219.1_2451573_2452485_+	cd02570, PseudoU_synth_EcTruA, Eukaryotic and bacterial pseudouridine synthases similar to E
GCF_000269965.1_ASM26996v1	NC_017219	Bifidobacterium longum subsp. infantis ATCC 15697 = JCM 1222 = DSM 20088, complete genome	5	2555303-2555385	5	CRISPRCasFinder	no		WYL,c2c9_V-U4,DEDDh,cas14j,cas3	Orphan	GCGAACCGGGCGGTCCTCGTCAT	23	0	0	NA	NA	NA	1	1	Orphan	WYL,c2c9_V-U4,DEDDh,cas14j,cas3	NA|107aa|up_9|NC_017219.1_2542683_2543004_-,NA	NA|107aa|up_9|NC_017219.1_2542683_2543004_-	NA	NA|231aa|up_8|NC_017219.1_2545180_2545873_-	PRK05424, rplA, 50S ribosomal protein L1; Validated	NA|144aa|up_7|NC_017219.1_2545888_2546320_-	PRK00140, rplK, 50S ribosomal protein L11; Validated	NA|298aa|up_6|NC_017219.1_2546580_2547474_-	PRK05609, nusG, transcription antitermination protein NusG; Validated	NA|76aa|up_5|NC_017219.1_2547503_2547731_-	PRK07597, secE, preprotein translocase subunit SecE; Reviewed	NA|402aa|up_4|NC_017219.1_2547996_2549202_-	PRK05764, PRK05764, aspartate aminotransferase; Provisional	NA|378aa|up_3|NC_017219.1_2549292_2550426_-	PRK05429, PRK05429, gamma-glutamyl kinase; Provisional	NA|564aa|up_2|NC_017219.1_2550426_2552118_-	PRK12296, obgE, GTPase CgtA; Reviewed	NA|83aa|up_1|NC_017219.1_2552186_2552435_-	PRK05435, rpmA, 50S ribosomal protein L27; Validated	NA|103aa|up_0|NC_017219.1_2552457_2552766_-	PRK05573, rplU, 50S ribosomal protein L21; Validated	NA|402aa|down_0|NC_017219.1_2556186_2557392_-	cd05647, M20_DapE_actinobac, M20 Peptidase actinobacterial DapE encoded N-succinyl-L,L-diaminopimelic acid desuccinylase	NA|315aa|down_1|NC_017219.1_2557498_2558443_+	COG0679, COG0679, Predicted permeases [General function prediction only]	NA|149aa|down_2|NC_017219.1_2558694_2559141_+	COG4154, FucU, Fucose dissimilation pathway protein FucU [Carbohydrate transport and metabolism]	NA|256aa|down_3|NC_017219.1_2559387_2560155_-	COG3618, COG3618, Predicted metal-dependent hydrolase of the TIM-barrel fold [General function prediction only]	NA|428aa|down_4|NC_017219.1_2560156_2561440_-	cd17394, MFS_FucP_like, Fucose permease and similar proteins of the Major Facilitator Superfamily of transporters	NA|264aa|down_5|NC_017219.1_2561669_2562461_-	PRK08628, PRK08628, SDR family oxidoreductase	NA|428aa|down_6|NC_017219.1_2562601_2563885_-	cd03324, rTSbeta_L-fuconate_dehydratase, Human rTS beta is encoded by the rTS gene which, through alternative RNA splicing, also encodes rTS alpha whose mRNA is complementary to thymidylate synthase mRNA	NA|343aa|down_7|NC_017219.1_2564064_2565093_+	cd06296, PBP1_CatR-like, ligand-binding domain of a LacI-like transcriptional regulator, CatR which is involved in catechol degradation	NA|483aa|down_8|NC_017219.1_2565074_2566523_-	COG3127, COG3127, Predicted ABC-type transport system involved in lysophospholipase L1 biosynthesis, permease component [Secondary metabolites biosynthesis, transport, and catabolism]	NA|317aa|down_9|NC_017219.1_2566519_2567470_-	cd03255, ABC_MJ0796_LolCDE_FtsE, ATP-binding cassette domain of the transporters involved in export of lipoprotein and macrolide, and cell division protein
