assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_002173535.1_ASM217353v1	NZ_CP021474	Pediococcus pentosaceus strain SRCM100892 chromosome, complete genome	1	78043-78155	1	CRISPRCasFinder	no	csa3	csa3,DinG,cas3,DEDDh,csn2,cas1,cas9	Type I-A	GGCGTTCAACAATTAGATTCAGGAAGTCA	29	0	0	NA	NA	NA	1	1	Orphan	csa3,DinG,cas3,DEDDh,csn2,cas1,cas9,RT	NA,NA	NA|333aa|up_9|NZ_CP021474.1_60903_61902_+	pfam09648, YycI, YycH protein	NA|417aa|up_8|NZ_CP021474.1_62889_64140_+	TIGR02037, Probable_periplasmic_serine_protease_do/HhoA-like, periplasmic serine protease, Do/DeqQ family	NA|393aa|up_7|NZ_CP021474.1_66301_67480_+	cd01949, GGDEF, Diguanylate-cyclase (DGC) or GGDEF domain	NA|279aa|up_6|NZ_CP021474.1_68278_69115_-	pfam05067, Mn_catalase, Manganese containing catalase	NA|371aa|up_5|NZ_CP021474.1_69268_70381_+	pfam06772, LtrA, Bacterial low temperature requirement A protein (LtrA)	NA|602aa|up_4|NZ_CP021474.1_70523_72329_+	pfam09972, DUF2207, Predicted membrane protein (DUF2207)	NA|280aa|up_3|NZ_CP021474.1_72747_73587_-	PRK11088, rrmA, 23S rRNA methyltransferase A; Provisional	NA|288aa|up_2|NZ_CP021474.1_73789_74653_+	TIGR00762, DegV, EDD domain protein, DegV family	NA|168aa|up_1|NZ_CP021474.1_74672_75176_+	pfam08876, DUF1836, Domain of unknown function (DUF1836)	csa3|107aa|up_0|NZ_CP021474.1_75267_75588_+	cd00090, HTH_ARSR, Arsenical Resistance Operon Repressor and similar prokaryotic, metal regulated homodimeric repressors	NA|212aa|down_0|NZ_CP021474.1_79190_79826_+	pfam15983, DUF4767, Domain of unknown function (DUF4767)	NA|224aa|down_1|NZ_CP021474.1_80066_80738_+	cd02136, PnbA_NfnB-like, nitroreductase similar to Mycobacterium smegmatis NfnB	NA|238aa|down_2|NZ_CP021474.1_80773_81487_-	COG2188, PhnF, Transcriptional regulators [Transcription]	NA|461aa|down_3|NZ_CP021474.1_81645_83028_+	COG2723, BglB, Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase [Carbohydrate transport and metabolism]	NA|448aa|down_4|NZ_CP021474.1_83042_84386_+	COG1455, CelB, Phosphotransferase system cellobiose-specific component IIC [Carbohydrate transport and metabolism]	NA|329aa|down_5|NZ_CP021474.1_84664_85651_+	TIGR00545, Probable_lipoate-protein_ligase_A, lipoyltransferase and lipoate-protein ligase	NA|373aa|down_6|NZ_CP021474.1_85662_86781_+	TIGR03181, PDH_E1_alph_x, pyruvate dehydrogenase E1 component, alpha subunit	NA|327aa|down_7|NZ_CP021474.1_86783_87764_+	COG0022, AcoB, Pyruvate/2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, eukaryotic type, beta subunit [Energy production and conversion]	NA|438aa|down_8|NZ_CP021474.1_87756_89070_+	PRK11856, PRK11856, branched-chain alpha-keto acid dehydrogenase subunit E2; Reviewed	NA|469aa|down_9|NZ_CP021474.1_89072_90479_+	PRK06416, PRK06416, dihydrolipoamide dehydrogenase; Reviewed
GCF_002173535.1_ASM217353v1	NZ_CP021474	Pediococcus pentosaceus strain SRCM100892 chromosome, complete genome	2	1153887-1153992	2	CRISPRCasFinder	no		csa3,DinG,cas3,DEDDh,csn2,cas1,cas9	Orphan	CAGATGCTCTACCAACTGAGCTAA	24	0	0	NA	NA	NA	1	1	Orphan	csa3,DinG,cas3,DEDDh,csn2,cas1,cas9,RT	NA|67aa|up_9|NZ_CP021474.1_1136432_1136633_-,NA|63aa|down_6|NZ_CP021474.1_1165941_1166130_-	NA|67aa|up_9|NZ_CP021474.1_1136432_1136633_-	NA	NA|274aa|up_8|NZ_CP021474.1_1136637_1137459_-	COG0561, Cof, Predicted hydrolases of the HAD superfamily [General function prediction only]	NA|220aa|up_7|NZ_CP021474.1_1137703_1138363_+	COG1428, COG1428, Deoxynucleoside kinases [Nucleotide transport and metabolism]	NA|135aa|up_6|NZ_CP021474.1_1138440_1138845_-	COG0589, UspA, Universal stress protein UspA and related nucleotide-binding proteins [Signal transduction mechanisms]	NA|240aa|up_5|NZ_CP021474.1_1139886_1140606_-	cd02553, PseudoU_synth_RsuA, Pseudouridine synthase, Escherichia coli RsuA like	NA|806aa|up_4|NZ_CP021474.1_1142502_1144920_-	PRK00390, leuS, leucyl-tRNA synthetase; Validated	NA|211aa|up_3|NZ_CP021474.1_1145264_1145897_+	cd03392, PAP2_like_2, PAP2_like_2 proteins	NA|188aa|up_2|NZ_CP021474.1_1145889_1146453_+	pfam06962, rRNA_methylase, Putative rRNA methylase	NA|397aa|up_1|NZ_CP021474.1_1148038_1149229_-	PRK05250, PRK05250, S-adenosylmethionine synthetase; Validated	NA|578aa|up_0|NZ_CP021474.1_1150200_1151934_-	TIGR02720, Pyruvate_oxidase, pyruvate oxidase	NA|345aa|down_0|NZ_CP021474.1_1159580_1160615_-	pfam03706, LPG_synthase_TM, Lysylphosphatidylglycerol synthase TM region	NA|339aa|down_1|NZ_CP021474.1_1160628_1161645_-	cd03801, GT4_PimA-like, phosphatidyl-myo-inositol mannosyltransferase	NA|392aa|down_2|NZ_CP021474.1_1161694_1162870_-	cd03817, GT4_UGDG-like, UDP-Glc:1,2-diacylglycerol 3-a-glucosyltransferase and similar proteins	NA|258aa|down_3|NZ_CP021474.1_1162959_1163733_-	COG0596, MhpC, Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) [General function prediction only]	NA|575aa|down_4|NZ_CP021474.1_1163805_1165530_-	COG1080, PtsA, Phosphoenolpyruvate-protein kinase (PTS system EI component in bacteria) [Carbohydrate transport and metabolism]	NA|89aa|down_5|NZ_CP021474.1_1165529_1165796_-	PRK13780, PRK13780, phosphocarrier protein HPr; Provisional	NA|63aa|down_6|NZ_CP021474.1_1165941_1166130_-	NA	NA|740aa|down_7|NZ_CP021474.1_1166334_1168554_+	COG0542, clpA, ATP-binding subunits of Clp protease and DnaK/DnaJ chaperones [Posttranslational modification, protein turnover, chaperones]	NA|121aa|down_8|NZ_CP021474.1_1168608_1168971_-	pfam07843, DUF1634, Protein of unknown function (DUF1634)	NA|282aa|down_9|NZ_CP021474.1_1168970_1169816_-	pfam01925, TauE, Sulfite exporter TauE/SafE
GCF_002173535.1_ASM217353v1	NZ_CP021474	Pediococcus pentosaceus strain SRCM100892 chromosome, complete genome	3	1320035-1320334	3,1,1	CRISPRCasFinder,CRT,PILER-CR	no	csn2,cas1,cas9	csa3,DinG,cas3,DEDDh,csn2,cas1,cas9	Type II-C,Type II-A,Type II-B	GTACTTGAACTTATTGATTTAACACTCTTCTGAAAC,GTACTTGAACTTATTGATTTAACACTCTTCTGAAAC,GTACTTGAACTTATTGATTTAACACTCTTCTGAAAC	36,36,36	0	0	NA	NA	NA:NA:NA	4,4,3	4	TypeII-C,TypeII-A,TypeII-B	csa3,DinG,cas3,DEDDh,csn2,cas1,cas9,RT	NA,NA|219aa|down_6|NZ_CP021474.1_1331488_1332145_-,NA|131aa|down_8|NZ_CP021474.1_1333229_1333622_+,NA|116aa|down_9|NZ_CP021474.1_1333853_1334201_-	NA|467aa|up_9|NZ_CP021474.1_1304485_1305886_-	TIGR02966, Phosphate_regulon_sensor_protein_PhoR, phosphate regulon sensor kinase PhoR	NA|372aa|up_8|NZ_CP021474.1_1306555_1307672_-	PRK00578, prfB, peptide chain release factor 2; Validated	NA|787aa|up_7|NZ_CP021474.1_1307753_1310114_-	PRK12904, PRK12904, preprotein translocase subunit SecA; Reviewed	NA|185aa|up_6|NZ_CP021474.1_1310274_1310829_-	COG1544, COG1544, Ribosome-associated protein Y (PSrp-1) [Translation, ribosomal structure and biogenesis]	NA|218aa|up_5|NZ_CP021474.1_1312980_1313634_+	pfam01205, UPF0029, Uncharacterized protein family UPF0029	NA|387aa|up_4|NZ_CP021474.1_1313648_1314809_-	cd06853, GT_WecA_like, This subfamily contains Escherichia coli WecA, Bacillus subtilis TagO and related proteins	NA|611aa|up_3|NZ_CP021474.1_1314959_1316792_-	pfam13520, AA_permease_2, Amino acid permease	NA|540aa|up_2|NZ_CP021474.1_1317135_1318755_-	PRK00013, groEL, chaperonin GroEL; Reviewed	NA|95aa|up_1|NZ_CP021474.1_1318782_1319067_-	PRK00364, groES, co-chaperonin GroES; Reviewed	NA|217aa|up_0|NZ_CP021474.1_1319251_1319902_+	pfam02517, Abi, CAAX protease self-immunity	csn2|224aa|down_0|NZ_CP021474.1_1320363_1321035_-	cd12218, Csn2, CRISPR/Cas system-associated protein Csn2	cas1|302aa|down_1|NZ_CP021474.1_1321313_1322219_-	cd09720, Cas1_II, CRISPR/Cas system-associated protein Cas1	cas9|1360aa|down_2|NZ_CP021474.1_1322426_1326506_-	pfam16592, Cas9_REC, REC lobe of CRISPR-associated endonuclease Cas9	NA|209aa|down_3|NZ_CP021474.1_1326697_1327324_+	pfam09726, Macoilin, Macoilin family	NA|845aa|down_4|NZ_CP021474.1_1327372_1329907_-	cd09601, M1_APN-Q_like, Peptidase M1 aminopeptidase N catalytic domain family which includes aminopeptidase N (APN), aminopeptidase Q (APQ), tricorn interacting factor F3, and endoplasmic reticulum aminopeptidase 1 (ERAP1)	NA|467aa|down_5|NZ_CP021474.1_1329955_1331356_-	TIGR00711, Uncharacterized_MFS-type_transporter_YhcA, drug resistance transporter, EmrB/QacA subfamily	NA|219aa|down_6|NZ_CP021474.1_1331488_1332145_-	NA	NA|105aa|down_7|NZ_CP021474.1_1332752_1333067_-	pfam13560, HTH_31, Helix-turn-helix domain	NA|131aa|down_8|NZ_CP021474.1_1333229_1333622_+	NA	NA|116aa|down_9|NZ_CP021474.1_1333853_1334201_-	NA
GCF_002173535.1_ASM217353v1	NZ_CP021475	Pediococcus pentosaceus strain SRCM100892 plasmid pPC892-4, complete sequence	1	54075-54151	1	CRISPRCasFinder	no	csa3	csa3	Type I-A	TTTTTCTGTTGCCAGCCAACTATCT	25	0	0	NA	NA	NA	1	1	Orphan	csa3,DinG,cas3,DEDDh,csn2,cas1,cas9,RT	NA|83aa|up_9|NZ_CP021475.1_41786_42035_-,NA|66aa|up_8|NZ_CP021475.1_46838_47036_+,NA|93aa|up_6|NZ_CP021475.1_47954_48233_-,NA|104aa|up_4|NZ_CP021475.1_50878_51190_+,NA|189aa|up_1|NZ_CP021475.1_52597_53164_+,NA|30aa|down_4|NZ_CP021475.1_59237_59327_-	NA|83aa|up_9|NZ_CP021475.1_41786_42035_-	NA	NA|66aa|up_8|NZ_CP021475.1_46838_47036_+	NA	NA|249aa|up_7|NZ_CP021475.1_47086_47833_+	COG3177, COG3177, Fic family protein [Function unknown]	NA|93aa|up_6|NZ_CP021475.1_47954_48233_-	NA	NA|688aa|up_5|NZ_CP021475.1_48731_50795_+	pfam03389, MobA_MobL, MobA/MobL family	NA|104aa|up_4|NZ_CP021475.1_50878_51190_+	NA	NA|112aa|up_3|NZ_CP021475.1_51841_52177_+	pfam16943, T4SS_CagC, Cag pathogenicity island, type IV secretory system	NA|121aa|up_2|NZ_CP021475.1_52197_52560_+	TIGR03928, T7_EssCb_Firm, type VII secretion protein EssC, C-terminal domain	NA|189aa|up_1|NZ_CP021475.1_52597_53164_+	NA	NA|224aa|up_0|NZ_CP021475.1_53348_54020_-	pfam18813, PBECR4, phage-Barnase-EndoU-ColicinE5/D-RelE like nuclease4	NA|236aa|down_0|NZ_CP021475.1_54384_55092_+	COG1168, MalY, Bifunctional PLP-dependent enzyme with beta-cystathionase and maltose regulon repressor activities [Amino acid transport and metabolism]	NA|98aa|down_1|NZ_CP021475.1_55208_55502_+	PRK06934, PRK06934, flavodoxin; Provisional	NA|385aa|down_2|NZ_CP021475.1_56055_57210_+	COG0475, KefB, Kef-type K+ transport systems, membrane components [Inorganic ion transport and metabolism]	NA|523aa|down_3|NZ_CP021475.1_57332_58901_+	cd01031, EriC, ClC chloride channel EriC	NA|30aa|down_4|NZ_CP021475.1_59237_59327_-	NA	NA|446aa|down_5|NZ_CP021475.1_61073_62411_-	COG1249, Lpd, Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide dehydrogenase (E3) component, and related enzymes [Energy production and conversion]	NA|105aa|down_6|NZ_CP021475.1_62432_62747_-	TIGR01068, Thioredoxin-like_protein_slr0233, thioredoxin	NA|217aa|down_7|NZ_CP021475.1_62901_63552_-	cd03024, DsbA_FrnE, DsbA family, FrnE subfamily; FrnE is a DsbA-like protein containing a CXXC motif	NA|308aa|down_8|NZ_CP021475.1_63567_64491_-	TIGR01292, Thioredoxin_reductase, thioredoxin-disulfide reductase	NA|95aa|down_9|NZ_CP021475.1_64512_64797_-	cd02947, TRX_family, TRX family; composed of two groups: Group I, which includes proteins that exclusively encode a TRX domain; and Group II, which are composed of fusion proteins of TRX and additional domains
