assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_001188815.1_ASM118881v1	NZ_CP006792	Yersinia pestis 2944 chromosome, complete genome	1	605592-605801	1,1,1	CRISPRCasFinder,PILER-CR,CRT	no	cas1,cas3f,cas8f,cas5f,cas7f,cas6f	cas3,cas1,cas3f,cas8f,cas5f,cas7f,cas6f,DEDDh,DinG,csa3	Type I-F	TGTTCACTGCCGCACAGGCAGCTTAGAAAA,GTTCACTGCCGCACAGGCAGCTTAGAAAATC,GTTCACTGCCGCACAGGCAGCTTAGAAAA	30,31,29	0	0	NA	NA	I-F:I-F:I-F	3,2,3	3	TypeI-F	cas3,cas1,cas3f,cas8f,cas5f,cas7f,cas6f,DEDDh,DinG,csa3	NA|372aa|up_2|NZ_CP006792.1_600957_602073_+,NA|425aa|up_1|NZ_CP006792.1_602111_603386_+,NA|70aa|down_9|NZ_CP006792.1_617852_618062_+	NA|463aa|up_9|NZ_CP006792.1_592855_594244_-	PRK15414, PRK15414, phosphomannomutase	NA|347aa|up_8|NZ_CP006792.1_594260_595301_-	cd06279, PBP1_LacI-like, ligand-binding domain of an uncharacterized transcription regulator from Corynebacterium glutamicum and its close homologs from other bacteria	NA|429aa|up_7|NZ_CP006792.1_595572_596859_+	COG1653, UgpB, ABC-type sugar transport system, periplasmic component [Carbohydrate transport and metabolism]	NA|292aa|up_6|NZ_CP006792.1_596967_597843_+	COG1175, UgpA, ABC-type sugar transport systems, permease components [Carbohydrate transport and metabolism]	NA|275aa|up_5|NZ_CP006792.1_597843_598668_+	COG0395, UgpE, ABC-type sugar transport system, permease component [Carbohydrate transport and metabolism]	NA|352aa|up_4|NZ_CP006792.1_598814_599870_+	cd18612, GH130_Lin0857-like, Glycoside hydrolase family 130 such as Listeria innocua beta-1,2-mannobiose phosphorylase	NA|313aa|up_3|NZ_CP006792.1_599896_600835_+	pfam08950, DUF1861, Protein of unknown function (DUF1861)	NA|372aa|up_2|NZ_CP006792.1_600957_602073_+	NA	NA|425aa|up_1|NZ_CP006792.1_602111_603386_+	NA	NA|415aa|up_0|NZ_CP006792.1_604010_605255_-	COG0668, MscS, Small-conductance mechanosensitive channel [Cell envelope biogenesis, outer membrane]	cas1|327aa|down_0|NZ_CP006792.1_606159_607140_+	TIGR03637, cas1_YPEST, CRISPR-associated endonuclease Cas1, subtype I-F/YPEST	cas3f|1096aa|down_1|NZ_CP006792.1_607136_610424_+	cd09673, Cas3_Cas2_I-F, CRISPR/Cas system-associated protein Cas3/Cas2	cas8f|449aa|down_2|NZ_CP006792.1_610832_612179_+	TIGR02564, conserved_hypothetical_protein, CRISPR type I-F/YPEST-associated protein Csy1	cas5f|317aa|down_3|NZ_CP006792.1_612175_613126_+	cd09676, Csy2_I-F, CRISPR/Cas system-associated RAMP superfamily protein Csy2	cas7f|335aa|down_4|NZ_CP006792.1_613143_614148_+	pfam09615, Cas_Csy3, CRISPR-associated protein (Cas_Csy3)	cas6f|185aa|down_5|NZ_CP006792.1_614158_614713_+	pfam09618, Cas_Csy4, CRISPR-associated protein (Cas_Csy4)	NA|340aa|down_6|NZ_CP006792.1_614790_615810_-	cd19079, AKR_EcYajO-like, Escherichia coli YajO and similar proteins	NA|168aa|down_7|NZ_CP006792.1_615820_616324_-	COG1795, COG1795, Formaldehyde-activating enzyme nesessary for methanogenesis [Energy    production and conversion]	NA|453aa|down_8|NZ_CP006792.1_616370_617729_-	cd17316, MFS_SV2_like, Metazoan Synaptic vesicle glycoprotein 2 (SV2) and related small molecule transporters of the Major Facilitator Superfamily	NA|70aa|down_9|NZ_CP006792.1_617852_618062_+	NA
GCF_001188815.1_ASM118881v1	NZ_CP006792	Yersinia pestis 2944 chromosome, complete genome	2	1479391-1479603	2,2,2	CRISPRCasFinder,CRT,PILER-CR	no	DEDDh	cas3,cas1,cas3f,cas8f,cas5f,cas7f,cas6f,DEDDh,DinG,csa3	Unclear	TTTCTAAGCTGCCTGTGCGGCAGTGAAC,TTTCTAAGCTGCCTGTGCGGCAGTGAAC,CTTACGTTCACTGCCGCACAGGCAGCTTAGAAAAT	28,28,35	0	0	NA	NA	I-F:I-F:I-F	3,3,2	3	Orphan	cas3,cas1,cas3f,cas8f,cas5f,cas7f,cas6f,DEDDh,DinG,csa3	NA,NA|16aa|down_7|NZ_CP006792.1_1487888_1487936_+	NA|426aa|up_9|NZ_CP006792.1_1464970_1466248_-	cd03324, rTSbeta_L-fuconate_dehydratase, Human rTS beta is encoded by the rTS gene which, through alternative RNA splicing, also encodes rTS alpha whose mRNA is complementary to thymidylate synthase mRNA	NA|281aa|up_8|NZ_CP006792.1_1466279_1467122_-	COG0179, MhpD, 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway) [Secondary metabolites biosynthesis, transport, and catabolism]	NA|247aa|up_7|NZ_CP006792.1_1467169_1467910_-	cd05368, DHRS6_like_SDR_c, human DHRS6-like, classical (c) SDRs	NA|141aa|up_6|NZ_CP006792.1_1468318_1468741_+	COG3011, COG3011, Predicted thiol-disulfide oxidoreductase [General function    prediction only]	NA|435aa|up_5|NZ_CP006792.1_1468749_1470054_-	cd17319, MFS_ExuT_GudP_like, Hexuronate transporter, Glucarate transporter, and similar transporters of the Major Facilitator Superfamily	NA|1051aa|up_4|NZ_CP006792.1_1471078_1474231_+	pfam11924, IAT_beta, Inverse autotransporter, beta-domain	NA|403aa|up_3|NZ_CP006792.1_1474193_1475402_-	COG3328, COG3328, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|106aa|up_2|NZ_CP006792.1_1475772_1476090_-	PRK05423, PRK05423, DUF496 family protein	NA|370aa|up_1|NZ_CP006792.1_1476523_1477633_-	cd05236, FAR-N_SDR_e, fatty acyl CoA reductases (FARs), extended (e) SDRs	DEDDh|477aa|up_0|NZ_CP006792.1_1477925_1479356_+	PRK11779, sbcB, exonuclease I; Provisional	NA|464aa|down_0|NZ_CP006792.1_1480092_1481484_-	COG0531, PotE, Amino acid transporters [Amino acid transport and metabolism]	NA|364aa|down_1|NZ_CP006792.1_1481942_1483034_-	PRK10586, PRK10586, putative oxidoreductase; Provisional	NA|298aa|down_2|NZ_CP006792.1_1483319_1484213_+	COG1129, MglA, ABC-type sugar transport system, ATPase component [Carbohydrate transport and metabolism]	NA|230aa|down_3|NZ_CP006792.1_1484170_1484860_+	COG1129, MglA, ABC-type sugar transport system, ATPase component [Carbohydrate transport and metabolism]	NA|332aa|down_4|NZ_CP006792.1_1484856_1485852_+	COG1172, AraH, Ribose/xylose/arabinose/galactoside ABC-type transport systems, permease components [Carbohydrate transport and metabolism]	NA|97aa|down_5|NZ_CP006792.1_1486240_1486531_+	pfam16695, Tai4, Type VI secretion system (T6SS), amidase immunity protein	NA|276aa|down_6|NZ_CP006792.1_1486685_1487513_-	cd05266, SDR_a4, atypical (a) SDRs, subgroup 4	NA|16aa|down_7|NZ_CP006792.1_1487888_1487936_+	NA	NA|300aa|down_8|NZ_CP006792.1_1488218_1489118_+	PRK00489, hisG, ATP phosphoribosyltransferase; Reviewed	NA|444aa|down_9|NZ_CP006792.1_1489121_1490453_+	PRK00877, hisD, bifunctional histidinal dehydrogenase/ histidinol dehydrogenase; Reviewed
GCF_001188815.1_ASM118881v1	NZ_CP006792	Yersinia pestis 2944 chromosome, complete genome	3	1604549-1604670	3	CRISPRCasFinder	no		cas3,cas1,cas3f,cas8f,cas5f,cas7f,cas6f,DEDDh,DinG,csa3	Orphan	CAATTGATTATATCGATTAATATTGGCATTAAG	33	0	0	NA	NA	NA	1	1	Orphan	cas3,cas1,cas3f,cas8f,cas5f,cas7f,cas6f,DEDDh,DinG,csa3	NA|73aa|up_0|NZ_CP006792.1_1603973_1604192_-,NA	NA|83aa|up_9|NZ_CP006792.1_1596849_1597098_-	PRK07117, PRK07117, acyl carrier protein; Validated	NA|147aa|up_8|NZ_CP006792.1_1597135_1597576_-	pfam13924, Lipocalin_5, Lipocalin-like domain	NA|263aa|up_7|NZ_CP006792.1_1597662_1598451_-	cd05233, SDR_c, classical (c) SDRs	NA|251aa|up_6|NZ_CP006792.1_1598453_1599206_-	PRK07110, PRK07110, polyketide biosynthesis enoyl-CoA hydratase; Validated	NA|240aa|up_5|NZ_CP006792.1_1599207_1599927_-	COG1024, CaiD, Enoyl-CoA hydratase/carnithine racemase [Lipid metabolism]	NA|413aa|up_4|NZ_CP006792.1_1599919_1601158_-	COG3425, PksG, 3-hydroxy-3-methylglutaryl CoA synthase [Lipid metabolism]	NA|246aa|up_3|NZ_CP006792.1_1601173_1601911_-	cd05333, BKR_SDR_c, beta-Keto acyl carrier protein reductase (BKR), involved in Type II FAS, classical (c) SDRs	NA|428aa|up_2|NZ_CP006792.1_1601912_1603196_-	cd00834, KAS_I_II, Beta-ketoacyl-acyl carrier protein (ACP) synthase (KAS), type I and II	NA|175aa|up_1|NZ_CP006792.1_1603185_1603710_-	COG0764, FabA, 3-hydroxymyristoyl/3-hydroxydecanoyl-(acyl carrier protein) dehydratases [Lipid metabolism]	NA|73aa|up_0|NZ_CP006792.1_1603973_1604192_-	NA	NA|249aa|down_0|NZ_CP006792.1_1604922_1605669_+	PRK06172, PRK06172, SDR family oxidoreductase	NA|806aa|down_1|NZ_CP006792.1_1605822_1608240_+	pfam01565, FAD_binding_4, FAD binding domain	NA|110aa|down_2|NZ_CP006792.1_1611883_1612213_+	COG2920, DsrC, Dissimilatory sulfite reductase (desulfoviridin), gamma subunit [Inorganic ion transport and metabolism]	NA|93aa|down_3|NZ_CP006792.1_1612264_1612543_-	PRK14426, PRK14426, acylphosphatase; Provisional	NA|397aa|down_4|NZ_CP006792.1_1612636_1613827_+	PRK15128, PRK15128, 23S rRNA (cytosine(1962)-C(5))-methyltransferase RlmI	NA|106aa|down_5|NZ_CP006792.1_1613887_1614205_+	PRK14129, PRK14129, heat shock protein HspQ; Provisional	NA|139aa|down_6|NZ_CP006792.1_1614313_1614730_-	COG1832, COG1832, Predicted CoA-binding protein [General function prediction only]	NA|227aa|down_7|NZ_CP006792.1_1615045_1615726_+	PRK03641, PRK03641, DUF2057 family protein	NA|155aa|down_8|NZ_CP006792.1_1615904_1616369_+	PRK05234, mgsA, methylglyoxal synthase; Validated	NA|685aa|down_9|NZ_CP006792.1_1616429_1618484_-	PRK11054, helD, DNA helicase IV; Provisional
GCF_001188815.1_ASM118881v1	NZ_CP006792	Yersinia pestis 2944 chromosome, complete genome	4	2523455-2523534	4	CRISPRCasFinder	no		cas3,cas1,cas3f,cas8f,cas5f,cas7f,cas6f,DEDDh,DinG,csa3	Orphan	CAGCCGCAGCCACGCTCAGCCATTC	25	0	0	NA	NA	NA	1	1	Orphan	cas3,cas1,cas3f,cas8f,cas5f,cas7f,cas6f,DEDDh,DinG,csa3	NA,NA|152aa|down_4|NZ_CP006792.1_2530192_2530648_-,NA|157aa|down_5|NZ_CP006792.1_2530674_2531145_-,NA|77aa|down_7|NZ_CP006792.1_2532668_2532899_-	NA|94aa|up_9|NZ_CP006792.1_2510730_2511012_+	COG2960, COG2960, Uncharacterized protein conserved in bacteria [Function unknown]	NA|417aa|up_8|NZ_CP006792.1_2511336_2512587_-	PRK11021, PRK11021, putative transporter; Provisional	NA|477aa|up_7|NZ_CP006792.1_2512623_2514054_-	PRK11316, PRK11316, bifunctional D-glycero-beta-D-manno-heptose-7-phosphate kinase/D-glycero-beta-D-manno-heptose 1-phosphate adenylyltransferase HldE	NA|952aa|up_6|NZ_CP006792.1_2514188_2517044_-	PRK11072, PRK11072, bifunctional [glutamate--ammonia ligase]-adenylyl-L-tyrosine phosphorylase/[glutamate--ammonia-ligase] adenylyltransferase	NA|390aa|up_5|NZ_CP006792.1_2517352_2518522_-	COG3025, COG3025, Uncharacterized conserved protein [Function unknown]	NA|208aa|up_4|NZ_CP006792.1_2518825_2519449_+	PRK10884, PRK10884, SH3 domain-containing protein; Provisional	NA|413aa|up_3|NZ_CP006792.1_2519608_2520847_+	PRK10885, cca, multifunctional CCA addition/repair protein	NA|273aa|up_2|NZ_CP006792.1_2521082_2521901_-	PRK00281, PRK00281, undecaprenyl-diphosphate phosphatase	NA|120aa|up_1|NZ_CP006792.1_2522174_2522534_-	PRK11593, folB, bifunctional dihydroneopterin aldolase/7,8-dihydroneopterin epimerase	NA|217aa|up_0|NZ_CP006792.1_2522641_2523292_+	PRK00220, PRK00220, glycerol-3-phosphate 1-O-acyltransferase PlsY	NA|338aa|down_0|NZ_CP006792.1_2523537_2524551_-	PRK09604, PRK09604, tRNA (adenosine(37)-N6)-threonylcarbamoyltransferase complex transferase subunit TsaD	NA|72aa|down_1|NZ_CP006792.1_2524956_2525172_+	PRK00270, rpsU, 30S ribosomal protein S21; Reviewed	NA|583aa|down_2|NZ_CP006792.1_2525307_2527056_+	PRK05667, dnaG, DNA primase; Validated	NA|613aa|down_3|NZ_CP006792.1_2527213_2529052_+	PRK05658, PRK05658, RNA polymerase sigma factor RpoD; Validated	NA|152aa|down_4|NZ_CP006792.1_2530192_2530648_-	NA	NA|157aa|down_5|NZ_CP006792.1_2530674_2531145_-	NA	NA|461aa|down_6|NZ_CP006792.1_2531153_2532536_-	cd00085, HNHc, HNH nucleases; HNH endonuclease signature which is found in viral, prokaryotic, and eukaryotic proteins	NA|77aa|down_7|NZ_CP006792.1_2532668_2532899_-	NA	NA|311aa|down_8|NZ_CP006792.1_2534571_2535504_-	cd08472, PBP2_CrgA_like_3, The C-terminal substrate binding domain of an uncharacterized LysR-type transcriptional regulator CrgA-like, contains the type 2 periplasmic binding fold	NA|239aa|down_9|NZ_CP006792.1_2535665_2536382_+	COG3531, COG3531, Predicted protein-disulfide isomerase [Posttranslational modification, protein turnover, chaperones]
GCF_001188815.1_ASM118881v1	NZ_CP006792	Yersinia pestis 2944 chromosome, complete genome	5	3089251-3089362	5	CRISPRCasFinder	no		cas3,cas1,cas3f,cas8f,cas5f,cas7f,cas6f,DEDDh,DinG,csa3	Orphan	AAAGCACATTTGAGCAGCGAGTGAAG	26	0	0	NA	NA	NA	1	1	Orphan	cas3,cas1,cas3f,cas8f,cas5f,cas7f,cas6f,DEDDh,DinG,csa3	NA,NA	NA|268aa|up_9|NZ_CP006792.1_3072744_3073548_-	COG0810, TonB, Periplasmic protein TonB, links inner and outer membranes [Cell envelope biogenesis, outer membrane]	NA|443aa|up_8|NZ_CP006792.1_3073612_3074941_-	TIGR01843, Hemolysin_secretion_protein_D_plasmid, type I secretion membrane fusion protein, HlyD family	NA|603aa|up_7|NZ_CP006792.1_3075016_3076825_-	COG4618, ArpD, ABC-type protease/lipase transport system, ATPase and permease components [General function prediction only]	NA|206aa|up_6|NZ_CP006792.1_3076990_3077608_-	pfam06438, HasA, Heme-binding protein A (HasA)	NA|831aa|up_5|NZ_CP006792.1_3077829_3080322_-	TIGR01785, Heme/hemopexin_utilization_protein_C, TonB-dependent heme/hemoglobin receptor family protein	NA|458aa|up_4|NZ_CP006792.1_3080920_3082294_-	PRK04833, PRK04833, argininosuccinate lyase; Provisional	NA|258aa|up_3|NZ_CP006792.1_3082460_3083234_-	cd04249, AAK_NAGK-NC, AAK_NAGK-NC: N-Acetyl-L-glutamate kinase - noncyclic (NAGK-NC) catalyzes the phosphorylation of the gamma-COOH group of N-acetyl-L-glutamate (NAG) by ATP in the second step of microbial arginine biosynthesis using the acetylated, noncyclic route of ornithine biosynthesis	NA|335aa|up_2|NZ_CP006792.1_3083402_3084407_-	PRK00436, argC, N-acetyl-gamma-glutamyl-phosphate reductase; Validated	NA|388aa|up_1|NZ_CP006792.1_3084657_3085821_+	PRK05111, PRK05111, acetylornithine deacetylase; Provisional	NA|879aa|up_0|NZ_CP006792.1_3086194_3088831_+	PRK00009, PRK00009, phosphoenolpyruvate carboxylase; Reviewed	NA|452aa|down_0|NZ_CP006792.1_3089572_3090927_+	PHA02517, PHA02517, putative transposase OrfB; Reviewed	NA|295aa|down_1|NZ_CP006792.1_3091112_3091997_-	PRK09432, metF, methylenetetrahydrofolate reductase	NA|812aa|down_2|NZ_CP006792.1_3092229_3094665_-	PRK09466, metL, bifunctional aspartate kinase II/homoserine dehydrogenase II; Provisional	NA|106aa|down_3|NZ_CP006792.1_3096204_3096522_+	PRK05264, PRK05264, met regulon transcriptional regulator MetJ	NA|152aa|down_4|NZ_CP006792.1_3096870_3097326_+	COG4682, COG4682, Predicted membrane protein [Function unknown]	NA|72aa|down_5|NZ_CP006792.1_3097868_3098084_-	PRK00019, rpmE, 50S ribosomal protein L31; Reviewed	NA|733aa|down_6|NZ_CP006792.1_3098326_3100525_+	PRK05580, PRK05580, primosome assembly protein PriA; Validated	NA|282aa|down_7|NZ_CP006792.1_3101972_3102818_+	PRK12757, PRK12757, cell division protein FtsN; Provisional	NA|175aa|down_8|NZ_CP006792.1_3102917_3103442_+	PRK05456, PRK05456, ATP-dependent protease subunit HslV	NA|444aa|down_9|NZ_CP006792.1_3103512_3104844_+	PRK05201, hslU, ATP-dependent protease ATPase subunit HslU
GCF_001188815.1_ASM118881v1	NZ_CP006792	Yersinia pestis 2944 chromosome, complete genome	6	4486675-4486945	3,6,3	PILER-CR,CRISPRCasFinder,CRT	no		cas3,cas1,cas3f,cas8f,cas5f,cas7f,cas6f,DEDDh,DinG,csa3	Orphan	GTTCACTGCCGCACAGGCAGCTTAGAAA,GTTCACTGCCGCACAGGCAGCTTAGAAA,GTTCACTGCCGCACAGGCAGCTTAGAAA	28,28,28	1	1	4486886-4486917	NZ_CP006792.1_1143829-1143798	I-F:I-F:I-F	3,4,4	4	Orphan	cas3,cas1,cas3f,cas8f,cas5f,cas7f,cas6f,DEDDh,DinG,csa3	NA|84aa|up_9|NZ_CP006792.1_4472375_4472627_+,NA|194aa|up_5|NZ_CP006792.1_4474537_4475119_-,NA	NA|84aa|up_9|NZ_CP006792.1_4472375_4472627_+	NA	NA|191aa|up_8|NZ_CP006792.1_4472650_4473223_+	TIGR03344, VI_effect_Hcp1, type VI secretion system effector, Hcp1 family	NA|134aa|up_7|NZ_CP006792.1_4473234_4473636_-	pfam06836, DUF1240, Protein of unknown function (DUF1240)	NA|134aa|up_6|NZ_CP006792.1_4473868_4474270_-	pfam06836, DUF1240, Protein of unknown function (DUF1240)	NA|194aa|up_5|NZ_CP006792.1_4474537_4475119_-	NA	NA|82aa|up_4|NZ_CP006792.1_4475791_4476037_+	pfam13693, HTH_35, Winged helix-turn-helix DNA-binding	NA|771aa|up_3|NZ_CP006792.1_4476249_4478562_-	TIGR02073, Includes:_Penicillin-insensitive_transglycosylase, penicillin-binding protein 1C	NA|153aa|up_2|NZ_CP006792.1_4478807_4479266_+	COG1943, COG1943, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|1993aa|up_1|NZ_CP006792.1_4479380_4485359_-	COG2373, COG2373, Large extracellular alpha-helical protein [General function prediction only]	NA|183aa|up_0|NZ_CP006792.1_4485765_4486314_-	COG3477, COG3477, Predicted periplasmic/secreted protein [Function unknown]	NA|285aa|down_0|NZ_CP006792.1_4486948_4487803_-	COG1737, RpiR, Transcriptional regulators [Transcription]	NA|509aa|down_1|NZ_CP006792.1_4488140_4489667_+	cd07085, ALDH_F6_MMSDH, Methylmalonate semialdehyde dehydrogenase and ALDH family members 6A1 and 6B2	NA|642aa|down_2|NZ_CP006792.1_4489701_4491627_+	COG3962, COG3962, Acetolactate synthase [Amino acid transport and metabolism]	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA
