assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_009688965.1_ASM968896v1	NZ_AP021875	Desulfosarcina widdelii strain PP31	1	1628064-1628200	1	CRISPRCasFinder	no		cas4,WYL,DinG,RT,DEDDh,cas3,csa3,Cas9_archaeal,PrimPol	Orphan	TTGTCCTTGTAATCGCCGACACCGTCGCCGTCACTGTCCAGCGGGCA	47	0	0	NA	NA	NA	1	1	Orphan	cas4,WYL,DinG,RT,DEDDh,cas3,csa3,Cas9_archaeal,PrimPol	NA|93aa|up_2|NZ_AP021875.1_1625541_1625820_-,NA|173aa|down_4|NZ_AP021875.1_1635507_1636026_+,NA|138aa|down_6|NZ_AP021875.1_1637766_1638180_+,NA|95aa|down_7|NZ_AP021875.1_1638279_1638564_+	NA|819aa|up_9|NZ_AP021875.1_1616145_1618602_+	COG0466, Lon, ATP-dependent Lon protease, bacterial type [Posttranslational modification, protein turnover, chaperones]	NA|575aa|up_8|NZ_AP021875.1_1618598_1620323_+	COG2187, COG2187, Uncharacterized protein conserved in bacteria [Function unknown]	NA|338aa|up_7|NZ_AP021875.1_1620265_1621279_+	pfam00582, Usp, Universal stress protein family	NA|56aa|up_6|NZ_AP021875.1_1621275_1621443_+	TIGR00320, Desulfoferrodoxin_homolog, desulfoferrodoxin	NA|161aa|up_5|NZ_AP021875.1_1621467_1621950_-	pfam04008, Adenosine_kin, Adenosine specific kinase	NA|600aa|up_4|NZ_AP021875.1_1622045_1623845_-	cd00400, Voltage_gated_ClC, CLC voltage-gated chloride channel	NA|439aa|up_3|NZ_AP021875.1_1623887_1625204_-	PRK08637, PRK08637, hypothetical protein; Provisional	NA|93aa|up_2|NZ_AP021875.1_1625541_1625820_-	NA	NA|273aa|up_1|NZ_AP021875.1_1625985_1626804_-	PRK10334, PRK10334, small-conductance mechanosensitive channel MscS	NA|184aa|up_0|NZ_AP021875.1_1626900_1627452_-	pfam11172, DUF2959, Protein of unknown function (DUF2959)	NA|329aa|down_0|NZ_AP021875.1_1629450_1630437_+	PRK10892, PRK10892, arabinose-5-phosphate isomerase KdsD	NA|444aa|down_1|NZ_AP021875.1_1630472_1631804_+	PRK06830, PRK06830, ATP-dependent 6-phosphofructokinase	NA|444aa|down_2|NZ_AP021875.1_1631820_1633152_-	PRK05749, PRK05749, 3-deoxy-D-manno-octulosonic-acid transferase; Reviewed	NA|665aa|down_3|NZ_AP021875.1_1633148_1635143_-	cd06339, PBP1_YraM_LppC_lipoprotein-like, periplasmic binding component of lipoprotein LppC, an immunodominant antigen	NA|173aa|down_4|NZ_AP021875.1_1635507_1636026_+	NA	NA|428aa|down_5|NZ_AP021875.1_1636185_1637469_+	pfam01116, F_bP_aldolase, Fructose-bisphosphate aldolase class-II	NA|138aa|down_6|NZ_AP021875.1_1637766_1638180_+	NA	NA|95aa|down_7|NZ_AP021875.1_1638279_1638564_+	NA	NA|272aa|down_8|NZ_AP021875.1_1638655_1639471_-	PRK09562, mazG, nucleoside triphosphate pyrophosphohydrolase; Reviewed	NA|175aa|down_9|NZ_AP021875.1_1639455_1639980_-	pfam02674, Colicin_V, Colicin V production protein
GCF_009688965.1_ASM968896v1	NZ_AP021875	Desulfosarcina widdelii strain PP31	2	3413743-3413859	2	CRISPRCasFinder	no		cas4,WYL,DinG,RT,DEDDh,cas3,csa3,Cas9_archaeal,PrimPol	Orphan	AAAACCTGTATCACTCCGATACGGGTTT	28	0	0	NA	NA	NA	1	1	Orphan	cas4,WYL,DinG,RT,DEDDh,cas3,csa3,Cas9_archaeal,PrimPol	NA|142aa|up_7|NZ_AP021875.1_3409078_3409504_-,NA|76aa|up_6|NZ_AP021875.1_3409895_3410123_-,NA|173aa|up_5|NZ_AP021875.1_3410526_3411045_-,NA|201aa|up_4|NZ_AP021875.1_3411739_3412342_-,NA|111aa|up_3|NZ_AP021875.1_3412341_3412674_-,NA|61aa|up_2|NZ_AP021875.1_3412670_3412853_-,NA|130aa|up_0|NZ_AP021875.1_3413163_3413553_-,NA|387aa|down_2|NZ_AP021875.1_3417830_3418991_-,NA|257aa|down_3|NZ_AP021875.1_3418972_3419743_-,NA|77aa|down_4|NZ_AP021875.1_3419861_3420092_+,NA|177aa|down_6|NZ_AP021875.1_3420790_3421321_+,NA|107aa|down_8|NZ_AP021875.1_3422265_3422586_-	NA|256aa|up_9|NZ_AP021875.1_3406387_3407155_+	cd02696, MurNAc-LAA, N-acetylmuramoyl-L-alanine amidase or MurNAc-LAA (also known as peptidoglycan aminohydrolase, NAMLA amidase, NAMLAA, Amidase 3, and peptidoglycan amidase; EC 3	NA|515aa|up_8|NZ_AP021875.1_3407559_3409104_-	COG1078, COG1078, HD superfamily phosphohydrolases [General function prediction only]	NA|142aa|up_7|NZ_AP021875.1_3409078_3409504_-	NA	NA|76aa|up_6|NZ_AP021875.1_3409895_3410123_-	NA	NA|173aa|up_5|NZ_AP021875.1_3410526_3411045_-	NA	NA|201aa|up_4|NZ_AP021875.1_3411739_3412342_-	NA	NA|111aa|up_3|NZ_AP021875.1_3412341_3412674_-	NA	NA|61aa|up_2|NZ_AP021875.1_3412670_3412853_-	NA	NA|74aa|up_1|NZ_AP021875.1_3412945_3413167_-	pfam12728, HTH_17, Helix-turn-helix domain	NA|130aa|up_0|NZ_AP021875.1_3413163_3413553_-	NA	NA|453aa|down_0|NZ_AP021875.1_3414621_3415980_-	pfam01609, DDE_Tnp_1, Transposase DDE domain	NA|366aa|down_1|NZ_AP021875.1_3416731_3417829_-	pfam13365, Trypsin_2, Trypsin-like peptidase domain	NA|387aa|down_2|NZ_AP021875.1_3417830_3418991_-	NA	NA|257aa|down_3|NZ_AP021875.1_3418972_3419743_-	NA	NA|77aa|down_4|NZ_AP021875.1_3419861_3420092_+	NA	NA|145aa|down_5|NZ_AP021875.1_3420299_3420734_+	smart00347, HTH_MARR, helix_turn_helix multiple antibiotic resistance protein	NA|177aa|down_6|NZ_AP021875.1_3420790_3421321_+	NA	NA|161aa|down_7|NZ_AP021875.1_3421403_3421886_-	pfam00583, Acetyltransf_1, Acetyltransferase (GNAT) family	NA|107aa|down_8|NZ_AP021875.1_3422265_3422586_-	NA	NA|485aa|down_9|NZ_AP021875.1_3422830_3424285_+	COG0469, PykF, Pyruvate kinase [Carbohydrate transport and metabolism]
GCF_009688965.1_ASM968896v1	NZ_AP021875	Desulfosarcina widdelii strain PP31	3	4892763-4892839	3	CRISPRCasFinder	no		cas4,WYL,DinG,RT,DEDDh,cas3,csa3,Cas9_archaeal,PrimPol	Orphan	ATATGACTCTCTATTCCATAGAGA	24	0	0	NA	NA	NA	1	1	Orphan	cas4,WYL,DinG,RT,DEDDh,cas3,csa3,Cas9_archaeal,PrimPol	NA|75aa|up_7|NZ_AP021875.1_4883604_4883829_-,NA|125aa|up_2|NZ_AP021875.1_4889874_4890249_+,NA|172aa|down_1|NZ_AP021875.1_4895419_4895935_-,NA|270aa|down_4|NZ_AP021875.1_4898158_4898968_+,NA|62aa|down_5|NZ_AP021875.1_4899049_4899235_-,NA|313aa|down_7|NZ_AP021875.1_4899830_4900769_+	NA|204aa|up_9|NZ_AP021875.1_4882058_4882670_+	pfam12118, SprA-related, SprA-related family	NA|302aa|up_8|NZ_AP021875.1_4882684_4883590_-	COG0697, RhaT, Permeases of the drug/metabolite transporter (DMT) superfamily [Carbohydrate transport and metabolism / Amino acid transport and metabolism / General function prediction only]	NA|75aa|up_7|NZ_AP021875.1_4883604_4883829_-	NA	NA|898aa|up_6|NZ_AP021875.1_4883940_4886634_+	PRK05755, PRK05755, DNA polymerase I; Provisional	NA|331aa|up_5|NZ_AP021875.1_4886799_4887792_+	cd07023, S49_Sppa_N_C, Signal peptide peptidase A (SppA), a serine protease, has catalytic Ser-Lys dyad	NA|257aa|up_4|NZ_AP021875.1_4887807_4888578_-	pfam05013, FGase, N-formylglutamate amidohydrolase	NA|221aa|up_3|NZ_AP021875.1_4889208_4889871_+	TIGR00229, Sensor_protein_FixL, PAS domain S-box	NA|125aa|up_2|NZ_AP021875.1_4889874_4890249_+	NA	NA|280aa|up_1|NZ_AP021875.1_4890276_4891116_+	TIGR00229, Sensor_protein_FixL, PAS domain S-box	NA|205aa|up_0|NZ_AP021875.1_4891404_4892019_-	cd01192, INT_C_like_3, Uncharacterized site-specific tyrosine recombinase, C-terminal catalytic domain	NA|468aa|down_0|NZ_AP021875.1_4893045_4894449_-	pfam13701, DDE_Tnp_1_4, Transposase DDE domain group 1	NA|172aa|down_1|NZ_AP021875.1_4895419_4895935_-	NA	NA|80aa|down_2|NZ_AP021875.1_4896256_4896496_-	cd01184, INT_C_like_1, Uncharacterized site-specific tyrosine recombinase, C-terminal catalytic domain	NA|437aa|down_3|NZ_AP021875.1_4896492_4897803_-	cd01184, INT_C_like_1, Uncharacterized site-specific tyrosine recombinase, C-terminal catalytic domain	NA|270aa|down_4|NZ_AP021875.1_4898158_4898968_+	NA	NA|62aa|down_5|NZ_AP021875.1_4899049_4899235_-	NA	NA|135aa|down_6|NZ_AP021875.1_4899429_4899834_+	PRK13258, PRK13258, 7-cyano-7-deazaguanine reductase; Provisional	NA|313aa|down_7|NZ_AP021875.1_4899830_4900769_+	NA	NA|324aa|down_8|NZ_AP021875.1_4900791_4901763_-	TIGR00737, Probable_tRNA-dihydrouridine_synthase, putative TIM-barrel protein, nifR3 family	NA|295aa|down_9|NZ_AP021875.1_4901759_4902644_-	pfam08902, DUF1848, Domain of unknown function (DUF1848)
GCF_009688965.1_ASM968896v1	NZ_AP021875	Desulfosarcina widdelii strain PP31	4	5259494-5259645	1	PILER-CR	no		cas4,WYL,DinG,RT,DEDDh,cas3,csa3,Cas9_archaeal,PrimPol	Orphan	TCCTCGCGTCCATCTTTGCAATCA	24	0	0	NA	NA	NA	2	2	Orphan	cas4,WYL,DinG,RT,DEDDh,cas3,csa3,Cas9_archaeal,PrimPol	NA|62aa|up_4|NZ_AP021875.1_5254648_5254834_-,NA|523aa|up_3|NZ_AP021875.1_5254966_5256535_-,NA	NA|187aa|up_9|NZ_AP021875.1_5237820_5238381_+	cd03768, SR_ResInv, Serine Recombinase (SR) family, Resolvase and Invertase subfamily, catalytic domain; members contain a C-terminal DNA binding domain	NA|1733aa|up_8|NZ_AP021875.1_5238638_5243837_+	COG1205, COG1205, Distinct helicase family with a unique C-terminal domain including a metal-binding cysteine cluster [General function prediction only]	NA|951aa|up_7|NZ_AP021875.1_5243833_5246686_+	cd18011, DEXDc_RapA, DEXH-box helicase domain of RapA	NA|1663aa|up_6|NZ_AP021875.1_5246682_5251671_+	NF033451, BREX_2_MTaseX, BREX-2 system adenine-specific DNA-methyltransferase PglX	NA|666aa|up_5|NZ_AP021875.1_5251691_5253689_+	COG1196, Smc, Chromosome segregation ATPases [Cell division and chromosome partitioning]	NA|62aa|up_4|NZ_AP021875.1_5254648_5254834_-	NA	NA|523aa|up_3|NZ_AP021875.1_5254966_5256535_-	NA	NA|194aa|up_2|NZ_AP021875.1_5256850_5257432_-	pfam01695, IstB_IS21, IstB-like ATP binding protein	NA|489aa|up_1|NZ_AP021875.1_5257590_5259057_-	COG4584, COG4584, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|128aa|up_0|NZ_AP021875.1_5259053_5259437_-	pfam13730, HTH_36, Helix-turn-helix domain	NA|128aa|down_0|NZ_AP021875.1_5260235_5260619_+	pfam13384, HTH_23, Homeodomain-like domain	NA|250aa|down_1|NZ_AP021875.1_5260656_5261406_+	pfam00665, rve, Integrase core domain	NA|390aa|down_2|NZ_AP021875.1_5261965_5263135_-	pfam14294, DUF4372, Domain of unknown function (DUF4372)	NA|468aa|down_3|NZ_AP021875.1_5263277_5264681_-	cd01483, E1_enzyme_family, Superfamily of activating enzymes (E1) of the ubiquitin-like proteins	NA|142aa|down_4|NZ_AP021875.1_5264687_5265113_-	pfam14462, Prok-E2_E, Prokaryotic E2 family E	NA|126aa|down_5|NZ_AP021875.1_5265820_5266198_+	PRK08099, PRK08099, multifunctional transcriptional regulator/nicotinamide-nucleotide adenylyltransferase/ribosylnicotinamide kinase NadR	NA|307aa|down_6|NZ_AP021875.1_5266181_5267102_+	pfam06114, Peptidase_M78, IrrE N-terminal-like domain	NA|440aa|down_7|NZ_AP021875.1_5267261_5268581_-	pfam13701, DDE_Tnp_1_4, Transposase DDE domain group 1	NA|169aa|down_8|NZ_AP021875.1_5268887_5269394_+	pfam10543, ORF6N, ORF6N domain	NA|369aa|down_9|NZ_AP021875.1_5269528_5270635_-	pfam13203, DUF2201_N, Putative metallopeptidase domain
