assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCA_003428335.1_ASM342833v1	CP031704	Solimonas sp. K1W22B-7 chromosome, complete genome	1	1774115-1774220	1	CRISPRCasFinder	no		WYL,Cas9_archaeal,cas3,csa3,RT,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,DinG,DEDDh	Orphan	CGCCACCGATGGCGCCGCCGATGAT	25	0	0	NA	NA	NA	1	1	Orphan	WYL,Cas9_archaeal,cas3,csa3,RT,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,DinG,DEDDh	NA|303aa|up_8|CP031704.1_1768185_1769094_-,NA|142aa|up_7|CP031704.1_1769122_1769548_-,NA|151aa|up_6|CP031704.1_1769685_1770138_+,NA|63aa|up_1|CP031704.1_1773243_1773432_-,NA|139aa|up_0|CP031704.1_1773428_1773845_-,NA	NA|330aa|up_9|CP031704.1_1767170_1768160_-	cd05228, AR_FR_like_1_SDR_e, uncharacterized subgroup of aldehyde reductase and flavonoid reductase related proteins, extended (e) SDRs	NA|303aa|up_8|CP031704.1_1768185_1769094_-	NA	NA|142aa|up_7|CP031704.1_1769122_1769548_-	NA	NA|151aa|up_6|CP031704.1_1769685_1770138_+	NA	NA|90aa|up_5|CP031704.1_1770160_1770430_-	pfam05166, YcgL, YcgL domain	NA|529aa|up_4|CP031704.1_1770456_1772043_-	COG2989, COG2989, Uncharacterized protein conserved in bacteria [Function unknown]	NA|243aa|up_3|CP031704.1_1772144_1772873_+	pfam13645, YkuD_2, L,D-transpeptidase catalytic domain	NA|112aa|up_2|CP031704.1_1772869_1773205_-	cd02214, cupin_MJ1618, Methanocaldococcus jannaschii MJ1618 and related proteins, cupin domain	NA|63aa|up_1|CP031704.1_1773243_1773432_-	NA	NA|139aa|up_0|CP031704.1_1773428_1773845_-	NA	NA|412aa|down_0|CP031704.1_1774374_1775610_-	TIGR01976, am_tr_V_VC1184, cysteine desulfurase family protein, VC1184 subfamily	NA|827aa|down_1|CP031704.1_1775676_1778157_+	pfam07995, GSDH, Glucose / Sorbosone dehydrogenase	NA|71aa|down_2|CP031704.1_1778181_1778394_-	cd16226, EFh_CREC_Calumenin_like, EF-hand, calcium binding motif, found in calumenin, reticulocalbin-1 (RCN-1), reticulocalbin-3 (RCN-3), and similar proteins	NA|386aa|down_3|CP031704.1_1778726_1779884_+	pfam14903, WG_beta_rep, WG containing repeat	NA|294aa|down_4|CP031704.1_1779965_1780847_-	cd08973, BaFpgNei_N_1, Uncharacterized bacterial subgroup of the N-terminal domain of Fpg (formamidopyrimidine-DNA glycosylase, MutM)_Nei  (endonuclease VIII) base-excision repair DNA glycosylases	NA|243aa|down_5|CP031704.1_1781010_1781739_+	cd07983, LPLAT_DUF374-like, Lysophospholipid Acyltransferases (LPLATs) of Glycerophospholipid Biosynthesis: DUF374	NA|277aa|down_6|CP031704.1_1781710_1782541_-	cd05327, retinol-DH_like_SDR_c_like, retinol dehydrogenase (retinol-DH), Light dependent Protochlorophyllide (Pchlide) OxidoReductase (LPOR) and related proteins, classical (c) SDRs	NA|309aa|down_7|CP031704.1_1782537_1783464_-	cd07729, AHL_lactonase_MBL-fold, quorum-quenching N-acyl-homoserine lactonase, MBL-fold metallo-hydrolase domain	NA|348aa|down_8|CP031704.1_1783679_1784723_+	cd03469, Rieske_RO_Alpha_N, Rieske non-heme iron oxygenase (RO) family, N-terminal Rieske domain of the oxygenase alpha subunit; The RO family comprise a large class of aromatic ring-hydroxylating dioxygenases found predominantly in microorganisms	NA|237aa|down_9|CP031704.1_1784908_1785619_+	pfam14224, DUF4331, Domain of unknown function (DUF4331)
GCA_003428335.1_ASM342833v1	CP031704	Solimonas sp. K1W22B-7 chromosome, complete genome	2	1935524-1946043	1,2,1,2	PILER-CR,CRISPRCasFinder,CRT,PILER-CR	no	WYL,cas3,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2	WYL,Cas9_archaeal,cas3,csa3,RT,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,DinG,DEDDh	Type I-E	GTGTTCCCCGCGCACGCGGGGATGAACCG,GTGTTCCCCGCGCACGCGGGGATGAACCG,GTGTTCCCCGCGCACGCGGGGATGAACCGN,GTGTTCCCCGCGCACGCGGGGATGAACCG	29,29,30,29	0	0	NA	NA	I-E:I-E:I-E:I-E	102,172,172,102	172	TypeI-E	WYL,Cas9_archaeal,cas3,csa3,RT,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,DinG,DEDDh	NA,NA|141aa|down_2|CP031704.1_1947021_1947444_+	cas3|895aa|up_9|CP031704.1_1926047_1928732_+	PRK09694, PRK09694, CRISPR-associated helicase/endonuclease Cas3	cas8e|512aa|up_8|CP031704.1_1928728_1930264_+	cd09729, Cse1_I-E, CRISPR/Cas system-associated protein Cse1	cse2gr11|172aa|up_7|CP031704.1_1930260_1930776_+	PRK13921, PRK13921, CRISPR-associated Cse2 family protein; Provisional	cas7|385aa|up_6|CP031704.1_1930772_1931927_+	pfam09344, Cas_CT1975, CT1975-like protein	cas5|232aa|up_5|CP031704.1_1931931_1932627_+	pfam09704, Cas_Cas5d, CRISPR-associated protein (Cas_Cas5)	cas6e|244aa|up_4|CP031704.1_1932613_1933345_+	cd09727, Cas6_I-E, CRISPR/Cas system-associated RAMP superfamily protein Cas6e	NA|118aa|up_3|CP031704.1_1933344_1933698_+	pfam03683, UPF0175, Uncharacterized protein family (UPF0175)	NA|168aa|up_2|CP031704.1_1933681_1934185_+	COG2405, COG2405, Predicted nucleic acid-binding protein, contains PIN domain [General function prediction only]	cas1|305aa|up_1|CP031704.1_1934184_1935099_+	TIGR03638, cas1_ECOLI, CRISPR-associated endonuclease Cas1, subtype I-E/ECOLI	cas2|107aa|up_0|CP031704.1_1935103_1935424_+	PRK11558, PRK11558, putative ssRNA endonuclease; Provisional	NA|144aa|down_0|CP031704.1_1946241_1946673_+	pfam08327, AHSA1, Activator of Hsp90 ATPase homolog 1-like protein	NA|86aa|down_1|CP031704.1_1946763_1947021_+	pfam02604, PhdYeFM_antitox, Antitoxin Phd_YefM, type II toxin-antitoxin system	NA|141aa|down_2|CP031704.1_1947021_1947444_+	NA	NA|152aa|down_3|CP031704.1_1947526_1947982_-	cd03443, PaaI_thioesterase, PaaI_thioesterase is a tetrameric acyl-CoA thioesterase with a hot dog fold and one of several proteins responsible for phenylacetic acid (PA) degradation in bacteria	NA|419aa|down_4|CP031704.1_1948005_1949262_-	PRK00711, PRK00711, D-amino acid dehydrogenase	NA|337aa|down_5|CP031704.1_1949323_1950334_+	PRK09293, PRK09293, class 1 fructose-bisphosphatase	NA|162aa|down_6|CP031704.1_1950398_1950884_-	sd00006, TPR, Tetratricopeptide repeat	NA|311aa|down_7|CP031704.1_1950895_1951828_-	PRK14299, PRK14299, chaperone protein DnaJ; Provisional	NA|145aa|down_8|CP031704.1_1951947_1952382_-	COG0071, IbpA, Molecular chaperone (small heat shock protein) [Posttranslational modification, protein turnover, chaperones]	NA|612aa|down_9|CP031704.1_1952439_1954275_-	COG2804, PulE, Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB [Cell motility and secretion / Intracellular trafficking and secretion]
GCA_003428335.1_ASM342833v1	CP031704	Solimonas sp. K1W22B-7 chromosome, complete genome	3	4009556-4009629	3	CRISPRCasFinder	no		WYL,Cas9_archaeal,cas3,csa3,RT,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,DinG,DEDDh	Orphan	ACAGGCGGGTCAAGGACCCGCCC	23	0	0	NA	NA	NA	1	1	Orphan	WYL,Cas9_archaeal,cas3,csa3,RT,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,DinG,DEDDh	NA|122aa|up_9|CP031704.1_4003085_4003451_-,NA|109aa|up_8|CP031704.1_4003542_4003869_-,NA|73aa|up_7|CP031704.1_4003877_4004096_-,NA|88aa|up_3|CP031704.1_4007718_4007982_-,NA|119aa|up_1|CP031704.1_4008534_4008891_-,NA|170aa|up_0|CP031704.1_4008952_4009462_-,NA|158aa|down_0|CP031704.1_4009637_4010111_-,NA|236aa|down_1|CP031704.1_4010205_4010913_-,NA|60aa|down_2|CP031704.1_4011024_4011204_-,NA|128aa|down_3|CP031704.1_4011481_4011865_-,NA|78aa|down_5|CP031704.1_4013307_4013541_-,NA|673aa|down_6|CP031704.1_4013638_4015657_-,NA|182aa|down_9|CP031704.1_4017825_4018371_-	NA|122aa|up_9|CP031704.1_4003085_4003451_-	NA	NA|109aa|up_8|CP031704.1_4003542_4003869_-	NA	NA|73aa|up_7|CP031704.1_4003877_4004096_-	NA	NA|110aa|up_6|CP031704.1_4004333_4004663_-	pfam15567, Imm35, Immunity protein 35	NA|556aa|up_5|CP031704.1_4004641_4006309_-	pfam15644, Gln_amidase, Papain fold toxin 1, glutamine deamidase	NA|341aa|up_4|CP031704.1_4006584_4007607_-	pfam00891, Methyltransf_2, O-methyltransferase	NA|88aa|up_3|CP031704.1_4007718_4007982_-	NA	NA|130aa|up_2|CP031704.1_4008026_4008416_-	pfam12680, SnoaL_2, SnoaL-like domain	NA|119aa|up_1|CP031704.1_4008534_4008891_-	NA	NA|170aa|up_0|CP031704.1_4008952_4009462_-	NA	NA|158aa|down_0|CP031704.1_4009637_4010111_-	NA	NA|236aa|down_1|CP031704.1_4010205_4010913_-	NA	NA|60aa|down_2|CP031704.1_4011024_4011204_-	NA	NA|128aa|down_3|CP031704.1_4011481_4011865_-	NA	NA|474aa|down_4|CP031704.1_4011861_4013283_-	cd16902, pesticin_lyz, lysozyme-like C-terminal domain of pesticin	NA|78aa|down_5|CP031704.1_4013307_4013541_-	NA	NA|673aa|down_6|CP031704.1_4013638_4015657_-	NA	NA|498aa|down_7|CP031704.1_4015815_4017309_-	PHA02533, 17, large terminase protein; Provisional	NA|136aa|down_8|CP031704.1_4017292_4017700_-	pfam03592, Terminase_2, Terminase small subunit	NA|182aa|down_9|CP031704.1_4017825_4018371_-	NA
GCA_003428335.1_ASM342833v1	CP031704	Solimonas sp. K1W22B-7 chromosome, complete genome	4	4018582-4018665	4	CRISPRCasFinder	no		WYL,Cas9_archaeal,cas3,csa3,RT,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,DinG,DEDDh	Orphan	AAAGCCCGGCGCGGGGCCGGGCT	23	0	0	NA	NA	NA	1	1	Orphan	WYL,Cas9_archaeal,cas3,csa3,RT,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,DinG,DEDDh	NA|236aa|up_9|CP031704.1_4010205_4010913_-,NA|60aa|up_8|CP031704.1_4011024_4011204_-,NA|128aa|up_7|CP031704.1_4011481_4011865_-,NA|78aa|up_5|CP031704.1_4013307_4013541_-,NA|673aa|up_4|CP031704.1_4013638_4015657_-,NA|182aa|up_1|CP031704.1_4017825_4018371_-,NA|65aa|up_0|CP031704.1_4018367_4018562_-,NA|150aa|down_3|CP031704.1_4021602_4022052_+,NA|87aa|down_5|CP031704.1_4022632_4022893_-,NA|134aa|down_7|CP031704.1_4023714_4024116_-,NA|145aa|down_8|CP031704.1_4024242_4024677_-	NA|236aa|up_9|CP031704.1_4010205_4010913_-	NA	NA|60aa|up_8|CP031704.1_4011024_4011204_-	NA	NA|128aa|up_7|CP031704.1_4011481_4011865_-	NA	NA|474aa|up_6|CP031704.1_4011861_4013283_-	cd16902, pesticin_lyz, lysozyme-like C-terminal domain of pesticin	NA|78aa|up_5|CP031704.1_4013307_4013541_-	NA	NA|673aa|up_4|CP031704.1_4013638_4015657_-	NA	NA|498aa|up_3|CP031704.1_4015815_4017309_-	PHA02533, 17, large terminase protein; Provisional	NA|136aa|up_2|CP031704.1_4017292_4017700_-	pfam03592, Terminase_2, Terminase small subunit	NA|182aa|up_1|CP031704.1_4017825_4018371_-	NA	NA|65aa|up_0|CP031704.1_4018367_4018562_-	NA	NA|309aa|down_0|CP031704.1_4018760_4019687_-	pfam14567, SUKH_5, SMI1-KNR4 cell-wall	NA|189aa|down_1|CP031704.1_4019711_4020278_-	cd04458, CSP_CDS, Cold-Shock Protein (CSP) contains an S1-like cold-shock domain (CSD) that is found in eukaryotes, prokaryotes, and archaea	NA|400aa|down_2|CP031704.1_4020288_4021488_-	pfam14567, SUKH_5, SMI1-KNR4 cell-wall	NA|150aa|down_3|CP031704.1_4021602_4022052_+	NA	NA|126aa|down_4|CP031704.1_4022173_4022551_-	pfam14119, DUF4288, Domain of unknown function (DUF4288)	NA|87aa|down_5|CP031704.1_4022632_4022893_-	NA	NA|177aa|down_6|CP031704.1_4023047_4023578_-	pfam00583, Acetyltransf_1, Acetyltransferase (GNAT) family	NA|134aa|down_7|CP031704.1_4023714_4024116_-	NA	NA|145aa|down_8|CP031704.1_4024242_4024677_-	NA	NA|171aa|down_9|CP031704.1_4025225_4025738_-	pfam14317, YcxB, YcxB-like protein
GCA_003428335.1_ASM342833v1	CP031704	Solimonas sp. K1W22B-7 chromosome, complete genome	5	4629877-4629990	5	CRISPRCasFinder	no		WYL,Cas9_archaeal,cas3,csa3,RT,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,DinG,DEDDh	Orphan	GCGCACCCTACGCCTGACAAAAGGACA	27	0	0	NA	NA	NA	1	1	Orphan	WYL,Cas9_archaeal,cas3,csa3,RT,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,DinG,DEDDh	NA,NA|308aa|down_2|CP031704.1_4633988_4634912_+	NA|349aa|up_9|CP031704.1_4619574_4620621_+	cd05283, CAD1, Cinnamyl alcohol dehydrogenases (CAD)	NA|125aa|up_8|CP031704.1_4620730_4621105_-	pfam04241, DUF423, Protein of unknown function (DUF423)	NA|393aa|up_7|CP031704.1_4621101_4622280_-	COG3268, COG3268, Uncharacterized conserved protein [Function unknown]	NA|148aa|up_6|CP031704.1_4622321_4622765_-	pfam07386, DUF1499, Protein of unknown function (DUF1499)	NA|203aa|up_5|CP031704.1_4622733_4623342_-	pfam11006, DUF2845, Protein of unknown function (DUF2845)	NA|697aa|up_4|CP031704.1_4623367_4625458_+	COG4232, COG4232, Thiol:disulfide interchange protein [Posttranslational modification, protein turnover, chaperones / Energy production and conversion]	NA|200aa|up_3|CP031704.1_4625490_4626090_+	cd02969, PRX_like1, Peroxiredoxin (PRX)-like 1 family; hypothetical proteins that show sequence similarity to PRXs	NA|456aa|up_2|CP031704.1_4626234_4627602_-	cd07302, CHD, cyclase homology domain	NA|203aa|up_1|CP031704.1_4627961_4628570_+	COG5340, COG5340, Predicted transcriptional regulator [Transcription]	NA|323aa|up_0|CP031704.1_4628643_4629612_+	pfam02253, PLA1, Phospholipase A1	NA|505aa|down_0|CP031704.1_4630067_4631582_-	pfam13435, Cytochrome_C554, Cytochrome c554 and c-prime	NA|797aa|down_1|CP031704.1_4631594_4633985_+	cd16367, DMSOR_beta_like, uncharacterized subfamily of DMSO Reductase beta subunit family	NA|308aa|down_2|CP031704.1_4633988_4634912_+	NA	NA|601aa|down_3|CP031704.1_4634918_4636721_+	cd00060, FHA, Forkhead associated domain (FHA); found in eukaryotic and prokaryotic proteins	NA|647aa|down_4|CP031704.1_4636717_4638658_+	pfam03928, Haem_degrading, Haem-degrading	NA|873aa|down_5|CP031704.1_4638654_4641273_+	COG3170, FimV, Tfp pilus assembly protein FimV [Cell motility and secretion / Intracellular trafficking and secretion]	NA|1270aa|down_6|CP031704.1_4641275_4645085_+	sd00042, LVIVD, LVIVD repeat	NA|213aa|down_7|CP031704.1_4645452_4646091_+	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|479aa|down_8|CP031704.1_4646123_4647560_+	COG1538, TolC, Outer membrane protein [Cell envelope biogenesis, outer membrane / Intracellular trafficking and secretion]	NA|351aa|down_9|CP031704.1_4647565_4648618_+	TIGR01730, COG0845:_Membrane-fusion_protein, RND family efflux transporter, MFP subunit
