assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000017565.1_ASM1756v1	NC_009719	Parvibaculum lavamentivorans DS-1, complete sequence	1	101247-104452	1,1,1,2,3	PILER-CR,CRISPRCasFinder,CRT,PILER-CR,PILER-CR	no	cas2,cas1,cas9,WYL	cas2,cas1,cas9,WYL,PD-DExK,DinG,cas3,csa3,DEDDh	Type II-B,,Type II-C,Type II-A	GCTGCGGATTGCGGCCGTCTCTCGATTTGCTACTCT,GCTGCGGATTGCGGCCGTCTCTCGATTTGCTACTCT,GCTGCGGATTGCGGCCGTCTCTCGATTTGCTACTCT,CGCTGCGGATTGCGGCCGTCTCTCGATTTGCTACTCT,GCTGCGGATTGCGGCCGTCTCTCGATTTGCTACTCT	36,36,36,37,36	0	0	NA	NA	NA:NA:NA:NA:NA	32,48,48,32,32	48	TypeII-B,,TypeII-C,TypeII-A	cas2,cas1,cas9,WYL,PD-DExK,DinG,cas3,csa3,DEDDh	NA,NA|83aa|down_4|NC_009719.1_110800_111049_+,NA|98aa|down_9|NC_009719.1_117697_117991_+	NA|666aa|up_9|NC_009719.1_88269_90267_+	pfam06980, DUF1302, Protein of unknown function (DUF1302)	NA|188aa|up_8|NC_009719.1_90358_90922_-	pfam14514, TetR_C_9, Transcriptional regulator, TetR, C-terminal	NA|369aa|up_7|NC_009719.1_91115_92222_+	cd08278, benzyl_alcohol_DH, Benzyl alcohol dehydrogenase	NA|415aa|up_6|NC_009719.1_92261_93506_+	PRK06025, PRK06025, acetyl-CoA C-acetyltransferase	NA|150aa|up_5|NC_009719.1_93538_93988_+	cd04776, HTH_GnyR, Helix-Turn-Helix DNA binding domain of the regulatory protein GnyR	NA|468aa|up_4|NC_009719.1_94038_95442_+	cd07106, ALDH_AldA-AAD23400, Streptomyces aureofaciens putative aldehyde dehydrogenase AldA (AAD23400)-like	NA|420aa|up_3|NC_009719.1_95539_96799_-	cd06198, FNR_like_3, NAD(P) binding domain of  ferredoxin reductase-like proteins catalyze electron transfer between an NAD(P)-binding sub-domain of the alpha/beta class and a discrete (usually N-terminal) domain, which varies in orientation with respect to the NAD(P) binding domain	NA|118aa|up_2|NC_009719.1_96882_97236_-	pfam04304, DUF454, Protein of unknown function (DUF454)	NA|531aa|up_1|NC_009719.1_97369_98962_-	PRK09275, PRK09275, bifunctional aspartate transaminase/aspartate 4-decarboxylase	NA|562aa|up_0|NC_009719.1_98979_100665_-	TIGR03802, Asp_Ala_antiprt, aspartate-alanine antiporter	cas2|102aa|down_0|NC_009719.1_104497_104803_-	COG3512, COG3512, CRISPR-associated protein, Cas2 homolog [Defense mechanisms]	cas1|312aa|down_1|NC_009719.1_104850_105786_-	TIGR03639, cas1_NMENI, CRISPR-associated endonuclease Cas1, subtype II/NMENI	cas9|1038aa|down_2|NC_009719.1_105794_108908_-	cd09643, Csn1, CRISPR/Cas system-associated protein Cas9	NA|369aa|down_3|NC_009719.1_109700_110807_+	pfam02486, Rep_trans, Replication initiation factor	NA|83aa|down_4|NC_009719.1_110800_111049_+	NA	NA|538aa|down_5|NC_009719.1_111208_112822_+	smart00857, Resolvase, Resolvase, N terminal domain	NA|167aa|down_6|NC_009719.1_113052_113553_+	pfam07310, PAS_5, PAS domain	NA|334aa|down_7|NC_009719.1_113554_114556_-	COG1131, CcmA, ABC-type multidrug transport system, ATPase component [Defense mechanisms]	NA|963aa|down_8|NC_009719.1_114738_117627_+	PRK05755, PRK05755, DNA polymerase I; Provisional	NA|98aa|down_9|NC_009719.1_117697_117991_+	NA
GCF_000017565.1_ASM1756v1	NC_009719	Parvibaculum lavamentivorans DS-1, complete sequence	2	483981-484083	2	CRISPRCasFinder	no	cas3	cas2,cas1,cas9,WYL,PD-DExK,DinG,cas3,csa3,DEDDh	Unclear	TGTCATCCCGGCGAAAGCCGGGACCCAT	28	0	0	NA	NA	NA	1	1	Unclear	cas2,cas1,cas9,WYL,PD-DExK,DinG,cas3,csa3,DEDDh	NA,NA|91aa|down_1|NC_009719.1_485386_485659_-	NA|146aa|up_9|NC_009719.1_475698_476136_-	COG2050, PaaI, HGG motif-containing thioesterase, possibly involved in aromatic compounds catabolism [Secondary metabolites biosynthesis,    transport, and catabolism]	NA|161aa|up_8|NC_009719.1_476132_476615_-	cd03443, PaaI_thioesterase, PaaI_thioesterase is a tetrameric acyl-CoA thioesterase with a hot dog fold and one of several proteins responsible for phenylacetic acid (PA) degradation in bacteria	NA|156aa|up_7|NC_009719.1_476881_477349_+	pfam07310, PAS_5, PAS domain	NA|465aa|up_6|NC_009719.1_477464_478859_+	pfam00743, FMO-like, Flavin-binding monooxygenase-like	NA|217aa|up_5|NC_009719.1_479011_479662_+	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|345aa|up_4|NC_009719.1_479755_480790_+	pfam07859, Abhydrolase_3, alpha/beta hydrolase fold	NA|117aa|up_3|NC_009719.1_480793_481144_+	PRK09272, PRK09272, hypothetical protein; Provisional	NA|192aa|up_2|NC_009719.1_481401_481977_+	COG2119, COG2119, Predicted membrane protein [Function unknown]	NA|293aa|up_1|NC_009719.1_481967_482846_-	COG0596, MhpC, Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) [General function prediction only]	NA|325aa|up_0|NC_009719.1_482965_483940_+	cd08241, QOR1, Quinone oxidoreductase (QOR)	NA|294aa|down_0|NC_009719.1_484209_485091_-	cd05369, TER_DECR_SDR_a, Trans-2-enoyl-CoA reductase (TER) and 2,4-dienoyl-CoA reductase (DECR), atypical (a) SDR	NA|91aa|down_1|NC_009719.1_485386_485659_-	NA	NA|284aa|down_2|NC_009719.1_485704_486556_-	PRK10334, PRK10334, small-conductance mechanosensitive channel MscS	cas3|852aa|down_3|NC_009719.1_486775_489331_+	TIGR04121, ATP-dependent_helicase, DEXH box helicase, DNA ligase-associated	NA|239aa|down_4|NC_009719.1_489408_490125_+	TIGR04123, hypothetical_protein, metallophosphoesterase, DNA ligase-associated	NA|260aa|down_5|NC_009719.1_490295_491075_-	pfam02683, DsbD, Cytochrome C biogenesis protein transmembrane region	NA|266aa|down_6|NC_009719.1_491188_491986_-	pfam09608, Alph_Pro_TM, Putative transmembrane protein (Alph_Pro_TM)	NA|308aa|down_7|NC_009719.1_491982_492906_-	pfam01925, TauE, Sulfite exporter TauE/SafE	NA|1013aa|down_8|NC_009719.1_493043_496082_-	sd00010, SLR, Sel1-like repeat	NA|361aa|down_9|NC_009719.1_496311_497394_-	cd13604, PBP2_TRAP_ketoacid_lactate_like, Substrate-binding domain of an alpha-keto acid binding Tripartite ATP-independent Periplasmic transporter and related proteins; the type 2 periplasmic-binding protein fold
GCF_000017565.1_ASM1756v1	NC_009719	Parvibaculum lavamentivorans DS-1, complete sequence	3	2451888-2452030	3	CRISPRCasFinder	no		cas2,cas1,cas9,WYL,PD-DExK,DinG,cas3,csa3,DEDDh	Orphan	GCGTCGCGCGTGCCGTGGCCGAT	23	0	0	NA	NA	NA	2	2	Orphan	cas2,cas1,cas9,WYL,PD-DExK,DinG,cas3,csa3,DEDDh	NA,NA|136aa|down_1|NC_009719.1_2452906_2453314_+,NA|85aa|down_4|NC_009719.1_2457974_2458229_-,NA|159aa|down_5|NC_009719.1_2458354_2458831_+	NA|84aa|up_9|NC_009719.1_2442065_2442317_-	TIGR01682, Molybdopterin_synthase_sulfur_carrier_subunit, molybdopterin converting factor, subunit 1, non-archaeal	NA|208aa|up_8|NC_009719.1_2442313_2442937_-	TIGR00560, pgsA, CDP-diacylglycerol--glycerol-3-phosphate 3-phosphatidyltransferase	NA|645aa|up_7|NC_009719.1_2443011_2444946_-	PRK00558, uvrC, excinuclease ABC subunit UvrC	NA|136aa|up_6|NC_009719.1_2445320_2445728_-	smart00905, FolB, Dihydroneopterin aldolase	NA|258aa|up_5|NC_009719.1_2445738_2446512_-	PRK09134, PRK09134, SDR family oxidoreductase	NA|351aa|up_4|NC_009719.1_2446520_2447573_-	COG0530, ECM27, Ca2+/Na+ antiporter [Inorganic ion transport and metabolism]	NA|589aa|up_3|NC_009719.1_2447816_2449583_+	PRK03562, PRK03562, glutathione-regulated potassium-efflux system protein KefC; Provisional	NA|118aa|up_2|NC_009719.1_2449585_2449939_-	COG3450, COG3450, Predicted enzyme of the cupin superfamily [General function prediction only]	NA|407aa|up_1|NC_009719.1_2450069_2451290_+	COG5653, COG5653, Protein involved in cellulose biosynthesis (CelD) [Cell envelope biogenesis, outer membrane]	NA|185aa|up_0|NC_009719.1_2451293_2451848_-	PRK08309, PRK08309, short chain dehydrogenase; Provisional	NA|167aa|down_0|NC_009719.1_2452274_2452775_-	COG3791, COG3791, Uncharacterized conserved protein [Function unknown]	NA|136aa|down_1|NC_009719.1_2452906_2453314_+	NA	NA|459aa|down_2|NC_009719.1_2453502_2454879_+	pfam13304, AAA_21, AAA domain, putative AbiEii toxin, Type IV TA system	NA|741aa|down_3|NC_009719.1_2455576_2457799_-	PRK05298, PRK05298, excinuclease ABC subunit UvrB	NA|85aa|down_4|NC_009719.1_2457974_2458229_-	NA	NA|159aa|down_5|NC_009719.1_2458354_2458831_+	NA	NA|264aa|down_6|NC_009719.1_2458827_2459619_+	PRK08317, PRK08317, hypothetical protein; Provisional	NA|209aa|down_7|NC_009719.1_2459621_2460248_-	pfam14246, TetR_C_7, AefR-like transcriptional repressor, C-terminal region	NA|205aa|down_8|NC_009719.1_2460273_2460888_-	pfam14246, TetR_C_7, AefR-like transcriptional repressor, C-terminal region	NA|200aa|down_9|NC_009719.1_2461066_2461666_-	TIGR03784, marine_sortase, sortase, marine proteobacterial type
GCF_000017565.1_ASM1756v1	NC_009719	Parvibaculum lavamentivorans DS-1, complete sequence	4	3050634-3050891	4	PILER-CR	no		cas2,cas1,cas9,WYL,PD-DExK,DinG,cas3,csa3,DEDDh	Orphan	AAGCGGAAGCCCGCGGCCAAGAAGAAGCCGGCTGCGAAGAAGACCGCAGCCAAGAAG	57	0	0	NA	NA	NA	2	2	Orphan	cas2,cas1,cas9,WYL,PD-DExK,DinG,cas3,csa3,DEDDh	NA,NA|114aa|down_1|NC_009719.1_3051881_3052223_+	NA|434aa|up_9|NC_009719.1_3039272_3040574_-	PRK09059, PRK09059, dihydroorotase; Validated	NA|328aa|up_8|NC_009719.1_3040570_3041554_-	PRK00856, pyrB, aspartate carbamoyltransferase catalytic subunit	NA|564aa|up_7|NC_009719.1_3041666_3043358_+	PRK11561, PRK11561, isovaleryl CoA dehydrogenase; Provisional	NA|274aa|up_6|NC_009719.1_3043382_3044204_+	pfam04250, DUF429, Protein of unknown function (DUF429)	NA|343aa|up_5|NC_009719.1_3044241_3045270_-	COG3677, COG3677, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|305aa|up_4|NC_009719.1_3045371_3046286_-	COG0679, COG0679, Predicted permeases [General function prediction only]	NA|183aa|up_3|NC_009719.1_3046310_3046859_-	PRK00109, PRK00109, Holliday junction resolvase RuvX	NA|96aa|up_2|NC_009719.1_3046989_3047277_+	PRK00034, gatC, Asp-tRNA(Asn)/Glu-tRNA(Gln) amidotransferase subunit GatC	NA|493aa|up_1|NC_009719.1_3047280_3048759_+	PRK00012, gatA, Asp-tRNA(Asn)/Glu-tRNA(Gln) amidotransferase subunit GatA	NA|495aa|up_0|NC_009719.1_3048788_3050273_+	PRK05477, gatB, Asp-tRNA(Asn)/Glu-tRNA(Gln) amidotransferase subunit GatB	NA|264aa|down_0|NC_009719.1_3050978_3051770_+	PRK13972, PRK13972, GSH-dependent disulfide bond oxidoreductase; Provisional	NA|114aa|down_1|NC_009719.1_3051881_3052223_+	NA	NA|211aa|down_2|NC_009719.1_3052347_3052980_+	PRK11752, PRK11752, putative S-transferase; Provisional	NA|90aa|down_3|NC_009719.1_3053156_3053426_+	sd00010, SLR, Sel1-like repeat	NA|297aa|down_4|NC_009719.1_3053532_3054423_+	COG1946, TesB, Acyl-CoA thioesterase [Lipid metabolism]	NA|205aa|down_5|NC_009719.1_3054620_3055235_+	pfam00440, TetR_N, Bacterial regulatory proteins, tetR family	NA|298aa|down_6|NC_009719.1_3055290_3056184_+	pfam12146, Hydrolase_4, Serine aminopeptidase, S33	NA|486aa|down_7|NC_009719.1_3056206_3057664_-	pfam13347, MFS_2, MFS/sugar transport protein	NA|234aa|down_8|NC_009719.1_3057737_3058439_+	COG2860, COG2860, Predicted membrane protein [Function unknown]	NA|360aa|down_9|NC_009719.1_3058703_3059783_+	pfam00924, MS_channel, Mechanosensitive ion channel
