assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000020505.1_ASM2050v1	NC_011027	Chlorobaculum parvum NCIB 8327, complete genome	1	87152-88366	1,1,1	CRISPRCasFinder,CRT,PILER-CR	no	cas1,csx15,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,csx1,cas6	cas1,csx15,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,csx1,cas6,cas3,csa3,Cas9_archaeal,DEDDh	Type III-B,Type III-A,Type III-D,Type III-C	GTCGCAATACGCTAAAGCGTGCAATGAAATGAAAT,GTCGCAATACGCTAAAGCGTGCAATGAAATGAAAT,AATACGCTAAAGCGTGCAATGAAATGAAAT	35,35,30	0	0	NA	NA	NA:NA:NA	16,16,15	16	TypeIII-B,TypeIII-A,TypeIII-D,TypeIII-C	cas1,csx15,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,csx1,cas6,cas3,csa3,Cas9_archaeal,DEDDh	NA|98aa|up_0|NC_011027.1_86598_86892_+,NA|105aa|down_6|NC_011027.1_95071_95386_+	NA|344aa|up_9|NC_011027.1_74840_75872_-	pfam02254, TrkA_N, TrkA-N domain	NA|204aa|up_8|NC_011027.1_75908_76520_-	pfam09366, DUF1997, Protein of unknown function (DUF1997)	NA|358aa|up_7|NC_011027.1_76712_77786_-	PRK00591, prfA, peptide chain release factor 1; Validated	NA|454aa|up_6|NC_011027.1_77808_79170_-	TIGR00054, Putative_zinc_metalloprotease_slr1821, RIP metalloprotease RseP	NA|383aa|up_5|NC_011027.1_79336_80485_-	PRK05447, PRK05447, 1-deoxy-D-xylulose 5-phosphate reductoisomerase; Provisional	NA|93aa|up_4|NC_011027.1_80899_81178_-	PRK14445, PRK14445, acylphosphatase; Provisional	NA|704aa|up_3|NC_011027.1_81183_83295_-	TIGR01241, ATP-dependent_zinc_metalloprotease_FtsH, ATP-dependent metalloprotease FtsH	NA|275aa|up_2|NC_011027.1_83605_84430_+	PRK00125, pyrF, orotidine 5'-phosphate decarboxylase; Reviewed	NA|615aa|up_1|NC_011027.1_84521_86366_+	PRK00331, PRK00331, isomerizing glutamine--fructose-6-phosphate transaminase	NA|98aa|up_0|NC_011027.1_86598_86892_+	NA	cas1|240aa|down_0|NC_011027.1_88616_89336_-	cd09634, Cas1_I-II-III, CRISPR/Cas system-associated protein Cas1	csx15|193aa|down_1|NC_011027.1_89395_89974_+	cd09766, Csx15_I-U, CRISPR/Cas system-associated protein Csx15	cas10|798aa|down_2|NC_011027.1_90204_92598_+	cd09680, Cas10_III, CRISPR/Cas system-associated protein Cas10	csm2gr11|158aa|down_3|NC_011027.1_92597_93071_+	pfam03750, Csm2_III-A, Csm2 Type III-A	csm3gr7|290aa|down_4|NC_011027.1_93086_93956_+	cd09684, Csm3_III-A, CRISPR/Cas system-associated RAMP superfamily protein Csm3	csm4gr5|367aa|down_5|NC_011027.1_93969_95070_+	pfam17953, Csm4_C, CRISPR Csm4 C-terminal domain	NA|105aa|down_6|NC_011027.1_95071_95386_+	NA	csm5gr7|521aa|down_7|NC_011027.1_95358_96921_+	COG1332, COG1332, CRISPR system related protein, RAMP superfamily [Defense mechanisms]	csx1|512aa|down_8|NC_011027.1_96917_98453_+	cd09747, Csx1_III-U, CRISPR/Cas system-associated protein Csx1	cas6|264aa|down_9|NC_011027.1_98479_99271_+	cd09759, Cas6_I-A, CRISPR/Cas system-associated RAMP superfamily protein Cas6
GCF_000020505.1_ASM2050v1	NC_011027	Chlorobaculum parvum NCIB 8327, complete genome	2	331893-331991	2	CRISPRCasFinder	no		cas1,csx15,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,csx1,cas6,cas3,csa3,Cas9_archaeal,DEDDh	Orphan	GGCACTACGATAACCGGTACGACG	24	0	0	NA	NA	NA	1	1	Orphan	cas1,csx15,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,csx1,cas6,cas3,csa3,Cas9_archaeal,DEDDh	NA|133aa|up_0|NC_011027.1_330565_330964_+,NA|270aa|down_6|NC_011027.1_340197_341007_+	NA|397aa|up_9|NC_011027.1_321205_322396_+	COG1232, HemY, Protoporphyrinogen oxidase [Coenzyme metabolism]	NA|366aa|up_8|NC_011027.1_322403_323501_+	pfam00924, MS_channel, Mechanosensitive ion channel	NA|143aa|up_7|NC_011027.1_323669_324098_+	COG1047, SlpA, FKBP-type peptidyl-prolyl cis-trans isomerases 2 [Posttranslational modification, protein turnover, chaperones]	NA|502aa|up_6|NC_011027.1_324341_325847_+	cd10334, SLC6sbd_u1, uncharacterized bacterial and archaeal solute carrier 6 subfamily; solute-binding domain	NA|336aa|up_5|NC_011027.1_325965_326973_+	cd03408, SPFH_like_u1, Uncharacterized family; SPFH (stomatin, prohibitin, flotillin, and HflK/C) superfamily	NA|491aa|up_4|NC_011027.1_326989_328462_+	smart00978, Tim44, Tim44 is an essential component of the machinery that mediates the translocation of nuclear-encoded proteins across the mitochondrial inner membrane	NA|285aa|up_3|NC_011027.1_328503_329358_+	pfam14257, DUF4349, Domain of unknown function (DUF4349)	NA|274aa|up_2|NC_011027.1_329339_330161_+	cd05327, retinol-DH_like_SDR_c_like, retinol dehydrogenase (retinol-DH), Light dependent Protochlorophyllide (Pchlide) OxidoReductase (LPOR) and related proteins, classical (c) SDRs	NA|86aa|up_1|NC_011027.1_330324_330582_-	pfam07277, SapC, SapC	NA|133aa|up_0|NC_011027.1_330565_330964_+	NA	NA|32aa|down_0|NC_011027.1_334649_334745_+	PRK06599, PRK06599, DNA topoisomerase I; Validated	NA|425aa|down_1|NC_011027.1_334781_336056_-	COG3259, FrhA, Coenzyme F420-reducing hydrogenase, alpha subunit [Energy production and conversion]	NA|256aa|down_2|NC_011027.1_336045_336813_-	COG1941, FrhG, Coenzyme F420-reducing hydrogenase, gamma subunit [Energy production and conversion]	NA|275aa|down_3|NC_011027.1_336822_337647_-	cd06221, sulfite_reductase_like, Anaerobic sulfite reductase contains an FAD and NADPH binding module with structural similarity to ferredoxin reductase and sequence similarity to dihydroorotate dehydrogenases	NA|360aa|down_4|NC_011027.1_337700_338780_-	TIGR02910, anaerobic_sulfite_reductase_subunit_A, sulfite reductase, subunit A	NA|413aa|down_5|NC_011027.1_338889_340128_-	COG0826, COG0826, Collagenase and related proteases [Posttranslational modification, protein turnover, chaperones]	NA|270aa|down_6|NC_011027.1_340197_341007_+	NA	NA|269aa|down_7|NC_011027.1_341003_341810_+	cd18109, SpoU-like_RNA-MTase, SAM-dependent RNA methylase related to SpoU-TrmH	NA|350aa|down_8|NC_011027.1_341944_342994_+	TIGR01926, peroxid_rel, uncharacterized peroxidase-related enzyme	NA|278aa|down_9|NC_011027.1_343022_343856_-	cd00293, USP_Like, Usp: Universal stress protein family
GCF_000020505.1_ASM2050v1	NC_011027	Chlorobaculum parvum NCIB 8327, complete genome	3	356934-357033	3	CRISPRCasFinder	no		cas1,csx15,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,csx1,cas6,cas3,csa3,Cas9_archaeal,DEDDh	Orphan	AACCCGACGGACATTCGACTCTTACATTGCC	31	0	0	NA	NA	NA	1	1	Orphan	cas1,csx15,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,csx1,cas6,cas3,csa3,Cas9_archaeal,DEDDh	NA,NA|189aa|down_5|NC_011027.1_362246_362813_+,NA|102aa|down_9|NC_011027.1_366060_366366_+	NA|809aa|up_9|NC_011027.1_343872_346299_-	pfam13654, AAA_32, AAA domain	NA|251aa|up_8|NC_011027.1_346458_347211_-	COG0861, TerC, Membrane protein TerC, possibly involved in tellurium resistance [Inorganic ion transport and metabolism]	NA|463aa|up_7|NC_011027.1_347507_348896_+	cd07087, ALDH_F3-13-14_CALDH-like, ALDH subfamily: Coniferyl aldehyde dehydrogenase, ALDH families 3, 13, and 14, and other related proteins	NA|260aa|up_6|NC_011027.1_348900_349680_+	cd07205, Pat_PNPLA6_PNPLA7_NTE1_like, Patatin-like phospholipase domain containing protein 6, protein 7, and fungal NTE1	NA|156aa|up_5|NC_011027.1_349915_350383_+	cd04332, YbaK_like, YbaK-like	NA|377aa|up_4|NC_011027.1_350533_351664_+	cd03814, GT4-like, glycosyltransferase family 4 proteins	NA|357aa|up_3|NC_011027.1_351752_352823_-	cd06256, M14_ASTE_ASPA-like, Peptidase M14 Succinylglutamate desuccinylase (ASTE)/aspartoacylase (ASPA)-like; uncharacterized subgroup	NA|478aa|up_2|NC_011027.1_352810_354244_-	pfam04107, GCS2, Glutamate-cysteine ligase family 2(GCS2)	NA|306aa|up_1|NC_011027.1_354501_355419_+	PRK09599, PRK09599, NADP-dependent phosphogluconate dehydrogenase	NA|474aa|up_0|NC_011027.1_355415_356837_+	PRK05722, PRK05722, glucose-6-phosphate 1-dehydrogenase; Validated	NA|144aa|down_0|NC_011027.1_357120_357552_-	pfam01797, Y1_Tnp, Transposase IS200 like	NA|272aa|down_1|NC_011027.1_357738_358554_-	cd01400, 6PGL, 6PGL: 6-Phosphogluconolactonase (6PGL) subfamily; 6PGL catalyzes the second step of the oxidative phase of the pentose phosphate pathway, the hydrolyzation of 6-phosphoglucono-1,5-lactone (delta form) to 6-phosphogluconate	NA|675aa|down_2|NC_011027.1_358550_360575_-	COG0021, TktA, Transketolase [Carbohydrate transport and metabolism]	NA|218aa|down_3|NC_011027.1_360732_361386_+	pfam10042, DUF2278, Uncharacterized conserved protein (DUF2278)	NA|229aa|down_4|NC_011027.1_361527_362214_+	COG2968, COG2968, Uncharacterized conserved protein [Function unknown]	NA|189aa|down_5|NC_011027.1_362246_362813_+	NA	NA|209aa|down_6|NC_011027.1_363117_363744_+	pfam13645, YkuD_2, L,D-transpeptidase catalytic domain	NA|584aa|down_7|NC_011027.1_363760_365512_+	COG2989, COG2989, Uncharacterized protein conserved in bacteria [Function unknown]	NA|156aa|down_8|NC_011027.1_365521_365989_+	cd07819, SRPBCC_2, Ligand-binding SRPBCC domain of an uncharacterized subfamily of proteins	NA|102aa|down_9|NC_011027.1_366060_366366_+	NA
GCF_000020505.1_ASM2050v1	NC_011027	Chlorobaculum parvum NCIB 8327, complete genome	4	2034621-2034757	4	CRISPRCasFinder	no		cas1,csx15,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,csx1,cas6,cas3,csa3,Cas9_archaeal,DEDDh	Orphan	TCTCCATGCACTGCGACACCTGG	23	0	0	NA	NA	NA	2	2	Orphan	cas1,csx15,cas10,csm2gr11,csm3gr7,csm4gr5,csm5gr7,csx1,cas6,cas3,csa3,Cas9_archaeal,DEDDh	NA|53aa|up_9|NC_011027.1_2025449_2025608_-,NA|157aa|up_1|NC_011027.1_2032868_2033339_+,NA	NA|53aa|up_9|NC_011027.1_2025449_2025608_-	NA	NA|118aa|up_8|NC_011027.1_2025585_2025939_-	pfam03413, PepSY, Peptidase propeptide and YPEB domain	NA|164aa|up_7|NC_011027.1_2026334_2026826_-	smart00748, HEPN, Higher Eukarytoes and Prokaryotes Nucleotide-binding domain	NA|86aa|up_6|NC_011027.1_2026865_2027123_-	pfam10387, DUF2442, Protein of unknown function (DUF2442)	NA|86aa|up_5|NC_011027.1_2027119_2027377_-	pfam13711, DUF4160, Domain of unknown function (DUF4160)	NA|459aa|up_4|NC_011027.1_2027642_2029019_+	PRK12391, PRK12391, TrpB-like pyridoxal phosphate-dependent enzyme	NA|230aa|up_3|NC_011027.1_2029173_2029863_-	COG3637, COG3637, Opacity protein and related surface antigens [Cell envelope biogenesis, outer membrane]	NA|850aa|up_2|NC_011027.1_2030119_2032669_-	COG0542, clpA, ATP-binding subunits of Clp protease and DnaK/DnaJ chaperones [Posttranslational modification, protein turnover, chaperones]	NA|157aa|up_1|NC_011027.1_2032868_2033339_+	NA	NA|280aa|up_0|NC_011027.1_2033411_2034251_-	cd05266, SDR_a4, atypical (a) SDRs, subgroup 4	NA|461aa|down_0|NC_011027.1_2034817_2036200_+	COG0641, AslB, Arylsulfatase regulator (Fe-S oxidoreductase) [General function prediction only]	NA|595aa|down_1|NC_011027.1_2036207_2037992_+	pfam02624, YcaO, YcaO cyclodehydratase, ATP-ad Mg2+-binding	NA|612aa|down_2|NC_011027.1_2037988_2039824_+	COG1032, COG1032, Fe-S oxidoreductase [Energy production and conversion]	NA|112aa|down_3|NC_011027.1_2039839_2040175_-	pfam03150, CCP_MauG, Di-haem cytochrome c peroxidase	NA|277aa|down_4|NC_011027.1_2040354_2041185_+	cd07713, DHPS-like_MBL-fold, Methanocaldococcus jannaschii dihydropteroate synthase, Thermoanaerobacter tengcongensis Tflp, and related proteins; MBL-fold metallo hydrolase domain	NA|71aa|down_5|NC_011027.1_2041218_2041431_-	pfam14375, Cys_rich_CWC, Cysteine-rich CWC	NA|484aa|down_6|NC_011027.1_2041450_2042902_-	cd07786, FGGY_EcGK_like, Escherichia coli glycerol kinase-like proteins; belongs to the FGGY family of carbohydrate kinases	NA|270aa|down_7|NC_011027.1_2043054_2043864_+	cd03265, ABC_DrrA, Daunorubicin/doxorubicin resistance ATP-binding protein	NA|282aa|down_8|NC_011027.1_2043799_2044645_+	TIGR01247, drrB, daunorubicin resistance ABC transporter membrane protein	NA|246aa|down_9|NC_011027.1_2044652_2045390_-	TIGR03915, putative_DNA_metabolism_protein, probable DNA metabolism protein
